Heart Disease Prediction System Using Convolutional Neural Networks

doi:10.21203/rs.3.rs-2009078/v1

In computer visualization. Deep learning affords effective outcomes for machine learning complications. Several techniques like minimum distance technique, K-Nearest neighbor algorithm, Naïve Bayes, Support Vector Machine, and Artificial Neural Network are used for the purpose of medical data classification. In this paper, heart disease data classification is performed using convolutional neural network. Generally convolutional neural network uses and applied in image data sets, but, in proposed method is used to calculate the accuracy of event for heart disease data sets. The experiments are passed out using heart disease data set of Uel machine learning repository. This trained classifier can classify the given data into either normal or abnormal of heart disease data.

Data Mining

Classification

Deep Learning

Convolutional Neural Network

Heart Disease

Cclassification is one of the fundamental problems in computer visualization. It forms basis for many other computer visualization tasks such as object identification, image segmentation and object prediction. The task of categorizing object into one of several predefined classes is called object classification. Though the task of classifying for data set is easy for human beings, it is very difficult for an automated system. By using machine learning techniques, data set can be classified. These machine learning algorithms falls under the category of deep learning. Deep learning is a type of neural network algorithms in which each layer is responsible for extracting one or more features of the data sets.

A neural network is a computational model that is similar to a human brain. It is collection of nodes called as neurons. These nodes are organized into layers where each neuron in the one layer takes some input processes it and passes the output to the neuron in the next layer. Different layers may perform different kinds of transformations. Data transfers from the input layer (first layer) to the output layer (last layer) by traversing various hidden layers. One of the most popular techniques used for improving the accuracy of heart disease data set for classification is Convolutional Neural Networks (CNN) [1]. Figure 1 shows the structure of an artificial neural network.

The neural network can generate efficient classification rules. Convolution neural network algorithm is a multilayer perceptron that is the special design for identification of two-dimensional data information. Always have more layers: input layer, convolution layer, sample layer, and output layer. Deep learning refers to the shining branch of machine learning that is based on learning levels of representations.

Convolutional Neural Networks (CNN) is one kind of deep neural network. To perform classification task of heart disease dataset, the neural network is trained using convolutions algorithm. The experiment is conducted with heart disease dataset by considering the single and multilayer neural network modes. The proposed algorithm gives detailed analysis of the process of CNN algorithm both the forward process and back propagation. In this work, applied improved convolutional neural network to implement the typical heart data recognition using weka tool. The experimental result show the best classification accuracy compare with other classifiers.

In this paper, build a convolutional neural network based object classifier which can identify and separate heart disease records of normal from that of abnormal. Four of the existing data mining techniques are taken (I.e. KNN, Naïve Bayes, SVM and ANN), and compared using our proposed system with better accuracy.

In this paper [2]; presented classification over many real-world datasets has a strange weakness says unstructured class problem. A dataset is said to be unstructured when the popular of the class has additional samples unimportantly than the slight class. Such disadvantages output in an unsuccessful performance of data classification algorithms. Classification is a supervised learning method which acquires a training dataset to form its model for classifying unseen examples. In the result, the FNR can be too high. The scholar’s emphasis on the unstructured data classification using uncertain nearest neighbor decision rule and also initiate the main subjects look by k-NN. This work reports the matters faced by k-NN by emerging Adaptive-Condensed Nearest Neighbor. The Ada-CNN classifier uses the delivery and density of test point's neighborhood and study suitable point-explicit by using artificial neural schemes. Ada-CNN achieved fine compared to k-NN and extra well-known classifiers. The investigational outputs presented that Ada-CNN reached 94% accuracy.

In this paper [3]; presented decision arrangement be contingent on the combination of ANN and Fuzzy Analytic Hierarchy Process. Fuzzy-AHP scheme was working to compute the worldwide weights for the attributes contingent on their separate participations. The worldwide weights represent the parts of the attributes which were occupied up to train ANN for the prediction of heart failure risks. The hybrid technique and ANN evade the unsuitable problematic and development the popularization competence. The training issue, such as, quantity of hidden layers and periods were carefully tested ended many runs and the best issue set was selected. The ANN classifier of worldwide weights was used deprived of regularization, but was not appropriate for simplification of novel data.

In this paper [4]; projected deep belief Networks and Convolutional Neural Networks are generally implemented strategies in deep learning. Amongst exclusive form of models, convolutional neural networks have been demonstrated high performance on image classification. In this work built an easy neural network and the experiments are based on benchmarking datasets MINIST and CIFAR-10. On the basis of the convolutional neural networks, special strategies of studying rate set and one of a kind optimization set of rules of fixing the most useful parameters of the impact on image classification are analyzed.

In this paper [5]; presented that despite the fact that several algorithms for image classification were advanced over the years, they have got now not been used with the invention of Convolutional Neural Networks. CNN deliver healthier effects than existing methods inside the literature due to significant which include dispensation through extracting hidden functions, letting similar dispensation to similar form, and real time process. In this work, uses the caffe library, winch is often usages for deep receiving to distinguish to train and income a appearance at through pics of cats and puppies occupied from the kaggle facts set. 10,000 marked info is rummage-sale for education and 5000 unlabeled figures is rummage-sale for examination out. Because of CNN let similar dispensation, GPU epoch has been recycled.

In this paper [6]; Illuminated, Convolutional Neural Networks is a deep neural network that has an assembly and method that varies after additional deep neural networks. Their forte and design is practice on two-dimensional data, like images and videos. In this work, established CNN to classify handwritten digits. A technique is castoff to change data into wavelet domain to reach better accuracy. Relating CNN on the raw pixels of images makes accurate outcomes. Though, the size and complexity of these images in the spatial domain reasons the competence of the procedure to reduction. By adapting the images into the wavelet domain, they can be treated at a inferior measurement, with sooner dispensation times. Also, assumed the changing incidences signified in every sub band, manifold Convolutional Neural Networks did on every sub band, or a mixture of them, can upsurge the accuracy of the classification.

In this paper [7]; projected distant detecting is the technique castoff to sense and amount target characteristics using electromagnetic vigor in the procedure of heat, light and radio waves. The procedure of creating thematic map from remotely detected images says image classification. Classification of accuracy depends on satellite image excellence. Four stages are castoff for image classification; first is preprocessing of image tailed by range of specific standards feature to explain the way then assortment of classifier and finally accuracy valuation of the image classification. For classification, multispectral cable images are castoff. Image classification can be supervised and unsupervised. There are numerous supervised classifiers namely minimum distance, SVM, maximum probability, and parallelepiped. The performance of these classifiers is arbitrated on the base of kappa coefficient and whole accuracy.

In this paper [8]; clarified convolutional neural network architecture and used for junk image classification. Image Junk Classification is grounded on CNN to discovery the images in Emails that cover subtle data by a deep neural network, which benefit persons on data observing. A linear support vector machine is applied as a part of the learning method, reducing a margin-based cost to find a inferior equal feature identification. The whole architecture playing of five convolutional layers and three fully connected layers with a small number of neurons. Lastly the result of the last fully connected layer is fed to a SVM layer to train a deep neural network for classification. Lower layer weights are learned by back propagating the gradients from the top linear SVM layer. This better deep learning method not only marks up the deficit in hand-crafted feature removal, but also advances healthier performance in unique dissimilar junk images.

In this paper [9]; proposed Image cover classification is an significant task in numerous dissimilar clinical imaging applications. CNN network is applied to classify dissimilar groups of lung interstitial lung disease designs. Owing to high graphic difference inside the similar class and great graphic resemblance among dissimilar classes, it is actual stimulating to achieve accurate classification. The main problem is to design a highly discriminative feature set to effectively grip the within class difference and among class resemblance. Somewhat than important a set of features physically, a completely involuntary neural built machine learning framework is applied to extract discriminative features from training samples and perform classification at the same time. He built CNN network architecture with a single convolutional layer. Fall out technique is used to increase the performance, by randomly restricting neurons in every layer through training. Modified CNN reached the better classification performance when associated to other algorithms like SIFT feature with key point situated at the cover center, rotation-invariant Local Binary Patterns feature with three tenacities and unsupervised feature learning using Restricted Boltzmann Machine.

The data sets of Cleveland and Statlog are utilized. Cleveland data set comprises 303 patient records of heart disease data [10], and the Statlog data set comprises 270 patient records of heart disease data [11]. A total of 573 patient records are chosen for valuation of the proposed prediction scheme. The data from the patient records will possess 13 input attributes, namely, age, sex, cp, t-rest-bps, chol, Rest-ecg, fbs, thalach, exang, old-peak, solpe, Ca, thal etc. The detail descriptions of these attributes are provided in given below Table 1.

Data Sets and Attributes:

The data set was categorized with 3 attributes: Key, Predictable, and Input attributes as given below:

Key Attribute: Patient Id: Patient’s Recognition Number.
Predictable Attribute: Diagnosis: Value 1 ≤ 50 % (no heart disease);

Value 0 ≥ 50 % (has heart disease)

Input Aattributes ( Heart Disease ): Table 1

Here, a key attribute of a patient record may be patient identification number (Patient ID), which is specified for every patient and can be easily distinguishable.

Table 1: Input Attributes for Prediction System

Variable Name	Attribute	Descriptions	Values
F1	Age	Age in years	Continuous
F2	Sex	Male or Female	1 = male; 0 = female
F3	Cp	Chest Pain Type	1 = typical type I 2 = typical type angina 3 = non-angina pain 4 = asymptomatic
F4	T rest bps	Resting blood pressure	Continuous value in mm hg.
F5	Chol	Serum cholesterol	Continuous value in mg/dl.
F6	Restecg	Resting electrographic results	0 = normal 1 = containing ST_T sign odd 2 = left ventricular hypertrophy
F7	Fbs	Fasting blood sugar	1 ≥ 120 mg/dl. 0 ≤ 120 mg/dl.
F8	Thalach	Maximum heart rate attained	Continuous value
F9	Chol	Exercise make angina	0 = no; 1 = yes
F10	Oldpeak	ST despair make by exercise virtual to rest	Continuous value
F11	Slop	Slope of the crest exercise ST fragment	1 = un sloping 2 = fat 3 = down sloping
F12	Ca	Number of main vessels tinted by fluoroscopy	0 - 3 value
F13	Thal	Defect type	3 = normal 6 = fixed 7 = reversible defect

The above 13 attributes of data of a patient are said to be as input attributes which may have some personal information. Along with these attributes there is one more attribute for every record and can be helpful to identify it even among a number of similar records also termed as key attributes. This database contains 76 attributes, but all published experiments refer to using a subset of 13 of them. In particular, the Cleveland, Statlog databases are used by Machine Learning researchers. The "goal" field refers to the presence of heart disease in the patient. It is integer valued from 0 (no presence) to 4. Experiments with this database have concentrated on simply attempting to distinguish presence (values 1, 2, 3, 4) from absence (value 0). The names and social security numbers of the patients were recently removed from the database, replaced with dummy values.

Convolutional neural network has input layers, output layers and hidden layers. The hidden layers are consists of convolutional layer, flattened layer and a fully connected layer. The Fig. 2 shows the architecture of the proposed convolutional neural network.

Deep learning refers to the shining branch of machine learning that is based on learning levels of representations. Convolutional Neural Networks (CNN) is one kind of deep neural network. It can study concurrently. The proposed detailed analysis of the process of CNN algorithm both the forward process and back propagation. Then we applied the particular convolutional neural network to implement the typical diabetic dataset problem by java with weak. In addition, by measuring the actual time of foirard and back Ward évaluation, analyse the maximal speed up and parallel efficiency theoretically.

4.1 Role of Convolutional Neural Networks

In general, the structure of CNN includes two layers, one is feature extraction layer, the input of every neuron is connected to the local except fields of the previous layer, and extracts the local feature. Once the local features are extracted, the positional relationship between it and other features will be evaluated. The other layer is feature map layer; all calculation layers of the network is mapped of a multitude of feature map. Every feature is a level, and the weights of the neurons in the level are close.

The construction of feature applies the sigmoid function as stimulation function of the convolution network, which makes the feature shift in-variance. The numeral of free parameters of the network is reduced. All convolution layers in CNN by calculating the layer which is applied to calculate the local mean and the second extract. This particular two feature extraction constructions reduce the record’s matrix level size. Multi-dimensional input vector of heart disease data sets can exit the network, which avoids the quality of data reconstruction in feature extraction and classification procedure.

4.2 Feature Selection Algorithm

Several feature ranking and feature selection algorithms have been projected in the machine learning study. The purpose of these algorithms is to makes unfit or unnecessary features from a feature vector. For the implementation of this research work, it applied feature ranking and selection modes by two initial steps of overall architecture: subset creation and subset calculation for the ranking of all features in every data set. Filter mode was applied to measure all subsets.

4.3 Information Gain

The proposed feature choice both class membership and the presence/absence of a specific period are observed as random variables and one evaluates how more information around the class membership is increased by finding the presence/absence statistics as applied in decision tree induction. So, if the class membership is taken as a random variable C with two values, positive and negative, and a word is similarly observed as a random variable T with two values, present and absent, it is by applying the information practical statement of ordinary information specified as,

IG(T) = H(C) H(C/T) = Ʃτ, cP(C = c,T = τ) In [cP(C = c,T = τ)/cP(C = c).P(T = τ)] (1)

Here, τ ranges over {present, absent} and c ranges over {c+, c⁻}. As pointed out above, this is the measure of information about C (the class label) that increases by finding T (presence or absence of a word).

4.4 Back Propagation Algorithm

CNN technique is a multilayer perceptron, which is the specific system for recognition of two-dimensional data. It has many layers: input layer, convolution layer, sample layer and output layer. The CNN algorithm has two primary procedures: convolution and sampling. Convolution procedure applies a predictable filter F_x, re-convolution of the input data (the initial phase is the input data, the input of the later convolution is the feature data of every layer, namely, Feature Compose). Then, add a bias b_x, get convolution b_x layer C_x. A selecting procedure: n points of each neighborhood by pooling steps, get a point, and then by scalar weighting W_x+1 weighted, add bias b_x+1, and then by an activity function, make a narrow n times feature S_x+1.

The central engineering of CNN is the local tract, jointing of weights, sub selecting by time or space, and hence the training parameters extract feature and reduce the size. The benefit of CNN technique is prevention of explicit feature extraction, and learning from the training data. The neuron weights on the surface of the feature composing, thus, the network can see parallels, and reduce the multilevel of the network, Adapting sub sampling structure by the time or space, can attain a few degrees of robustness, scale and modification replacement. Input information and network topology can be a very good match, It has specific benefits in speech identification and data sets processing.

\({\text{O}}_{\text{x},\text{y}}^{1,\text{k}}\) = tanh\(\sum _{\text{t}=0}^{\text{f}-1}\sum _{\text{r}=0}^{{\text{k}}_{\text{h}}}\sum _{\text{c}=0}^{{\text{k}}_{\text{w}}}{\text{W}}_{(\text{r},\text{c})}^{(\text{k},\text{t})}{\text{O}}_{(\text{x}+\text{r}, \text{x}+\text{c})}^{(\text{l}-1, \text{x})}\) + \({\text{B}\text{i}\text{a}\text{s}}^{\left(\text{l},\text{k}\right) }\) (2)

Among them, f is the numeral of convolution cores in a feature pattern, output of neuron of row x, column y in the l^th sub sample layer and k^th feature pattern:

\({\text{O}}_{\text{x},\text{y}}^{1,\text{k}}\) = tanh ( \({\text{W}}^{\text{k}}\sum _{\text{r}=0}^{{\text{s}}_{\text{h}}}\sum _{\text{c}=0}^{{\text{s}}_{\text{w}}}{\text{O}}_{(\text{x}\times {\text{s}}_{\text{h}}+\text{r}, \text{y}\times {\text{s}}_{\text{w}}+\text{c})}^{\text{l}-1, \text{t}}\) + \({\text{B}\text{i}\text{a}\text{s}}^{\left(\text{l},\text{k}\right) }\) (3)

The output of the j^th neuron in l^th hides layer H:

\({\text{O}}_{(\text{i},\text{j})}\) = tanh \(( {\text{W}}^{\text{k}}\sum _{\text{k}=0}^{\text{S}-1}\sum _{\text{x}=0}^{\text{S}}\sum _{\text{y}=0}^{{\text{S}}_{\text{w}}}{\text{W}}_{(\text{x},\text{y})}^{(\text{j},\text{k})}{\text{O}}_{(\text{x},\text{y})}^{(\text{l}-1, \text{k})}\) + \({\text{B}\text{i}\text{a}\text{s}}^{\left(\text{l},\text{k}\right) }\) (4)

Among them, s is the number of feature patterns in sample layer output of the i^th neuron l^th output layer F:

\({\text{O}}_{(\text{i},\text{j})}\) = tanh (\(\sum _{\text{j}=0}^{\text{H}}{\text{O}}_{(\text{l}-1,\text{j})}{\text{W}}_{(\text{i},\text{j})}^{\text{l}}\) + \({\text{B}\text{i}\text{a}\text{s}}^{\left(\text{l},\text{i}\right) }\) (5)

4.5 Modified Back-Propagation

A speed matrix depends on the technique to evaluate the output from an NN. Especially, it is an excellent mode of acquiring the notation used in back-propagation. Back-propagation is an NN learning algorithm. The neural networks field was addressed by psychologists and neuro-biologists who wanted to create and test evaluation analogy of neurons. An NN is a set of input/output units in which every attachment has a weight jointed by it. During the learning stage, the network learns with changing the weights so far capable to predict the exact class label of the input tuples.

Output deviation of the k^th neuron in output layer O:

d(\({\text{O}}_{\text{k}}^{0})= {\text{y}}^{\text{k}}- {{\tau }}^{\text{k}}\) (6)

Input deviation of the k^th neuron in output layer:

d(\({\text{I}}_{\text{k}}^{0})=\) (\({\text{y}}^{\text{k}}- {{\tau }}^{\text{k}}\)) φ(\({\text{v}}_{\text{k}}\)) d(\({\text{O}}_{\text{k}}^{0})\) (7)

Weight and bias variation of k^th neuron in output O:

∆\({\text{W}}_{\text{k},\text{x}}^{0}\) = d(\({\text{I}}_{\text{k}}^{0})\) \({\text{y}}_{\text{k},\text{x}}\) (8)

∆\({\text{B}\text{i}\text{a}\text{s}}_{\text{k}}^{0 }=\)d(\({\text{I}}_{\text{k}}^{0})\) (9)

Output bias of k^th neuron in hide layer H:

d(\({\text{O}}_{\text{k}}^{\text{H}})\) = \(\sum _{\text{i}=0}^{\text{i}++}\text{d}\left({\text{I}}_{\text{k}}^{0}\right){\text{W}}_{\text{i},\text{k} }\) (10)

Input bias of k^th neuron in hide layer H:

d(\({\text{I}}_{\text{k}}^{\text{H}})\) = φ(\({\text{v}}_{\text{k}}\)) d(\({\text{O}}_{\text{k}}^{\text{H}})\) (11)

Where, weight and bias variation in row x, column y, in the m^th feature pattern, a previous layer in front of k neurons in hide layer H.

∆\({\text{W}}_{\text{m},\text{x},\text{y}}^{\text{H},\text{k}}\) = d(\({\text{I}}_{\text{k}}^{\text{H}})\) \({\text{y}}_{\text{x}.\text{y}}^{\text{m}}\) (12)

∆\({\text{B}\text{i}\text{a}\text{s}}_{\text{k}}^{\text{H} }=\)d(\({\text{I}}_{\text{k}}^{\text{H}})\) (13)

Output bias of row x, column y in m^th feature pattern, sub sample layer S

d(\({\text{O}}_{\text{x},\text{y}}^{\text{S},\text{m}})\) = \(\sum _{\text{k}}^{\text{i}++}\text{d}\left({\text{I}}_{\text{m},\text{x},\text{y}}^{\text{H}}\right){\text{W}}_{\text{m},\text{x},\text{y}}^{\text{H},\text{k}}\) (14)

Input bias of row x, column y, in m^th feature pattern ,sub sample layer S:

d(\({\text{I}}_{\text{x},\text{y}}^{\text{S},\text{m}})\) = φ(\({\text{v}}_{\text{k}}\)) d(\({\text{O}}_{\text{x},\text{y}}^{\text{S},\text{m}})\) (15)

Weight and bias variation of row x, column y, in m^th feature pattern sub sample layer S:

∆\({\text{W}}^{\text{S},\text{m}}\) = \(\sum _{\text{x}=0}^{\text{f}\text{h}}\sum _{\text{y}=0}^{\text{w}}\text{d}\)(\({\text{I}}_{\frac{\text{x}}{2},\frac{\text{y}}{2}}^{\text{S},\text{m}})\) \({\text{O}}_{\text{x}.\text{y}}^{\text{C},\text{m}}\) (16)

Among them, C represents convolution layer:

∆\({\text{B}\text{i}\text{a}\text{s}}^{\text{S},\text{m}}= \sum _{\text{x}=0}^{\text{f}\text{h}}\sum _{\text{y}=0}^{\text{f}\text{w}}\text{d}\)(\({\text{O}}_{\text{x},\text{y}}^{\text{S},\text{m}})\) (17)

Output bias of row x, column y in k^th feature pattern, convolution layer C:

\({\text{d}(\text{O}}_{\text{x},\text{y}}^{\text{C},\text{k}})\) = d(\({\text{I}}_{\frac{\text{x}}{2},\frac{\text{y}}{3}}^{\text{S},\text{k}})\) \({\text{W}}^{\text{k}}\) (18)

Input bias of row x, column y in k^th feature pattern, convolution layer C:

\({\text{d}(\text{I}}_{\text{x},\text{y}}^{\text{c},\text{k}})\) = φ(\({\text{v}}_{\text{k}}\)) d(\({\text{O}}_{\text{x},\text{y}}^{\text{C},\text{k}})\) (19)

Weight variation of row r, column c in m^th convolution core, corresponding to k^th feature pattern in l^th layer, convolution C.

∆\({\text{W}}_{\text{r},\text{c}}^{\text{k},\text{m}}\) = \(\sum _{\text{x}=0}^{\text{f}\text{h}}{\sum }_{\text{y}=0}^{\text{f}\text{w}}\text{d}\)(\({\text{I}}_{\text{x},\text{y}}^{\text{C},\text{k}})\) \({\text{O}}_{\text{x}+\text{r}.\text{y}+\text{c}}^{\text{l}-1,\text{m}}\) (20)

Total bias variation of the convolution core:

∆\({\text{B}\text{i}\text{a}\text{s}}^{\text{C},\text{k}}= \sum _{\text{x}=0}^{\text{f}\text{h}}\sum _{\text{y}=0}^{\text{f}\text{w}}\text{d}\)(\({\text{I}}_{\text{x},\text{y}}^{\text{C},\text{k}})\) (21)

4.6 CNN Algorithm Process Steps

Step 1: Take the arff data set

Step 2: Feature selection using information gain and ranking

Step 3: Classification algorithm

Step 4: Every Feature calculates fx value of input layer

Step 5: Bias class of every feature calculation

Step 6: After giving the feature map, it goes to forward pass input layer

Step 7: Evaluate the convolution cores in a feature pattern

Step 8: Gives sub sample layer and feature value

Step 9: Back-propagation input deviation of the k^th neuron in o/p layer

Step 10: Lastly, produce the chosen feature and classification results

The dataset of all records having a set of 13 similar attributes. During the collection of a few missing values are found because of manual write down by doctors. By means of data missing Weka 3.6.6 tool the missing values are replaced by an appropriate data by way of mean, mode method by Replaced Missing Values filter option available in the tool. A confusion matrix is obtained to compute the accuracy of classification. In this work, confusion matrix shows how many instances have been assigned to every class.

Table 2

Class A = YES (has heart disease); Class B = NO (No heart disease)
	A (has heart disease)	B (no heart disease)
A (has heart disease)	True Positive	False Negative
B (no heart disease)	False Positive	True Negative

The data set made of 573 records and it has been divided into on 25 classes, where each class consists of 23 records. Training and Testing sets are divided into equal amount to predict the exactness of the system. To measured certainty of 13 attributes are chosen to classify the system by a symbolic learning approach by interval method, i.e., (µσ, µ + σ) as indicate in the Table 1.

In this work trained the classifiers to classify the clinical data set as either ‘’has heart disease’’ or ‘’no heart disease’’. The general and particular of confusion matrixes are two classes (i.e. normal and abnormal) of five classifiers.

Table 3

Confusion matrix accès frome *CNN* classifier
	A	B
A	257	05
B	08	271

The CNN algorithm compared with the classifiers such as K-NN, Naïve Bayes, SVM, and ANN. The classifiers of Accuracy and Mean Value Error measures results are shown in bellow Table 4. Similarly, Overall accuracy accès for data set among unreliable K values shows in Table 5, and their comparison chart is specified in below Fig. 3 and Fig. 4 respectively.

Table 4

Accuracy, and Mean Value Error comparative of five classifiers
Classifiers	Accuracy	Mean Absolute Error
K-NN	54.42	0.046
Naive Bayes	75.68	0.003
SVM	79.58	0.009
ANN	87.87	0.0028
CNN	97.43	0.0014

Table 5

Overall accuracy accès for data set among unreliable K values
K	K-NN	Naïve Bayes	SVM	ANN	CNN
1	0.56	0.77	0.79	0.96	0.98
3	0.52	0.77	0.75	0.93	0.99
5	0.50	0.74	0.73	0.88	0.95
7	0.48	0.71	0.71	0.84	0.94
9	0.45	0.70	0.79	0.81	0.88

Table 6

Comparison Table for Sensitivity, Specificity, and F-Measure
Classifiers	Sensitivity	Specificity	F-Measure
K-NN	0.9467	0.9143	0.8520
Naive Bayes	0.9507	0.9421	0.9422
SVM	0.9534	0.9435	0.9455
ANN	0.9638	0.9569	0.9521
CNN	0.9645	0.9577	0.9532

Table 6, shows the Sensitivity, Specificity and F-Measure values are projected and compared to the other existing classifiers and those values are allotted in the chart of Fig. 5. Similarly, the following Table 7 depicts the False Positive Rate, False Negative Rate and Overall Error Rate, and their comparison performance levels shows in Fig. 6.

Table 7

Comparison Table for FPR, FNR, and Overall Error Rate
Classifiers	FPR	FNR	Overall Error Rate
K-NN	0.0631	0.0596	0.0678
Naive Bayes	0.0568	0.0382	0.0453
SVM	0.0487	0.0356	0.0417
ANN	0.0321	0.0267	0.0345
CNN	0.0398	0.0298	0.0343

The evaluated measurement of the performance based on 13 attributes is accurate in natural history, and the accuracy of Convolution neural network using heart disease data set is reached up to 99%. CNN is improved performance than other classifiers, while K-NN shows poor performance than other classifiers.

In this proposed system, neural network classification methodology is made efficiently. To perform classification task of medical data, the neural network is trained using Convolution technique. The experimentation is shown with heart disease dataset by considering the single and multilayer neural network methods. Convolution neural network technique is a multilayer perceptron that is the superior design for reorganization of two-dimensional heart disease data information. In this so many layers are observed, i.e.: input layer, convolution layer, sample layer and output layer. In addition, in a deep neural network architecture the convolution layer and sample layer can have multiple.The overall objective of this work is to predict the heart disease patients with more accuracy. This work can be enhanced by increasing the more number of records with attributes to provide better accuracy for the existing system.

ACKNOWLEDGEMENTS

I would like sincerely to express my gratitude for my management of Neil Gogte Institute of Technology (NGIT), and also the Department of Computer Science and Engineering.

Funding

This work is not funded by any organisation.

Conflicts of Interest

The authors declare no conflict of interest.

Waseem Rawat, Zenghui Wang, “Deep Convolutional Neural Networks for Image Classification: A Comprehensive Review”, Neural Computation © 2017 Massachusetts Institute of Technology, Volume 29, Issue 9, September 2017.
Nijaguna Gollara Siddappa, Thippeswamy Kampalappa, ‘Adaptive Condensed Nearest Neighbor for Imbalance Data Classification”, International Journal of Intelligent Engineering and Systems (IJIES), Volume-12, Number 2, 2019 DOI: 10.22266 / ijies 2019. 0430.11, Pp. 104 - 113.
Samuel, O. W., et., al., “An integrated decision support system based on ANN and Fuzzy_AHP for heart failure risk prediction”, Expert Systems with Applications, Vol. 68, Pp. 163 - 172, 2017.
Tianmei Guo, Jiwen Dong, Henjian Li'Yunxing Gao, “Simple Convolutional Neural Network on Image Classification”, IEEE 2nd International Conference on Big Data Analytics at Beijing, China on 10-12 March 2017.
Emine CENGIL, Ahmet ÇINAR, Zafer GÜLER, “A GPU-Based Convolutional Neural Network Approach for Image Classification”, International Conference on Artificial Intelligence and Data Processing Symposium at Malatya, Turkey on 16-17 September 2017.
Travis Williams, Robert Li, “Advanced Image Classification using Wavelets and Convolutional Neural Networks”, 15th International Conference on Machine Learning and Applications at Anaheim, CA, USA on 18-20 December 2016.
Sayali Jog, Mrudul Dixit, “Supervised Classification of Satellite Images”, Conference on Advances on signal processing at Pune, India on 9-11 June 2016.
ER-XIN SHANG, HONG-GANG ZHANG, “Image Spam Classification Based On Convolutional Neural Network”, International Conference on Machine Learning and Cybernetics at Jeju, South Korea on 10-13 July 2016.
Qing Li, Weidong Cai, Xiaogang Wang, Yun Zhou, David Dagan Feng and Mei Chen, “Medical Image Classification with Convolutional Neural Network”, 13th International Conference on Control Automation Robotics and Vision at Singapore on 10-12 December 2014.
Clevelanddatabase:http://archive.ics.uci.edu/ml/dat sets/Heart disease
Stalogdatabase:http://archive.ics.uci.edu/ml/machine-learning databases/stalog/heart/.

No competing interests reported.

Heart Disease Prediction System Using Convolutional Neural Networks

Status:

Version 1

Abstract

Figures

1. Introduction

2. Related Work

3. Data Source

4. Proposed Methogology

4.1 Role of Convolutional Neural Networks

4.2 Feature Selection Algorithm

4.3 Information Gain

4.4 Back Propagation Algorithm

4.5 Modified Back-Propagation

4.6 CNN Algorithm Process Steps

5 Results

6 Conclusion

Declarations

References

Additional Declarations

Status:

Version 1