Towards securing Wireless insulin pump system using unsupervised deep learning technique

doi:10.21203/rs.3.rs-2109728/v1

With the advent of Internet of things (IoT) technology across various fields give arise to occurrence of many smart objects/things. These kind of smart objects involves even in medical area to achieve smart health care monitoring system, wearable devices, and medical implanted devices, in general those kind of systems were called as internet of medical things. Hence the tremendous increase in smart devices among medical domain has both pros and cons. Among many existing problems, security issues sounds to be most addressable problem in the IoT based medical application. These IoMT devices were considered to be resource constrained, so it does not possess enough security framework to fight against all sorts of malicious attack as well as data privacy of patients. The malicious attack in IoMT system can bring huge data loss and life threat to patients. The existing solutions suggested for IoMT security issues are relies on supervised learning, so this work is heavily based on unsupervised learning to improve the efficiency of the designed security model. In this paper, an intrusion detection system has been designed for most significant IoMT device namely Insulin pump system for diabetes treatment using deep learning technique in an unsupervised manner. In this model deep autoencoder has been utilized to classify the unauthorized insulin value from the legitimate insulin value and this model used insulin logs of several patients as its dataset. The performance of the designed model has been evaluated using the quality metrics like accuracy, precision, F1-measure, and recall. Furthermore the resultant model is compared and analyzed against existing methodology as well as traditional machine learning classifiers.

Medical devices

IoT

intrusion detection

IoMT

Deep autoencoders

The Internet of things (IoT) can be defined as the network of multiple network which comprises of numerous different technologies such as cloud/fog, wireless sensor network, Software defined networking[1], RFID, Bluetooth etc., Internet of things (IoT) capable of connecting all kinds of objects under single umbrella (network) using aforementioned technologies which makes normal things as smart objects/devices. The term Internet of things can be rephrased based on the device in which it belongs to it. Suppose if an object connected to the IoT which belongs to automobile industry then the technology is renamed as Internet of vehicles (IoV)[2]. In this same way, suppose if any medical devices are converted as smart devices then it is said to be Internet of Medical things (IoMT). Likewise if the connected devices which comes under defense field means then that technology can be identified as Internet of Battlefield Things (IoBT)[3]. This phenomenon shows how the technology “IoT” has play huge role across almost all kinds of field. Obviously, this fame brings the security threat to the application in which it incorporates IoT technology [].

The most sensitive field of IoT application is considered to be medical or health care domain since it involves the life of the human and the number of IoMT devices seems to increase by the year of 2025[4]. This scenario sounds more efficient and reliable security framework for IoT enabled health care system or IoMT(internet of medical things) is necessary[5] and many researchers have done their towards this issue[6]. There are numerous IoMT application/devices has been developed namely Insulin pump system, implantable cardioverter defibrillator etc and these devices facilitate the workload reduction for smart health care system [7]. Out of these devices, insulin pump system is quite popular so in this work, an effective security model is specifically designed and many security issues have been discussed in detailed manner for insulin pump system[8]. The existing solution designed for securing IoMT devices relies on blockchain based architecture[9],access based control mechanisms[10],biometric based scheme[11] and some of them belongs to Machine/deep learning techniques

The main contribution of this work is listed below

Generating three types of attack as adversarial sample in the dataset.

To deploy deep auto encoder to distinguish fake dosage from the genuine dosage and it has been carried out in two ways viz.,binary classification and multilabel classification

To compare and evaluated the designed model with the basic machine learning classifiers.

Ahmed et al[12] proposed an authentication model using machine learning techniques along with trust management for IoMT environment and to ensure the privacy of the patient’s sensitive data. A challenge-response protocol is introduced on both server and client side to provide data security. Sida Gao et al[13] suggested an security framework for the implanted medical security using machine learning technique. Author created appropriate the feature set to determine normal as well as malicious behavior of the medical device security using decision tree learning. Mohamed et al[14] proposed a malware detection system for IoMT environment. Author has deployed learning based Deep Q-learning networks to secure patient data in IoMT platform for authentication process. Usman et al[15] proposed the detection of malicious attack establish in Insulin pump system with the help of deep learning along with gesture recognition system. Author utilized three month log of insulin dosage from patient’s history and trained LSTM RNN with that data. So it is used to predict the future dosage of insulin. In case of any major deviation in the predicted value, that communication is considered to be malicious and then the patient will establish the new request based on the request. Heena et al[16] suggested an anomaly detection system for Deep brain stimulator using deep learning techniques. Deep brain stimulator is consider to be the most prominent devices among all the implantable medical devices. Because this device is used to stimulate the brain by providing pulse to handle brain disorders like movement disorder, epilepsy etc., In this regard, attacker may tries to stimulate pulse by sending wrong commands to the implantable device which is in patient’s brain. This scenario may leads even to death. Here Author utilized LSTM to predict the rest tremor velocity. Thereby the fake stimulation can be determined at high accuracy. AKM Iqtidar et al[17] illustrated a malware detection framework for Smart health care system using machine learning techniques so called Healthgaurd. This model observes the different sign from various medical devices to extract the pattern of the normal as well as malicious activities. This model incorporates using four machine learning techniques such as Artificial Neural Network, DecisionTree, Random Forest, k-Nearest Neighbor. Rakesh Kumar Mahendran et al[18] demonstrated fuzzy based biometric authentication scheme for IoT enabled Body sensor networks. In order to Author utilizes fuzzy vault along with fuzzy extractor for biometric authentication. In this approach, fuzzy extractor is used to retrieve the features from the preprocessed ECG signal and those features were extracted in such a way that private key cannot be misused by the hacker in biometric authentication scheme. Hei et al [19] suggested a pattern based access control scheme for securing the wireless insulin pump system. To achieve this, author has employed supervised machine learning technique namely support vector machine (SVM) and the model is trained and tested using patient’s insulin record maintained for several months. Rathore et al [20] presented an idea to protect the insulin pump system from fake insulin dosages using deep learning techniques namely multilayer perceptron and the reliability of the designed model is evaluated using Bayesian network. Rathore et al[21] extended her previous work towards securing medical device using MLP along with bayesian network by implementing this model in FPGA chip

At the end of the literature survey, so far deep autoencoder is not utilized for IoMT security and it is so obvious there is no model in an unsupervised way. Recently many researchers has proved deep auto-encoder performs well for the purpose of intrusion detection in network environment. So in this deep auto encoders has been used to detect the fake dosage command which is issued by the attacker to the implantable medical device (insulin pump)

This section describes about the methods and materials used for the proposed model.

3.1. Wireless Insulin pump System

Wireless insulin pump system is one of the most widely implantable medical device which is used to deliver/inject required amount insulin on the basis of regular intervals of time. This insulin pump system mainly consists of internet enabled devices such as insulin pump, continuous glucose monitor and remote control. The insulin pump is the main component which is directly connected to the patient and it will supply the instructed amount of insulin to the patient. The dosage supply can be divided into two types namely basal and bolus dosage. The basal dosage specifies the dosage which is given to the patient in a distributive manner over a specific period of time i.e sequence of dosage given between meals and at night. The bolus dosage is a kind of single shot dosage given to overcome the high level of glucose exist in the blood. The next component continuous glucose monitor is a device which is used to keep track of blood glucose level for the patient and to feed the required insulin to them. The remote control is capable of communicating with insulin pump as well as glucose monitor. The above mentioned components are always connected through the wireless medium, its communication can be easily tampered by attackers.

3.2. Deep Auto-encoder

A Deep auto encoder is an unsupervised technique which is primarily used for compression purposes but recently it gain focus for the anomaly detection techniques. Basically deep auto-encoder comprises of three major components namely encoder, code and decoder. The encoder is part of neural network which is used to compress the input data based on the number of the neurons given in the network layers. The component “code” just transfer the compressed input produced by the encoder to the decoder part. The component “part” is used to choose the most significant features which can be transferred to the decoder step. The decoder is another part of neural network and it will reconstruct the original input data. The performance of this technique is depends on the value assigned for the hyperparameters which is given below:

Loss Function

This function is used to calculate the loss for each iteration at the time of execution. Most probably mean square error is considered to be well suited for auto-encoder.

Number of layers

This denotes the number of network layers used for encoder as well as decoder. Always the number of layers used for encoder will be equal to decoder.

Number of nodes in each layer

The number of nodes per layer decreases with each subsequent layer of the encoder, and increases back in the decoder. The decoder is symmetric to the encoder in terms of the layer structure.

Activation Function

This function is used to compute the corresponding output for each node in the given layers. Most probably it can be “relu” or “tanh”

This section elaborates the work flow of the proposed intrusion detection system designed insulin pump system. The overall view of the architecture is given in Fig. 1 and the security model designed using deep learning techniques is detailed in Fig. 2.

4.1. Data Collection and Preprocessing

This model is evaluated and tested using the previous logs of 70 diabetes patients obtained from UCI repository [22]. All the patient records were combined and loaded from the text file to .csv file. The input attributes are Patient id, Date, Time, Code, Value and the output attribute is Label. Since all the patient records were combined, additionally ‘patient id’ is included as one of the attributes in the existing dataset.

4.2. Classification

The classification process involves identifying the fake dosage which comes along with the genuine dosage. In this regard, a deep auto encoder is implemented to categorize the dosage correspondingly with high accuracy rate with low false alarm rate. The implementation of deep auto-encoder is carried out in two ways for each kind of classification.

4.2.1 Binary Classification

In this implementation, autoencoder is trained only with one class among two available class i.e fake and genuine. This autoencoder is pre trained only with genuine instances, so that this model is capable of reconstruct the genuine dosage alone at low reconstruction error. For fake dosage instance, the reconstruction error will be high and it is easily detected using threshold value and its architecture is explained in Fig. 3.

4.2.2 Multi label Classification

For multi label classification, the decoder part is removed from the autoencoder model and then the last layer is connected with softmax function to perform classification and its architecture is given in Fig. 4.

In the proposed model, unsupervised algorithm deep auto encoder was implemented on a Core i3 Laptop with 2.30 GHz CPU and 4 GB RAM using keras library with tensorflow as backend in python version 3.7 software environment. The performance analysis of the designed model is measured using the following performance monitors

True Positive (TP) denotes that fake dosages are correctly predicted as fake.

True Negative (TN) denotes that genuine dosages are correctly predicted as genuine.

False Positive (FP) denotes that genuine dosages is wrongly detected as an fake.

False Negative (FN) denotes that fake dosages is wrongly detected as genuine.

1) Accuracy:

Accuracy can be defined as the ratio between the number of correctly predicted samples to the total number of samples and it is calculated using Eq. (1)

$$Accuracy=\frac{TP+TN}{TP+TN+FP+FN}$$

2) Precision

Precision can be termed as the ability of the classifier to correctly label fake dosage as attacks. Eq. (2s) is used to calculate the precision of the classifier.

$$Precision=\frac{TP}{TP+FP}$$

2

3) Recall or Detection rate

Recall or detection rate can be defined as the number of correctly detected fake dosaages. Eq. (3) is used to calculate the recall of the classifier.

$$Recall=\frac{TP}{TP+FN}$$

3

4) F-measure

F-measure can be defined as the weighted harmonic mean of precision and recall. Eq. (4) is used to calculate the F-measure of the classifier.

$$F-measure=2*\frac{Precision*Recall}{Precision+Recall}$$

4

5.1. Dataset Description

The Dataset has several logs of insulin pump values for 70 different patients and this dataset is publicly available in the UCI repository [23]. Each patient record possess nearly 1000 recorded samples either by system or manually. As mentioned earlier in section (),the dataset has only four attributes those are considered as inputs and the output attribute ‘Label’ has to include manually by assigning values either ‘0’ or ‘1’.

For binary classification, the assignment of values is done by referring the values given in the attribute ‘code’. That means if the value of code is equal to 48, 57, 72 is belongs to unspecified category so it has the label value of ‘1’ (fake dosage) and the remaining code values is considered to be genuine dosages which has label value of ‘0’.

For multilabel classification, the adversarial samples were introduced to simulate the four types of attack namely long resume, Single acute overdose, single acute underdose and chronic overdose. In this regard, the input attribute ‘value’ is changed according to the behaviour of below mentioned attack. The Table 1 explores the sample distribution of the generated dataset

Long Resume

Long resume is a kind of attack in which attacker tends to send same insulin value to the insulin pump for over a period of month or week. This phenomenon leads the patient’s life to the serious illness.

Single acute overdose/underdose

This attack sends the manipulated insulin value to the insulin pump, that value will be either underdose or overdose. This dosage will be injected to the patient once in a while but not continuously for particular duration.

Chronic overdose

This attack is carried out by the injecting overdose to the patient for over the period of one month or one week. This attack seems to be very serious than the above mentioned attacks and it will bring life threat to patient.

Table 1

Sample Distribution of Dataset
Type	Number of Samples
Type	Training	Testing
Fake	2318	579
Genuine	20757	5189

5.2. Evaluation of Binary Classification

The above Fig. 5 illustrates about the model loss occur at training and testing phase. The blue line indicates the training loss whereas the red line indicates the validation loss accordingly. These two lines convergence nicely when the number of epochs keeps on increasing. The values of both loss and validation loss are very close to each other. The Fig. 6a demonstrates about the distribution of reconstruction error calculated for fake dosage alone whereas Fig. 6b describes about both genuine as well as fake. These two diagrams shows that reconstruction error for fake dosage is higher than the genuine dosages.

The Fig. 7 explores the ROC curve plots between true positive rate and false positive rate. For this binary classification using deep auto encoder, around 0.646 is obtained as Area under curve (AUC) value. This shows how perfectly this model predicted the normal instances as normal and attack instances as attack in an unsupervised manner.

Figure 8 shows the precision-recall curve of deep auto encoder for the task of binary classification. This curve explores the relationship between recall and precision generated for various threshold values correspondingly. The larger area under the precision-recall curve denotes the highest precision and recall value achieved by the classifier. This curve shows the value of average precision is equal to 0.887.

Table 2

Comparative analysis for various parameters (Binary Classification)
1-layer
No of Neurons	Accuracy	Loss	Val.Acc	Val.loss
16	26.959	4375.794	26.116	4625.173
24	26.959	4375.794	26.116	4625.173
32	26.959	4375.794	26.116	4625.173
64	26.959	4375.794	26.116	4625.174
2-layer
16	99.97	2.984	99.97	2.967
24	99.97	2.984	99.97	2.967
32	99.97	2.485	99.98	2.465
64	99.97	9.648	99.97	9.411
3-layer
16	99.98	3.536	999.8	3.524
24	99.97	2.981	99.97	2.965
32	99.97	2.476	99.97	2.456
64	99.97	9.640	99.97	9.403
4-layers
16	9997	3.528	99.97	3.516
24	99.97	2.978	99.97	2.962
32	99.98	2.477	99.98	2.457
64	99.98	9.647	99.98	9.419

In Table 2, the performance of autoencoder is evaluated with various number of hidden layers as well as number of neurons for binary classification. At the end of the evaluation, the most optimized number of layers is 3 with 16 neurons.

5.3 Evaluation for multilabel classification

This Table 3 shows comparative analysis for multi-label classification using various values assigned for the parameters, the optimized number of encoding layer is 64 with neurons and it is highlighted in table. From this observation it proves that for both kind of classification, there is only slight variation in accuracy and loss is exist, even if the layers and neurons got changed.

Table 3

Comparative analysis for various parameters (Multilabel Classification)
3-layers
No of Neurons	Accuracy	Loss	Val.Acc	Val.loss
32	95.349	0.699	94.913	0.765
64	95.349	0.146	94.913	0.171
128	95.304	0.581	94.913	0.671
4-layers
32	95.349	0.229	94.913	0.264
64	95.349	0.702	94.913	0.766
128	95.278	0.160	94.720	0.189
5-layers
64	95.323	0.210	94.913	0.241
128	95.053	0.184	94.354	0.215
256	95.284	0.180	94.913	0.208
6-layers
128	95.349	0.197	94.913	0.230
256	95.349	0.200	94.913	0.231
512	95.362	0.167	94.913	0.191

The above graph (Fig. 10) explores the performance comparison between existing solutions designed for insulin pump system using machine/deep learning techniques with the proposed solution. This shows that the proposed model outperforms the other existing solution at high rate. In Table 4, the performance metrics of proposed DL model is compared with the existing machine learning model. This analysis shows that the proposed DL model outperforms the existing machine learning classifiers.

Table 4

Comparative analysis of Proposed model with ML models
Methodology		Accuracy	Precision	Recall	F1-measure
Autoencoder (Binary classification)		99.98	89.85	91.05	93.45
Autoencoder(Multi label Classification)	Normal	95.701	0.95	1.000	0.97
	Long resume	94.223	0.95	1.000	0.96
	Single acute overdose	95.113	0.95	1.000	0.98
	Single acute underdose	95.212	0.94	99.99	0.97
	Chronic overdose	94.123	0.94	99.99	0.96
Support Vector Machine(SVM)		89.851	0.898	1.000	0.946
Decision Tree		95.052	0.972	0.972	0.972
Naïve Bayes		43.019	0.926	0.397	0.556

This work focuses on providing security framework for implantable medical devices like wireless insulin pump system. Unlike existing security solution proposed for insulin pump system, this model is completely based on unsupervised manner to enhance its reliability and performance. Thus the proposed methodology outperforms the existing solution interms of quality metrics such as accuracy, precision, recall and F1-measure. In the same way, the proposed model was also evaluated against some existing traditional machine learning to prove the capability of the designed solution.

This methodology is carefully designed with minimum computational workload along with minimum hardware requirement in order to deploy this model in resource constrained medical device. In future, this methodology is adopted to deploy on fog nodes which is connected with the particular medical device and this technique can be further extend to implement on other implantable medical devices like deep brain implants, implantable cardioverter defibrillators.

Funding Declaration

This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.

Author Contribution

M.Shobana make contributes to write the research article and implement the concept to retrieve the result under the guidance of Dr. S. Poonkuzhali. Dr. S. Poonkuzhali completely evaluates the obtained result of this work and She reviewed the entire article.

Conflict of Interest

The authors have no conflicts of interest to declare. All co-authors have seen and agree with the contents of the manuscript and there is no financial interest to report. We certify that the submission is original work and is not under review at any other publication.

Dawoud, A., Shahristani, S., & Raun, C. (2018). Deep learning and software-defined networks: Towards secure IoT architecture. Internet of Things, 3, 82-89.
Yang, F., Wang, S., Li, J., Liu, Z., & Sun, Q. (2014). An overview of internet of vehicles. China communications, 11(10), 1-15.
“The Internet of Things For Defense,” Wind River Systems, 2015.
Shobana, M., & Poonkuzhali, S. (2020, February). A novel approach to detect IoT malware by system calls using Deep learning techniques. In 2020 International Conference on Innovative Trends in Information Technology (ICITIIT) (pp. 1-5). IEEE.
A Guide to the Internet of Things Infographic, Intel. (n.d.) https:// www.intel.com/content/www/us/en/internet-ofthings/ infographics/guide-to-iot.html (accessed October 13, 2017).
Shivaji Kulkarni, Shrihari Durg, Nalini Iyer,”Internet of Things (IoT) Security,” IEEE, 2016, pp. 821-824.
Alsubaei, F., Abuhussein, A., Shandilya, V., & Shiva, S. (2019). IoMT-SAF: Internet of medical things security assessment framework. Internet of Things, 8, 100123.
A. O. Putri, M. A. M. Ali, M. Saad, and S. S. Hidayat, “Wearable sensor and internet of things technology for better medical science: A review,” International Journal of Engineering and Technology(UAE),vol. 7, no. 4, pp. 1–4, 2018
Paul, N., Kohno, T., & Klonoff, D. C. (2011). A review of the security of insulin pump infusion systems. Journal of diabetes science and technology, 5(6), 1557-1562
Malamas, V., Dasaklis, T., Kotzanikolaou, P., Burmester, M., & Katsikas, S. (2019, July). A forensics-by-design management framework for medical devices based on blockchain. In 2019 IEEE World Congress on Services (SERVICES) (Vol. 2642, pp. 35-40). IEEE.
Palve, A., & Patel, H. (2018, November). Towards securing real time data in IoMT environment. In 2018 8th international conference on communication systems and network technologies (CSNT) (pp. 113-119). IEEE.
Pirbhulal, S., Wu, W., & Li, G. (2018, November). A biometric security model for wearable healthcare. In 2018 IEEE International Conference on Data Mining Workshops (ICDMW) (pp. 136-143). IEEE.
Mawgoud, A. A., Karadawy, A. I., & Tawfik, B. S. (2019). A Secure Authentication Technique in Internet of Medical Things through Machine Learning. arXiv preprint arXiv:1912.12143.
Gao, S., & Thamilarasu, G. (2017, July). Machine-learning classifiers for security in connected medical devices. In 2017 26th International Conference on Computer Communication and Networks (ICCCN) (pp. 1-5). IEEE.
Shakeel, P. M., Baskar, S., Dhulipala, V. S., Mishra, S., & Jaber, M. M. (2018). Maintaining security and privacy in health care system using learning based deep-Q-networks. Journal of medical systems, 42(10), 186.
Ahmad, U., Song, H., Bilal, A., Saleem, S., & Ullah, A. (2018, August). Securing insulin pump system using deep learning and gesture recognition. In 2018 17th IEEE International Conference On Trust, Security And Privacy In Computing And Communications/12th IEEE International Conference On Big Data Science And Engineering (TrustCom/BigDataSE) (pp. 1716-1719). IEEE.
Rathore, H., Al-Ali, A. K., Mohamed, A., Du, X., & Guizani, M. (2019). A novel deep learning strategy for classifying different attack patterns for deep brain implants. IEEE Access, 7, 24154-24164.
Newaz, A. I., Sikder, A. K., Rahman, M. A., & Uluagac, A. S. (2019, October). Healthguard: A machine learning-based security framework for smart healthcare systems. In 2019 Sixth International Conference on Social Networks Analysis, Management and Security (SNAMS) (pp. 389-396). IEEE.
Mahendran, R. K., & Velusamy, P. (2020). A secure fuzzy extractor based biometric key authentication scheme for body sensor network in Internet of Medical Things. Computer Communications, 153, 545-552.
Hei, X., Du, X., Lin, S., & Lee, I. (2013, April). PIPAC: Patient infusion pattern based access control scheme for wireless insulin pump system. In 2013 Proceedings IEEE INFOCOM (pp. 3030-3038). IEEE.
Rathore, H., Al-Ali, A., Mohamed, A., Du, X., & Guizani, M. (2017, December). DLRT: Deep learning approach for reliable diabetic treatment. In GLOBECOM 2017-2017 IEEE Global Communications Conference (pp. 1-6). IEEE.
Rathore, H., Wenzel, L., Al-Ali, A. K., Mohamed, A., Du, X., & Guizani, M. (2018). Multi-layer perceptron model on chip for secure diabetic treatment. IEEE Access, 6, 44718-44730.
https://archive.ics.uci.edu/ml/datasets/diabetes

No competing interests reported.

Towards securing Wireless insulin pump system using unsupervised deep learning technique

Status:

Version 1

Abstract

Figures

1. Introduction

2. Related Works

3. Background

3.1. Wireless Insulin pump System

3.2. Deep Auto-encoder

4. Proposed Model

4.1. Data Collection and Preprocessing

4.2. Classification

4.2.1 Binary Classification

4.2.2 Multi label Classification

5. Result And Discussion

5.1. Dataset Description

5.2. Evaluation of Binary Classification

5.3 Evaluation for multilabel classification

Conclusion And Future Work

Declarations

References

Additional Declarations

Status:

Version 1