Deeper Fine-Tuned Autoencoder for User Datagram Protocol Flooding Network Traffic Detection in Internet of Things

doi:10.21203/rs.3.rs-2442056/v1

Download PDF

Research Article

Deeper Fine-Tuned Autoencoder for User Datagram Protocol Flooding Network Traffic Detection in Internet of Things

https://doi.org/10.21203/rs.3.rs-2442056/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

The increase in the use of Internet of Things (IOT) devices operating online has led to an increase in cyber-attacks with these devices. One of the uncontrolled attacks carried out with a botnet is User Datagram Protocol (UDP) flooding. It is necessary to develop an effective method to detect abnormal UDP flooding traffic IOT devices that are compromised the IOT devices. Detection of anomalies in network traffic is the most effective method. Although machine learning, shallow neural networks and deep learning methods are used to detect abnormal traffic, in this study, it is suggested that the effective measurement metrics should be selected and applied to a fine-tuned auto-coder architecture. The main contribution of the proposed method is that a classification with high accuracy and performance can be performed by encoding the selected features deeper. The proposed method is verified with UDP-flood data in the N-BaIoT and NSL-KDD test datasets. The proposed method proved to be successful in terms of Cohen kappa, f₁ score, sensitivity and accuracy metrics obtained in the experimental results. Experiments in the study showed that the number of optimally selected features was significantly reduced, resulting in the lowest detection time. This enabled a more optimized and feasible design.

Fine-tuned Stacked Sparse Autoencoder

Network Traffic of the Internet of Things

User Datagram Protocol Flooding

Abnormal Traffic Detection

Deep Feature Extraction

Internet of Things (IOT) are devices that are connected to the internet in order to facilitate daily life. These devices generally include collect, communicate, analyze and act processes. The collection, transfer and processing of the data in the environment and the possibility of remote control and monitoring are the flexibility of these devices [1]. The lack of a large transaction volume causes some compromises. One of them is that they do not have versatile features for any protection related to cyber security. This makes these devices suitable for zombie attacks [2]. These devices are generally used by attackers in Transmission Control Protocol (TCP) and User Datagram Protocol (UDP) overflow attacks because of the seizure of the device. While there is a mechanism that contributes to security, such as 3 way handshake in TCP, there is no such situation in UDP. When the IOT device starts broadcasting over UDP, UDP causes flooding [3]. The purpose of this type of attack is to affect the victim's reachability. The flooding can be performed by overflowing with large numbers of requests or by consuming the resources of the central processing unit and memory [4]. In the attack detection, it is necessary to describe the current behavior of the attacker or users trying to invade [5]. UDP flooding detection from an IOT device is important for the performance of the system. Therefore, it is vital to identify these types of attacks.

Various approaches were proposed in the literature to detect these attacks. Celil et al. used supervised and unsupervised learning models and the dataset created with the Provision 737E security camera in the N-BaIoT dataset in their study. The authors reduced the parameters related to traffic metrics measured with 115 different features to 10 in their experiments. The J48 algorithm and EM algorithm trained with 10 features have achieved an accuracy of 99% and 76.73%, respectively. The performance obtained by cross validation. [6]. Kponyo et al. examined Denial of Service (DoS) attacks against IOT devices in their studies. The authors used machine learning and data including Synchronize (SYN), Internet Control Message Protocol (ICMP) and UDP flood attacks. In the study, in which all three attack models were classified as binary, an experiment was not performed only to recognize UDP flood attacks [2]. Guang et al. used the LeNet deep learning algorithm to identify abnormal traffic. They achieved 89.6% accuracy with the 20-dimensional feature vector [7]. Liu et al. used a CNN-based deep learning architecture in their study. The authors stated that the detection of UDP flood attacks is more difficult than acknowledgement (ACK), scan and SYN flood attacks [8]. Rizal et al. used a network forensic approach to detect UDP flood attacks with IOT devices according to traffic flow. The authors detected anomalies in traffic with the network model. [9]. Campus et al. investigated the effects of flood attacks on IOT routing. In particular, they explained how anomalies in network traffic caused by DOS attacks and flood attacks affect routing in IOT [10]. Meidan et al. validated their method with the N-BaIoT dataset. The authors used a deep auto-encoder model for detecting the flooding attacks. The anomalies were gathered from Bashlite and Mirai. A threshold was used for training their model [11]. Shorman et al. proposed a botnet detection model for the IOT network. Before classification, the traffic data in the N-BaIoT dataset were cleaned and rescaled by the authors. With these processed data, the properties are optimized with the Gray Wolf optimization (GWO) algorithm. Problematic traffic was classified using the support vector machines (SVM) algorithm with optimized data [12]. Allotaibi et al. tried to detect IOT cyber-attacks with the deep learning architecture in their stacked structure. The authors stated that the stacked architecture increases the effective accuracy of the study [13]. Wang et al. performed syntactic analysis of protocols affecting network traffic. The authors determined the limits of the traffic on the network of IOT devices with the deep learning architecture. [14]. Su et al. demonstrated in their study that feature selection algorithms increase the classification success of IOT abnormal traffic data if appropriate conditions are met [15]. Shafiq et al. examined feature selection that enables machine learning methods to work more efficiently. They demonstrated that with a Wrapper-based feature selection algorithm, the success of the limiters increased. They achieved an accuracy of 95% in the verification using the N-BaIoT dataset [16]. On the same dataset, Alharbi et al. achieved a precision of 99.82% [17]. On the other hand, Palla and Tayeb detected abnormal traffic on different IOT devices in the N-BaIoT dataset. When the authors ran the ANN and RF algorithms in their study, they were able to detect anomalous traffic on the security camera with an accuracy of only 83.9% with ANN and 75.6% with RF [18]. Nõmm and Bahşi examined the effect of feature selection on the classifier in the N-BaIoT dataset in their study. The authors stated that they achieved over 90% accuracy in SVM with 10 features [19].Wang et al. stated that stacked AEs shorten the data processing time and contribute positively to accuracy [20].

Given the heterogeneity and unexpected traffic of IOT devices, some instinctive constraints of abnormal traffic detection can significantly affect this kind of approach. Flooding will not be detected when attackers often generate traffic similar to normal traffic, but containing malicious activity. Another issue is that the traffic of IOT devices with very heavy streams can be perceived as heavy traffic. For this reason, unsupervised learning can provide a lower false positive. Mirsky et al. attempted to identify abnormal patterns and abnormal traffic in network traffic. The authors use an unsupervised learning method based on an autoencoder [21]. In the proposed study, only the attacks related to the UDP flood were focused on. We can find many articles in the literature using features extracted from traffic streams, but none with a particular focus on UDP flood attacks alone.

In this study, UDP flood attacks were detected with the proposed Minimum redundancy maximum relevance (MRMR) algorithm and fine-tune stacked Autoencoder. 115 features of data traffic were reduced to 57 with the MRMR algorithm. In the Stacked Sparse Autoencoder (SSAE) model, the features were first reduced to 30, then to 15. Afterwards, an accuracy of 99.99% was obtained when 51,773 part of the dataset, 30% of which was reserved for hold-out validation, was applied to the proposed method. When the trained model was validated with the NSL-KDD test dataset prepared from online traffic data and used for testing, 97.14% success was achieved. In experiments, only 12 of the 1983 flooding attacks in the dataset were classified as benign. This success shows that the proposed method for detecting UDP flood attacks works effectively.

The main contribution of this study lies in the model development phase where deeper non-linear features are obtained and fine-tuning is encoded and classified with SSAE. The proposed model is built on the unique hierarchical structure of the SSAE method encoding properties. Increasing the accuracy of the classification by fine-tuning and allowing classification with fewer features decreases the identification time of the anomalies, which is critical for IOT.

The rest of this article is organized into 4 sections. The preliminary information on MRMR and Fine-tuned SSAE algorithms is investigated in Section 2. The experiments performed to show the effectiveness of these methods used to detect UDP-flood attacks in the network with IOT devices and the results of these experiments are presented in Section 3. Finally, the results and future directions of this study are summarized in Section 4.

In the model proposed in the study, the measurements of traffic flow in the network to which the IOT device is connected were classified. First, the optimum selection and integration of the data in the N-BaIoT dataset is performed. In integration, raw data is prepared numerically by encoding each feature column. Training and validation data were then generated to validate the proposed method. Here, the dataset, which was arranged as 70% training 30% test, was used in the experiments to verify the method with the hold-out validation method. N-BaIoT dataset, which is the network traffic measurements performed on Simple home XCS7 1002 WHT model Security Camera, was preferred as an IOT device. Softmax layer was used in classification, which is based on Autoencoder architecture, which was trained in two stages. In the binary classification system, primarily 115 different measurement feature vectors were encoded. The length of this vector was reduced to 57 with the MRMR algorithm. Then, these 57 features were reduced to 30 with the first AE and 20 with the second AE. The UDP flood status was determined by classifying the results obtained from the softmax layer with 20 encoded features. The training was repeated by merging the AE1, AE2 and softmax layers and the classification was fine tuned. Finally, the training data is applied for obtaining the active model. The active model is used for anomaly detection, which is reserved for validation from the NSL-KDD test set and the N-BaIoT dataset. This is used to evaluate SSAE fine-tuned with validation criteria. The flow chart of the proposed method is presented in Fig. 1.

2.1. The Validation Dataset of the Proposed Method

The experiments were performed with the N-BaIoT dataset. This dataset contains the records describing the transmitted network traffic [22]. The records in the dataset consist of 115 features separated by commas. There are 11 classes in the dataset, 10 classes of attack and 1 class of benign [11]. The legacy UDP data of the Simple home XCS7 1002 WHT model Security Cameras traffic data according to the purpose of the study were used. This device is connected to the IOT network via Wi-Fi. Data was obtained both under normal operating conditions and under UDP-flood attack by BASHLITE and Mirai botnet [21]. The dataset contains 151779 lines of vulnerable traffic measurement results with UDP flood. There are 46585 measurement results that belong to normal traffic flow. In addition, NSL-KDD dataset was used in the study to validate the trained model. NSL-KDD dataset is in two parts as train and test [23]. The test dataset was used in the study in order to compare it with other studies in the literature. The dataset contains a total of 41 measurement and feature information [24]. The lines from the test set containing UDP flooding attacks and labeled as benign were used for validation to fit the purpose of the study. All 1983 flooding and 11054 benign data were applied directly to the trained model.

2.2. Problem Formulation

Feature vectors of 198364x115 size in the dataset consist of 115 different measurement data belonging to the network traffic to which the IOT device is connected. 70% of these data were reserved for training and 30% for testing. Each 1x115 size vector in the training set is encoded for each measurement value. The situation for this labeling is shown in Eq. 1.

$$d(n,m)=\sum _{n=1}^{146591}\left({x}_{n}\right[{d}_{1}, {d}_{2},\dots {,d}_{115}],{y}_{m})$$

The expression x in Eq. 1 denotes the vector containing 115 feature data. The inclusion of each row of data in a class as labeled is represented by y. The data used for verification, which is an attack on the metrics from traffic, are shown in Eq. 2.

$$du(n,m)=\sum _{n=1}^{40658}\left({x}_{u}\right[{du}_{1}, {du}_{2},\dots {,du}_{115}],{y}_{u})$$

The x_u parameter in Eq. 2 refers to the rows of data that are the UDP flood attack and are shown to the trained model for the first time. The y_u is defined as flooding attacks as these metrics are known to be flooding attacks.

When 57 features obtained by selecting from the MRMR model are applied to the SSAE model, the 115 features are reduced to the number of hidden layer neurons obtained during the training. In this study, 57 valued feature vectors were coded in the trained SSAE model in order to obtain deep features in a chain, and the number of features was encoded to 15. The vector that is reduced by coding is calculated by Eq. 3.

$$de(n,m)=\sum _{n=1}^{51773}\left({x}_{e}\right[{de}_{1}, {de}_{2},\dots {,de}_{25}],{y}_{m},{y}_{u})$$

The expression x_e in Eq. 3 represents the encoded size of the 57-element vector obtained from the MRMR algorithm applied as SSAE input. The test data (y_m) and the flooding data (y_u) is used for calculating the d_e.

Finally, predictions of UDP flooding attacks against y values are obtained. The predictions made with the Softmax layer are included in the class closer to which class they are in the classification stage. Finally, the classification of UDP flood attacks is made.

2.3. Minimum-redundancy maximum-relevance Feature Selection Algorithm

The most effective feature subset which is of maximal relevance to the target class and minimal redundancy among a feature set is obtained by the MRMR algorithm [25]. These redundancy and relevance measurements are performed with mutual information. The parameter I represents the mutual information in the MRMR algorithm in Eq. 4.

$${V}_{x}=I(x,c)$$

The relevance V_x is obtained by using mutual information for each input feature vector with its labeled class c.

$${W}_{x}=\frac{1}{\left|S\right|}{\sum }_{yϵS}I(x,y)$$

S is a feature subset of a dataset with its class label in Eq. 5. The redundancy of between x and y vectors are represented with W_x. The mutual information quotient is calculated by the ratio of V_x to W_x in Eq. 6.

$$\text{max}MI{Q}_{x}=\text{max}\left(\frac{{V}_{x}}{{W}_{x}}\right)$$

The 115 features were ranked according to their priorities and a score was obtained for each feature with the MRMR algorithm [26]. The feature vector was optimized by selecting features larger than the threshold value obtained with the mean of the scores of 115 features. As a result, feature vectors for artificial intelligence were obtained by selecting the best 57 features.

2.4. Fine Tuned Stacked Autoencoder

An AE is similar to a multi-layer perceptron. AE is generally used to reconstruct the vector applied to the input. The elements of the feature vector applied to the entrance are encoded to the number of neurons in the hidden layer, in training. Then, the decoder reconstructs the input vector [27]. In the encoder, the mapping of the input to the output is obtained by Eq. 7.

$$c=f({W}^{T}.x+b)$$

In Eq. 7, x is the input vector and W is the weight matrix. The output parameter c is obtained by the activation function f with each bias value added to the product of these two expressions. Similarly, Eq. 8 is used for decode process.

$${x}^{{\prime }}=f({W}^{{\prime }T}.c+{b}^{{\prime }})$$

In Eq. 8, W’ represents the weight coefficients between the exit neurons and the middle layer neurons. The reconstructed input statements x' are obtained by passing through the activation function of the f. In the training of AE, W weight matrices are calculated according to the number of neurons in the hidden layer [28]. The error at the end of the training is calculated with Eq. 9.

$${e}_{i}=\left|x+x{\prime }\right|$$

In Eq. 9, x represents the input vector of AE and x represents the decoded input vector. The error value is obtained with these two vectors. Regularization factor is used in the solution of overfitting problem in AE model [29]. This factor is calculated by Eq. 10.

$$E=\frac{\lambda }{2}(\sum W+\sum W{\prime })$$

In Eq. 10, E parameter denotes the regularization factor. With the relation of W and W 'weight matrices used in the encoding and decoding, the term used for the regulation of the model is obtained by λ. The sparseness limitation contributes to the model to learning feature vector. E₂ is calculated by Eq. 11 using the sparseness factor.

$${E}_{2}=\beta .\rho .\text{log}\frac{\rho }{{\rho }^{{\prime }}}+\left(1-\rho \right).\frac{1-\rho }{1-\rho {\prime }}$$

In Eq. 11, β is the sparsity weight term. The E₂ is obtained by multiplying this term by the Kullback Leibler (KL) convergence. Ρ and ρ' in KL represent the average activation value of the related neuron. The ρ' parameter represents the average value of inputs passed through the E₂ of the related neuron.

Table 1

The Training Parameters of the Fine-tuned Stacked Sparse Autoencoder Model
Model Parameters	Value
Input Neurons (AE1)	57
Output Neurons (AE1)	57
Hidden Neurons (AE1)	30
Epochs (AE1)	100
L2 Weight Regularization (AE1)	0.004
Sparsity Regularization (AE1)	4
Sparsity Proportion (AE1)	0.15
Input Neurons (AE2)	30
Output Neurons (AE2)	30
Hidden Neurons (AE2)	15
Epochs (AE2)	100
L2 Weight Regularization (AE2)	0.002
Sparsity Regularization (AE2)	4
Sparsity Proportion (AE2)	0.1
Transfer function for the encoder (AE1, AE2)	Logistic sigmoid
Loss function for training (AE1, AE2)	mean squared error
Training Algorithm (AE1, AE2)	scaled conjugate gradient descent
Epoch (Softmax)	100
Loss Function (Softmax)	Cross Entropy
Training Algorithm (Softmax)	scaled conjugate gradient

Stacked AE is obtained by cascading AEs to each other [30]. After the first AE is encoded, deep feature learning is provided by connecting to the input of the next AE. The features encoded with AE at the end are connected to the softmax layer to contribute to the solution of classification problems. Softmax function performs normalization and exponentiation to find class probabilities. In this type of architecture, each AE and softmax are trained individually in the stacked structure created first. The retraining of the combined architecture as a whole is called fine-tune. Classification error is minimized with this process. The stacked AE architecture created in this study is shown in Fig. 2. The first AE encodes the input vector consisting of 115 features into 60 features. The second AE encodes 60 features from the first AE into 20 features. Then the properties are divided into 2 separate classes with the softmax layer.

The parameters of each of the 3 layers used in the most successful Fine-tuned SSAE model in the experiments in the study are presented in Table 1. In the study, 2 AE and softmax layers were connected as a cascade. This connected structure was retrained with the parameters in Table 1 and fine-tuned.

The Python framework was preferred to apply the proposed method to detect UDP flood anomalies in the N-BaIoT dataset. The experiments were performed on a 64 bit Windows Operating System with 8 GB RAM memory and Intel I7 CPU 2730 processor.

Evaluation metrics are calculated to demonstrate the success of the approach that was proposed in this study. The proposed method includes MRMR and Fine tune SSAE methodologies trained with N-BaIoT dataset data to detect UDP-flood activity in network traffic. True positive (TP), True negative (TN), False negative (FN), and False positive (FP) are used for the verification parameters [31]. The Accuracy, Precision, sensitivity, specificity, f₁ score, and Cohen kappa metrics are calculated by using TP, TN, FN and FP. The accurate classification rate of feature vectors in the test set is determined by Accuracy metric. Eq. 12 is used for calculating the accuracy.

$$Accuracy=\frac{TP+TN}{TP+TN+FP+FN}$$

The precision metric shows how many of the values decided by the model proposed as the UDP flood are actually true-positive. It is calculated by Eq. 13.

$$Precision =\frac{TP}{TP+FP}$$

The sensitivity parameter indicates the rate at which the UDP flood attack can be detected. It is calculated by Eq. 14.

$$Sensitivity =\frac{TP}{TP+FN}$$

The specificity metric specifies the rate of benign detection to non-attack data when detecting an attack. It is calculated by Eq. 15.

$$Specificity =\frac{TN}{TN+FP}$$

The f₁ score metric is used for classification success in unbalanced datasets. The closer the value obtained by a calculation based on the harmonic mean of precision and sensitivity metrics to one, the more successful the model allows us to obtain the result. It is calculated by Eq. 16.

$${f}_{1} score=\frac{2.Prc.Sn}{Prc+Sn}$$

Cohen's Kappa (К) coefficient is a coefficient used in unbalanced classification problems and expresses the efficiency of the classification. The success of the classifier depends on the unbalanced dataset. The unbalanced dataset causes robustness and high stability problems. The value of the K coefficient measures the unbalanced dataset classification success. This coefficient is calculated by Eq. 17.

$$Kappa=\frac{Acc-Exp}{1-Exp}$$

It expresses the relationship between accuracy (Acc) and expected classification accuracy ($Exp=\frac{A+B}{TP+TN+FP+FN}$) in Eq. 4. While A in the Exp expression is obtained by ($\frac{\left(TP+FN\right)\left(TP+FP\right)}{TP+TN+FP+FN}$), The B parameter is calculated by using the expression ($\frac{\left(FP+TN\right)\left(FN+TN\right)}{TP+TN+FP+FN}$). К values are obtained between 0 and 1. The approach of this value to zero indicates that the classification is unsuccessful, while its approach to one proves that the classification is successful [32].

In the proposed method, an experiment was conducted using samples from the N-BaIoT dataset to detect UDP-flood attacks. 70% of the 198465 data in the dataset was allocated for training and 30% for verification. In the training, the two AEs and softmax were trained separately and combined. Classification success was achieved when the data allocated for testing the resulting Stacked AE was applied. However, when the created stacked AE was retrained as a single piece, it was fine-tuned. This increased the success positively. The Confusion Matrix obtained from the binary classification results are shown in Table 2.

Table 2

Confusion Matrix of experiments
	MRMR- FTSSAE with normalization (Optimum Accuracy)		MRMR- FTSSAE with 15 features (Optimum Accuracy)		MRMR- SSAE with 15 features		MRMR-AE with 15 features		FTSSAE with 15 features (Optimum Accuracy)		SVM		Active Model with NSL-KDD Test set
	Benign	UDP Flood	Benign	UDP Flood	Benign	UDP Flood	Benign	UDP Flood	Benign	UDP Flood	Benign	UDP Flood	Benign	UDP Flood
Benign	11115	0	11115	0	11113	2	11100	15	11110	5	11091	24	10781	273
UDP Flood	3	40655	976	39682	1120	39538	1575	39083	2259	38399	12202	28456	12	1971

The data of 11115 benign and 40658 UDP-flood traffic are included in the 51773 feature vector in the dataset and reserved for testing. These data were used to verify the method proposed in the study with the hold-out validation method. The remaining 11115 data are benign traffic measurements. The proposed method was able to detect 97.6% of problematic traffic by detecting 39682 of 40658 data of UDP flood traffic measurements. False-positive was calculated as 976. Benign traffic was detected at a rate of 100%. Total success was achieved as 98.11%. In the model without fine tuning, the success was 97.83%. The success of the system increased a little when the model, which was obtained as FP amount of 1120, was tuned to fine. Other verification parameters are presented in Table 3.

UDP measurements in the N-BaIoT dataset used in the study were performed with only 60 features instead of 115. When these 60 features were applied to the MRMR algorithm, 57 features were selected. An FPR as high as 8% was achieved in the deep SSAE that was trained and fine-tuned with the selected features. In experiments with Batch normalization added to the SSAE input before fine-tuning to reduce FPR, FPR was obtained as a value very close to zero. The z-score was calculated by using the mean and standard deviation of the selected features in the batch normalization process. These calculated values are considered as distances. Standard deviation was used in this evaluation. This method has the feature of preserving the shape properties of the original dataset. For this reason, the success of flooding detection was almost 100% in the experiments performed by evaluating the training data in one batch and the test data in the other batch with batch normalization.

Table 3

Performance of the experiments of UDP Flood
Performance Metrics	MRMR- FTSSAE with normalization (Optimum Accuracy)	MRMR- FTSSAE with 15 features (Optimum Accuracy)	MRMR- SSAE with 15 features	MRMR-AE with 15 features	FTSSAE with 15 features (Optimum Accuracy)	SVM	Active model with NSL-KDD Test set
Sensitivity	100	100	99.99	99.96	99.99	99.92	99.89
Precision	99.99	97.60	97.25	96.19	94.62	69.99	97.53
f₁ Score	100	98.79	98.60	98.04	97.23	82.32	98.70
Specificity	99.97	91.93	90.79	86.89	81.21	47.62	87.83
Kappa	100	95.31	93.8	91	86.9	76.41	92

A significant increase in kappa value was obtained in experiments with the unbalanced dataset. The achievement of the experiment in which only AE was used by selecting effective properties with the MRMR algorithm is calculated as 96.9%. Although the SSAE algorithm was fine-tuned without the MRMR algorithm, the accuracy remained at 95.62%. Therefore, the MRMR algorithm has become a very effective solution to the accuracy of the system. Although the sensitivity, precision and F1-score were close to each other in 4 different experiments, Specificity and kappa values were obtained more successfully in the proposed model compared to other experimental models due to the uneven data distribution. As shown in Table 3, performance metrics of the proposed method were obtained as sensitivity 100%, precision 99.99%, specificity 99.97%, f₁ score 100% and kappa 100%, respectively. Receiver operating characteristic of the proposed method is gathered as shown in Fig. 4. The area under the curve of proposed method is calculated as 99.99%.

Experiments were also performed with ACK flood, scan flood, sys flood and UPD-plain flood. In the experiments conducted under the same conditions as the UDP flood, each flood traffic was achieved with very high success. Experiments with each flooding situation within the Mirai attack set yielded almost 100% accuracy with the proposed method. In the ACK flood attacks, only 1 false positive was seen in the 23121 test set. In the experiment with scan flood attack data, 12141 were identified in 12149 attack cases. Only 4 false positives were detected. All 32987 attacks were detected in the sys flood attack data. Although no FP was seen, only 3 false negatives were seen. In udpplain flood attacks, 20447 attacks out of 20449 were determined, and the number of FPs was obtained as only 2. After that, the model trained with the N-BaIoT dataset was validated with the NSL test dataset, which was recorded in real time and widely used by different researchers. Of the network traffic records of the 1983 UDP attack in the dataset, 1971 were detected with the proposed method. Only 12 UDP flood data were placed in the benign class. In addition, 10781 of the 11054 benign traffic data were placed in the benign class. Only 273 of the benign-labeled data were falsely detected as flooding. Especially in the dataset with an unbalanced data distribution, the detection of flooding attacks was performed with high accuracy (97.81%). As shown in Table 3, performance metrics of the proposed method with NSL-KDD test set were obtained as sensitivity 99.89%, precision 97.53%, specificity 87.83%, f₁ score 98.70% and kappa 92%, respectively.

When evaluated as the working time, the classification performed with SVM took 0.33 seconds with the data reduced by MRMR. In the proposed method, all 51773 data were evaluated in 0.2 seconds. While 57 features applied to SVM as a single data were classified in 0.0521 seconds, this time was measured as 0.0133 in the proposed method. This result has shown that the proposed method works better than the SVM method.

In similar studies in the literature, problematic traffic detection studies related to UDP in the N-BaIoT dataset were performed and presented in Table 4. Aminanto et al. in their study, verified the traffic-based attack recognition in WiFi networks with the AWID dataset. The authors stated that the deeper AE architecture was effective in determining attack. They compared the proposed deep AE model with SVM, Decision Tree (DT) and ANN. The most successful model was presented as the use of AE in feature selection and SVM classification with these selected features. However, training of SVM with too much data is the disadvantage of the study [33]. In their study, Aldweesh et al. conducted research on models that detect anomalies in network traffic. The authors examined recurrent neural networks, convolutional neural networks, Boltzmann machine deep learning and autoencoder approaches. With these approaches, they evaluated their effectiveness in solving the problem of classifying the abnormal traffic in the network. In this evaluation, they stated that the security in the scada and IOT platform was determined by shallow neural network and machine learning algorithms. The authors stated that deep learning algorithms can be adapted to this field and that this algorithm should be tested on a dataset only in the field of IOT. [34]. Ferrag et al. examined the effectiveness of different deep learning models in detecting anomalies in network traffic. Training and test data generated from Bot-IoT and CSE-CIC-IDS2018 datasets were divided into 80% and 20%, respectively. DBN, RNN and CNN models were verified with these data. UDP flood attacks were successfully detected with DBN, RNN and CNN models on average 96.66%, 96.85%, 97.34%, respectively [35]. In the N-BaIoT dataset, 96.118% success was achieved with DBN, 96.666% with RNN and 97.006% with CNN. In addition, the authors obtained the following results with a 20% test result in the experiments they performed with RBM, DBN, DBM and deep autoencoder (DA) models to detect UDP flood attacks in the N-BaIoT dataset. They achieved an accuracy of 96.522% with RBM, 96.623% with DBN, 96.111% with DBM and 97.991% with DA algorithm. Alharbi et al. analyzed the Mirai attacks in the N-BaIoT dataset in their study. The authors stated that the classification success increased by optimizing the features with PSO and Local-Global best Bat Algorithm (LGBA). They achieved a significant increase in success as a result of optimization with the Neural network architecture used as a classifier. UDP flood attacks in the PSO optimized N-BaIoT dataset were successful, with 0.997233 Precision, 0.923866 Recall and 0.959148 F1-Score, respectively. In the neural network architecture trained by optimization with LGBA, success was achieved as 0.9982 Precision, 0.9987 Recall and 0.9985 F1-Score, respectively. Palla and Tayeb detected abnormal traffic on different IOT devices in the N-BaIoT dataset. When the authors ran the ANN and RF algorithms in their study, they were able to detect anomalous traffic on the security camera with an accuracy of only 83.9% with ANN and 75.6% with RF [18].

Table 4

Comparison of the metrics between proposed method and state of art studies
Study	Description	Metric (%)
Aminanto et al. [33]	Autoencoder with SVM	99.91 Acc 0.012 FPR
Ferrag et al. [35]	DBN RNN CNN DA	96.118 Acc 96.666 Acc 97.006 Acc 97.991 Acc
Shafiq et al. [16]	Wrapper-based feature selection algorithm	95 Acc
Alharbi et al. [17]	PSO-NN	99.72 Pr 92.38 Sn 95.91 F1
Alharbi et al. [17]	Local-Global best Bat Algorithm NN	99.82 Pr 0.9987 Sn 99.85 F1
Palla and Tayeb [18]	ANN (Security Camera)	89.3 Acc 84 Pr 99 Sn 92 F1
Palla and Tayeb [18]	RF (Security Camera)	75.6 Acc 68 Pr 92 Sn 78 F1
Kushwah and Ranga [36]	ANN and Imperialistic Competitive Algorithm with NSL-KDD test set	83.5
Al-Qatf et al. [37]	SAE-SVM with NSL-KDD test set	84.96
Kushwah and Ranga [38]	extreme learning machine with NSL-KDD test set	86.80
Yusof et al. [39]	MLP with NSL-KDD test set	91.7
Ma et al. [40]	Deep learning	92.99
Proposed Method	Deeper hybrid model	99.99 Acc 99.99 Pr 100 F1 99.99 Sp 100 Kappa
Proposed Method	Deeper hybrid model with NSL-KDD test set	97.81 Acc

When the studies in the literature was examined, which were recently confirmed by the NSL-KDD test set, flooding was determined by different methods. Kuswah and Ranga achieved 83.5% success rate from the NSL-KDD test set with ANN and Imperialistic Competitive Algorithm [36]. Al-Qatf et al achieved 84.96% success in validating the NSL test set using the SAE and SVM methods [37]. In another study, Kuswah and Ranga achieved a success rate of 86.80 in the NSL-KDD test set in the detection of flooding they proposed with an extreme learning machine [38]. Yusof et al achieved 91.7% success with the MLP [39]. Ma et al, on the other hand, achieved 92.99% success when they verified the flooding detection method they designed with deep learning with the NSL-KDD dataset [40]. The method proposed in this study and trained with the N-BaIoT dataset is in the NSL-KDD dataset.

In addition, unsupervised learning methods have a high rate of FP in recognizing abnormal traffic [41]. An Auto Encoder is an unsupervised neural network that is used to efficiently learn the encoding of the input data. Typically, an auto-encoder is used for the size reduction performed by encoding the inputs. Stacked automatic encoders were used in this study to discover the nonlinear representations of the data. With this architecture, UDP flood attacks were effectively detected. Only 3 of abnormal traffic was detected as false positives. The total accuracy was 99.99%. This result shows that UDP-flood attacks are detected more effectively than other studies.

In the online IOT environment, flooding attacks can appear in various protocols. For this reason, direct broadcast UDP should be considered in flood analysis in networks to which IOT devices are connected. The proposed model contributes to providing a fine-tuned SSAE that can easily analyze the UDP-flood detection. In the SSAE architecture, which was designed with deep architecture and fine-tuned, was developed in order to detect UDP-flood attacks originating from IOT devices using the legacy UDP data of Simple home XCS7 1002 WHT model Security Camera, one of the IOT devices. In this study, UDP-flood attacks with IOT cameras were detected. The proposed framework consists of the training model and the detection model, including the UDP-flood dataset. As the UDP-flood dataset, the N-BaIoT which is composed of the injected Mirai UDP flooding was used. The dataset included data labeled as UDP-flood and benign. In the experiments made with the fine-tuned stacked AE model, UDP flooding traffic and harmless traffic data were distinguished. The accuracy, sensitivity, Cohen kappa, specificity and f₁ score of the proposed model were quite high. The SVM algorithm, which was trained and tested with the same data, achieved a very low accuracy rate and a longer processing time. The experimental results proved that UDP-flood detection performance depends on the artificial intelligence model rather than whatever IOT devices. The error detection time of the proposed model was compared with the SVM method. It was seen in the experiments that the classification time decreased significantly.

In the study, a fine tuned deep architecture is proposed that can identify the attack patterns collected from different IOT devices in the N-BaIoT dataset. If patterns in overflow attacks on different devices can be detected, the continuity of IOT services can be ensured. However, if they can create attack patterns specific to the IOT service provider in future studies, they will be able to develop service-specific attack identification systems.

Compliance with Ethical Standards

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Funding

This work was not funded by any organization.

Data Availability

The data set “detection_of_IoT_botnet_attacks_N_BaIoT” comes from the UCI Machine Learning Repository. This dataset addresses the lack of public botnet datasets, especially for the IoT. It suggests *real* traffic data, gathered from 9 commercial IoT devices authentically infected by Mirai and BASHLITE.

Salman, O., Elhajj, I. H., Chehab, A., & Kayssi, A. (2019). A machine learning based framework for IoT device identification and abnormal traffic detection. Transactions on Emerging Telecommunications Technologies, e3743.
Kponyo, J. J., Agyemang, J. O., Klogo, G. S., & Boateng, J. O. (2020). Lightweight and host-based denial of service (DoS) detection and defense mechanism for resource-constrained IoT devices. Internet of Things, 12, 100319.
Alzahrani, M. Y., & Bamhdi, A. M. (2022). Hybrid deep-learning model to detect botnet attacks over internet of things environments. Soft Computing, 1-15.
de Souza, C. A., Westphall, C. B., Machado, R. B., Sobral, J. B. M., & dos Santos Vieira, G. (2020). Hybrid approach to intrusion detection in fog-based IoT environments. Computer Networks, 180, 107417.
Tidjon, L. N., Frappier, M., & Mammar, A. (2019). Intrusion detection systems: A cross-domain overview. IEEE Communications Surveys & Tutorials, 21(4), 3639-3681.
Ghobaei-Arani, M., & Shahidinejad, A. (2022). A cost-efficient IoT service placement approach using whale optimization algorithm in fog computing environment. Expert Systems with Applications, 200, 117012.
Guang, K. O. U., TANG, G. M., Shuo, W. A. N. G., SONG, H. T., & Yuan, B. I. A. N. (2016). Using deep learning for detecting BotCloud. Journal on Communications, 37(11), 114.
Ahmad, R., Alsmadi, I., Alhamdani, W., & Tawalbeh, L. A. (2022). A comprehensive deep learning benchmark for IoT IDS. Computers & Security, 114, 102588.
Rizal, R., Riadi, I., & Prayudi, Y. (2018). Network forensics for detecting flooding attack on internet of things (IoT) device. Int. J. Cyber-Security Digit. Forensics, 7(4), 382-390.
Campus, N. M. I. T., Govindapura, G., & Yelahanka, B. (2018). Denial-of-service or flooding attack in IoT routing. Int. J. Pure Appl. Math, 118, 29-42.
Meidan, Y., Bohadana, M., Mathov, Y., Mirsky, Y., Shabtai, A., Breitenbacher, D., & Elovici, Y. (2018). N-baiot—network-based detection of iot botnet attacks using deep autoencoders. IEEE Pervasive Computing, 17(3), 12-22.
Al Shorman, A., Faris, H., & Aljarah, I. (2020). Unsupervised intelligent system based on one class support vector machine and Grey Wolf optimization for IoT botnet detection. Journal of Ambient Intelligence and Humanized Computing, 11(7), 2809-2825.
Alotaibi, B., & Alotaibi, M. (2020). A Stacked Deep Learning Approach for IoT Cyberattack Detection. Journal of Sensors, 2020.
Wang, Y., Bai, B., Hei, X., Zhu, L., & Ji, W. (2020). An unknown protocol syntax analysis method based on convolutional neural network. Transactions on Emerging Telecommunications Technologies, e3922.
Su, S., Sun, Y., Gao, X., Qiu, J., & Tian, Z. (2019). A correlation-change based feature selection method for IoT equipment anomaly detection. Applied Sciences, 9(3), 437.
Shafiq, M., Tian, Z., Bashir, A. K., Du, X., & Guizani, M. (2020). IoT malicious traffic identification using wrapper-based feature selection mechanisms. Computers & Security, 94, 101863.
Alharbi, A., Alosaimi, W., Alyami, H., Rauf, H. T., & Damaševičius, R. (2021). Botnet Attack Detection Using Local Global Best Bat Algorithm for Industrial Internet of Things. Electronics, 10(11), 1341.
Palla, T. G., and Tayeb, S. (2021). Intelligent Mirai Malware Detection for IoT Nodes. Electronics, 10(11), 1241.
Nõmm, S.,and Bahsi, H.: Unsupervised anomaly based botnet detection in IOT networks. In: 2018 17th IEEE international conference on machine learning and applications (ICMLA), pp. 1048–1053 (2018)
Wang, Z., Liu, Y., He, D., and Chan, S. (2021). Intrusion detection methods based on integrated deep learning model. Computers & Security, 103, 102177.
Mirsky, Y., Doitshman, T., Elovici, Y., & Shabtai, A. (2018). Kitsune: an ensemble of autoencoders for online network intrusion detection. arXiv preprint arXiv:1802.09089.
Machine Learning Repository. Accessed: Aug. 14, 2018. [Online]. Available: https://archive.ics.uci.edu/ml/datasets/detection_of_IoT_botnet_attacks_N_BaIoT
M. Tavallaee, E. Bagheri, W. Lu, and A. A. Ghorbani, ‘‘A detailed analysis of the KDD CUP 99 dataset,’’ In Proc. IEEE Symp. Comput. Intell. Secur. Defense Appl., Ottawa, ON, Canada, Jul. 2009, pp. 1–6.
UNB. NSL-KDD Dataset. Accessed: September 01, 2020. [Online]. Available: https://www.unb.ca/cic/datasets/nsl.html
Özyurt, F. (2020). A fused CNN model for WBC detection with MRMR feature selection and extreme learning machine. Soft Computing, 24(11), 8163-8172.
Tsapparellas, G., Jin, N., Dai, X., & Fehringer, G. (2020). Laplacian Scores-Based Feature Reduction in IoT Systems for Agricultural Monitoring and Decision-Making Support. Sensors, 20(18), 5107.
Kannadasan, K., Edla, D. R., & Kuppili, V. (2019). Type 2 diabetes data classification using stacked autoencoders in deep neural networks. Clinical Epidemiology and Global Health, 7(4), 530-535.
Simon, J., Kapileswar, N., Polasi, P. K., & Elaveini, M. A. (2022). Hybrid intrusion detection system for wireless IoT networks using deep learning algorithm. Computers and Electrical Engineering, 102, 108190.
Wang, Y., Yang, H., Yuan, X., Schardt, Y., Yang, C., & Gui, W. (2020). Deep learning for fault-relevant feature extraction and fault classification with stacked supervised auto-encoder. Journal of Process Control, 92, 79-89.
Wang, H., Wu, N., Cai, Y., Ren, L., Zhao, Z., Han, G., & Wang, J. (2019). Optimization of reconstruction accuracy of anomaly position based on stacked auto-encoder neural networks. IEEE Access, 7, 116578-116584.
Roseline, J. F., Naidu, G. B. S. R., Pandi, V. S., alias Rajasree, S. A., & Mageswari, N. (2022). Autonomous credit card fraud detection using machine learning approach☆. Computers and Electrical Engineering, 102, 108132.
Almiani, M., AbuGhazleh, A., Al-Rahayfeh, A., Atiewi, S., & Razaque, A. (2020). Deep recurrent neural network for IoT intrusion detection system. Simulation Modelling Practice and Theory, 101, 102031.
Aminanto, M. E., Choi, R., Tanuwidjaja, H. C., Yoo, P. D., & Kim, K. (2017). Deep abstraction and weighted feature selection for Wi-Fi impersonation detection. IEEE Transactions on Information Forensics and Security, 13(3), 621-636.
Aldweesh, A., Derhab, A., & Emam, A. Z. (2020). Deep learning approaches for anomaly-based intrusion detection systems: A survey, taxonomy, and open issues. Knowledge-Based Systems, 189, 105124.
Ferrag, M. A., Maglaras, L., Moschoyiannis, S., & Janicke, H. (2020). Deep learning for cyber security intrusion detection: Approaches, datasets, and comparative study. Journal of Information Security and Applications, 50, 102419.
Kushwah, G. S., & Ranga, V. (2022). DDoS Attacks Detection in Cloud Computing Using ANN and Imperialistic Competitive Algorithm. In Artificial Intelligence and Sustainable Computing (pp. 253-263). Springer, Singapore.
Al-Qatf, M., Lasheng, Y., Al-Habib, M., & Al-Sabahi, K. (2018). Deep learning approach combining sparse autoencoder with SVM for network intrusion detection. IEEE Access, 6, 52843-52856.
Kushwah, G. S., & Ranga, V. (2021). Optimized extreme learning machine for detecting DDoS attacks in cloud computing. Computers & Security, 105, 102260.
Yusof, A. R. A., Udzir, N. I., Selamat, A., Hamdan, H., & Abdullah, M. T. (2017, November). Adaptive feature selection for denial of services (DoS) attack. In 2017 IEEE Conference on Application, Information and Network Security (AINS) (pp. 81-84). IEEE.
Ma, L., Chai, Y., Cui, L., Ma, D., Fu, Y., & Xiao, A. (2020, June). A deep learning-based DDoS detection framework for Internet of Things. In ICC 2020-2020 IEEE International Conference on Communications (ICC) (pp. 1-6). IEEE.
Gu, Y., Li, K., Guo, Z., & Wang, Y. (2019). Semi-supervised k-means ddos detection method using hybrid feature selection algorithm. IEEE Access, 7, 64351-64365.

Download PDF

Editorial decision: Major Revision
06 Oct, 2023
Reviewers agreed at journal
03 Feb, 2023
Reviewers invited by journal
02 Feb, 2023
Editor assigned by journal
05 Jan, 2023
First submitted to journal
04 Jan, 2023

You are reading this latest preprint version

Deeper Fine-Tuned Autoencoder for User Datagram Protocol Flooding Network Traffic Detection in Internet of Things

Status:

Version 1

Abstract

Figures

1. Introduction

2. Material And Methods

2.1. The Validation Dataset of the Proposed Method

2.2. Problem Formulation

2.3. Minimum-redundancy maximum-relevance Feature Selection Algorithm

2.4. Fine Tuned Stacked Autoencoder

3. Experimental Results And Discussion

4. Conclusion

Declarations

References

Status:

Version 1