A Network Security Situation Assessment Method Based on Fusion Model

doi:10.21203/rs.3.rs-3013097/v1

Download PDF

Research Article

A Network Security Situation Assessment Method Based on Fusion Model

https://doi.org/10.21203/rs.3.rs-3013097/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

This paper proposes a new model for network security situation assessment (NSSA) that addresses the limitations of current methods in terms of feature extraction quality and efficiency. The proposed NSSA model is based on a fusion model (FM) that incorporates an attention mechanism and bi-directional gated recurrent unit (BiGRU). The FM model is used to extract key information about different cyber threats, and the attention mechanism is applied to weight these key features, thereby improving the model's accuracy. Finally, the evaluation results are output by BiGRU, in combination with proposed quantitative indicators of cybersecurity posture. The results of threat detection experiments show that the proposed FM model outperforms other models based on several judging metrics.

Attention

BiGRU

Network security situation assessment

Fusion model

Feature extraction

With the lightning-fast advancement of information technology, information networks' openness, diversity, and uncertainty have led to huge challenges to information network security, and network security problems have become increasingly serious. It is of great significance to ensure the safe and stable operation of the network [1–3].

NSSA can perceive the security risks of the network in real-time in a complex network environment, and security analysts can make quick and accurate judgments based on the network security environment to minimize risks and losses [4]. The most common NSSA method is a mathematical model. Some scholars have researched NSSA based on the AHP method [5]. Some scholars have introduced the Markov chain and information inheritance when conducting NSSA, which provides an effective way for NSSA [6]. However, the above methods are easily affected by human factors in the assessment process, which will greatly interfere with the situation assessment results. Therefore, some methods focus on the network's vulnerability and fuse to generate the Bayes network attack graph model. However, such methods based on probability and knowledge reasoning cannot provide timely feedback when faced with a large amount of cyber threat data [7].

The particle swarm algorithm has a simple structure, fast convergence speed, and meets the requirements of NSSA for timeliness [8]. It is the most widely used situational element identification algorithm at present. The particle swarm algorithm shares the information between particles when searching for the optimal solution, so the particles always update to the current optimal solution. The convergence speed is very fast at the beginning of the algorithm until the state of all particles is similar, and the convergence speed slows down. This will make it difficult for the particles to recover from the optimal local solution when the algorithm finds it and converges to this position. It jumps out of the optimal solution, which reduces the accuracy of situational element identification.

Some scholars have introduced machine learning methods into NSSA. In [9], scholars propose a BP neural network (BPNN) method, which has high assessment accuracy [9]. However, this method is difficult to extract the effective features in the network situation. The calculation is large, the process is complicated, and it is not easy to understand.

In recent years, deep learning-based models and methods have provided new ideas for research in NSSA. Some scholars have replaced the traditional BPNN model with a long short-term memory network (LSTM) and a recurrent gated unit (GRU), which improved the accuracy of NSSA to a certain extent, but the evaluation indicators of these models are relatively single and cannot accurately evaluate the overall situation of the network. Compared with LSTM and GRU, BiGRU greatly improves the feature extraction ability and reduces the amount of calculation, and it has been applied in many fields. Therefore, this paper introduces BiGRU, fuses it with the attention mechanism, and proposes a fusion model FM, which improves the accuracy of NSSA. The contributions of this paper are as follows:

This paper proposes a fusion model FM that combines the attention mechanism and BiGRU to solve the problems of the traditional model being easily affected by human factors and slow in the evaluation process.
This paper introduces NSSA quantitative indicators to evaluate the network situation from multiple dimensions comprehensively.
This paper demonstrates the superiority of the proposed FM model on NSSA on multiple experimental datasets.

The structure of this paper is as follows. Chapter 1 is an introduction which introduces the motivation, contribution, and structure of this paper. Chapter 2 presents the work related to NSSA. The third chapter presents the fusion model FM. Chapter 4 presents the experimental results, and the last chapter summarizes the contributions of this paper and presents future work

2.1 Traditional methods

Since the idea of NSSA was first put forward in 1999, researchers have done a lot of work and made a lot of different security situation information assessment models. Initially, some scholars proposed a situational awareness model based on Ontology and combined the various modules to propose the idea of abstract entities, which is of great significance to the development of NSSA [1, 4, 7, 8]. Initially, NSSA was widely used in the military field. Some scholars proposed a concept-based NSSA technology for NSSA in the military field, but the data source extracted by this method was single and could not deal with multi-source attacks. Other scholars started from high dimensions. An NSSA method based on the analysis of the spatiotemporal dimension is proposed, which predicts the security situation element set in the future period in the time dimension and analyzes the influence of each security situation element set on the NSSA in the space dimension.

2.2 Particle swarm-based algorithms

With the introduction of particle swarm optimization (PSO), some scholars have studied how to apply PSO to the field of NSSA [11]. PSO belongs to the swarm intelligence algorithm, which has the characteristics of simple structure and strong robustness and performs well in solving combinatorial optimization problems [12–14]. In recent years, the research on the PSO algorithm mainly focuses on the optimization of the PSO algorithm and the comparison with other algorithms. In terms of fusion, some scholars have proposed a scheme of linearly decreasing inertia weight, which can quickly find the range of the optimal solution in the initial stage of the algorithm. With the increase in iterations, the algorithm performs a more precise search and finally obtains the optimal solution. The particle swarm optimization of simulated annealing cooling speed is too slow and obtains good results in actual scenarios. Simulated annealing is helpful in solving the problem that particle swarms tend to fall into local extreme values, and a new global path planning algorithm is designed [15]. The extreme global value can be found more accurately while considering the convergence speed. But the most difficult thing about the PSO method is that it's hard to find the best solution, which makes NSSA less accurate [16–19].

2.3 Machine learning-based methods

In recent years, with the development and maturity of machine learning theory, technologies such as radial basis function (RBF), BPNN, support vector machine (SVM), and deep learning have been gradually applied in the field of NSSA [19–25]. Some scholars use the grey relational analysis method to determine the security situation indicators' weight, use the SVM to build the NSSA model and obtain high-precision evaluation results [26–28]. Some scholars have built an NSSA model based on BPNN, and achieved an effective assessment of the security situation of the power system by establishing a complete evaluation index system [28–30]. Some scholars have introduced deep learning technologies such as LSTM based on BPNN, which has improved the accuracy of NSSA to a certain extent [31–35].

First, learning and extracting key features from various attack types is necessary. This paper uses multiple autoencoders to extract features to improve extraction efficiency and classification accuracy. Moreover, multiple self-encoders can mine the potential distribution of different attack types based on the characteristics of each attack type to obtain more representative features. The structure of our feature extraction process is shown in Fig. 1.

Additionally, the BiGRU module has been included in the FM model. Learning the time series relationship between the previous moment and the next moment and the current state, mining the potential representation rules between network threat traffic, and successfully boosting the network's ability to learn are all things that BiGRU is capable of doing. On the other hand, inefficiency and sluggishness would emerge whenever BiGRU learns sequence data that is excessively lengthy. The solution to this problem can be found in attention models. To begin, it can concentrate on data at various positions in the sequence, reducing the total length of the input data. Second, because the feature information in the threat data at various times contributes differently to the classification and detection of the current attack type, the attention mechanism can be used to assign weights to the features that affect the detection results. This enables the model to learn the potential features more effectively, improving the model's ability to detect attacks. The following are the components that make up the FM model:

First, the extracted feature data ${x_{i,j}}$ is input into the network to get the output ${y_i}$

$${y_{i,j}}=o\left( {{x_{i,j}}} \right)$$

Further, assign weights to features via the attention layer

$${h_{i,j}}=\tanh \left( {{\mathbf{W}}{y_{i,j}}+b} \right)$$

$${w_{i,j}}={\text{softmax}}\left( {{h_{i,j}},{\mathbf{w^{\prime}}}} \right)$$

where ${h_{i,j}}$ is the state of the hidden layer, ${\mathbf{W}}$ is the weight matrix, is the bias term, ${\mathbf{w^{\prime}}}$ is the initial weight matrix of the attention layer. Then we have

$${w_i}=\sum\limits_{j} {{w_{i,j}}} {h_{i,j}}$$

Input the calculated local weights into BiGRU to get the global distribution weights

$${y_i}=o\left( {{w_i}} \right)$$

$${h_i}=\tanh \left( {{\mathbf{W^{\prime}}}{y_i}+b} \right)$$

$${w_i}={\text{softmax}}\left( {{h_i},{\mathbf{w^{\prime\prime}}}} \right)$$

where ${\mathbf{W^{\prime}}}$ and ${\mathbf{w^{\prime\prime}}}$ are the weight matrix and initial weight matrix in the attention layer, respectively, we then can get

$$w=\sum\limits_{i} {{w_i}} {h_i}$$

Input the global feature weights into the softmax layer. Then we can get the final prediction results. The overall framework of the FM method is shown in Fig. 2.

Further, this paper quantifies various indicators in NSSA. First, quantify the severity of the attack. For attack detection, thousands of data sets are chosen at random from the data collection, and then those random selections are entered into the trained threat detection model. Suppose the number of detected attacks is ${N_i}$the actual number of occurrences of each attack type ${N^{\prime}_i}$then, the error probability is

$${p_{i,j}}=\frac{{{{N^{\prime}}_j}}}{{{{N^{\prime}}_i}}}$$

Further, different types of attacks are classified into three categories according to the attack severity level, and the calculation methods of the attack severity operator $a{o_i}$ are as follows

when attack level $1 \leqslant {l_i} \leqslant 0.5n$

$$a{o_i}=\frac{{3+\sqrt { - 2\ln 2{l_i}+2\ln n} }}{6}$$

when attack level ${l_i}=0.5n$

$$a{o_i}=0.5$$

when attack level $0.5n \leqslant {l_i} \leqslant n$

$$a{o_i}=\frac{{3 - \sqrt { - 2\ln 2{l_i}+2\ln n} }}{6}$$

The higher the attack severity level, the larger the attack severity operator, and the more serious the threat caused by the attack. Further, according to works of literature 7 and 11, this paper formulates NSSA levels, as shown in Table 1.

Table 1

Level table of NSSA
NSSA Level	Value Range
Safety	[0, 0.3]
Low risk	[0.3, 0.6]
Medium risk	[0.6, 0.9]
High risk	[0.9, 1.2]
Danger	[1.2, +∞]

This section evaluates the overall performance of the FM model. First, this article introduces the composition of the NSSA method after the fusion of the FM method. The NSSA method proposed in this paper consists of an extraction module (EXM), an analysis module (ANM) and an assessment module (ASM). EXM is responsible for preprocessing the training set traffic data and feeding the preprocessed data into the FM training model. ANM is accountable for feeding the test set into the trained FM network for threat detection and recording the model's output results. Based on the detection results obtained by ANM, ASM quantifies the attack severity and attack impact degree, calculates the network security posture value, and analyzes and evaluates the network security posture based on the interval in which the posture value is located, the overall framework of NSSA is as shown in Fig. 3.

This paper uses the deep learning framework Tensorflow to build the model and GPU to improve the training efficiency. The detailed hardware parameters are an Intel I9-10900K processor, 128GB RAM, and four RTX3090 graphics cards. In this paper, we use open datasets KDD99, NSL-KDD, and UNSW-NB, and the comparison algorithms are SVM, BPNN, RNN, LSTM, GRU and the comparison results are shown in Table 2.

Table 2

Performance comparison of different algorithms on three datasets
Dataset	Method	Training accuracy	Testing accuracy
KDD99	SVM	0.993	0.897
	BPNN	0.994	0.902
	RNN	0.995	0.912
	LSTM	0.996	0.921
	GRU	0.995	0.918
	FM	0.998	0.969
NSL-KDD	SVM	0.994	0.797
	BPNN	0.993	0.812
	RNN	0.995	0.843
	LSTM	0.996	0.898
	GRU	0.996	0.901
	FM	0.997	0.932
UNSW-NB	SVM	0.939	0.781
	BPNN	0.941	0.793
	RNN	0.943	0.810
	LSTM	0.944	0.831
	GRU	0.942	0.836
	FM	0.951	0.905

As shown in Table 2, FM has the highest training and testing accuracy when compared to SVM, BPNN, RNN, LSTM, and GRU in the experiments on the three open network security datasets. FM has the highest training and testing accuracy on the three datasets, followed by LSTM and GRU, which are not very different. During training, the performances of the six algorithms on the three datasets are not very different, while the accuracy rates during testing are quite different, but FM shows good performance on all three test sets. On KDD99, the test is accurate. The accuracy rate can reach 0.969, and the accuracy rate on UNSW-NB is lower at 0.905, but it is also significantly better than the other five comparison algorithms. Compared with BPNN, RNN has some advantages, and the worst performance is SVM

Further, other metrics are selected in this paper to test the performance of the algorithms in this paper, and the metrics are calculated as follows

$${\text{Precision}}=\frac{{TP}}{{FP+TP}}$$

$${\text{Recall}}=\frac{{TP}}{{FN+TP}}$$

$${\text{F}}=\frac{{{\text{2Recall}} \cdot {\text{Precision}}}}{{{\text{Recall+Precision}}}}$$

In this paper, the NSL-KDD dataset is selected to test the performance of different methods in the above three indicators, as shown in Fig. 4, it is obvious that the FM model outperforms the comparison algorithm, and the values of the three indicators are more than 0.845. Specifically, the Precision of FM reaches more than 0.85, while LSTM and GRU are only 0.836 and 0.831, respectively, RNN is 82.1, and the other two methods are lower than 0.8. The Recall of FM is 0.867, which is higher than that of LSTM, GRU, and RNN, respectively. The F values of 0.032, 0.042, and 0.022 are higher than SVM, which is close to 0.1. FM is also significantly higher than other species methods.

Based on the preceding analysis, LSTM and GRU are the optimal models for comparison. In addition, LSTM, GRU, and FM models are utilized for threat detection on ten sets of test data with the same number of entries. Figure 5 compares network posture values for the ten experimental groups. As shown in Fig. 5, the network security posture values calculated by the three models have a similar trend to the actual network security posture values, but the posture situation evaluated by the method presented in this paper is more consistent with the actual posture situation.

Lastly, the impact of situation-level prediction is examined. The results of eight time-period evaluations of the situation are displayed in Table 3. Table 3 demonstrates that the results of the method presented in this paper are comparable to the actual results for each sub-period and that the obtained situation level is consistent with the actual situation level, accurately reflecting the network security situation.

Table 3

Situation level prediction results
Number	Predicted situational value	Predicted situational level	True situational value	True situational level
1	0.58	Low risk	0.56	Low risk
2	0.76	Medium risk	0.77	Medium risk
3	0.81	Medium risk	0.81	Medium risk
4	0.84	Medium risk	0.82	Medium risk
5	0.95	High risk	0.94	High risk
6	1.13	High risk	1.15	High risk
7	0.25	Safety	0.19	Safety
8	0.58	Low risk	0.49	Low risk

This paper proposes an NSSA model FM based on a fusion model to solve the problems of low feature extraction quality and poor extraction efficiency of current NSSA methods. The evaluation model incorporates an attention mechanism and a Bidirectional Gated Recurrent Unit (BiGRU). Firstly, the FM model extracts the key information of different network threats. Secondly, the key features are weighted by the attention mechanism to improve the accuracy of the model, and finally, combined with the proposed quantitative indicators of NSSA, the evaluation results are output through BiGRU. The threat detection results show that the fusion model presented in this paper is superior to other compared models in terms of precision, recall, and other indicators.

Future research will mainly focus on three aspects. The first aspect is optimising the NSSA index and introducing more factors to evaluate the network situation comprehensively. The second aspect is the introduction of more advanced neural network models. The third aspect is to optimize the feature extraction module and study how to extract features in parallel to improve the efficiency of early model training. Also, we will continue to improve the FM model so that it can be applied to NSSA in different situations

Competing Interests: The authors declare that there is no conflict of interest regarding the publication of this paper.

Data Availability: The labeled dataset used to support the findings of this study are available from the corresponding author upon request.

Acknowledgments: This work was supported by the China Southern Power Grid.

Funding: This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Clark K, Tyree S, Dawkins J, et al. Qualitative and quantitative analytical techniques for network security assessment[C]//Proceedings from the Fifth Annual IEEE SMC Information Assurance Workshop, 2004. IEEE, 2004: 321–328.
Wu S, Zhang Y, Cao W. Network security assessment using a semantic reasoning and graph based approach[J]. Computers & Electrical Engineering, 2017, 64: 96–109.
Ghosh N, Ghosh S K. An approach for security assessment of network configurations using attack graph[C]//2009 First International Conference on Networks & Communications. IEEE, 2009: 283–288.
Kaluri R, Pradeep Reddy C. A framework for sign gesture recognition using improved genetic algorithm and adaptive filter[J]. Cogent Engineering, 2016, 3(1): 1251730.
Gonzales D, Kaplan J M, Saltzman E, et al. Cloud-trust—A security assessment model for infrastructure as a service (IaaS) clouds[J]. IEEE Transactions on Cloud Computing, 2015, 5(3): 523–536.
Wu F F. Real-time network security monitoring, assessment and optimization[J]. International Journal of Electrical Power & Energy Systems, 1988, 10(2): 83–100.
Kotenko I V, Doynikova E. Evaluation of Computer Network Security based on Attack Graphs and Security Event Processing[J]. J. Wirel. Mob. Networks Ubiquitous Comput. Dependable Appl., 2014, 5(3): 14–29.
Viduto V, Maple C, Huang W, et al. A novel risk assessment and optimization model for a multi-objective network security countermeasure selection problem[J]. Decision Support Systems, 2012, 53(3): 599–610.
Saeh I S, Khairuddin A. Static security assessment using artificial neural network[C]//2008 IEEE 2nd International Power and Energy Conference. IEEE, 2008: 1172–1178.
Dong C, Zhao L. Sensor network security defense strategy based on attack graph and improved binary PSO[J]. Safety science, 2019, 117: 81–87.
Lin Z, Chen G, Guo W, et al. PSO-BPNN-based prediction of network security situation[C]//2008 3rd International Conference on Innovative Computing Information and Control. IEEE, 2008: 37–37.
Zheng Q. Information System Security Evaluation Algorithm Based on PSO-BP Neural Network[J]. Computational Intelligence and Neuroscience, 2021, 2021.
Kalyani S, Swarup K S. Particle swarm optimization based K-means clustering approach for security assessment in power systems[J]. Expert systems with applications, 2011, 38(9): 10839–10846.
Cruz L M, Alvarez D L, Al-Sumaiti A S, et al. Load curtailment optimization using the PSO algorithm for enhancing the reliability of distribution networks[J]. Energies, 2020, 13(12): 3236.
Yi B, Cao Y P, Song Y. Network security risk assessment model based on fuzzy theory[J]. Journal of Intelligent & Fuzzy Systems, 2020, 38(4): 3921–3928.
Keserwani P K, Govil M C, Pilli E S, et al. A smart anomaly-based intrusion detection system for the Internet of Things (IoT) network using GWO–PSO–RF model[J]. Journal of Reliable Intelligent Environments, 2021, 7(1): 3–21.
Yong Q, Zhenyu Z, Bo C, et al. Research on the prediction model for the security situation of metro station based on PSO/SVM[J]. Journal of Intelligent Learning Systems and Applications, 2013, 2013.
Yoshida H, Kawata K, Fukuyama Y, et al. A particle swarm optimization for reactive power and voltage control considering voltage security assessment[J]. IEEE Transactions on power systems, 2000, 15(4): 1232–1239.
Tao X, Liu Z, Yang C. An efficient network security situation assessment method based on AE and PMU[J]. Wireless Communications and Mobile Computing, 2021, 2021.
Ansari M S, Bartoš V, Lee B. GRU-based deep learning approach for network intrusion alert prediction[J]. Future Generation Computer Systems, 2022, 128: 235–247.
Fan J, Mu D, Liu Y. Research on network traffic prediction model based on neural network[C]//2019 2nd International Conference on Information Systems and Computer Aided Education (ICISCAE). IEEE, 2019: 554–557.
Dong R H, Shu C, Zhang Q Y, et al. Security Situation Prediction Method for Industrial Control Network Based on Adaptive Grey Verhulst Model and GRU Network[J]. International Journal of Network Security, 2022, 24(1): 49–61.
Wahab O A, Bentahar J, Otrok H, et al. Towards trustworthy multi-cloud services communities: A trust-based hedonic coalitional game[J]. IEEE Transactions on Services Computing, 2016, 11(1): 184–201.
Xiao-ling T, Zi-yi L, Chang-song Y. An Efficient Network Security Situation Assessment Method Based on AE and PMU[J]. Wireless Communications & Mobile Computing (Online), 2021, 2021.
Chen Q, Wang H. Time-adaptive transient stability assessment based on gated recurrent unit[J]. International Journal of Electrical Power & Energy Systems, 2021, 133: 107156.
Kasongo M S, Sun Y. A Gated Recurrent Unit based Intrusion Detection for SCADA Networks[C]//2021 6th International Conference on Computing, Communication and Security (ICCCS). IEEE, 2021: 1–6.
Zhang H, Kang C, Xiao Y. Research on network security situation awareness based on the LSTM-DT model[J]. Sensors, 2021, 21(14): 4788.
Althubiti S A, Jones E M, Roy K. LSTM for anomaly-based network intrusion detection[C]//2018 28th International telecommunication networks and applications conference (ITNAC). IEEE, 2018: 1–3.
Lv Y, Ren H, Gao X, et al. Multi-scale Risk Assessment Model of Network Security Based on LSTM[C]//International Conference on Verification and Evaluation of Computer and Communication Systems. Springer, Cham, 2020: 257–267.
Muhammad K, Ullah A, Imran A S, et al. Human action recognition using attention based LSTM network with dilated CNN features[J]. Future Generation Computer Systems, 2021, 125: 820–830.
Dong Z, Su X, Sun L, et al. Network security situation prediction method based on strengthened LSTM neural network[C]//Journal of Physics: Conference Series. IOP Publishing, 2021, 1856(1): 012056.
Rasheed I, Hu F, Zhang L. Deep reinforcement learning approach for autonomous vehicle systems for maintaining security and safety using LSTM-GAN[J]. Vehicular Communications, 2020, 26: 100266.
Meng F, Fu Y, Lou F, et al. An effective network attack detection method based on kernel PCA and LSTM-RNN[C]//2017 International Conference on Computer Systems, Electronics and Control (ICCSEC). IEEE, 2017: 568–572.
Boukhalfa A, Abdellaoui A, Hmina N, et al. LSTM deep learning method for network intrusion detection system[J]. International Journal of Electrical and Computer Engineering, 2020, 10(3): 3315.
Wang Q, Bu S, He Z, et al. Toward the prediction level of situation awareness for electric power systems using CNN-LSTM network[J]. IEEE Transactions on Industrial Informatics, 2020, 17(10): 6951–6961.

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

A Network Security Situation Assessment Method Based on Fusion Model

Status:

Version 1

Abstract

Figures

1. Introduction

2. Related Work

2.1 Traditional methods

2.2 Particle swarm-based algorithms

2.3 Machine learning-based methods

3. FM model

4. Results

5. Conclusion

Declarations

References

Additional Declarations

Status:

Version 1