A New Approach to Network Traffic Efficiency and DDOS Attack Detection on Software-defined Networks

doi:10.21203/rs.3.rs-2326340/v1

Download PDF

Research Article

A New Approach to Network Traffic Efficiency and DDOS Attack Detection on Software-defined Networks

https://doi.org/10.21203/rs.3.rs-2326340/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 24 Oct, 2023

Read the published version in Tehnicki vjesnik - Technical Gazette →

Version 1

posted

You are reading this latest preprint version

Many devices have been connected to each other and a wide platform has been formed with the development of internet technologies. The continuous expansion of this platform has revealed requirements such as single-point management, accessibility, bandwidth management, and efficient use of the network. Considering that software-defined networks are systematically managed with software, it is predicted that they will meet the specified network requirements more easily.

In this study, a model that detects both efficient use of the network and Distributed Denial of Service Attack (DDOS) attacks by clustering the bandwidths according to the network traffic history collected over the software-defined network is proposed. A dataset specific to the study was created to determine possible attacks and usages according to bandwidth. From the data in the dataset, 3 different datasets were created with the Kmeans clustering algorithm. A virtual network was created for the implementation of the model and tests were carried out on this network. Efficient use of the network is ensured by allocating bandwidth according to clusters created especially in multi-user, heavy-traffic networks. In addition, while the data is being collected, DDOS scanning is also performed to prevent possible attacks on the network.

SDN

Intrusion Detection

Intrusion Clustering

K-Means

Thanks to the developments in information technologies, internet technology has created a large digital platform where every device is connected to each other. This platform has revealed control and management requirements such as accessibility, smart management, and bandwidth management. Traditionally, the manual configuration of Internet Protocol networks (IP-Internet Protocol) and devices used today is a difficult and complex process[1]. Performance delays may occur during the manual configuration process. In addition, there is a risk of human error during configuration. When predetermined rules need to be updated, programming the working network and reconfiguring the network depending on the dynamic changes that may occur on the existing network require a series of difficult operations that negatively affect the time cost.

The data produced by IT assets are stored according to a certain writing standard and these data are stored in different file extensions. Log records of operating systems, network assets, and firewalls are critical records to be obtained for the security of the system. Especially in the development of software-defined network solutions, data generated in the operating system or network is needed. In order to reveal artificial intelligence-supported solutions in possible anomaly detection or configuration processes, data should be included in a single dataset and data-cleaning processes should be applied [2].

A Software Defined Network (SDN) can be defined as a software-managed network programmatically. SDN networking, network performance boosting, troubleshooting, etc. It is configured using the Open-Flow protocol with software written for various purposes. The slow, expensive and limited nature of changes in traditional networks hinders investments and network switching in many organizations [3].

Programming switchgear has become a challenge as requirements grow with ever-growing network sizes. Setting up individual network switches manually is time-consuming in large networks and businesses running multiple virtual systems. Since SDN is an approach that provides software changes on the network, monitoring and controlling the network, it facilitates the management and configuration of the network. Compared to traditional networks, the data and control plane of the switching device are separated in the SDN network [4].

In SDN networks, the control and data planes are separate from each other and the decision power is given to the SDN controller in network packet transmission. This is why switching devices behave like a dumb device. This centralized control in SDN networks makes the network vulnerable to attacks such as Denial of Service (DoS), spoofing, and flooding. Such attacks reduce SDN performance by disabling different units such as switching devices and network controllers [5].

In this study, a new model proposal for clustering home network efficiency is presented in an SDN network based on historical network traffic collected over the network. In the study, the Isparta University of Applied Sciences Information Processing Department backbone switching device was used to obtain the data. Clustering was performed in the K-means algorithm by analyzing the data, and the IP addresses in the clusters obtained in the last step were assigned to different bandwidths with pre-defined queues to ensure effective use of the network. While collecting data, DDOS attacks were detected on the network at the same time, and the network was protected against attacks. As a result, with this SDN-based approach developed, the bandwidth of the networks is adjusted and the network is protected against attacks. The original contributions of the proposed study model and its contributions to the literature are given below.

Use of a large study-specific dataset
Ensuring network efficiency in dense and high bandwidth institutions
Detection of DDOS attacks in networks with servers for the enterprise

2.1 Software Defined Network-SDN

Software Defined Networks (SDN) technology offers methods capable of solving these problems. While it is necessary to program each network device (router-router, switch-switching device) separately in generally used networks, with the management software used in software-defined networks, the devices in the network are programmed from a single point and the devices are automatically configured according to the determined rules. Managing the network from a single point; It offers a flexible environment that increases the traceability of the network, facilitates the detection of errors that may occur on the network, and simplifies network management. Figure 1 shows the working diagrams of the traditional network structure and the software-based network structure.

Software Defined Networks (SDN) offer the opportunity to easily manage all network infrastructures in the system together, thanks to its control panel and user interface that can be managed from a single point. In this way, the operations desired to be performed on the system can be performed quickly and easily. New technology applications planned to be created in network systems will be profitable in terms of service performance and resource usage [7].

It has been shown experimentally that conventional techniques do not provide correct network behavior in case of failure. This is because these techniques only address part of the problem. Ignoring the switch state can result in inconsistencies, potentially resulting in severe network anomalies. This leads to the need to design fault-tolerant SDN solutions [8].

Software Defined Networks emerged as a challenge to the limitations of traditional network architectures. Its main advantages can be listed as programmability, centralized network view, and separate operation of the data plane and control plane. With these features, the implementation of QoS in various network applications attracts the attention of developers and researchers [9].

2.2. ONOS Software Defined Network Controller

The Open Network Operating System (ONOS) is an operating system designed to help network service providers create carrier-level software-defined networks designed for high scalability, availability, and performance. While specifically designed to meet the needs of service providers, ONOS can also function as a software-defined network (SDN) control plane for enterprise campus local area networks (LAN) and data center networks.

ONOS software-defined network controller will be installed on the computer at least; It should have 2 core processors, 2 GB RAM, 10 GB hard disk, 1 network controller, and Linux operating system. It is recommended not to run ONOS services with root users on Linux operating system. For this reason, it is recommended to create a user named "SDN" for the ONOS application. Since ONOS is a java-based application, java and its plugins must be installed before starting the installation [10].

SDN control software used today has been examined in terms of programming language, interface support, documentation, modularity, platform support, southern part API, and northern part API criteria and shown in Table 1 comparatively.

Table 1

Feature-based comparison of popular open source SDN control software [11]
SDN Software	ONOS	OPEN DAYLIGHT	NOX	RYU
Programming Language	Java	Java	C++	Python
GUI	Web	Web	Python	Python
Documentation	Good	Very Good	Poor	Fair
Modularity	High	High	Low	Low
Platform Support	Linux, Mac OS, Windows	Linux, Mac OS, Windows	Linux	Linux
Southbound API	OF1.0, 1.3, Netconf	OF1.0, 1.3, 1.4, NETCONF/YANG, OVSDB, PCEP, BGP/LS, LISP, SNMP	OF 1.0	OF 1.0, 1.2, 1.3, 1.4, NETCONF, OFCONFIG
Northbound API	Rest API	Rest API	Rest API	Rest API

2.3. Mininet Network Simulation Application

Mininet is a network emulator that can network with virtual computers, virtual switches, virtual controllers, and virtual connections. Mininet allows us to build experimental networking on a computer such as research, network development, prototyping, learning, debugging, and testing [12].

The benefits of the Mininet application can be listed as follows [13];

Having tools to create complex network topologies without establishing a physical network structure,
Having networking tools to develop OpenFlow applications,
Having a command line interface to run tests on the network,
It has a simple Python API for creating and managing networks.

In order to install the Mininet virtual networking application, firstly, after downloading the mininet project to the local repository via GitHub, it is necessary to select the desired version and start the installation. Installation is done with the "install.sh -a" command in the downloaded mininet project [14].

Example network commands created with python programming language on Mininet are given in Fig. 2. Figure 3 shows the s1, s2, s3, s4, s5, and s6 switching devices, h1, h2, h3, h4, h5, and h6 host computers connected to switching devices, definitions made with net.addLink command show connections between switching devices and hosts.

2.4. SFLOW protocol

NetFlow is a traffic monitoring technology developed by Darren and Barry Bruins at Cisco in 1996. Defines how a router exports information and statistics of routed sockets. As a de facto industry standard, it is a built-in feature of most routers and switches from Cisco, Juniper, and other vendors. There are several versions of NetFlow from v1 to v9. v5 and v9 are the most used versions [15].

Packet sampling (sFlow) of traffic flow has a long history before NetFlow was developed. Sflow is a technology supported by manufacturers such as Alcatel, Extreme, Force10, HP, and Hitachi, which uses simple random sampling and produces switches and routers that include the sFlow tool. sFlow is software that combines interface counters and flow samples into sFlow datagrams and sends these data to sFlow collectors via UPD [15].

sFlow-RT is an application with InMon's asynchronous analytics technology, providing real-time visibility into Software-Defined Networking (SDN) and DevOps stacks, and introducing new classes of performance-sensitive applications such as load balancing, DDoS mitigation, and workload placement [16].

This SDN-based approach, which works to improve the resource consumption of network assets by limiting user speeds in complex networks according to their average usage, consists of four stages. The first of these stages is to create a virtual network in the mininet environment and communicate with the ONOS software-defined network controller for attack detection. In the second step, data were collected from both the created network and a real network device with the sflow protocol, possible DDOS attacks were detected and the data were examined. The traffic data collected in the third stage were analyzed with the help of the K-means algorithm and divided into groups. In the fourth and last step, the IP addresses in the separated groups were assigned to the predefined Qos on the network device and bandwidth selection was made.

3.1. Creating the Dataset for Clustering

The data to be used by the K-Means clustering for speed limitation were collected with Sflow from the Isparta University of Applied Sciences network. In Fig. 3, the screen output of the interface used in the data collection process with Sflow is given. More than 7.500.000 records in total are stored in PostgreSQL after data cleaning.

3.2. Designing the Virtual Network

In our study, a network consisting of 2 switches and 4 hosts was designed in the mininet virtual network emulator. The view of the designed network structure on the ONOS software-based network controller is given in Fig. 4. In the figure, H1, H2, H3, and H4 represent the hosts used to perform a DDOS attack, while S1 and S2 are the switching devices over which the data is collected.

“ovs-vsctl” command from Openflow protocol commands is used for sflow protocol setting of switching devices in the created network environment. With the help of this command, it is ensured that the traffic passing through the switching devices is directed to the sflow-rt application.

There are some parameters used when configuring sflow in switching devices. With the "target" parameter, the IP address of the sflow aggregator server is entered. With the "header" parameter, the header size of the sflow package is determined. The "sampling" parameter is the ratio of the number of packets arriving at a port to the number of samples received from these packets. Port speed should be taken into account when selecting the sflow sample rate. Sampling selection ranges according to port speed are given in Fig. 5 [17]. The "polling" parameter determines how often sflow packets will be sent to the sflow collector.

“target = onos-ip-address” header = 128 sampling = 400 polling = 30” parameters and values were entered to the switching devices used in the study.

Data from Sflow configured network switching devices were transferred to the database with a generated python script. Meanwhile, DDOS attacks that may occur with the DDOS prevention application of the Sflow-rt software were detected and information about stopping the traffic was sent to the ONOS SDB controller. Source IP, destination IP, packet size information from the data received with the written script are recorded in the database with time data. With the Sflow protocol, the header information given in Fig. 6 can be obtained.

The collected data were processed and counted how many times each destination IP address was used as the destination IP, and during this process, the packet sizes for the relevant destination IP address were collected and recorded. An example image of the recorded data is given in Fig. 7.

While data is being collected with the Sflow protocol, DDos attacks that may occur on the network are also detected and the attacker IP addresses are automatically blocked for a specified period of time. IP addresses whose block has expired are allowed to generate traffic again.

3.3. Clustering Data with K-means Algorithm

K-means algorithm is one of the most known and used methods among clustering methods [19]. Clustering algorithms are useful tools for clustering and analysis of network traffic usage, data mining, compression, probability density estimation [20].

With K-means, it will recursively assign data points to one of its determined clusters, depending on how close the point is to the cluster center. With the K-means algorithm, it is aimed to determine the number of K cluster centroids and data points classified as clusters.

Assuming we have x ₁, x ₂, x ₃, …, x _n data points and K required number of clusters, basically the procedure is followed as follows.

The first centers from the dataset are randomly chosen as K points or the first K points.
Find the Euclidean distance of each point in the data set with the determined K cluster centers.
Assign each data point to its nearest center point using the distance found in the previous step.
Find the new center of gravity by averaging the points in each cluster group.
Reassignment to the group is repeated until the centers do not change or by finding the distance for a fixed number of iterations.

The relationship between the two values in the dataset is calculated over the euclidean distances and the distance between the two points is calculated as shown in Eq. 1.

\({d\left(p,q\right)}^{}=\sqrt{{\left({q}_{1}-{p}_{1}\right)}^{2}{+\left({q}_{2}-{p}_{2}\right)}^{2}}\) (Eq. 1)

If p = ( p ₁, p ₂ ) and q = ( q ₁, q ₂ ) the distance is given as:

[Pyhton Code]

def euclidean_distance(point1, point2):

return math.sqrt((point1[0]-point2[0])**2 + (point1[1]-point2[1])**2)

The number of clusters to be created in the K-means algorithm is given to the algorithm as a parameter. How many clusters the data should be divided into was determined using the elbow method. In Fig. 8, elbow method graphs created according to daily, weekly and monthly data are given.

Based on the daily, weekly and monthly graphs created by the elbow method, it was determined that the most suitable number of clusters for the k-means algorithm was '3'. After determining the number of clusters, the process of assigning each data to the nearest cluster is started. If each cluster center is denoted by ci, each x data point is assigned to a cluster based on Eq. 2. Here dist() is the euclidean distance.

Equation 3 is applied to find the new center of the clustered data from the point group. S_i is the set of all points assigned to the İ set.

\({c}_{i}=\frac{1}{{S}_{i}}\sum _{{x}_{i}\in {S}_{i}}{e}_{i}\) Eq. 3

The collected and separated data were divided into 3 different clusters with the K-means algorithm. Clusters created with the K-means algorithm are given in Fig. 9.

3.4. Processing of clustered data in Onos

IP addresses in clusters divided by K-means are assigned to pre-created Qos queues in network switching devices, taking into account the end link speed. The user port speed of the generally used switching devices is taken as 1 Gbps. The first queue bandwidth is 500 Mbps, the second queue bandwidth is 350 Mbps, and the third queue bandwidth is 150 Mbps. The bandwidth controller required for the queues to work is limited as in Eq. 4. Here, q_data is the data controller for limiting, and q₁₅₀, q_300, q₅₀₀ are the queues created.

For the IP address cluster with the highest total packet size and hit count among the data on which the K-Means algorithm is applied, a flow record is entered to the ONOS controller to assign these traffics to the first queue with 500 Mbps bandwidth. IP addresses in the cluster with the lowest total packet size and hit count are assigned to the queue with 150 Mbps bandwidth. For the remaining cluster, a queue with 300 Mbps bandwidth was used.

Onos SDN controller defaults to 'forward' module priority value is "10". The priority value for the entered flow records is selected to be more than “10”. Figure 10 shows examples of logged flow records for each queue.

Speed tests were carried out with the iperf application on the virtual network created on mininet to test the applied flow records and queues. In the tests, queues with bandwidths of q₁₅₀for the h1 host computer, q₃₀₀ for the h2 host computer, and q₅₀₀ for the h3 host computer were selected. In Fig. 11, the iperf speed test results from the h4 computer to the h3, h2, and h1 host computers are given, respectively.

According to the speed test results, the queue selection process was performed according to the flow records entered into the ONOS SDN controller, and the bandwidth selection process was carried out correctly depending on the different queues for the IP addresses of different clusters.

In order to test the DDOS attacks that may occur on the network, the intruder IP addresses are blocked with the codes added to the sflow application by continuously sending packets to the 5001 port with iperf on the virtual network. In Fig. 12, a screenshot of a blocked traffic and the flow record on the onos controller are given.

The rapid development of internet technologies and the use of the internet in all areas have revealed the need for increased bandwidth. Software-defined networks are currently a research area used for network management. Bandwidth needs of users can be adjusted on network devices with Qos.

In this study, internet traffic created by users on the software-defined network was collected and a possible DDOS attack was detected. Different clusters were created by processing the collected traffic data with the K-means algorithm. IP addresses in the created clusters are assigned to different Qos defined in network switching devices via onos software-defined network controller. In this way, users are provided with higher bandwidth to connect to the most frequently visited IP addresses on the internet.

In the next phase of this work, data packets within the network will be included in the continuous dataset and dynamic expansion of the 3 active clusters will be provided if more clusters are needed in different networks.

Ethical Approval

Not applicable.

Competing interests

On behalf of all authors, the corresponding author states that there is no conflict of interest.

Authors' contributions

Not applicable.

Funding

Not applicable.

Availability of data and materials

The datasets generated during and/or analyzed during the current study are available from the corresponding author upon reasonable request.

W. Xia, Y. Wen, C. H. Foh, D. Niyato, ve H. Xie, “A Survey on Software-Defined Networking”, IEEE Commun. Surv. Tutorials, c. 17, sayı 1, ss. 27–51, Oca. 2015, doi: 10.1109/COMST.2014.2330903.
A. A. Süzen, “Developing a multi-level intrusion detection system using hybrid-DBN”, J. Ambient Intell. Humaniz. Comput., c. 12, sayı 2, ss. 1913–1923, Şub. 2021, doi: 10.1007/S12652-020-02271-W/TABLES/8.
D. Singh Rana, S. Kumar Chamoli, ve S. Ashish Dhondiyal, “Software Defined Networking (SDN) Challenges, issues and Solution Network Security View project Sleeping Mode MODLEACH Protocol for WSN View project Software Defined Networking (SDN) Challenges, issues and Solution”, Int. J. Comput. Sci. Eng. Open Access Res. Pap., sayı 7, 2019, doi: 10.26438/ijcse/v7i1.884889.
S. Hikmat Haji vd., “Editor(s): (1) Dr. Xiao-Guang Lyu, Huaihai Institute of Technology”, Asian J. Res. Comput. Sci., c. 9, sayı 2, ss. 1–18, 2021, doi: 10.9734/AJRCOS/2021/v9i230216.
M. Imran, M. H. Durad, F. A. Khan, ve A. Derhab, “Toward an optimal solution against Denial of Service attacks in Software Defined Networks”, Futur. Gener. Comput. Syst., c. 92, ss. 444–453, Mar. 2019, doi: 10.1016/J.FUTURE.2018.09.022.
T. A. Tang, L. Mhamdi, D. McLernon, S. A. R. Zaidi, ve M. Ghogho, “Deep learning approach for Network Intrusion Detection in Software Defined Networking”, Proc. - 2016 Int. Conf. Wirel. Networks Mob. Commun. WINCOM 2016 Green Commun. Netw., ss. 258–263, Ara. 2016, doi: 10.1109/WINCOM.2016.7777224.
M. Cicioğlu ve A. Çalhan, “Yazılım Tanımlı Ağlar – YTA”, Karaelmas Sci. Eng. J., c. 7, sayı 2, ss. 684–695, Haz. 2017, Erişim: 06 Eylül 2022. [Çevrimiçi]. Available at: https://dergipark.org.tr/en/pub/karaelmasfen/issue/57121/805900
A. Mantas ve F. M. V. Ramos, “Rama: Controller Fault Tolerance in Software-Defined Networking Made Practical”, Şub. 2019, doi: 10.48550/arxiv.1902.01669.
M. Karakus ve A. Durresi, “Quality of Service (QoS) in Software Defined Networking (SDN): A survey”, J. Netw. Comput. Appl., c. 80, ss. 200–218, Şub. 2017, doi: 10.1016/J.JNCA.2016.12.019.
Onos, “Onos Project”, 2020. https://wiki.onosproject.org3
O. Salman, I. H. Elhajj, A. Kayssi, ve A. Chehab, “SDN controllers: A comparative study”, Proc. 18th Mediterr. Electrotech. Conf. Intell. Effic. Technol. Serv. Citizen, MELECON 2016, Haz. 2016, doi: 10.1109/MELCON.2016.7495430.
Mininet, “Mininet”.
X. Hesselbach Serra, “Implementació bàsica i proves de funcionament de la plataforma ONOS”, 2019, Erişim: 28 Eylül 2022. [Çevrimiçi]. Available at: https://upcommons.upc.edu/handle/2117/132166
Mininet, “No Title”, 2020. http://mininet.org
B. Li, J. Springer, G. Bebis, ve M. Hadi Gunes, “A survey of network flow applications”, J. Netw. Comput. Appl., c. 36, sayı 2, ss. 567–581, Mar. 2013, doi: 10.1016/J.JNCA.2012.12.020.
SFlow-RT, “sFlow-RT”, 2021. https://sflow-rt.com/ (erişim 22 Eylül 2022).
sFlow, “sFlow sampling rates”, 2009. https://blog.sflow.com/2009/06/sampling-rates.html (erişim 08 Eylül 2022).
R. M. A. Ujjan, Z. Pervez, K. Dahal, A. K. Bashir, R. Mumtaz, ve J. González, “Towards sFlow and adaptive polling sampling for deep learning based DDoS detection in SDN”, Futur. Gener. Comput. Syst., c. 111, ss. 763–779, Eki. 2020, doi: 10.1016/J.FUTURE.2019.10.015.
K. P. Sinaga ve M. S. Yang, “Unsupervised K-means clustering algorithm”, IEEE Access, c. 8, ss. 80716–80727, 2020, doi: 10.1109/ACCESS.2020.2988796.
G. Hamerly ve C. Elkan, “Learning the k in k-means”, içinde Advances in Neural Information Processing Systems, 2003, c. 16. [Çevrimiçi]. Available at: https://proceedings.neurips.cc/paper/2003/file/234833147b97bb6aed53a8f4f1c7a7d8-Paper.pdf

No competing interests reported.

Download PDF

Journal Publication

published 24 Oct, 2023

Read the published version in Tehnicki vjesnik - Technical Gazette →

Version 1

posted

You are reading this latest preprint version

A New Approach to Network Traffic Efficiency and DDOS Attack Detection on Software-defined Networks

Status:

Journal Publication

Version 1

Abstract

Figures

1. Introduction

2. Examination Of Methods

2.1 Software Defined Network-SDN

2.2. ONOS Software Defined Network Controller

2.3. Mininet Network Simulation Application

2.4. SFLOW protocol

3. Design Of Attack Detection And Clustering System

3.1. Creating the Dataset for Clustering

3.2. Designing the Virtual Network

3.3. Clustering Data with K-means Algorithm

3.4. Processing of clustered data in Onos

4. Research Findings

5. Conclusion

Declarations

References

Additional Declarations

Status:

Journal Publication

Version 1