Dynamic nearest neighbor resources classification algorithm for multiple cloud data center based on natural clustering rule

doi:10.21203/rs.3.rs-2086856/v1

Download PDF

Research Article

Dynamic nearest neighbor resources classification algorithm for multiple cloud data center based on natural clustering rule

https://doi.org/10.21203/rs.3.rs-2086856/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Cloud service providers need to reduce the operating costs and energy consumption of cloud data center (CDC) by optimizing scheduling algorithms, and ultimately reduce the cost of cloud users. However, the existing scheduling algorithms are less effective in dealing with the scheduling problems of multi-cloud data center (MDC). This paper systematically analyzes the MDC model and physical machine (PM) utilization. Secondly, using the idea of K-means clustering algorithm in machine learning, natural clustering rules are proposed to complete automatic clustering of PMs. Then, the supervised learning KNN classification algorithm is extended and the dynamic KNN classification rules are established accordingly. Finally, a dynamic nearest neighbor resources classification algorithm for multiple CDC based on natural clustering rule (DNSC) is proposed. Comparing the algorithm with the comparison algorithm shows that the algorithm comprehensively considers the resource parameters of the MDC and ultimately reduces the energy consumption of the MDC.

Clustering Algorithm

Supervised Learning

Cloud Data Center

Machine Learning

Energy Consumption Optimization

Large-scale cloud service providers will deploy different CDCs in many regions of the world according to national policies, climates, and costs. This not only can more effectively meet the needs of users in the region, but also can choose different energy supply methods and scheduling methods according to the climate environment. However, the development of MDCs has also brought about an increase in the complexity of scheduling algorithms.

After the CDC is deployed, it will enter a continuous operation stage, and the cost of the operation stage is mainly composed of its energy consumption. Existing scheduling algorithms have conducted certain research on energy consumption optimization, but these algorithms have certain limitations[1, 2]. First, a MDC may lead to different energy consumption and maintenance costs due to different climate and policy conditions in the region. Secondly, different PM models for building a CDC will show different PM parameters. This situation will increase the complexity of virtual machine (VM) deployment. Third, the VM capacity leased by cloud service providers has certain specifications, and the number of CPU cores and memory size also meet a certain ratio. Finally, different CDCs will cause differences in VM rental fees due to their different costs. Cloud users can choose VMs provided by CDCs in different regions according to their own budget and performance requirements. These factors will increase the complexity of the scheduling algorithm, and different algorithms will cause great differences in energy consumption and costs.

Considering these practical issues, this article first analyzes the impact of MDC model and PM utilization on energy consumption. Secondly, the supervised learning KNN classification algorithm is expanded and the dynamic KNN classification rules are established accordingly. Third, a dynamic nearest neighbor resources classification algorithm for multiple CDC based on natural clustering rule (DNSC) is proposed.

The main contributions of this paper are as follows:

(1) We determined the utilization of PMs.

(2) We propose natural clustering rule. At the same time, we establish dynamic KNN(DKNN) classification rules accordingly.

(3) We propose a dynamic nearest neighbor resources classification algorithm for multiple CDC based on natural clustering rule.

(4) The algorithm is verified by using the real data of Amazon cloud service.

The following content mainly contains these contents. Section 2 describes the related work of the scheduling algorithm. Section 3 establishes a MDC model, natural clustering rule and dynamic KNN classification rule. Section 4 introduce the DNSC algorithm. Section 5 simulates the real effect of the algorithm. The algorithm is summarized in Section 6.

Mapping and scheduling algorithms are the core technologies of CDCs. The previous research mainly included the optimization of efficiency and the reduction of energy consumption.

2.1 Research based on optimization efficiency

Li, et al. [3] proposes a resource management strategy, which can meet the workload of edge cloud and reduce the financial cost of renting nodes to the greatest extent. Wang, et al. [4] maximizes the profit of MSP by jointly scheduling network resources in C-RAN and computing resources in MEC, and proposes a unified framework of power performance tradeoff for MSP. Tong, et al. [5] proposes a novel artificial intelligence algorithm called deep Q learning task scheduling (DQTS), which combines the advantages of Q learning algorithm and deep neural network. Bi, et al. [6] proposes an integrated prediction method, which has the functions of noise filtering and data frequency representation, called Svitzky Golay and wavelet supported random assignment network (SGW-SCN) to predict the workload in future slots. Zhang, et al. [7] proposes a privacy preserving collaborative personalized search (CPPS) scheme in cloud environment, which uses matrix encryption to ensure the privacy of users. Khabbaz and Assi [8] focuses on the proposal of a novel deadline aware ur scheduling scheme (DASS), which aims to improve the QoS performance of data centers according to the above indicators. Yan, et al. [9] introduces uncertainty into task runtime estimation model and proposes a fault-tolerant task allocation mechanism. Liu, et al. [10] considers a multiple mapping mechanism, which allows VMs assigned to a user to be mapped to PMs. This can improve social welfare and reduce resource fragmentation. Sahni and Vidyarthi [11] aims to consider the performance variability of VM and instance acquisition delay, and take advantage of the advantages of cloud computing to determine the timely planning of scientific workflow with limited deadline at lower cost. The performance evaluation on some famous scientific workflow shows that the algorithm has better performance than the latest heuristic algorithm. Fahmideh, et al. [12] proposes an interactive target reasoning method, which is supported by probability layer, and is used to analyze cloud migration risk accurately to improve the reliability of risk control. Cao, et al. [13] proposes an automatic pre-allocation strategy to solve the problem of bandwidth over subscription in CDC. Alsarhan, et al. [14] is the first attempt to promote this integration to improve the profits of cloud service providers and avoid SLA violations. Numerical analysis emphasizes that this method can avoid violating the SLA requirements and maximize the profit of CP in the changing cloud environment. This study is the first attempt to promote such integration to improve the profits of cloud service providers and avoid SLA violations.

2.2 Research based on reducing energy consumption

Ismayilov and Topcuoglu [15] takes into account the minimum manufacturing time, cost, energy and imbalance, so as to maximize the reliability and utilization. Zhang and Wen [16] studies energy-saving collaborative task execution to reduce energy consumption on mobile devices. Wang, et al. [17] aims to achieve a network of green data centers and save as much energy as possible. Wei, et al. [18] presents a heterogeneous resource allocation method called avoid skewness avoid Multiple Resource Allocation (SAMR), which is used to allocate resources based on diverse requirements for different types of resources. Xie, et al. [19] is designed to reduce the energy consumption of multiple real-time workflows on CPCS, which maximizes the number of workflows completed within its time limit. Liang, et al. [20] maps cloud tasks and deploys VMs by prioritizing memory utilization. Finally, the algorithm improves the utilization of PMs and reduces the energy consumption of CDCs. Li, et al. [21] reduces the energy consumption of CDCs by proposing detailed thermal models to analyze the temperature distribution of airflow and server CPUs. Nir, et al. [22] presents a mathematical task scheduler model that can save energy and money. Compared with previous models, this model allows mobile devices to offload multiple tasks to cloud resources. Sahoo, et al. [23] designs a link-based virtual resource management (LVRM) algorithm to map VMs to PM based on the available and required resources of PM and VM, respectively. Dabbagh, et al. [24] presents an integrated, energy-efficient resource allocation framework for overused clouds. Liang, et al. [25] can reduce the energy consumption of cloud computing clusters by using the wear and tear comparison rule. As the size of the cluster and the number of tasks increase, the effect of this method is more obvious. Wei, et al. [26] expresses the problem as a three-dimensional packing optimization to minimize the energy costs of the worker and idle machine. Yang, et al. [27] establishes a simplified model of cloud computing task scheduling system. Unlike previous studies on cloud computing task scheduling algorithms, this simplified model is based on game theory as a mathematical tool.

In summary, performance improvements often result in increased energy consumption. To solve these problems, we take into account the diversity of cloud user needs and the heterogeneity of CDCs. In view of the relationship between CDC utilization and energy consumption, the KNN classification algorithm for supervised learning are used for reference, and the natural clustering rule and dynamic KNN classification rule are established. Finally, a dynamic nearest neighbor resources classification algorithm for multiple CDC based on natural clustering rule is presented.

3.1 MDC

This article is mainly to solve the problem of resource scheduling and allocation in MDCs. Therefore, we first conduct research on MDCs and establish corresponding models. In the research, the model assumes that the set of CDCs DC=[dc₁,dc₂,...,dc_n], where n represents the number of CDCs available to cloud service providers. At the same time, each CDC will be expressed and the specific parameter meanings are shown in Table 1.

Table 1

model parameters of CDC
parameter	meaning	unit
dc_i.np	Number of PMs	/
dc_i.ap	Number of activated PMs	/
dc_i.au	Average utilization of PMs	/
dc_i.te	Energy consumption of CDC	kW·h

Similarly, the PMs in a CDC can also be represented as a collection. The specific meanings and units of the parameters are shown in Table 2.

Table 2

model parameters of PM
parameter	meaning	unit
pm_j.tc	Number of CPU cores	/
pm_j.tm	Memory size	GB
pm_j.uc	CPU core utilization	/
pm_j.um	Memory utilization	/
pm_j.ft	Complete time	/
pm_j.dn	CDC number	/

The VM serves as the basis for cloud service providers to provide cloud services externally. The specific meanings and values of the parameters are shown in Table 3.

Table 3

model parameters of VM
parameter	meaning	unit
vm_k.tc	Number of CPU cores	/
vm_k.tm	Memory size	GB
vm_k.ft	Complete time	/
vm_k.pn	PM number	/

3.2 Natural clustering rule

Once a cloud user submits an application for VM rental, it will selects a suitable PM to deploy it according to the current PM operation. We take 9 PMs in three CDC as an example. They are all homogeneous PMs based on Intel Xeon E5-2686 v4. Therefore, the number of CPU cores is 18, and the corresponding memory size is 72GB. Their parameters are shown in Table 4.

Table 4

parameters before PM deployment
PM	tc	tm	uc	um	ft	dn
pm₁	18	72 GB	60%	60%	4000	1
pm₂	18	72 GB	50%	50%	3600	2
pm₃	18	72 GB	10%	10%	3700	2
pm₄	18	72 GB	40%	40%	7000	3
pm₅	18	72 GB	60%	60%	7200	3
pm₆	18	72 GB	45%	45%	7300	2
pm₇	18	72 GB	60%	60%	7500	1
pm₈	18	72 GB	40%	40%	8000	1
pm₉	18	72 GB	45%	45%	10800	3

The entire scheduling process is very fast and can be dynamically changed as cloud user needs change. This article uses the VM parameters in Table 5, and comprehensively considers the utilization and energy consumption of the PM when deploying it. The FF algorithm only considers the utilization rate of the PM. Therefore, the PM that can meet its parameter requirements is selected according to the PM number. Because the remaining space of pm₁ is not enough to deploy vm₁, pm₂ is selected to deploy vm₁. If vm₁ is deployed in this way, the CPU utilization of pm₂ is 94%. In order to pursue higher efficiency, the scheduling algorithm usually increases the utilization of the PM as much as possible. However, a fully loaded physical opportunity results in a decrease in VM performance or PM downtime due to the competition of VMs for public resources. Therefore, the upper limit of the utilization of the PM is usually set to 90%, and some redundancy is set aside to improve the performance of the VM. Therefore, vm₁ cannot be deployed on pm₂. At the same time, studies have shown that even when the PM is empty, the energy consumption is 50%-70% of the full load. Therefore, this article sets a lower limit of 60% for PM utilization. According to this setting, pm₃ cannot complete the deployment of vm₁. The scheduling algorithm continues to find that pm₄ can complete the deployment of vm₁, and the upper and lower limits of pm₄ utilization after deployment meet the conditions. Finally, the deployment of vm₁ is completed on pm₄. The process of FF algorithm deployment of VMs is shown in Table 6. Similarly, in order to ensure the performance of the VM and the energy consumption of the PM, the deployed PM should meet the upper and lower limits of PM utilization. According to this demand, we found that because pm₄, pm_6, pm₈ and pm₉ can all meet the needs of PM utilization, but deploying vm₁ on pm₆ will maximize the utilization of the PM. Therefore, the BF algorithm will choose pm₆ to complete the deployment of VM. The selection process of BF algorithm deployment VM is shown in Table 7. As this article studies the resource scheduling problem of MDCs, the clustering process can be naturally completed according to the CDC.

Rule 1. Natural clustering rule. Since the deployment of VMs is the first selection of PM clustering, PMs are selected when the clustering is confirmed. Therefore, it is necessary to cluster the PMs of the MDC. However, these PMs can be divided into different CDCs due to geographical conditions, so as to complete the natural clustering of the PMs. In the end, each CDC will form an independent cluster. Next, expand the KNN algorithm to complete the classification of VMs and deploy the VMs.

Table 5

parameters of VM
VM	tc	tm	ft	pn
vm₁	8	32GB	9000	/
vm₂	4	16GB	7800	/

Table 6

the selection process of FF algorithm deployment VM
PM	tc	tm	uc	um	ft	dn
pm₁	18	72 GB	104%	104%	9000	1
pm₂	18	72 GB	94%	94%	9000	2
pm₃	18	72 GB	54%	54%	9000	2
pm₄	18	72 GB	84%	84%	9000	3

Table 7

the selection process of BF algorithm deployment VM
PM	tc	tm	uc	um	ft	dn
pm₄	18	72 GB	84%	84%	9000	3
pm₆	18	72 GB	89%	89%	9000	2
pm₈	18	72 GB	84%	84%	9000	1
pm₉	18	72 GB	89%	89%	10800	3

3.3 Dynamic KNN classification rule

Supervised learning is an important part of machine learning, and the classification problem is the core problem of supervised learning. In the classification problem, the KNN algorithm is widely used in various fields. Its basic idea is to select K objects that meet the conditions and come from different categories when classifying the target object. Then statistically select the category containing the most objects as the target category, and classify the target object into this category. This can ensure that the target object is classified into the appropriate category as much as possible. At the same time, the K value can adjust the classification result, and the classification result becomes more reasonable as the K value increases.

We still use the PMs in Table 4 and the VM parameters in Table 5, and the clustering of PMs has been completed through natural clustering rules. Through observation, it can be found that pm₄, pm₆, pm₈, and pm₉ can all meet the upper and lower limits of PM utilization. Therefore, at this time, we take the value of K as 4. According to the KNN algorithm, the number of PMs meeting the conditions is shown in Table 8. The third cluster contains 2 PMs that meet the conditions, and vm₁ should be classified into this cluster. After considering that the PM completion time cannot be extended, pm₉ will be selected. The selection process of the VM of this deployment method is shown in Table 9.

Table 8

the vm₁ process of KNN
cluster	number of PM
1	1
2	1
3	2

Table 9

the selection process of comprehensive of utilization and completion time deployment vm₁
PM	tc	tm	uc	um	ft	dn
pm₄	18	72 GB	40%	40%	7000	3
pm₆	18	72 GB	45%	45%	7300	2
pm₈	18	72 GB	40%	40%	8000	1
pm₉	18	72 GB	89%	89%	10800	3

After completing the classification and deployment of vm₁, the parameters of the PM have changed. The PMs are shown in Table 10. At this time we will classify and deploy vm₂. Observation at this time shows that pm₁, pm₂, pm₄, pm₅, pm₆, pm₇, and pm₈ can all meet the upper and lower limits of PM utilization. The number is different from when vm₁ was deployed. Therefore, we can dynamically adjust the K value in KNN by meeting the upper and lower limits of the PM. This can further improve the accuracy of classification, according to which we propose dynamic KNN clustering rule.

Rule 2. Dynamic KNN classification rule. When deploying a new VM application, the MDC performs KNN classification on the VM according to the result of the natural clustering rule. The scheduling system will use the number of VMs that meet the upper and lower limits of PM utilization as the K value of the KNN algorithm, which will cause the deployment of each VM to produce a different K value. The dynamic change of K value ultimately leads to more accurate classification results.

According to the dynamic KNN classification rule, we take the value of K as 7. The number of PMs meeting the conditions is shown in Table 11. The first cluster contains 3 PMs that meet the conditions, and vm₂ should be classified into this cluster. After considering that the PM completion time cannot be extended, pm₈ will be selected. The selection process of the VM of this deployment method is shown in Table 12.

Table 10

parameters after vm₁ deployment
PM	tc	tm	uc	um	ft	dn
pm₁	18	72 GB	60%	60%	4000	1
pm₂	18	72 GB	50%	50%	3600	2
pm₃	18	72 GB	10%	10%	3700	2
pm₄	18	72 GB	40%	40%	7000	3
pm₅	18	72 GB	60%	60%	7200	3
pm₆	18	72 GB	45%	45%	7300	2
pm₇	18	72 GB	60%	60%	7500	1
pm₈	18	72 GB	40%	40%	8000	1
pm₉	18	72 GB	89%	89%	10800	3

Table 11

the vm₂ process of KNN
cluster	number of PM
1	3
2	2
3	2

Table 12

the selection process of comprehensive of utilization and completion time deployment vm₂
PM	tc	tm	uc	um	ft	dn
pm₁	18	72 GB	60%	60%	4000	1
pm₂	18	72 GB	50%	50%	3600	2
pm₄	18	72 GB	40%	40%	7000	3
pm₅	18	72 GB	60%	60%	7200	3
pm₆	18	72 GB	45%	45%	7300	2
pm₇	18	72 GB	60%	60%	7500	1
pm₈	18	72 GB	62%	62%	8000	1

From the above example, the utilization rate of the PM can be improved without prolonging the running time of the PM, thereby ensuring that efficiency is improved and energy consumption is reduced at the same time. The DNSC algorithm proposed accordingly will be described in the next section.

According to the previous conclusions, the DNSC algorithm as shown in Algorithm 1.

Algorithm 1: (DNSC)

Input: DC collection: DC={dc₁,dc₂,…,dc_n},

PM collection: PM={pm₁,pm₂,…,pm_m},

VM collection: VM={vm₁,vm₂,…,vm_p},

Output: PM collection: PM={pm₁,pm₂,…,pm_m}

1 SCA(DC,PM)

2 while VM not null do

3 NoDC = DKNN(vm_i,DC,PM)

4 if(NoDC = = null)

5 BF(vm_i,DC,PM)

6 end if

7 if(NoTPM!=null)

8 if(FIND(vm_i,NoDC,PM) = 1)

9 DD(vm_i,NoDC,PM)

10 end if

11 if(FIND(vm_i,NoDC,PM) > 1)

12 NDD(vm_i,NoDC,PM)

13 end if

14 end if

15 end while

In the algorithm, the PMs are first clustered according to the natural clustering rule (line 1). Second, classify VMs according to dynamic KNN classification rule (line 3). Finally, according to the classification results, different algorithms are called on the VM to complete the deployment (line 4–14).

The 1st line of Algorithm 1 clusters the PM according to the natural clustering rule, which belongs to the idea of natural clustering. The third line of Algorithm 1 classifies the PMs according to the dynamic KNN classification rule to classify the VMs. As shown in Algorithm 2.

Algorithm 2: (DKNN)

Input: VM to be deployed: vm_i,

DC collection: DC={dc₁,dc₂,…,dc_n},

PM collection: PM={pm₁,pm₂,…,pm_m}

Output: Number of DC: NoDC

1 while PM not null do

2 if(((pm_j.uc + vm_i.tc/pm_j.tc) > = 0.6)&&((pm_j.uc + vm_i.tc/pm_j.tc) < = 0.9))

3 if((pm_j.um + vm_i.tm/pm_j.tm) < = 1.0))

4 A[pm_j.dn]++

5 end if

6 end if

7 end while

8 NoDC = MAX(A[pmj.dn])

The fifth line of Algorithm 1 is that when the VM does not find a suitable classification, the BF algorithm is called to deploy the VM. The 12th line of Algorithm 1 is that there are multiple activated PMs to meet the utilization requirement. In this case, the PM that will not cause the execution time of the VM to be extended is selected for deployment.

Algorithm 3: (NDD)

Input: VM to be deployed: vm_i,

Number of DC: NoDC,

PM collection: PM={pm₁,pm₂,…,pm_m}

Output: PM collection: PM={pm₁,pm₂,…,pm_m}

1 while PM not null do

2 if(pm_j.dn = = NoDC)

3 if(((pm_j.uc + vm_i.tc/pm_j.tc) > = 0.6)&&((pm_j.uc + vm_i.tc/pm_j.tc) < = 0.9))

4 if((pm_j.um + vm_i.tm/pm_j.tm) < = 1.0))

5 if((pm_j.ft-vm_i.ft) < MinDifference)

6 NoPM = j

7 MinDifference = pm_j.ft-vm_i.ft

8 end if

9 end if

10 end if

11 end if

12 end while

13 pm_NoPM.uc + = vm_i.tc/pm_NoPM.tc

14 pm_NoPM.um + = vm_i.tm/pm_NoPM.tm

15 vm_i.pn = NoPM

This section implements the DNSC algorithm, the FF algorithm, BF algorithm and RAE algorithm for comparison based on CloudSim[22]. The parameters are shown in Table 13.

Table 13

experimental parameters
parameters	value
Number of VMs	{10000, 20000, 30000, 40000, 50000}
Number of CDCs	{1, 2, 3, 4, 5}
CPU of the VM	{1, 2, 4}
CPU of the PM	{16, 18, 64}
Lower limit of PM utilization	{0.60, 0.65, 0.70}

5.1 The effect with the number of VMs

Figure 1 shows that the average utilization of the DNSC algorithm is 74%. Compared with the other three algorithms 18%, 17% and 12%, respectively. The precise classification of DKNN classification rules greatly improves the utilization of VMs after deployment, thus making the DNSC algorithm more effective. The results in Fig. 2 show that the total power consumption of the four algorithms increases as the number of VMS increases. However, DNSC algorithm still has a certain reduction compared with the other three. This is mainly because the DNSC algorithm selects a PM for deployment that does not extend execution time.

5.2 The effect with the number of CDCs

The average utilization of the four algorithms remained stable as the number of CDCs increased. However, DNSC algorithm still improved by 20%, 19% and 12% respectively compared with the other three algorithms. The natural clustering rule is very helpful to the PM clustering process of DNSC algorithm. The advantage in Fig. 4 depends on the fact that the execution time is not extended when the vm is deployed to meet the PM utilization.

5.3 The effect with the capacity of the VM

In Fig. 5 and Fig. 6, the average utilization of DNSC algorithm is also greatly improved, and the energy consumption is also reduced to a certain extent. These are all due to the DNSC algorithm, which classifies VMS into clusters that meet the maximum number of PMs.

5.4 The effect with the capacity of the PM

The results in Fig. 7 show that the DNSC algorithm can be applied to any CDC built in a physical structure. The power of the PM shown in Fig. 8 is given in Table 13. The PM based on the AWS Graviton2 CPU has large capacity, low power consumption, and low power consumption. However, compared with FF algorithm, BF algorithm and RAE algorithm, DNSC algorithm is also greatly optimized.

5.5 The effect with the lower limit of the PM utilization

With the change of the lower limit of PM utilization, the utilization of DNSC algorithm is also due to the other three algorithms. The energy consumption of the four algorithms in the CDC is basically stable. Because the DNSC algorithm adopts the natural clustering rule when deploying VMS, the rule first completes the clustering of PMs according to the geographical location of the CDC.

In this paper, we first analyze the impact of multi-cloud data center model and physical machine utilization on energy consumption. Secondly, the K-means clustering algorithm borrowed from unsupervised learning completes automatic clustering of physical machines in multi-cloud data centers. Thirdly, the KNN classification algorithm of supervised learning is extended and dynamic KNN classification rules are established accordingly. Then, a dynamic proximity classification algorithm (DNSC) for multi-cloud data center resources based on natural clustering rules is proposed. Finally, referring to the real data of Amazon cloud data center and verifying the effect of the DNSC algorithm proposed in this paper in CloudSim. It can be seen that the DNSC algorithm improves the average utilization rate of physical machines and reduces the total energy consumption of the multi-cloud data center compared with the FF algorithm, BF algorithm and RAE algorithm.

There are two main directions for future work. First, combine the real data of Amazon's cloud data center and formulate reasonable virtual machine prices, so as to maximize the cost of cloud users and improve the benefits of cloud service providers. Secondly, combining a variety of new clean energy supply methods will further reduce the carbon emissions of cloud data centers, ultimately reducing the cost of cloud service providers and improving the operating environment.

ACKNOWLEDGMENTS

This work was supported by the Key R & D Plan of Shaanxi Province (General Project)[ No. 2019GY-033], Shaanxi Province Science and Technology Achievements Transfer and Promotion Plan Project[No. 2020CGXNG-041], Special Scientific Research Plan of Shaanxi Provincial Department of Education[No. 20JC027], Xi'an Science and Technology Plan Project[2020KJRC0085] and the Science and Technology Program of Xi'an[No. 2020KJRC0101].

Lebre A, Pastor J, Simonet A, Südholt M (2019) Putting the Next 500 VM Placement Algorithms to the Acid Test: The Infrastructure Provider Viewpoint. IEEE Trans Parallel Distrib Syst 30:204–217
Cheng D, Zhou X, Ding Z, Wang Y, Ji M (2019) Heterogeneity Aware Workload Management in Distributed Sustainable Datacenters. IEEE Trans Parallel Distrib Syst 30:375–387
Li C, Bai J, Chen Y, Luo Y (2020) Resource and replica management strategy for optimizing financial cost and user experience in edge cloud computing system. Inf Sci 516:33–55
Wang X, Wang K, Wu S, Di S, Jin H, Yang K, Ou S (2018) Dynamic Resource Scheduling in Mobile Edge Cloud with Cloud Radio Access Network. IEEE Trans Parallel Distrib Syst 29:2429–2445
Tong Z, Chen H, Deng X, Li K, Li K (2020) A scheduling scheme in the cloud computing environment using deep Q-learning. Inf Sci 512:1170–1191
Bi J, Yuan H, Zhang L, Zhang J (2019) An integrated machine learning approach for workload forecasting in geo-distributed cloud data centers⁎⁎This paper belongs to the special issue special issue name edited by “Prof. W. Pedrycz”. Inf Sci 481:57–68
Zhang Q, Wang G, Liu Q (2019) Enabling Cooperative Privacy-preserving Personalized search in cloud environments. Inf Sci 480:1–13
Khabbaz M, Assi CM (2018) Modelling and Analysis of A Novel Deadline-Aware Scheduling Scheme for Cloud Computing Data Centers. IEEE Trans Cloud Comput 6:141–155
Yan H, Zhu X, Chen H, Guo H, Zhou W, Bao W (2019) Dynamic Fault-Tolerant Elastic scheduling for tasks with uncertain runtime in cloud. Inf Sci 477:30–46
Liu X, Li W, Zhang X (2018) Strategy-Proof Mechanism for Provisioning and Allocation Virtual Machines in Heterogeneous Clouds. IEEE Trans Parallel Distrib Syst 29:1650–1663
Sahni J, Vidyarthi DP, Cost-Effective A (2018) Deadline-Constrained Dynamic Scheduling Algorithm for Scientific Workflows in a Cloud Environment. IEEE Trans Cloud Comput 6:2–18
Fahmideh M, Beydoun G, Low G (2019) Experiential probabilistic assessment of cloud services. Inf Sci 502:510–524
Cao J, Ma Z, Xie J, Zhu X, Dong F, Liu B (2020) Towards tenant demand-aware bandwidth allocation strategy in cloud datacenter. Future Generation Computer Systems 105:904–915
Alsarhan A, Itradat A, Al-Dubai AY, Zomaya AY, Min G (2018) Adaptive Resource Allocation and Provisioning in Multi-Service Cloud Environments. IEEE Trans Parallel Distrib Syst 29:31–42
Ismayilov G, Topcuoglu HR (2020) Neural network based multi-objective evolutionary algorithm for dynamic workflow scheduling in cloud computing. Future Generation Computer Systems 102:307–322
Zhang W, Wen Y (2018) Energy-Efficient Task Execution for Application as a General Topology in Mobile Cloud Computing. IEEE Trans Cloud Comput 6:708–719
Wang T, Xia Y, Muppala J, Hamdi M (2018) Achieving Energy Efficiency in Data Centers Using an Artificial Intelligence Abstraction Model. IEEE Trans Cloud Comput 6:612–624
Wei L, Foh CH, He B, Cai J (2018) Towards Efficient Resource Allocation for Heterogeneous Workloads in IaaS Clouds. IEEE Trans Cloud Comput 6:264–275
Xie G, Zeng G, Jiang J, Fan C, Li R, Li K (2020) Energy management for multiple real-time workflows on cyber–physical cloud systems. Future Generation Computer Systems 105:916–931
Liang B, Dong X, Wang Y, Zhang X (2020) Memory-aware resource management algorithm for low-energy cloud data centers. Future Generation Computer Systems 113:329–342
Li X, Garraghan P, Jiang X, Wu Z, Xu J (2018) Holistic Virtual Machine Scheduling in Cloud Datacenters towards Minimizing Total Energy. IEEE Trans Parallel Distrib Syst 29:1317–1331
Nir M, Matrawy A, St-Hilaire M (2018) Economic and Energy Considerations for Resource Augmentation in Mobile Cloud Computing. IEEE Trans Cloud Comput 6:99–113
Sahoo PK, Dehury CK, Veeravalli B (2018) On the Design of Efficient Link Based Virtual Resource Management Algorithm for Cloud Platforms. IEEE Trans Parallel Distrib Syst 29:887–900
Dabbagh M, Hamdaoui B, Guizani M, Rayes A (2018) An Energy-Efficient VM Prediction and Migration Framework for Overcommitted Clouds. IEEE Trans Cloud Comput 6:955–966
Liang B, Dong X, Wang Y, Zhang X (2020) A low-power task scheduling algorithm for heterogeneous cloud computing. J Supercomputing 76:7290–7314
Wei C, Hu Z-H, Wang Y-G (2020) Exact algorithms for energy-efficient virtual machine placement in data centers. Future Generation Computer Systems 106:77–91
Yang J, Jiang B, Lv Z, Choo K-KR (2020) A task scheduling algorithm considering game theory designed for energy management in cloud computing. Future Generation Computer Systems 105:985–992

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

Dynamic nearest neighbor resources classification algorithm for multiple cloud data center based on natural clustering rule

Status:

Version 1

Abstract

Figures

1 INTRODUCTION

2 RELATED WORK

2.1 Research based on optimization efficiency

2.2 Research based on reducing energy consumption

3 Natural clustering rule and dynamic KNN classification rule

3.1 MDC

3.2 Natural clustering rule

3.3 Dynamic KNN classification rule

4 DNSC algorithm

5 TEST

5.1 The effect with the number of VMs

5.2 The effect with the number of CDCs

5.3 The effect with the capacity of the VM

5.4 The effect with the capacity of the PM

5.5 The effect with the lower limit of the PM utilization

6 CONCLUSIONS AND FUTURE WORK

Declarations

ACKNOWLEDGMENTS

References

Additional Declarations

Status:

Version 1