CN105630604A

CN105630604A - SLA based multi-tenant virtual machine resource allocation method

Info

Publication number: CN105630604A
Application number: CN201510963639.4A
Authority: CN
Inventors: 莫展鹏; 杨松; 季统凯
Original assignee: G Cloud Technology Co Ltd
Current assignee: G Cloud Technology Co Ltd
Priority date: 2015-12-18
Filing date: 2015-12-18
Publication date: 2016-06-01

Abstract

The present invention relates to the technical field of cloud computing, and in particular, to an SLA based multi-tenant virtual machine resource allocation method. The method comprises: firstly, after an application initiates a resource request, computing a use amount of each resource type of the whole application according to a quantity and a configuration of a virtual machine in the request; then comparing the resource use amount of the application and an idle resource amount of a physical machine according to each resource type; and finally, if any one resource requested by the application cannot meet the request, putting the request into a waiting queue, wherein a scheduling mechanism of the waiting queue performs resource allocation, or otherwise creating a virtual machine for requesting resource allocation. The method provided by the present invention solves the problems of reasonable resource allocation and rapid response of the virtual machine and can be used for multi-tenant virtual machine resource allocation.

Description

A kind of many tenants resources of virtual machine distribution method based on SLA

Technical field

The present invention relates to field of cloud computer technology, particularly a kind of many tenants resources of virtual machine distribution method based on SLA.

Background technology

Under privately owned cloud environment, due to the finiteness of resource and need to meet the Compulsory Feature of the SLA signed with many tenants simultaneously, before a new application request is deployed to cloud platform, it should the idling-resource of privately owned cloud environment is checked. If idling-resource cannot meet the demand of application, so should postpone to this allocated resources, simultaneously waiting list is put in request, wait until when privately owned cloud platform can provide enough resources for it this please to be sought out and distribute corresponding resources of virtual machine by waiting list from asking again.

Traditional method adopts the method for first in first out that request queue is managed, and there is following deficiency:

1, the deployment request of some application is likely to be due to resources requirement relatively greatly, causes that the waiting time is long, and application slowly cannot be disposed and affect Consumer's Experience.

2, when there is contradiction in the extensibility of the finiteness of resources of virtual machine and application scale, it is impossible to not only ensured service level but also reasonable distribution resources of virtual machine according to the SLA signed with user.

Summary of the invention

Present invention solves the technical problem that and be in that a kind of many tenants resources of virtual machine distribution method based on SLA; Solve aforementioned problem of the prior art.

This invention address that the technical scheme of above-mentioned technical problem is:

Described method comprises the following steps:

Step 1: after application sends resource request, according to the quantity of virtual machine and configuration in request, calculates whole application and every kind of resource class is made consumption;

Step 2: make the idling-resource amount of consumption and physical machine compare one by one the resource of application according to the classification of every kind of resource;

Step 3: if any one resource of application request cannot meet, request being put in waiting list, the scheduling mechanism of waiting list carries out resource distribution;

Step 4: otherwise create virtual machine for request Resources allocation.

Described resource class refers to for describing the parameter of virtual machine configuration in tenant SLA, including CPU core number, memory size and hard disk size; Physical machine idling-resource refers to residue in physical machine and is available for the physical resource that virtual machine uses, and also includes CPU core number, memory size and hard disk size; Parameter in resource class and physical machine idling-resource are one-to-one relationship.

The step of described queuing scheduling mechanism includes:

Step 1: when having application revocation to delete virtual machine, the idling-resource of Computational Physics machine;

Step 2: judge whether the request of waiting list team head waits that round exceeds standard; If it is, perform step 3, otherwise perform step 4;

Step 3: take out the request of team's head, it is judged that whether this request mates with the idling-resource of physical machine, if it is, perform step 6, otherwise, performs step 7;

Step 4: take out the later request of team's head request, it is judged that whether this request mates with the idling-resource of physical machine, if it is, perform step 5; Otherwise, check that in queue, whether later request has reached tail of the queue, if it is then perform step 7, otherwise, performs step 4;

Step 5: the wait round of all requests before this request in queue is added 1;

Step 6: be used for creating virtual machine for request Resources allocation;

Step 7: continue waiting for application revocation releasing idling-resource.

Described wait round refers to request in waiting list and waits the number of times of idling-resource, waits until that round exceeds standard to refer to and waits that round exceedes some threshold value, and this threshold value is specified by user, exceedes this threshold value and represents that application waits the overlong time disposed.

The method of the present invention can produce following beneficial effect:

1, the inventive method can promise to undertake acquisition balance between reasonable distribution resource meeting SLA;

2, the inventive method is have good Consumer's Experience at inadequate resource, both can respond the request of little demand user at short notice, and the request waiting time that will not make again big demand user is long.

Accompanying drawing explanation

Below in conjunction with accompanying drawing, the present invention is further described:

Fig. 1 is the flow chart of the present invention;

Fig. 2 is the queue scheduling flow chart of the present invention.

Detailed description of the invention

Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete description, it is clear that described embodiment is only a part of embodiment of the present invention, rather than whole embodiments. Based on the embodiment in the present invention, the every other embodiment that those of ordinary skill in the art obtain under not making creative work premise, broadly fall into the scope of protection of the invention.

See Fig. 1, shown in 2, first the present invention realizes the first process to request:

Input parameter: application resource request AppRequest, application request includes two contents respectively, applies owning user user, resource requirement array ResArray.

Output valve: true is to be its Resources allocation, and false is can not.

Host monitor function obtains data. If able to meet and be returned to true after each resource metrics of application request and each resource data of idling-resource are compared, ask waiting list accordingly if can not meet to put into according to type.

Then the scheduling of request in queue is realized:

Input parameter: type is for discharging resource comes from what type of resource pool, and flag waits in queue

The threshold value of the longest round

Output valve: true is assigned with a new resource request, and false is for continuing waiting for.

Flag is the threshold value waiting round. This algorithm will be used when response application exits event time. First at waiting list

Queue travels through, AppRequest.wait is the wait round of certain request, if waiting that round has exceeded flag, now have to for this request Resources allocation, to this request, whether the resource run needed for request identification algorithm can calculate this request can be satisfied, if can not meet, algorithm terminates, and continues waiting for new idling-resource. Without the wait round applied more than flag, one by one the AppRequest in Queue is run request identification algorithm, if can Resources allocation; distribute corresponding resources of virtual machine, if could not; current AppRequest.wait would be added one, then traversal the next one request.

Claims

1. the many tenants resources of virtual machine distribution method based on SLA, it is characterised in that described method comprises the following steps:

Step 4: otherwise create virtual machine for request Resources allocation.

2. method according to claim 1, it is characterised in that described resource class refers to for describing the parameter of virtual machine configuration in tenant SLA, including CPU core number, memory size and hard disk size; Physical machine idling-resource refers to residue in physical machine and is available for the physical resource that virtual machine uses, and also includes CPU core number, memory size and hard disk size; Parameter in resource class and physical machine idling-resource are one-to-one relationship.

3. method according to claim 1 and 2, it is characterised in that the step of described queuing scheduling mechanism includes:

Step 5: the wait round of all requests before this request in queue is added 1;

Step 6: be used for creating virtual machine for request Resources allocation;

Step 7: continue waiting for application revocation releasing idling-resource.

4. method according to claim 3, it is characterized in that, described wait round refers to request in waiting list and waits the number of times of idling-resource, by the time round exceeds standard to refer to and waits that round exceedes some threshold value, this threshold value is specified by user, exceedes this threshold value and represents that application waits the overlong time disposed.