CN105630604A - SLA based multi-tenant virtual machine resource allocation method - Google Patents

SLA based multi-tenant virtual machine resource allocation method Download PDF

Info

Publication number
CN105630604A
CN105630604A CN201510963639.4A CN201510963639A CN105630604A CN 105630604 A CN105630604 A CN 105630604A CN 201510963639 A CN201510963639 A CN 201510963639A CN 105630604 A CN105630604 A CN 105630604A
Authority
CN
China
Prior art keywords
resource
request
virtual machine
idling
application
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510963639.4A
Other languages
Chinese (zh)
Inventor
莫展鹏
杨松
季统凯
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
G Cloud Technology Co Ltd
Original Assignee
G Cloud Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by G Cloud Technology Co Ltd filed Critical G Cloud Technology Co Ltd
Priority to CN201510963639.4A priority Critical patent/CN105630604A/en
Publication of CN105630604A publication Critical patent/CN105630604A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5027Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
    • G06F9/5044Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals considering hardware capabilities
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45558Hypervisor-specific management and integration aspects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/48Program initiating; Program switching, e.g. by interrupt
    • G06F9/4806Task transfer initiation or dispatching
    • G06F9/4843Task transfer initiation or dispatching by program, e.g. task dispatcher, supervisor, operating system
    • G06F9/485Task life-cycle, e.g. stopping, restarting, resuming execution
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45558Hypervisor-specific management and integration aspects
    • G06F2009/45562Creating, deleting, cloning virtual machine instances

Abstract

The present invention relates to the technical field of cloud computing, and in particular, to an SLA based multi-tenant virtual machine resource allocation method. The method comprises: firstly, after an application initiates a resource request, computing a use amount of each resource type of the whole application according to a quantity and a configuration of a virtual machine in the request; then comparing the resource use amount of the application and an idle resource amount of a physical machine according to each resource type; and finally, if any one resource requested by the application cannot meet the request, putting the request into a waiting queue, wherein a scheduling mechanism of the waiting queue performs resource allocation, or otherwise creating a virtual machine for requesting resource allocation. The method provided by the present invention solves the problems of reasonable resource allocation and rapid response of the virtual machine and can be used for multi-tenant virtual machine resource allocation.

Description

A kind of many tenants resources of virtual machine distribution method based on SLA
Technical field
The present invention relates to field of cloud computer technology, particularly a kind of many tenants resources of virtual machine distribution method based on SLA.
Background technology
Under privately owned cloud environment, due to the finiteness of resource and need to meet the Compulsory Feature of the SLA signed with many tenants simultaneously, before a new application request is deployed to cloud platform, it should the idling-resource of privately owned cloud environment is checked. If idling-resource cannot meet the demand of application, so should postpone to this allocated resources, simultaneously waiting list is put in request, wait until when privately owned cloud platform can provide enough resources for it this please to be sought out and distribute corresponding resources of virtual machine by waiting list from asking again.
Traditional method adopts the method for first in first out that request queue is managed, and there is following deficiency:
1, the deployment request of some application is likely to be due to resources requirement relatively greatly, causes that the waiting time is long, and application slowly cannot be disposed and affect Consumer's Experience.
2, when there is contradiction in the extensibility of the finiteness of resources of virtual machine and application scale, it is impossible to not only ensured service level but also reasonable distribution resources of virtual machine according to the SLA signed with user.
Summary of the invention
Present invention solves the technical problem that and be in that a kind of many tenants resources of virtual machine distribution method based on SLA; Solve aforementioned problem of the prior art.
This invention address that the technical scheme of above-mentioned technical problem is:
Described method comprises the following steps:
Step 1: after application sends resource request, according to the quantity of virtual machine and configuration in request, calculates whole application and every kind of resource class is made consumption;
Step 2: make the idling-resource amount of consumption and physical machine compare one by one the resource of application according to the classification of every kind of resource;
Step 3: if any one resource of application request cannot meet, request being put in waiting list, the scheduling mechanism of waiting list carries out resource distribution;
Step 4: otherwise create virtual machine for request Resources allocation.
Described resource class refers to for describing the parameter of virtual machine configuration in tenant SLA, including CPU core number, memory size and hard disk size; Physical machine idling-resource refers to residue in physical machine and is available for the physical resource that virtual machine uses, and also includes CPU core number, memory size and hard disk size; Parameter in resource class and physical machine idling-resource are one-to-one relationship.
The step of described queuing scheduling mechanism includes:
Step 1: when having application revocation to delete virtual machine, the idling-resource of Computational Physics machine;
Step 2: judge whether the request of waiting list team head waits that round exceeds standard; If it is, perform step 3, otherwise perform step 4;
Step 3: take out the request of team's head, it is judged that whether this request mates with the idling-resource of physical machine, if it is, perform step 6, otherwise, performs step 7;
Step 4: take out the later request of team's head request, it is judged that whether this request mates with the idling-resource of physical machine, if it is, perform step 5; Otherwise, check that in queue, whether later request has reached tail of the queue, if it is then perform step 7, otherwise, performs step 4;
Step 5: the wait round of all requests before this request in queue is added 1;
Step 6: be used for creating virtual machine for request Resources allocation;
Step 7: continue waiting for application revocation releasing idling-resource.
Described wait round refers to request in waiting list and waits the number of times of idling-resource, waits until that round exceeds standard to refer to and waits that round exceedes some threshold value, and this threshold value is specified by user, exceedes this threshold value and represents that application waits the overlong time disposed.
The method of the present invention can produce following beneficial effect:
1, the inventive method can promise to undertake acquisition balance between reasonable distribution resource meeting SLA;
2, the inventive method is have good Consumer's Experience at inadequate resource, both can respond the request of little demand user at short notice, and the request waiting time that will not make again big demand user is long.
Accompanying drawing explanation
Below in conjunction with accompanying drawing, the present invention is further described:
Fig. 1 is the flow chart of the present invention;
Fig. 2 is the queue scheduling flow chart of the present invention.
Detailed description of the invention
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete description, it is clear that described embodiment is only a part of embodiment of the present invention, rather than whole embodiments. Based on the embodiment in the present invention, the every other embodiment that those of ordinary skill in the art obtain under not making creative work premise, broadly fall into the scope of protection of the invention.
See Fig. 1, shown in 2, first the present invention realizes the first process to request:
Input parameter: application resource request AppRequest, application request includes two contents respectively, applies owning user user, resource requirement array ResArray.
Output valve: true is to be its Resources allocation, and false is can not.
Host monitor function obtains data. If able to meet and be returned to true after each resource metrics of application request and each resource data of idling-resource are compared, ask waiting list accordingly if can not meet to put into according to type.
Then the scheduling of request in queue is realized:
Input parameter: type is for discharging resource comes from what type of resource pool, and flag waits in queue
The threshold value of the longest round
Output valve: true is assigned with a new resource request, and false is for continuing waiting for.
Flag is the threshold value waiting round. This algorithm will be used when response application exits event time. First at waiting list
Queue travels through, AppRequest.wait is the wait round of certain request, if waiting that round has exceeded flag, now have to for this request Resources allocation, to this request, whether the resource run needed for request identification algorithm can calculate this request can be satisfied, if can not meet, algorithm terminates, and continues waiting for new idling-resource. Without the wait round applied more than flag, one by one the AppRequest in Queue is run request identification algorithm, if can Resources allocation; distribute corresponding resources of virtual machine, if could not; current AppRequest.wait would be added one, then traversal the next one request.

Claims (4)

1. the many tenants resources of virtual machine distribution method based on SLA, it is characterised in that described method comprises the following steps:
Step 1: after application sends resource request, according to the quantity of virtual machine and configuration in request, calculates whole application and every kind of resource class is made consumption;
Step 2: make the idling-resource amount of consumption and physical machine compare one by one the resource of application according to the classification of every kind of resource;
Step 3: if any one resource of application request cannot meet, request being put in waiting list, the scheduling mechanism of waiting list carries out resource distribution;
Step 4: otherwise create virtual machine for request Resources allocation.
2. method according to claim 1, it is characterised in that described resource class refers to for describing the parameter of virtual machine configuration in tenant SLA, including CPU core number, memory size and hard disk size; Physical machine idling-resource refers to residue in physical machine and is available for the physical resource that virtual machine uses, and also includes CPU core number, memory size and hard disk size; Parameter in resource class and physical machine idling-resource are one-to-one relationship.
3. method according to claim 1 and 2, it is characterised in that the step of described queuing scheduling mechanism includes:
Step 1: when having application revocation to delete virtual machine, the idling-resource of Computational Physics machine;
Step 2: judge whether the request of waiting list team head waits that round exceeds standard; If it is, perform step 3, otherwise perform step 4;
Step 3: take out the request of team's head, it is judged that whether this request mates with the idling-resource of physical machine, if it is, perform step 6, otherwise, performs step 7;
Step 4: take out the later request of team's head request, it is judged that whether this request mates with the idling-resource of physical machine, if it is, perform step 5; Otherwise, check that in queue, whether later request has reached tail of the queue, if it is then perform step 7, otherwise, performs step 4;
Step 5: the wait round of all requests before this request in queue is added 1;
Step 6: be used for creating virtual machine for request Resources allocation;
Step 7: continue waiting for application revocation releasing idling-resource.
4. method according to claim 3, it is characterized in that, described wait round refers to request in waiting list and waits the number of times of idling-resource, by the time round exceeds standard to refer to and waits that round exceedes some threshold value, this threshold value is specified by user, exceedes this threshold value and represents that application waits the overlong time disposed.
CN201510963639.4A 2015-12-18 2015-12-18 SLA based multi-tenant virtual machine resource allocation method Pending CN105630604A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510963639.4A CN105630604A (en) 2015-12-18 2015-12-18 SLA based multi-tenant virtual machine resource allocation method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510963639.4A CN105630604A (en) 2015-12-18 2015-12-18 SLA based multi-tenant virtual machine resource allocation method

Publications (1)

Publication Number Publication Date
CN105630604A true CN105630604A (en) 2016-06-01

Family

ID=56045581

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510963639.4A Pending CN105630604A (en) 2015-12-18 2015-12-18 SLA based multi-tenant virtual machine resource allocation method

Country Status (1)

Country Link
CN (1) CN105630604A (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106201655A (en) * 2016-06-30 2016-12-07 中国联合网络通信集团有限公司 Virtual machine distribution method and virtual machine distribution system
CN107818013A (en) * 2016-09-13 2018-03-20 华为技术有限公司 A kind of application scheduling method thereof and device
CN108958934A (en) * 2018-06-28 2018-12-07 郑州云海信息技术有限公司 A kind of cpu resource method for obligating and device
CN109992418A (en) * 2019-03-25 2019-07-09 华南理工大学 The multi-tenant big data platform resource priority level scheduling method and system of SLA perception
CN110377415A (en) * 2018-04-12 2019-10-25 华为技术有限公司 A kind of request processing method and server
WO2020211652A1 (en) * 2019-04-18 2020-10-22 华为技术有限公司 Tenant resource management method and device in multi-tenant scenario
CN113923130A (en) * 2021-09-06 2022-01-11 特赞(上海)信息科技有限公司 Multi-tenant open interface resource configuration method, device and terminal

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110004875A1 (en) * 2009-07-01 2011-01-06 International Business Machines Corporation Method and System for Performance Isolation in Virtualized Environments
CN102739803A (en) * 2012-07-09 2012-10-17 上海达希能源科技有限公司 Cloud computing resource allocation method
CN102761469A (en) * 2011-04-27 2012-10-31 阿里巴巴集团控股有限公司 Allocation method and device for resource pool
CN102958166A (en) * 2011-08-29 2013-03-06 华为技术有限公司 Resource allocation method and resource management platform
CN104281496A (en) * 2014-10-27 2015-01-14 深圳中兴网信科技有限公司 Virtual resource distribution method and virtual resource distribution device
CN105159782A (en) * 2015-08-28 2015-12-16 北京百度网讯科技有限公司 Cloud host based method and apparatus for allocating resources to orders

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110004875A1 (en) * 2009-07-01 2011-01-06 International Business Machines Corporation Method and System for Performance Isolation in Virtualized Environments
CN102761469A (en) * 2011-04-27 2012-10-31 阿里巴巴集团控股有限公司 Allocation method and device for resource pool
CN102958166A (en) * 2011-08-29 2013-03-06 华为技术有限公司 Resource allocation method and resource management platform
CN102739803A (en) * 2012-07-09 2012-10-17 上海达希能源科技有限公司 Cloud computing resource allocation method
CN104281496A (en) * 2014-10-27 2015-01-14 深圳中兴网信科技有限公司 Virtual resource distribution method and virtual resource distribution device
CN105159782A (en) * 2015-08-28 2015-12-16 北京百度网讯科技有限公司 Cloud host based method and apparatus for allocating resources to orders

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106201655A (en) * 2016-06-30 2016-12-07 中国联合网络通信集团有限公司 Virtual machine distribution method and virtual machine distribution system
CN106201655B (en) * 2016-06-30 2019-07-02 中国联合网络通信集团有限公司 Virtual machine distribution method and virtual machine distribution system
CN107818013A (en) * 2016-09-13 2018-03-20 华为技术有限公司 A kind of application scheduling method thereof and device
CN110377415A (en) * 2018-04-12 2019-10-25 华为技术有限公司 A kind of request processing method and server
CN108958934A (en) * 2018-06-28 2018-12-07 郑州云海信息技术有限公司 A kind of cpu resource method for obligating and device
CN109992418A (en) * 2019-03-25 2019-07-09 华南理工大学 The multi-tenant big data platform resource priority level scheduling method and system of SLA perception
CN109992418B (en) * 2019-03-25 2023-01-06 华南理工大学 SLA-aware resource priority scheduling method and system for multi-tenant big data platform
WO2020211652A1 (en) * 2019-04-18 2020-10-22 华为技术有限公司 Tenant resource management method and device in multi-tenant scenario
CN111835679A (en) * 2019-04-18 2020-10-27 华为技术有限公司 Tenant resource management method and device under multi-tenant scene
CN111835679B (en) * 2019-04-18 2022-03-25 华为技术有限公司 Tenant resource management method and device under multi-tenant scene
CN113923130A (en) * 2021-09-06 2022-01-11 特赞(上海)信息科技有限公司 Multi-tenant open interface resource configuration method, device and terminal
CN113923130B (en) * 2021-09-06 2024-03-08 特赞(上海)信息科技有限公司 Multi-tenant open interface resource configuration method, device and terminal

Similar Documents

Publication Publication Date Title
CN105630604A (en) SLA based multi-tenant virtual machine resource allocation method
US10325343B1 (en) Topology aware grouping and provisioning of GPU resources in GPU-as-a-Service platform
WO2020258920A1 (en) Network slice resource management method and apparatus
US8413158B2 (en) Processor thread load balancing manager
WO2017166643A1 (en) Method and device for quantifying task resources
WO2016078178A1 (en) Virtual cpu scheduling method
JP2015506526A5 (en)
TWI755415B (en) Method, device and system for application cluster resource allocation in cloud environment
US20140282540A1 (en) Performant host selection for virtualization centers
CN102981973B (en) Perform the method for request within the storage system
WO2016183799A1 (en) Hardware acceleration method and relevant device
CN114416352A (en) Computing resource allocation method and device, electronic equipment and storage medium
Khan et al. Clouds of small things: Provisioning infrastructure-as-a-service from within community networks
CN110187960A (en) A kind of distributed resource scheduling method and device
WO2019105379A1 (en) Resource management method and apparatus, electronic device, and storage medium
CN103425536A (en) Test resource management method oriented towards distributed system performance tests
CN104331659A (en) Design method for system resource application isolation of critical application host
CN104331332A (en) Virtual resource preallocation algorithm based on SLA (Service Level Agreement)
WO2020134133A1 (en) Resource allocation method, substation, and computer-readable storage medium
CN107343023A (en) Resource allocation methods, device and electronic equipment in a kind of Mesos management cluster
CN115686805A (en) GPU resource sharing method and device, and GPU resource sharing scheduling method and device
CN105335235A (en) IaaS mode based resource management system and method
KR20220025746A (en) Dynamic allocation of computing resources
CN104809026A (en) Method for borrowing CPU computing resources by using remote node
US20200133367A1 (en) Power management for workload offload engines

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20160601

WD01 Invention patent application deemed withdrawn after publication