CN111506419A - GPU resource sharing method and device - Google Patents

GPU resource sharing method and device Download PDF

Info

Publication number
CN111506419A
CN111506419A CN202010164599.8A CN202010164599A CN111506419A CN 111506419 A CN111506419 A CN 111506419A CN 202010164599 A CN202010164599 A CN 202010164599A CN 111506419 A CN111506419 A CN 111506419A
Authority
CN
China
Prior art keywords
virtual machine
resources
gpu physical
gpu
resource
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010164599.8A
Other languages
Chinese (zh)
Other versions
CN111506419B (en
Inventor
郭彦辉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Udsafe Technology Co ltd
Original Assignee
Beijing Udsafe Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Udsafe Technology Co ltd filed Critical Beijing Udsafe Technology Co ltd
Priority to CN202010164599.8A priority Critical patent/CN111506419B/en
Priority claimed from CN202010164599.8A external-priority patent/CN111506419B/en
Publication of CN111506419A publication Critical patent/CN111506419A/en
Application granted granted Critical
Publication of CN111506419B publication Critical patent/CN111506419B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5027Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45558Hypervisor-specific management and integration aspects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45558Hypervisor-specific management and integration aspects
    • G06F2009/4557Distribution of virtual machine instances; Migration and load balancing

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The invention relates to a GPU resource sharing method and a device, wherein the method comprises the steps of comparing GPU physical resource requirements of a virtual machine with unallocated resources in a resource pool; if the unallocated resources in the resource pool do not meet the GPU physical resource requirements of the virtual machine, adjusting the GPU physical resources allocated to the virtual machine according to the GPU physical resource utilization rate allocated to the virtual machine, and allocating the GPU physical resources to the virtual machine according to the unallocated resources in the adjusted resource pool; otherwise, allocating GPU physical resources to the virtual machine according to the GPU physical resource requirements of the virtual machine. The GPU resource sharing method and device provided by the invention can realize that a plurality of virtual machines directly use GPU resources, improve the computing capacity of the virtual machines and effectively improve the utilization rate of the GPU resources.

Description

GPU resource sharing method and device
Technical Field
The invention belongs to the technical field of resource sharing, and particularly relates to a GPU resource sharing method and device.
Background
With the rapid development of science and technology, in order to meet the increasing computing requirements, the computing requirements of various laboratories are more and more prominent, and a common scientific research laboratory cannot configure GPU equipment for everyone, so that the problem of GPU sharing and use is solved by the sGPU technology.
The current technologies for realizing GPU resource sharing include GPU virtualization technology and GPU direct technology. The GPU direct connection technology can only be bound to one virtual machine for use, one virtual machine completely monopolizes GPU equipment, other virtual machines cannot be used, the single virtual machine cannot completely use GPU resources, and great resource waste is caused. The GPU virtualization technology virtualizes a plurality of GPU devices from the GPU to be allocated to a plurality of virtual machines, GPU resource allocation occupies the same amount, and reasonable resource allocation and effective resource utilization cannot be achieved due to the limitation of IO virtualization performance problems.
Disclosure of Invention
Aiming at the defects of the prior art, the invention provides the GPU resource sharing method and the device, which directly allocate GPU physical resources to a virtual machine for use, can give full play to the physical performance of a display card, and simultaneously allocate the GPU resources to a plurality of people for common use, thereby effectively improving the utilization rate of the GPU resources.
The purpose of the invention is realized by adopting the following technical scheme:
in a method of sharing GPU resources, the improvement comprising:
comparing GPU physical resource requirements of the virtual machine with unallocated resources in the resource pool;
if the unallocated resources in the resource pool do not meet the GPU physical resource requirements of the virtual machine, adjusting the GPU physical resources allocated to the virtual machine according to the GPU physical resource utilization rate allocated to the virtual machine, and allocating the GPU physical resources to the virtual machine according to the unallocated resources in the adjusted resource pool;
otherwise, allocating GPU physical resources to the virtual machine according to the GPU physical resource requirements of the virtual machine.
Preferably, the adjusting the GPU physical resources allocated to the virtual machine according to the GPU physical resource utilization rate allocated to the virtual machine includes:
if the utilization rate of the virtual machine to the GPU physical resources does not reach 100%, reducing the distribution percentage of the GPU physical resources to the virtual machine until the virtual machine marks the GPU physical resources released in the reduction process as unallocated resources when the utilization rate of the GPU physical resources reaches 90%;
and if the utilization rate of the virtual machine to the GPU physical resources reaches 100%, increasing the GPU physical resource allocation percentage of the virtual machine until the utilization rate of the virtual machine to the GPU physical resources reaches 90%, and marking the GPU physical resources occupied in the increasing process as allocated resources.
Preferably, the allocating GPU physical resources to the virtual machine according to unallocated resources in the adjusted resource pool includes:
if the resources are not allocated in the adjusted resource pool to meet the GPU physical resource requirement of the virtual machine, allocating GPU physical resources to the virtual machine according to the GPU physical resource requirement of the virtual machine;
and if the unallocated resources in the adjusted resource pool still do not meet the GPU physical resource requirement of the virtual machine, sending alarm information to the virtual machine.
Preferably, the allocating GPU physical resources to the virtual machine according to the GPU physical resource demand of the virtual machine includes:
determining a GPU physical resource allocation percentage for the virtual machine based on GPU physical resource requirements of the virtual machine;
allocating GPU physical resources with corresponding percentages in the resource pool to corresponding virtual machines;
marking allocated resources and unallocated resources in a resource pool;
the resource pool is formed by pooling GPU physical resources.
Preferably, the determining the GPU physical resource allocation percentage for the virtual machine based on the GPU physical resource demand of the virtual machine includes:
and taking the ratio of the GPU physical resource demand of the virtual machine sending the demand response to all resources in the resource pool as the GPU physical resource allocation percentage of the virtual machine.
Based on the same inventive concept, the invention also provides a GPU resource sharing device, the improvement of which comprises:
the comparison unit is used for comparing the GPU physical resource requirement of the virtual machine with unallocated resources in the resource pool; when unallocated resources in the resource pool do not meet the GPU physical resource requirement of the virtual machine, jumping to an adjusting unit; otherwise, jumping to a second distribution unit;
the adjusting unit is used for adjusting the GPU physical resources allocated to the virtual machine according to the GPU physical resource utilization rate allocated to the virtual machine;
the first allocation unit allocates GPU physical resources to the virtual machine according to unallocated resources in the resource pool adjusted by the adjusting unit;
and the second allocation unit is used for allocating GPU physical resources to the virtual machine according to the GPU physical resource requirements of the virtual machine.
Preferably, the adjusting unit is specifically configured to:
if the utilization rate of the virtual machine to the GPU physical resources does not reach 100%, reducing the distribution percentage of the GPU physical resources to the virtual machine until the virtual machine marks the GPU physical resources released in the reduction process as unallocated resources when the utilization rate of the GPU physical resources reaches 90%;
and if the utilization rate of the virtual machine to the GPU physical resources reaches 100%, increasing the GPU physical resource allocation percentage of the virtual machine until the utilization rate of the virtual machine to the GPU physical resources reaches 90%, and marking the GPU physical resources occupied in the increasing process as allocated resources.
Preferably, the first allocation unit is further specifically configured to:
if the resources which are not allocated in the adjusted resource pool meet the GPU physical resource requirement of the virtual machine, jumping to a second allocation unit;
and if the unallocated resources in the adjusted resource pool still do not meet the GPU physical resource requirement of the virtual machine, sending alarm information to the virtual machine.
Preferably, the second distribution unit includes:
the calculating subunit is used for determining the GPU physical resource allocation percentage of the virtual machine based on the GPU physical resource demand of the virtual machine;
the allocation subunit is used for allocating the GPU physical resources with the corresponding percentage in the resource pool to the corresponding virtual machine according to the GPU physical resource allocation percentage of the virtual machine;
the marking subunit is used for marking the allocated resources and the unallocated resources in the resource pool;
the resource pool is formed by pooling GPU physical resources.
Preferably, the calculating subunit is specifically configured to:
and taking the ratio of the GPU physical resource demand of the virtual machine sending the demand response to all resources in the resource pool as the GPU physical resource allocation percentage of the virtual machine.
Compared with the closest prior art, the invention has the following beneficial effects:
the invention provides a GPU resource sharing method and a device, which comprises the steps of comparing GPU physical resource requirements of a virtual machine with unallocated resources in a resource pool; if the unallocated resources in the resource pool do not meet the GPU physical resource requirements of the virtual machine, adjusting the GPU physical resources allocated to the virtual machine according to the GPU physical resource utilization rate allocated to the virtual machine, and allocating the GPU physical resources to the virtual machine according to the unallocated resources in the adjusted resource pool; otherwise, allocating GPU physical resources to the virtual machine according to the GPU physical resource requirements of the virtual machine. The GPU resource sharing method and device provided by the invention can realize that a plurality of virtual machines directly use GPU resources, improve the computing capacity of the virtual machines and effectively improve the utilization rate of the GPU resources.
Drawings
FIG. 1 is a flowchart of a GPU resource sharing method of the present invention;
FIG. 2 is a diagram illustrating percentage allocation of GPU resources in an embodiment;
FIG. 3 is a diagram of a GPU resource sharing device of the present invention.
Detailed Description
The following describes embodiments of the present invention in further detail with reference to the accompanying drawings.
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
In order to solve the problem of low utilization rate of GPU resources due to excessive or insufficient GPU resource allocation, the present invention provides a GPU resource sharing method, as shown in fig. 1, including:
comparing GPU physical resource requirements of the virtual machine with unallocated resources in the resource pool;
if the unallocated resources in the resource pool do not meet the GPU physical resource requirements of the virtual machine, adjusting the GPU physical resources allocated to the virtual machine according to the GPU physical resource utilization rate allocated to the virtual machine, and allocating the GPU physical resources to the virtual machine according to the unallocated resources in the adjusted resource pool;
otherwise, allocating GPU physical resources to the virtual machine according to the GPU physical resource requirements of the virtual machine.
In order to more clearly illustrate the objects of the present invention, the process of the present invention is further explained below with reference to specific examples:
in an embodiment of the present invention, adjusting the GPU physical resources allocated to the virtual machine according to the GPU physical resource utilization rate allocated to the virtual machine may include:
if the utilization rate of the virtual machine to the GPU physical resources does not reach 100%, reducing the distribution percentage of the GPU physical resources to the virtual machine until the virtual machine marks the GPU physical resources released in the reduction process as unallocated resources when the utilization rate of the GPU physical resources reaches 90%;
and if the utilization rate of the virtual machine to the GPU physical resources reaches 100%, increasing the GPU physical resource allocation percentage of the virtual machine until the utilization rate of the virtual machine to the GPU physical resources reaches 90%, and marking the GPU physical resources occupied in the increasing process as allocated resources.
In the embodiment of the present invention, allocating GPU physical resources to the virtual machine according to unallocated resources in the adjusted resource pool may include:
if the resources are not allocated in the adjusted resource pool to meet the GPU physical resource requirement of the virtual machine, allocating GPU physical resources to the virtual machine according to the GPU physical resource requirement of the virtual machine;
and if the unallocated resources in the adjusted resource pool still do not meet the GPU physical resource requirement of the virtual machine, sending alarm information to the virtual machine.
In the embodiment of the present invention, allocating GPU physical resources to the virtual machine according to the GPU physical resource requirement of the virtual machine may include:
determining a GPU physical resource allocation percentage for the virtual machine based on GPU physical resource requirements of the virtual machine;
allocating GPU physical resources with corresponding percentages in the resource pool to corresponding virtual machines;
marking allocated resources and unallocated resources in a resource pool;
the resource pool is formed by pooling GPU physical resources in advance, and the proportion of all resources in the resource pool is marked by 100% in an initial state.
In the embodiment of the present invention, in order to enable the use of the allocated resources to be unaffected, the GPU physical resources of each virtual machine that have been allocated may also be isolated by an isolation technique.
In an embodiment of the present invention, determining the GPU physical resource allocation percentage for the virtual machine based on the GPU physical resource demand of the virtual machine may include:
taking the ratio of the GPU physical resource demand of the virtual machine sending the demand response to all resources in the resource pool as the GPU physical resource allocation percentage of the virtual machine;
wherein, all resources in the resource pool comprise allocated resources and unallocated resources.
In the embodiment of the present invention, as shown in fig. 2, GPU physical resources may be divided into 20%, 30%, and 50% according to GPU physical resource requirements of each virtual machine, and then the divided percentage of GPU resources may be allocated to the virtual machines with corresponding percentage requirements.
Based on the same inventive concept, the present invention further provides a GPU resource sharing apparatus, as shown in fig. 3, including:
the comparison unit is used for comparing the GPU physical resource requirement of the virtual machine with unallocated resources in the resource pool; when unallocated resources in the resource pool do not meet the GPU physical resource requirement of the virtual machine, jumping to an adjusting unit; otherwise, jumping to a second distribution unit;
the adjusting unit is used for adjusting the GPU physical resources allocated to the virtual machine according to the GPU physical resource utilization rate allocated to the virtual machine;
the first allocation unit allocates GPU physical resources to the virtual machine according to unallocated resources in the resource pool adjusted by the adjusting unit;
and the second allocation unit is used for allocating GPU physical resources to the virtual machine according to the GPU physical resource requirements of the virtual machine.
In an embodiment of the present invention, the adjusting unit is specifically configured to:
if the utilization rate of the virtual machine to the GPU physical resources does not reach 100%, reducing the distribution percentage of the GPU physical resources to the virtual machine until the virtual machine marks the GPU physical resources released in the reduction process as unallocated resources when the utilization rate of the GPU physical resources reaches 90%;
and if the utilization rate of the virtual machine to the GPU physical resources reaches 100%, increasing the GPU physical resource allocation percentage of the virtual machine until the utilization rate of the virtual machine to the GPU physical resources reaches 90%, and marking the GPU physical resources occupied in the increasing process as allocated resources.
In an embodiment of the present invention, the first allocation unit is specifically configured to:
if the resources which are not allocated in the adjusted resource pool meet the GPU physical resource requirement of the virtual machine, jumping to a second allocation unit;
and if the unallocated resources in the adjusted resource pool still do not meet the GPU physical resource requirement of the virtual machine, sending alarm information to the virtual machine.
In an embodiment of the present invention, the second allocating unit may include:
the calculating subunit is used for determining the GPU physical resource allocation percentage of the virtual machine based on the GPU physical resource demand of the virtual machine;
the allocation subunit is used for allocating the GPU physical resources with the corresponding percentage in the resource pool to the corresponding virtual machine according to the GPU physical resource allocation percentage of the virtual machine;
the marking subunit is used for marking the allocated resources and the unallocated resources in the resource pool;
the resource pool is formed by pooling GPU physical resources.
In an embodiment of the present invention, the calculating subunit is specifically configured to:
and taking the ratio of the GPU physical resource demand of the virtual machine sending the demand response to all resources in the resource pool as the GPU physical resource allocation percentage of the virtual machine.
In summary, the present invention provides a method and an apparatus for sharing GPU resources, which can enable multiple virtual machines to directly use GPU resources, thereby improving the computing power of the virtual machines; the GPU resources can be dynamically adjusted according to the utilization rate of the virtual machine to the GPU physical resources, and the utilization rate of the GPU resources is effectively improved.
As will be appreciated by one skilled in the art, embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present application is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the application. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
Finally, it should be noted that: the above embodiments are only for illustrating the technical solutions of the present invention and not for limiting the same, and although the present invention is described in detail with reference to the above embodiments, those of ordinary skill in the art should understand that: modifications and equivalents may be made to the embodiments of the invention without departing from the spirit and scope of the invention, which is to be covered by the claims.

Claims (10)

1. A GPU resource sharing method is characterized by comprising the following steps:
comparing GPU physical resource requirements of the virtual machine with unallocated resources in the resource pool;
if the unallocated resources in the resource pool do not meet the GPU physical resource requirements of the virtual machine, adjusting the GPU physical resources allocated to the virtual machine according to the GPU physical resource utilization rate allocated to the virtual machine, and allocating the GPU physical resources to the virtual machine according to the unallocated resources in the adjusted resource pool;
otherwise, allocating GPU physical resources to the virtual machine according to the GPU physical resource requirements of the virtual machine.
2. The method of claim 1, wherein adjusting the GPU physical resources allocated to the virtual machine based on the GPU physical resource utilization allocated to the virtual machine comprises:
if the utilization rate of the virtual machine to the GPU physical resources does not reach 100%, reducing the distribution percentage of the GPU physical resources to the virtual machine until the virtual machine marks the GPU physical resources released in the reduction process as unallocated resources when the utilization rate of the GPU physical resources reaches 90%;
and if the utilization rate of the virtual machine to the GPU physical resources reaches 100%, increasing the GPU physical resource allocation percentage of the virtual machine until the utilization rate of the virtual machine to the GPU physical resources reaches 90%, and marking the GPU physical resources occupied in the increasing process as allocated resources.
3. The method of claim 1, wherein the allocating GPU physical resources to the virtual machine according to unallocated resources in the adjusted resource pool comprises:
if the resources are not allocated in the adjusted resource pool to meet the GPU physical resource requirement of the virtual machine, allocating GPU physical resources to the virtual machine according to the GPU physical resource requirement of the virtual machine;
and if the unallocated resources in the adjusted resource pool still do not meet the GPU physical resource requirement of the virtual machine, sending alarm information to the virtual machine.
4. The method of claim 1 or 3, wherein the allocating GPU physical resources to the virtual machines according to GPU physical resource requirements of the virtual machines comprises:
determining a GPU physical resource allocation percentage for the virtual machine based on GPU physical resource requirements of the virtual machine;
allocating GPU physical resources with corresponding percentages in the resource pool to corresponding virtual machines;
marking allocated resources and unallocated resources in a resource pool;
the resource pool is formed by pooling GPU physical resources.
5. The method of claim 4, wherein determining the GPU physical resource allocation percentage for the virtual machine based on the GPU physical resource demand of the virtual machine comprises:
and taking the ratio of the GPU physical resource demand of the virtual machine sending the demand response to all resources in the resource pool as the GPU physical resource allocation percentage of the virtual machine.
6. A GPU resource sharing apparatus, comprising:
the comparison unit is used for comparing the GPU physical resource requirement of the virtual machine with unallocated resources in the resource pool; when unallocated resources in the resource pool do not meet the GPU physical resource requirement of the virtual machine, jumping to an adjusting unit; otherwise, jumping to a second distribution unit;
the adjusting unit is used for adjusting the GPU physical resources allocated to the virtual machine according to the GPU physical resource utilization rate allocated to the virtual machine;
the first allocation unit allocates GPU physical resources to the virtual machine according to unallocated resources in the resource pool adjusted by the adjusting unit;
and the second allocation unit is used for allocating GPU physical resources to the virtual machine according to the GPU physical resource requirements of the virtual machine.
7. The apparatus of claim 6, wherein the adjustment unit is specifically configured to:
if the utilization rate of the virtual machine to the GPU physical resources does not reach 100%, reducing the distribution percentage of the GPU physical resources to the virtual machine until the virtual machine marks the GPU physical resources released in the reduction process as unallocated resources when the utilization rate of the GPU physical resources reaches 90%;
and if the utilization rate of the virtual machine to the GPU physical resources reaches 100%, increasing the GPU physical resource allocation percentage of the virtual machine until the utilization rate of the virtual machine to the GPU physical resources reaches 90%, and marking the GPU physical resources occupied in the increasing process as allocated resources.
8. The apparatus of claim 6, wherein the first allocation unit is further specifically configured to:
if the resources which are not allocated in the adjusted resource pool meet the GPU physical resource requirement of the virtual machine, jumping to a second allocation unit;
and if the unallocated resources in the adjusted resource pool still do not meet the GPU physical resource requirement of the virtual machine, sending alarm information to the virtual machine.
9. The apparatus of claim 6 or 8, wherein the second allocation unit comprises:
the calculating subunit is used for determining the GPU physical resource allocation percentage of the virtual machine based on the GPU physical resource demand of the virtual machine;
the allocation subunit is used for allocating the GPU physical resources with the corresponding percentage in the resource pool to the corresponding virtual machine according to the GPU physical resource allocation percentage of the virtual machine;
the marking subunit is used for marking the allocated resources and the unallocated resources in the resource pool;
the resource pool is formed by pooling GPU physical resources.
10. The apparatus of claim 9, wherein the computing subunit is specifically configured to:
and taking the ratio of the GPU physical resource demand of the virtual machine sending the demand response to all resources of the resource pool as the GPU physical resource allocation percentage of the virtual machine.
CN202010164599.8A 2020-03-11 GPU resource sharing method and device Active CN111506419B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010164599.8A CN111506419B (en) 2020-03-11 GPU resource sharing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010164599.8A CN111506419B (en) 2020-03-11 GPU resource sharing method and device

Publications (2)

Publication Number Publication Date
CN111506419A true CN111506419A (en) 2020-08-07
CN111506419B CN111506419B (en) 2024-07-02

Family

ID=

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101593134A (en) * 2009-06-29 2009-12-02 北京航空航天大学 Virtual machine cpu resource distribution method and device
US8103769B1 (en) * 2009-09-03 2012-01-24 Amazon Technologies, Inc. Dynamic isolation of shared resources
KR20120100833A (en) * 2011-03-04 2012-09-12 한국항공대학교산학협력단 Method and terminal for performing direct communication between terminals
CN103180830A (en) * 2012-08-31 2013-06-26 华为技术有限公司 Cpu resource distribution method and calculating node
CN105242957A (en) * 2015-09-28 2016-01-13 广州云晫信息科技有限公司 Method and system for cloud computing system to allocate GPU resources to virtual machine
CN105491569A (en) * 2014-09-19 2016-04-13 电信科学技术研究院 Spectrum resource distribution, acquisition method and device
CN105589751A (en) * 2015-11-27 2016-05-18 杭州华三通信技术有限公司 Physical resource scheduling method and device
WO2017059695A1 (en) * 2015-10-10 2017-04-13 百度在线网络技术(北京)有限公司 Resource management method and device based on stream calculation
CN107515786A (en) * 2017-08-04 2017-12-26 北京奇虎科技有限公司 Resource allocation methods, master device, from device and distributed computing system
CN109213555A (en) * 2018-08-16 2019-01-15 北京交通大学 A kind of resource dynamic dispatching method of Virtual desktop cloud
CN109558210A (en) * 2018-11-21 2019-04-02 郑州云海信息技术有限公司 A kind of method and system of virtual machine applied host machine GPU equipment
US20190220383A1 (en) * 2018-01-17 2019-07-18 Microsoft Technology Licensing, Llc Techniques for tracking graphics processing resource utilization

Patent Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101593134A (en) * 2009-06-29 2009-12-02 北京航空航天大学 Virtual machine cpu resource distribution method and device
US8103769B1 (en) * 2009-09-03 2012-01-24 Amazon Technologies, Inc. Dynamic isolation of shared resources
KR20120100833A (en) * 2011-03-04 2012-09-12 한국항공대학교산학협력단 Method and terminal for performing direct communication between terminals
CN103180830A (en) * 2012-08-31 2013-06-26 华为技术有限公司 Cpu resource distribution method and calculating node
CN105491569A (en) * 2014-09-19 2016-04-13 电信科学技术研究院 Spectrum resource distribution, acquisition method and device
CN105242957A (en) * 2015-09-28 2016-01-13 广州云晫信息科技有限公司 Method and system for cloud computing system to allocate GPU resources to virtual machine
WO2017059695A1 (en) * 2015-10-10 2017-04-13 百度在线网络技术(北京)有限公司 Resource management method and device based on stream calculation
CN105589751A (en) * 2015-11-27 2016-05-18 杭州华三通信技术有限公司 Physical resource scheduling method and device
CN107515786A (en) * 2017-08-04 2017-12-26 北京奇虎科技有限公司 Resource allocation methods, master device, from device and distributed computing system
US20190220383A1 (en) * 2018-01-17 2019-07-18 Microsoft Technology Licensing, Llc Techniques for tracking graphics processing resource utilization
WO2019143460A1 (en) * 2018-01-17 2019-07-25 Microsoft Technology Licensing, Llc Techniques for tracking graphics processing resource utilization
CN109213555A (en) * 2018-08-16 2019-01-15 北京交通大学 A kind of resource dynamic dispatching method of Virtual desktop cloud
CN109558210A (en) * 2018-11-21 2019-04-02 郑州云海信息技术有限公司 A kind of method and system of virtual machine applied host machine GPU equipment

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
QIANG WU 等: "《Dynamically Adujsting Scale of a Kubernetes Cluster under QoS Guarantee》", 《IEEE》, 31 December 2019 (2019-12-31) *
SHENGBO SONG: "《Gaia Scheduler: A Kubernetes-Based Scheduler Framework》", 《IEEE》, 31 December 2018 (2018-12-31) *
张远: "《基于开源平台的云资源管理***的研究与应用》", 《中国优秀硕士学位论文全文数据库》, 15 March 2016 (2016-03-15) *
邱悦: "《云媒体中基于效用最大化协商机制的资源分配算法研究》", 《中国优秀硕士学位论文全文数据库》, 15 January 2015 (2015-01-15) *

Similar Documents

Publication Publication Date Title
WO2018119952A1 (en) Device virtualization method, apparatus, system, and electronic device, and computer program product
US20160239333A1 (en) Apparatus and method for scheduling graphics processing unit workloads from virtual machines
CN111090521B (en) Memory allocation method and device, storage medium and electronic equipment
EP3748508A1 (en) Memory management in virtualized computing
CN115988217B (en) Virtualized video encoding and decoding system, electronic equipment and storage medium
CN103744716A (en) Dynamic interrupt balanced mapping method based on current virtual central processing unit (VCPU) scheduling state
CN109712060B (en) Cloud desktop display card sharing method and system based on GPU container technology
US8352702B2 (en) Data processing system memory allocation
WO2016202154A1 (en) Gpu resource allocation method and system
JP2020503609A (en) Memory access method, apparatus and electronic equipment for multi-operating system
CN103049328A (en) Distribution method of internal memory resources in computer system
CN111104219A (en) Binding method, device, equipment and storage medium of virtual core and physical core
CN106598696B (en) Method and device for data interaction between virtual machines
US20200201691A1 (en) Enhanced message control banks
CN105677481A (en) Method and system for processing data and electronic equipment
CN106796535B (en) Video display method, video display device, electronic equipment and computer program product
CN106775925B (en) Quota processing method and device for CPU of virtual machine
CN109558210B (en) Method and system for virtual machine to apply GPU (graphics processing Unit) equipment of host
CN102096594B (en) Method and device for using system resources in memory
CN111506419B (en) GPU resource sharing method and device
CN110908783A (en) Management and control method, system and equipment for virtual machine of cloud data center
CN111506419A (en) GPU resource sharing method and device
CN111158911A (en) Processor configuration method and device, processor and network equipment
CN110447019B (en) Memory allocation manager and method for managing memory allocation performed thereby
CN114489952A (en) Queue distribution method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant