CN111506419B

CN111506419B - GPU resource sharing method and device

Info

Publication number: CN111506419B
Application number: CN202010164599.8A
Authority: CN
Inventors: 郭彦辉
Original assignee: BEIJING LIANCHUANG XINAN TECHNOLOGY CO LTD
Current assignee: BEIJING LIANCHUANG XINAN TECHNOLOGY CO LTD
Priority date: 2020-03-11
Filing date: 2020-03-11
Publication date: 2024-07-02
Anticipated expiration: 2040-03-11
Also published as: CN111506419A

Abstract

The invention relates to a GPU resource sharing method and device, wherein the method comprises the steps of comparing GPU physical resource requirements of a virtual machine with unallocated resources in a resource pool; if the unallocated resources in the resource pool do not meet the GPU physical resource requirements of the virtual machine, adjusting the GPU physical resources allocated to the virtual machine according to the GPU physical resource utilization rate allocated to the virtual machine, and allocating the GPU physical resources to the virtual machine according to the unallocated resources in the adjusted resource pool; otherwise, GPU physical resources are allocated to the virtual machine according to GPU physical resource requirements of the virtual machine. The GPU resource sharing method and device provided by the invention can realize that a plurality of virtual machines directly use GPU resources, thereby improving the computing capacity of the virtual machines and effectively improving the utilization rate of the GPU resources.

Description

GPU resource sharing method and device

Technical Field

The invention belongs to the technical field of resource sharing, and particularly relates to a GPU resource sharing method and device.

Background

With the rapid development of technology, in order to meet the increasing computing demands, each laboratory has more and more outstanding computing demands, and the common scientific research laboratory cannot configure GPU equipment for each person, so that sGPU technology solves the difficulty of GPU sharing use.

The current technology for realizing GPU resource sharing includes a GPU virtualization technology and a GPU direct-through technology. The GPU direct connection technology can only be used by binding to one virtual machine, the single virtual machine completely monopolizes GPU equipment, other virtual machines cannot be used, resources of the GPU cannot be completely used by the single virtual machine, and great resource waste is caused. GPU virtualization technology is used for virtualizing a plurality of GPU devices to be distributed to a plurality of virtual machines, GPU resource distribution occupation is consistent, and in addition, the limitation of IO virtualization performance problem is added, so that reasonable resource distribution and effective resource utilization cannot be achieved.

Disclosure of Invention

Aiming at the defects of the prior art, the invention provides the GPU resource sharing method and the GPU resource sharing device, which directly allocate GPU physical resources to a virtual machine for use, can fully exert the physical properties of a graphics card, simultaneously allocate the GPU resources to multiple people for common use, and effectively improve the utilization rate of the GPU resources.

The invention aims at adopting the following technical scheme:

in a method for GPU resource sharing, the improvement comprising:

comparing GPU physical resource requirements of the virtual machine with unallocated resources in a resource pool;

If the unallocated resources in the resource pool do not meet the GPU physical resource requirements of the virtual machine, adjusting the GPU physical resources allocated to the virtual machine according to the GPU physical resource utilization rate allocated to the virtual machine, and allocating the GPU physical resources to the virtual machine according to the unallocated resources in the adjusted resource pool;

otherwise, GPU physical resources are allocated to the virtual machine according to GPU physical resource requirements of the virtual machine.

Preferably, the adjusting the GPU physical resources allocated to the virtual machine according to the GPU physical resource utilization ratio allocated to the virtual machine includes:

If the utilization rate of the GPU physical resources by the virtual machine is not 100%, reducing the distribution percentage of the GPU physical resources of the virtual machine until the utilization rate of the GPU physical resources by the virtual machine is 90%, and marking the GPU physical resources released in the reduction process as unallocated resources;

If the utilization rate of the GPU physical resources by the virtual machine reaches 100%, the distribution percentage of the GPU physical resources of the virtual machine is increased until the utilization rate of the GPU physical resources by the virtual machine reaches 90%, and the GPU physical resources occupied in the increasing process are marked as distributed resources.

Preferably, the allocating GPU physical resources to the virtual machine according to the unallocated resources in the adjusted resource pool includes:

if the unallocated resources in the adjusted resource pool meet the GPU physical resource requirements of the virtual machine, allocating GPU physical resources to the virtual machine according to the GPU physical resource requirements of the virtual machine;

And if the unallocated resources in the adjusted resource pool still do not meet the GPU physical resource requirements of the virtual machine, sending alarm information to the virtual machine.

Preferably, the allocating GPU physical resources to the virtual machine according to GPU physical resource requirements of the virtual machine includes:

Determining a GPU physical resource allocation percentage for the virtual machine based on GPU physical resource requirements of the virtual machine;

Distributing GPU physical resources with corresponding percentages in the resource pool to corresponding virtual machines;

marking the allocated resources and unallocated resources in the resource pool;

The resource pool is formed by pooling GPU physical resources.

Preferably, the determining the GPU physical resource allocation percentage for the virtual machine based on the GPU physical resource requirement of the virtual machine includes:

And taking the ratio of the GPU physical resource requirement of the virtual machine sending the requirement response to all resources in the resource pool as the distribution percentage of the GPU physical resources of the virtual machine.

Based on the same inventive concept, the invention also provides a GPU resource sharing device, which is improved in that the device comprises:

The comparison unit is used for comparing the GPU physical resource requirement of the virtual machine with unallocated resources in the resource pool; when the unallocated resources in the resource pool do not meet the GPU physical resource requirements of the virtual machine, jumping to an adjusting unit; otherwise, jumping to a second distribution unit;

the adjusting unit is used for adjusting the GPU physical resources allocated to the virtual machine according to the utilization rate of the GPU physical resources allocated to the virtual machine;

the first allocation unit allocates GPU physical resources to the virtual machine according to the unallocated resources in the resource pool adjusted by the adjustment unit;

and the second allocation unit is used for allocating GPU physical resources to the virtual machine according to the GPU physical resource requirements of the virtual machine.

Preferably, the adjusting unit is specifically configured to:

Preferably, the first dispensing unit is further specifically configured to:

If the unallocated resources in the adjusted resource pool meet the GPU physical resource requirements of the virtual machine, jumping to a second allocation unit;

Preferably, the second distributing unit includes:

the computing subunit is used for determining the distribution percentage of the GPU physical resources of the virtual machine based on the GPU physical resource requirements of the virtual machine;

The allocation subunit is used for allocating the GPU physical resources with corresponding percentages in the resource pool to the corresponding virtual machines according to the allocation percentages of the GPU physical resources to the virtual machines;

a marking subunit, configured to mark allocated resources and unallocated resources in the resource pool;

the resource pool is formed by pooling GPU physical resources.

Preferably, the computing subunit is specifically configured to:

Compared with the closest prior art, the invention has the following beneficial effects:

The invention provides a GPU resource sharing method and device, comprising the steps of comparing GPU physical resource requirements of a virtual machine with unallocated resources in a resource pool; if the unallocated resources in the resource pool do not meet the GPU physical resource requirements of the virtual machine, adjusting the GPU physical resources allocated to the virtual machine according to the GPU physical resource utilization rate allocated to the virtual machine, and allocating the GPU physical resources to the virtual machine according to the unallocated resources in the adjusted resource pool; otherwise, GPU physical resources are allocated to the virtual machine according to GPU physical resource requirements of the virtual machine. The GPU resource sharing method and device provided by the invention can realize that a plurality of virtual machines directly use GPU resources, thereby improving the computing capacity of the virtual machines and effectively improving the utilization rate of the GPU resources.

Drawings

FIG. 1 is a flowchart of a GPU resource sharing method according to the present invention;

FIG. 2 is a schematic diagram of a GPU resource allocation percentage in an embodiment;

FIG. 3 is a diagram of a GPU resource sharing device according to the present invention.

Detailed Description

The following describes the embodiments of the present invention in further detail with reference to the drawings.

For the purpose of making the objects, technical solutions and advantages of the embodiments of the present invention more apparent, the technical solutions of the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention, and it is apparent that the described embodiments are some embodiments of the present invention, but not all embodiments of the present invention. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.

In order to solve the problem of low GPU resource utilization rate caused by excessive or insufficient GPU resource allocation, the invention provides a GPU resource sharing method, as shown in FIG. 1, comprising the following steps:

For the purpose of illustrating the invention more clearly, the method of the invention is further explained below in connection with specific examples:

In an embodiment of the present invention, adjusting the GPU physical resources allocated to the virtual machine according to the GPU physical resource utilization allocated to the virtual machine may include:

In an embodiment of the present invention, allocating GPU physical resources to a virtual machine according to unallocated resources in an adjusted resource pool may include:

In an embodiment of the present invention, allocating GPU physical resources to a virtual machine according to GPU physical resource requirements of the virtual machine may include:

marking the allocated resources and unallocated resources in the resource pool;

the resource pool is formed by pooling GPU physical resources in advance, and the ratio of all resources in the resource pool in an initial state is marked with 100%.

In the embodiment of the invention, in order to make the use of the allocated resources not affected, the GPU physical resources of each virtual machine are also isolated by an isolation technology.

In an embodiment of the present invention, determining the GPU physical resource allocation percentage for the virtual machine based on the GPU physical resource requirements of the virtual machine may include:

Taking the ratio of the GPU physical resource requirement of the virtual machine sending the requirement response to all resources in the resource pool as the distribution percentage of the GPU physical resources of the virtual machine;

Wherein all resources in the resource pool include allocated resources and unallocated resources.

In the embodiment provided by the invention, as shown in fig. 2, the GPU physical resources of each virtual machine may be divided into 20%, 30% and 50% according to the GPU physical resource requirements of each virtual machine, and then the GPU resources with the divided percentages are allocated to the virtual machines with corresponding percentage requirements.

Based on the same inventive concept, the present invention also provides a GPU resource sharing device, as shown in fig. 3, including:

In an embodiment of the present invention, the adjusting unit is specifically configured to:

In an embodiment of the invention, the first distribution unit is specifically configured to:

In an embodiment of the present invention, the second allocation unit may include:

the resource pool is formed by pooling GPU physical resources.

In an embodiment of the invention, the computing subunit is specifically configured to:

In summary, the method and the device for sharing the GPU resources can realize that a plurality of virtual machines directly use the GPU resources, and improve the computing capacity of the virtual machines; the GPU resources can be dynamically adjusted according to the utilization rate of the virtual machine to the GPU physical resources, and the utilization rate of the GPU resources is effectively improved.

It will be appreciated by those skilled in the art that embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.

The present application is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the application. It will be understood that each flow and/or block of the flowchart illustrations and/or block diagrams, and combinations of flows and/or blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.

These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.

These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.

Finally, it should be noted that: the above embodiments are only for illustrating the technical aspects of the present invention and not for limiting the same, and although the present invention has been described in detail with reference to the above embodiments, it should be understood by those of ordinary skill in the art that: modifications and equivalents may be made to the specific embodiments of the invention without departing from the spirit and scope of the invention, which is intended to be covered by the claims.

Claims

1. A method for sharing GPU resources, comprising:

Otherwise, distributing GPU physical resources to the virtual machine according to GPU physical resource requirements of the virtual machine;

The adjusting the GPU physical resources allocated to the virtual machine according to the GPU physical resource utilization rate allocated to the virtual machine comprises:

2. The method of claim 1, wherein the allocating GPU physical resources to the virtual machine based on unallocated resources in the adjusted resource pool comprises:

3. The method of claim 1 or 2, wherein the allocating GPU physical resources to the virtual machine according to GPU physical resource requirements of the virtual machine comprises:

marking the allocated resources and unallocated resources in the resource pool;

The resource pool is formed by pooling GPU physical resources.

4. The method of claim 3, wherein the determining the GPU physical resource allocation percentage for the virtual machine based on GPU physical resource requirements of the virtual machine comprises:

5. A GPU resource sharing device, comprising:

the second allocation unit is used for allocating GPU physical resources to the virtual machine according to GPU physical resource requirements of the virtual machine;

The adjusting unit is specifically configured to:

6. The apparatus according to claim 5, wherein the first distribution unit is further specifically configured to:

7. The apparatus of claim 5 or 6, wherein the second distribution unit comprises:

the resource pool is formed by pooling GPU physical resources.

8. The apparatus of claim 7, wherein the computing subunit is configured to:

And taking the ratio of the GPU physical resource requirement of the virtual machine sending the requirement response to the total resources of the resource pool as the distribution percentage of the GPU physical resources of the virtual machine.