CN109471733A

CN109471733A - A kind of resource control method and device

Info

Publication number: CN109471733A
Application number: CN201811399681.8A
Authority: CN
Inventors: 张�浩
Original assignee: Zhengzhou Yunhai Information Technology Co Ltd
Current assignee: Zhengzhou Yunhai Information Technology Co Ltd
Priority date: 2018-11-22
Filing date: 2018-11-22
Publication date: 2019-03-15

Abstract

The application provides a kind of resource control method and device, which comprises obtains the graphics processor GPU information and memory information of each service unit；Flexible strategy is determined according to the GPU information of each service unit, memory information and current service unit total quantity, and the flexible strategy is for controlling the quantity of service unit.Through the above technical solutions, can predict that the resource quantity of service is effectively controlled to deep learning.

Description

A kind of resource control method and device

Technical field

The present invention relates to computer field more particularly to a kind of resource control methods and device.

Background technique

Generally there are two types of methods for elastic telescopic: horizontal extension and vertical telescopic.Horizontal extension is the number of additions and deletions service unit Amount, vertical telescopic is the resource for changing each service unit.(one increasing income, more in cloud platform for managing by Kubernetes The application of containerization on a host) official use HPA (Horizontal service unit Autoscaling, horizontal extension) Strategy is that (the minimum deployment unit in Kubernetes is made of one group of tightly coupled container according to each service unit Pod Container group) CPU (Central Processing Unit, central processing unit) monitoring data, determine increase/deletion Pod number Amount.In the epoch of deep learning, many companies both provide the prediction service of deep learning, and many cloud service platforms are bases In Kubernetes clustered deploy(ment).The prediction service of deep learning is not different with common service in itself, but In operational process, common service relys more on CPU, and the prediction of deep learning service is smaller to the dependence of CPU, passes through CPU Monitoring data control can not effectively control the quantity of Pod.

Summary of the invention

The application technology to be solved is to provide a kind of resource control method and device, can predict to take to deep learning The resource quantity of business is effectively controlled.

In order to solve the above-mentioned technical problem, this application provides a kind of resource control methods, which comprises

Obtain the graphics processor GPU information and memory information of each service unit；

Flexible strategy is determined according to the GPU information of each service unit, memory information and current service unit total quantity, The flexible strategy is for controlling the quantity of service unit.

Optionally, described true according to the GPU information of each service unit, memory information and current service unit total quantity Flexible strategy includes: calmly

The GPU utilization rate of each service unit is calculated according to the GPU information of each service unit；

The memory usage of each service unit is calculated according to the memory information of each service unit；

It is determined according to the GPU utilization rate of each service unit, memory usage and current service unit total quantity flexible Strategy.

Optionally, true according to the GPU utilization rate of each service unit, memory usage and current service unit total quantity Flexible strategy includes: calmly

When meeting following either condition, flexible strategy is determined to increase service unit:

The average GPU utilization rate of current all service units is greater than first threshold, and current service unit total quantity is less than Service unit quantity max-thresholds；

The average memory usage of current all service units is greater than second threshold, and current service unit total quantity is less than Service unit quantity max-thresholds.

Optionally, described according to the GPU utilization rate of each service unit, memory usage and current service unit sum Amount determines that flexible strategy includes:

If the average GPU utilization rate of current all service units and average memory usage are respectively less than third threshold value, and work as Preceding service unit total quantity is greater than service unit quantity minimum threshold, it is determined that flexible strategy is for deletion service unit and/or temporarily Stop service unit.

Optionally, described true according to the GPU information of each service unit, memory information and current service unit total quantity After fixed flexible strategy, the method also includes:

Execute the corresponding operation of the flexible strategy.

The application also provides a kind of resource control, comprising: memory and processor；The memory, for saving Program for resources control；

The processor executes the program for being used for resources control for reading, performs the following operations:

Optionally, the processor executes the program for being used for resources control for reading, also performs the following operations:

Described determined according to the GPU information of each service unit, memory information and current service unit total quantity is stretched After strategy, the corresponding operation of the flexible strategy is executed.

Compared with prior art, the application includes: to obtain the graphics processor GPU information and memory letter of each service unit Breath；Flexible strategy is determined according to the GPU information of each service unit, memory information and current service unit total quantity, it is described Flexible strategy is for controlling the quantity of service unit.Through the above technical solutions, can predict deep learning the resource of service Quantity is effectively controlled.

Detailed description of the invention

Fig. 1 is the flow diagram of the resource control method of the embodiment of the present invention one；

Fig. 2 is another flow diagram of the resource control method of the embodiment of the present invention one；

Fig. 3 is the structural schematic diagram of the resource control of the embodiment of the present invention one.

Specific embodiment

To make the objectives, technical solutions, and advantages of the present invention clearer, below in conjunction with attached drawing to the present invention Embodiment be described in detail.It should be noted that in the absence of conflict, in the embodiment and embodiment in the application Feature can mutual any combination.

Step shown in the flowchart of the accompanying drawings can be in a computer system such as a set of computer executable instructions It executes.Also, although logical order is shown in flow charts, and it in some cases, can be to be different from herein suitable Sequence executes shown or described step.

Embodiment one

As shown in Figure 1, the present embodiment provides a kind of resource control methods, which comprises

Step S100, GPU (Graphics Processing Unit, the graphics processor) letter of each service unit is obtained Breath and memory information；

Step S102, it is determined according to the GPU information of each service unit, memory information and current service unit total quantity Flexible strategy, the flexible strategy is for controlling the quantity of service unit.

Optionally, described true according to the GPU information of each service unit, memory information and current service unit total quantity Flexible strategy may include: calmly

Optionally, true according to the GPU utilization rate of each service unit, memory usage and current service unit total quantity Flexible strategy may include: calmly

In addition to it can determine flexible strategy to increase service unit when meeting above-mentioned condition, in other embodiments, also Other conditions that can increase service unit can be set.For example, can set when in the GPU utilization rate of all service units Maximum value is greater than first threshold and when current service unit total quantity is less than service unit quantity max-thresholds, it is determined that flexible plan Slightly increase service unit；Or it can also set when the maximum value in the memory usage of all service units is greater than the second threshold Value and when current service unit total quantity is less than service unit quantity max-thresholds, it is determined that flexible strategy is single to increase service Member.

It above are only for example, in other embodiments, it can be according to the utilization rate of the GPU of each service unit, interior The total quantity of utilization rate and current service unit is deposited, setting can increase the various conditional combinations of service unit, different herein One enumerates.

Optionally, described according to the GPU utilization rate of each service unit, memory usage and current service unit sum Amount determines that flexible strategy may include:

In addition to it can determine flexible strategy for deletion and/or unit out of service when meeting above-mentioned condition, in other realities It applies in example, other can delete and/or the condition of unit out of service can also be set.For example, can set when all services The minimum of minimum value and memory usage in the GPU utilization rate of unit is respectively less than third threshold value and current service unit sum When measuring greater than service unit quantity minimum threshold, it is determined that flexible strategy is deletion and/or unit out of service.

It above are only for example, in other embodiments, it can be according to the utilization rate of the GPU of each service unit, interior The total quantity of utilization rate and current service unit is deposited, setting can delete and/or the various conditional combinations of unit out of service, It is numerous to list herein.

It should be noted that first threshold, second threshold and third threshold value between any two can be equal, it can not also phase Deng.

Optionally, as shown in Fig. 2, it is described according to the GPU information of each service unit, memory information and current service list After first total quantity determines flexible strategy, the method can also include:

Step S104, the corresponding operation of the flexible strategy is executed.

In the present embodiment, service unit can be the minimum deployment unit Pod in Kubernetes, and Pod is by one group of tight coupling The container group of the container composition of conjunction.Flexible strategy can be determined according to the GPU information and memory information of Pod by Kubernetes, After determining flexible strategy, Kubernetes controls the quantity of Pod according to the content for the strategy that stretches.

When needing to increase Pod quantity, Kubernetes can determine increased quantity according to system mechanism, such as often hold The primary operation for increasing Pod of row can increase by 4 Pod, alternatively, the primary operation for increasing Pod of every execution can increase by 8 Pod. It above are only example, specifically increase that how many Pod can set corresponding mechanism in Kubernetes.

Equally, when needing to delete Pod quantity, Kubernetes can determine the quantity deleted, example according to system mechanism If the primary operation for deleting Pod of every execution can delete 2 Pod, alternatively, the primary operation for deleting Pod of every execution can delete 4 A Pod.It above are only example, specifically delete that how many Pod can set corresponding mechanism in Kubernetes.

Through the above technical solutions, can predict that the resource quantity of service is effectively controlled to deep learning.

As shown in figure 3, the present embodiment also provides a kind of resource control, comprising: memory 10 and processor 20；

The memory 10, for saving the program for being used for resources control；

The processor 20 is performed the following operations for reading the program for executing the resources control:

Optionally, the processor 20, which is used to read, executes the program for being used for resources control, can also be performed as follows Operation:

Those of ordinary skill in the art will appreciate that all or part of the steps in the above method can be instructed by program Related hardware is completed, and described program can store in computer readable storage medium, such as read-only memory, disk or CD Deng.Optionally, one or more integrated circuits also can be used to realize, accordingly in all or part of the steps of above-described embodiment Ground, each module/unit in above-described embodiment can take the form of hardware realization, can also use the shape of software function module Formula is realized.The application is not limited to the combination of the hardware and software of any particular form.

The foregoing is merely preferred embodiment of the present application, are not intended to limit this application, for the skill of this field For art personnel, various changes and changes are possible in this application.Within the spirit and principles of this application, made any to repair Change, equivalent replacement, improvement etc., should be included within the scope of protection of this application.

Claims

1. a kind of resource control method, which is characterized in that the described method includes:

Flexible strategy is determined according to the GPU information of each service unit, memory information and current service unit total quantity, it is described Flexible strategy is for controlling the quantity of service unit.

2. the method as described in claim 1, which is characterized in that described according to the GPU information of each service unit, memory information And current service unit total quantity determines that flexible strategy includes:

Flexible strategy is determined according to the GPU utilization rate of each service unit, memory usage and current service unit total quantity.

3. method according to claim 2, which is characterized in that according to the GPU utilization rate of each service unit, memory usage And current service unit total quantity determines that flexible strategy includes:

The average GPU utilization rate of current all service units is greater than first threshold, and current service unit total quantity is less than service Element number max-thresholds；

The average memory usage of current all service units is greater than second threshold, and current service unit total quantity is less than service Element number max-thresholds.

4. method according to claim 2, which is characterized in that described to be made according to the GPU utilization rate of each service unit, memory Determine that flexible strategy includes: with rate and current service unit total quantity

If the average GPU utilization rate of current all service units and average memory usage are respectively less than third threshold value, and current clothes Unit total quantity of being engaged in is greater than service unit quantity minimum threshold, it is determined that flexible strategy is to delete service unit and/or pause clothes Business unit.

5. the method as described in Claims 1-4 is any, it is characterised in that:

It is described that flexible strategy is determined according to the GPU information of each service unit, memory information and current service unit total quantity Later, the method also includes:

Execute the corresponding operation of the flexible strategy.

6. a kind of resource control, comprising: memory and processor；It is characterized by:

The memory, for saving the program for being used for resources control；

7. device as claimed in claim 6, which is characterized in that described according to the GPU information of each service unit, memory information And current service unit total quantity determines that flexible strategy includes:

8. device as claimed in claim 7, which is characterized in that according to the GPU utilization rate of each service unit, memory usage And current service unit total quantity determines that flexible strategy includes:

9. device as claimed in claim 8, which is characterized in that described to be made according to the GPU utilization rate of each service unit, memory Determine that flexible strategy includes: with rate and current service unit total quantity

10. the device as described in claim 6 to 9 is any, it is characterised in that:

The processor executes the program for being used for resources control for reading, also performs the following operations:

It is described that flexible strategy is determined according to the GPU information of each service unit, memory information and current service unit total quantity Later, the corresponding operation of the flexible strategy is executed.