CN113032134B - Method and device for realizing cloud computing resource allocation and cloud management server - Google Patents

Method and device for realizing cloud computing resource allocation and cloud management server Download PDF

Info

Publication number
CN113032134B
CN113032134B CN201911344153.7A CN201911344153A CN113032134B CN 113032134 B CN113032134 B CN 113032134B CN 201911344153 A CN201911344153 A CN 201911344153A CN 113032134 B CN113032134 B CN 113032134B
Authority
CN
China
Prior art keywords
inventory
instance
available area
instance specification
sufficient
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201911344153.7A
Other languages
Chinese (zh)
Other versions
CN113032134A (en
Inventor
田永军
贺荣徽
余洋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba Group Holding Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201911344153.7A priority Critical patent/CN113032134B/en
Publication of CN113032134A publication Critical patent/CN113032134A/en
Application granted granted Critical
Publication of CN113032134B publication Critical patent/CN113032134B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5083Techniques for rebalancing the load in a distributed system

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Stored Programmes (AREA)

Abstract

The application discloses a method and a device for realizing cloud computing resource allocation and a cloud management server, which realize that computing resources are automatically created for users according to corresponding instance specifications acceptable by the users to finish computation according to the inventory real-time situation of public cloud services, fully utilize the elastic capability of a single data center, namely a single available area, and effectively help the users to acquire computing resources as soon as possible without the problem of waiting for resource release caused by the single instance specifications.

Description

Method and device for realizing cloud computing resource allocation and cloud management server
Technical Field
The present application relates to, but not limited to, cloud computing technologies, and in particular, to a method and apparatus for implementing cloud computing resource allocation, and a cloud management server.
Background
The cloud computing has the greatest advantage of elastic expansion, so that a user can use computing resources, storage resources and the like water, electricity and coal, and in summary, the user can consider resources on the cloud to be infinite.
Public cloud computing platforms typically deploy multiple data centers, also known as Available Zones (AZ), in multiple regions, each machine room may deploy different instance specifications, the inventory of each instance specification being real-time and different. For users of high performance computing (HPC, high Performance Computing), the cluster size is large and the resources requested by the users are typically centrally deployed in the same data center for performance. The scheduling and expansion requirements for resources are very high, and pulse capacity expansion clusters are often carried out to a very large scale, so that the problem of cluster capacity expansion failure caused by insufficient inventory can be encountered even on a cloud computing platform, and the computing time of a computing task is prolonged.
In the existing scheme for realizing resource allocation, only an example of one specification can be automatically stretched and contracted, and the expansion is limited in one region. For this case, for users who have not requested resources, the user waits until the resources in the data center are released and available.
Disclosure of Invention
The application provides a method and a device for realizing cloud computing resource allocation and a cloud management server, which can fully exert public cloud elasticity so as to ensure computing performance.
The embodiment of the invention provides a method for realizing cloud computing resource allocation, which comprises the following steps:
the cloud management server determines different types of instance specifications matched with the new task according to resource demand configuration information pre-configured by a user;
inquiring whether the inventory of the current priority instance specification in the available area is sufficient or not according to the determined priority order of the instance specifications of different types;
When the inventory of the current priority instance specification in the available area is inquired to be sufficient, selecting the current priority instance specification to create computing resources for a new task of the user; and when the inventory of the current priority instance specification in the available area is inquired to be insufficient, continuing to inquire the inventory of the next priority instance specification.
In an exemplary embodiment, the method further comprises:
the cloud management server receives the pre-configured resource demand configuration information;
wherein the resource requirement configuration information includes one or more resource configuration items, each resource configuration item including one or more different types of instance specifications.
In an exemplary embodiment, the available area includes an available area; the querying whether the inventory of the current priority instance specification in the available area is sufficient includes:
For different resource configuration items, the cloud management server inquires whether the inventory of the current instance specification in the available area is sufficient or not from the instance specification with the highest priority in the determined instance specifications of different types.
In one illustrative example, the method further comprises:
If the inventory of the current priority instance specification in the available area is determined to be sufficient, creating computing resources in the cluster according to the current instance specification, and running the new job to finish the flow;
If the inventory of the current priority instance specification in the available area is not sufficient, returning to the inquiring step, inquiring the inventory of the next priority instance specification of the current priority instance specification until all the inventories of different types of instance specifications in the resource allocation item are inquired, and ending the flow.
In one exemplary embodiment, the available area includes two or more than two and all belong to the same geographic area; the querying whether the inventory of the current priority instance specification in the available area is sufficient includes:
for different resource configuration items, the cloud management server inquires whether the inventory corresponding to the current instance specification in the available area is sufficient or not from the instance specification with the highest priority in the determined instance specifications of different types; the available area corresponding to each resource configuration item comprises one or more than one available areas, and the available areas corresponding to different resource configuration items are different in provided example specification.
In one illustrative example, the method further comprises:
aiming at different resource configuration items, if the inventory of the current priority instance specification in the corresponding available area is determined to be sufficient, computing resources are created in the cluster according to the current instance specification, the new job is operated, and the process is ended;
If the inventory of the current priority instance specification in the corresponding available area is not sufficient, returning to the query step, and querying the inventory of the next priority instance specification of the current priority instance specification until all the inventories of different types of instance specifications in the resource configuration item are queried, and ending the process.
In one illustrative example, the available region includes two or more and is distributed over different geographic areas; the querying whether the inventory of the current priority instance specification in the available area is sufficient includes:
for different resource configuration items, the cloud management server inquires whether the inventory of the current instance specification in the corresponding available area of the corresponding cluster is sufficient or not from the instance specification with the highest priority in the determined instance specifications of different types; wherein, one or more than one available area corresponding to each resource configuration item is distributed in one or more than one cluster, and the instance specifications provided by the available areas corresponding to different resource configuration items are different.
In one illustrative example, the method further comprises:
Aiming at different resource configuration items, if the stock of the current priority instance specification in the corresponding available area of the corresponding cluster is determined to be sufficient, computing resources are created in the cluster according to the current instance specification, the new job is operated, and the process is ended;
And if the inventory of the current priority instance specification in the corresponding available region of the corresponding cluster is determined to be insufficient, returning to the inquiring step, inquiring the inventory of the next priority instance specification of the current priority instance specification until all the inventories of different types of instance specifications in the resource configuration item are inquired, and ending the flow.
The application also provides a computer readable storage medium storing computer executable instructions for performing the method for implementing cloud computing resource allocation according to any one of the above.
The application also provides a device for realizing cloud computing resource allocation, which comprises a memory and a processor, wherein the memory stores the following instructions executable by the processor: steps for performing the method of implementing cloud computing resource allocation of any of the above.
The application further provides a cloud management server, comprising: the system comprises a determining module, a query module and a processing module; wherein,
The determining module is used for determining different types of example specifications of the configurable new task according to the resource demand configuration information pre-configured by the user;
The query module is used for querying whether the inventory of the current priority instance specification in the available area is sufficient or not according to the determined priority order of the instance specifications of different types;
the processing module is used for selecting the current priority instance specification to create computing resources for a new task of a user when the inventory of the current priority instance specification in the available area is queried to be sufficient; and when the inventory of the current priority instance specification in the available area is inquired to be insufficient, returning to the inquiry module to continuously inquire the inventory of the next priority instance specification.
According to the application, the calculation is completed by automatically creating the calculation resources for the user according to the corresponding instance specifications acceptable by the user according to the inventory real-time condition of the public cloud service, the elasticity of a single data center, namely a single available area, is fully utilized, the problem of waiting for resource release caused by the single instance specifications is avoided, and the user is effectively helped to acquire the calculation resources as soon as possible.
Additional features and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.
Drawings
The accompanying drawings are included to provide a further understanding of the application and are incorporated in and constitute a part of this specification, illustrate and do not limit the application.
FIG. 1 is a flow chart of a method of implementing cloud computing resource allocation in accordance with the present application;
FIG. 2 is a schematic diagram of a first embodiment of the present application for implementing cloud computing resource allocation;
FIG. 3 is a flow chart of a first embodiment of the present application;
FIG. 4 is a schematic diagram of a scenario in which a second embodiment of cloud computing resource allocation is implemented according to the present application;
FIG. 5 is a schematic diagram of a third embodiment of a cloud computing resource allocation implementation of the present application;
FIG. 6 is a schematic diagram of a scenario in which a fourth embodiment of cloud computing resource allocation is implemented according to the present application;
Fig. 7 is a schematic diagram of a composition structure of a cloud management server according to the present application;
Fig. 8 is a schematic diagram of an application scenario for implementing cloud computing resource allocation according to the present application.
Detailed Description
For the purpose of making the objects, technical solutions and advantages of the present application more apparent, embodiments of the present application will be described in detail hereinafter with reference to the accompanying drawings. It should be noted that, without conflict, the embodiments of the present application and features of the embodiments may be arbitrarily combined with each other.
In one typical configuration of the application, a computing device includes one or more processors (CPUs), input/output interfaces, network interfaces, and memory.
The memory may include volatile memory in a computer-readable medium, random Access Memory (RAM) and/or nonvolatile memory, such as Read Only Memory (ROM) or flash memory (flash RAM). Memory is an example of computer-readable media.
Computer readable media, including both non-transitory and non-transitory, removable and non-removable media, may implement information storage by any method or technology. The information may be computer readable instructions, data structures, modules of a program, or other data. Examples of storage media for a computer include, but are not limited to, phase change memory (PRAM), static Random Access Memory (SRAM), dynamic Random Access Memory (DRAM), other types of Random Access Memory (RAM), read Only Memory (ROM), electrically Erasable Programmable Read Only Memory (EEPROM), flash memory or other memory technology, compact disc read only memory (CD-ROM), digital Versatile Discs (DVD) or other optical storage, magnetic cassettes, magnetic tape disk storage or other magnetic storage devices, or any other non-transmission medium, which can be used to store information that can be accessed by a computing device. Computer readable media, as defined herein, does not include non-transitory computer readable media (transmission media), such as modulated data signals and carrier waves.
The steps illustrated in the flowchart of the figures may be performed in a computer system, such as a set of computer-executable instructions. Also, while a logical order is depicted in the flowchart, in some cases, the steps depicted or described may be performed in a different order than presented herein.
FIG. 1 is a flowchart of a method for implementing cloud computing resource allocation according to the present application, as shown in FIG. 1, including:
step 100: and the cloud management server determines different types of instance specifications matched with the new task according to the resource demand configuration information pre-configured by the user.
There are various types of instance specifications on public clouds, such as general purpose type, computing type, memory type, high dominant frequency type, etc. In an illustrative example, before this step, further comprising:
The cloud management server receives pre-configured resource requirement configuration information from a user (e.g., a cluster administrator), the resource requirement configuration information including one or more resource configuration items, each resource configuration item including one or more different types of instance specifications.
In one illustrative example, a user may set resource configuration items in a queue manner, setting a plurality of types of instance specifications that can be expanded/contracted for each queue, such as: example specifications of the expandable/contractible volume in the first queue include: specification 1, specification 2; and the following steps: example specifications of the expandable/contractible volume in the second queue include: specification 3, specification 4, etc.
In combination with practical application scenarios, for example: the first queue is configured as a 32-core example, and then the user can sequentially expand the following examples according to the characteristics of the computing task and the priority setting: general (32 core 128GB memory) -high-master (32 core 128GB memory) -compute (32 core 64GB memory) -memory type (32 core 256GB memory). And the following steps: the second queue is configured as a 16-core example, and then the user can set the following expansion examples according to the priority according to the characteristics of the computing task: general (16 core 64GB memory) -high-master frequency (16 core 64GB memory) -computational (16 core 32GB memory) -memory type (16 core 128GB memory). It should be noted that this is only an example and is not intended to limit the scope of the present application.
In an exemplary embodiment, the resource demand configuration information comprises two or more resource configuration items, namely, a plurality of instance specifications can be expanded/contracted in a plurality of available areas, so that the elasticity advantage of the public cloud is fully exerted.
In an illustrative example, step 100 may further include:
The cloud management server acquires the job information in the cluster according to a preset period or a specified time period, and learns that a new job waits for creating a computing resource to complete the computation from the job information.
Step 101: and inquiring whether the inventory of the current priority instance specification in the available area is sufficient according to the determined priority order of the instance specifications of different types.
In an exemplary embodiment, for the case that one cluster corresponds to one available area, as cluster 1 in fig. 8, the available area in this step includes one available area; the step may include:
for different resource configuration items, the cloud management server inquires whether the inventory of the current instance specification in the available area is sufficient or not from the instance specification with the highest priority in the determined instance specifications of different types.
In an exemplary embodiment, for a case that one cluster corresponds to a plurality of available areas, such as cluster 2 and cluster 3 in fig. 8, the available areas in this step include two or more and all belong to the same geographical area; the step may include:
for different resource configuration items, the cloud management server inquires whether the inventory corresponding to the current instance specification in the applicable area is sufficient or not from the instance specification with the highest priority in the determined instance specifications of different types; the available area corresponding to each resource configuration item comprises one or more than one available areas, and the available areas corresponding to different resource configuration items are different in provided example specification.
In an exemplary embodiment, one available area may be used to expand/contract an instance specification included in one resource configuration item, and may also be used to expand/contract an instance specification included in two or more resource configuration items.
In an exemplary example, for a plurality of clusters such as cluster 1, cluster 2, cluster 3 as shown in fig. 8, each cluster corresponds to one or more available areas, the available areas in this step include two or more and are distributed in different geographical ranges; the step may include:
For different resource configuration items, the cloud management server inquires whether the inventory of the current instance specification in the corresponding available region of the corresponding cluster is sufficient or not from the instance specification with the highest priority in the determined instance specifications of different types; wherein, one or more than one available area corresponding to each resource configuration item is distributed in one or more than one cluster, and the instance specifications provided by the available areas corresponding to different resource configuration items are different.
Step 102: when the inventory of the current priority instance specification in the available area is inquired to be sufficient, selecting the current priority instance specification to create computing resources for a new task of the user; when the inventory of the current priority instance specification in the available area is inquired to be insufficient, returning to the step 101 to continuously inquire the inventory of the next priority instance specification.
In one illustrative example, the available region includes one available region; the step may include:
if the inventory of the current priority instance specification in the available area is sufficient, creating computing resources in the cluster according to the current instance specification, and running a new job to end the process;
If it is determined that the inventory of the current priority instance specification is insufficient in the available area, returning to step 101, querying the inventory of the next priority instance specification of the current priority instance specification until all the inventories of different types of instance specifications in the resource configuration item are queried, and ending the process.
Therefore, the method and the system realize the calculation by automatically creating the calculation resources for the user according to the corresponding instance specifications acceptable by the user according to the inventory real-time condition of the public cloud service, fully utilize the elastic capability of a single data center, namely a single available area, and effectively help the user to acquire the calculation resources as soon as possible without the problem of waiting for resource release caused by the single instance specifications.
In one illustrative example, the available area includes two or more and all belong to the same geographic area; the step may include:
If the inventory corresponding to the current priority instance specification in the available area is determined to be sufficient according to different resource configuration items, computing resources are created in the cluster according to the current instance specification, a new job is operated, and the process is ended;
If it is determined that the inventory corresponding to the current priority instance specification in the available area is insufficient, returning to step 101, querying the inventory of the next priority instance specification of the current priority instance specification until all the inventories of different types of instance specifications in the resource configuration item are queried, and ending the process.
In this way, by configuring multiple data centers, namely multiple instance specifications of multiple available areas, in the resource demand configuration information, different capacity expansion/contraction demands of users are met in multiple data centers of one cluster, and the performance of the cluster is ensured.
In one illustrative example, the available area includes two or more and is distributed over different geographic areas; the step may include:
for different resource configuration items, if the inventory of the current priority instance specification in the corresponding available area of the corresponding cluster is determined to be sufficient, computing resources are created in the cluster according to the current instance specification, a new job is operated, and the process is ended;
If it is determined that the inventory of the current priority instance specification is insufficient in the corresponding available area of the corresponding cluster, returning to step 101, querying the inventory of the next priority instance specification of the current priority instance specification until all the inventories of different types of instance specifications in the resource configuration item are queried, and ending the process.
In this way, by configuring multiple instance specifications of multiple data centers of multiple regions, namely multiple instance specifications of multiple available areas of multiple clusters, in the resource demand configuration information, different capacity expansion/contraction demands of users in the multiple data centers of multiple regions are met, and the performance of the clusters is ensured.
By the method for realizing cloud computing resource allocation, public cloud elasticity is fully exerted, and computing performance is guaranteed. Particularly, in the case of emergency of a user computing task, the method effectively helps the user to acquire computing resources as soon as possible.
The present application also provides a computer readable storage medium storing computer executable instructions for performing the method of implementing cloud computing resource allocation of any of the above.
The application further provides a device for realizing cloud computing resource allocation, which comprises a memory and a processor, wherein the memory stores the following instructions executable by the processor: steps for performing the method of implementing cloud computing resource allocation of any of the above.
The technical scheme of the present application will be described in detail with reference to specific embodiments.
Fig. 2 is a schematic view of a scenario of a first embodiment of implementing cloud computing resource allocation according to the present application, where a first embodiment uses a case where multiple instance specifications are expanded in an available area of a cluster (i.e. the same region) as an example, and it is assumed that a user sets the multiple instance specifications to be expanded in a queue, for example: the first queue (corresponding to the first resource configuration item) is configured as a 32-core example, and the following examples are assumed to be expanded in sequence according to the priority setting by the user according to the characteristics of the computing task: example specification 11 is a general-purpose (32 core 128GB memory) -example specification 12 is a high-dominant frequency (32 core 128GB memory), as shown in the flowchart of the first embodiment in fig. 3, including:
Step 300: the user (cluster administrator) sets the resource demand configuration information and submits it to the cloud management server.
In this embodiment, the resource configuration items and the instance specifications included in each queue are set in a queue manner, and it is assumed that in the first embodiment, a first queue is set, and according to the priority, the instance specification 11, the instance specification 12, and a second queue according to the priority, the instance specification 21, and the instance specification 22 are expanded.
Step 301: if a user submits a job to the first queue of the cluster service, the cloud management server periodically acquires the job information on the first queue from the cluster service and knows that a new job waits for creating a computing resource in the first queue to complete the computation.
Step 302: according to the resource demand configuration information configured by the user, the cloud management server determines an instance specification 11 and an instance specification 12 of a new task to be expanded; the cloud management server queries the cloud service for the real-time inventory condition of the instance specification 11 in the currently available area, assuming that the cloud service returns that the instance specification 11 is not in inventory.
Step 303: the cloud management server queries the cloud service for real-time inventory of instance specifications 12 in the currently available region, assuming the cloud service returns that instance specifications 12 are in inventory.
Step 304: the cloud management server initiates creation of compute nodes of instance specification 12 to the cloud service and automatically joins the compute nodes into a cluster to run new jobs.
Generally, there are various types of instance specifications on public clouds, such as general purpose type, computing type, memory type, high main frequency type, and the like. The first embodiment realizes that according to the inventory real-time condition of public cloud service, computing resources are automatically created for users according to corresponding instance specifications acceptable by the users to finish computation, the elastic capability of a single data center, namely a single available area, is fully utilized, the problem of waiting for resource release caused by the single instance specifications is avoided, and the users are effectively helped to acquire computing resources as soon as possible
Fig. 4 is a schematic view of a scenario of a second embodiment of implementing cloud computing resource allocation according to the present application, where in the second embodiment, multiple instance specifications are expanded in multiple available areas of a cluster, and the expanded instance specifications provided by each available area are different, and assume that a user sets resource requirement configuration information and submits the resource requirement configuration information to a cloud management server, in this embodiment, a resource configuration item and an instance specification included in each queue are set in a queue manner, and in this embodiment, it is assumed that a first queue is set, and according to priority expanded 4, the instance specification 11, and according to the verification of 4, the instance specification 12 are verified, and the second queue expands according to priority, the GPU instance specification 21 and the GPU instance specification 22.
When the cloud management server knows that a user submits a new job to the first queue, the cloud management server sequentially inquires the cloud service about the inventory condition of the instance specification 11 and the instance specification 12 in the available area 1 according to the corresponding first resource configuration item in the resource demand configuration information, namely the instance specification sequence configured by the first queue, and then automatically expands the capacity; when the cloud management server knows that the user submits a new job to the second queue, the cloud management server can sequentially inquire the cloud service about the instance specification 23 in the available area 3 according to the corresponding second resource configuration item in the resource demand configuration information, namely the instance specification sequence configured by the second queue, and the inventory of the instance specification 24 is automatically expanded.
Typically, the specifications of examples provided by different available areas on a public cloud vary, or there is a large variation in inventory, such as: in this embodiment, the available area 1 and the available area 2 in one cluster have different specifications. In the second embodiment, by configuring multiple data centers, that is, multiple instance specifications of multiple available areas, in the resource requirement configuration information, different capacity expansion/contraction requirements of users are met in multiple data centers of one cluster, and the performance of the cluster is ensured.
Fig. 5 is a schematic view of a scenario of a third embodiment of implementing cloud computing resource allocation according to the present application, where, as in the second embodiment, multiple instance specifications are expanded in multiple available areas of a cluster, unlike in the second embodiment, each available area may be included in one or more resource configuration items, that is, a single queue may attempt to expand in multiple available areas.
It should be noted that, there is a network delay for inter-node communication across the available areas compared to inter-node communication within the same available area, so the present application further includes:
Through the management of the cluster scheduling queues, a single queue is limited to be expanded in one available area, and after a user submits a new job to the queue, the new job is ensured to only run on the node of the same available area through scheduling, so that the computing performance is further ensured.
Fig. 6 is a schematic view of a scenario of a fourth embodiment of implementing cloud computing resource allocation according to the present application, as shown in fig. 6, for a requirement of a very large scale cluster, capacity expansion can be performed across regions, that is, a cloud management server can meet capacity expansion requirements of users between different regions. The specific implementation in different regions, such as region a and region B, in fig. 6 may be implemented by any combination of the first embodiment to the third embodiment.
The fourth embodiment realizes that the multiple data centers in the multiple regions meet different capacity expansion/contraction requirements of users and ensures the performance of the clusters by configuring multiple instance specifications of the multiple data centers in the multiple regions, namely multiple instance specifications of multiple available areas of the multiple clusters, in the resource requirement configuration information.
Fig. 7 is a schematic diagram of a composition structure of a cloud management server according to the present application, where, as shown in fig. 7, the cloud management server at least includes: the system comprises a determining module, a query module and a processing module; wherein,
The determining module is used for determining different types of example specifications of the configurable new task according to the resource demand configuration information pre-configured by the user;
The query module is used for querying whether the inventory of the current priority instance specification in the available area is sufficient or not according to the determined priority order of the instance specifications of different types;
the processing module is used for selecting the current priority instance specification to create computing resources for a new task of a user when the inventory of the current priority instance specification in the available area is queried to be sufficient; and when the inventory of the current priority instance specification in the available area is inquired to be insufficient, returning to the inquiry module to continuously inquire the inventory of the next priority instance specification.
In one illustrative example, the determination module is further to:
The method comprises the steps of receiving pre-configured resource requirement configuration information of a user (such as a cluster administrator) reported by the user, wherein the resource requirement configuration information comprises one or more resource configuration items, and each resource configuration item comprises one or more different types of instance specifications.
Although the embodiments of the present application are described above, the embodiments are only used for facilitating understanding of the present application, and are not intended to limit the present application. Any person skilled in the art can make any modification and variation in form and detail without departing from the spirit and scope of the present disclosure, but the scope of the present disclosure is to be determined by the appended claims.

Claims (10)

1. A method of implementing cloud computing resource allocation, comprising:
the cloud management server determines different types of instance specifications matched with the new task according to resource demand configuration information pre-configured by a user;
inquiring whether the inventory of the current priority instance specification in the available area is sufficient or not according to the determined priority order of the instance specifications of different types;
When the inventory of the current priority instance specification in the available area is inquired to be sufficient, selecting the current priority instance specification to create computing resources for a new task of the user; when the inventory of the current priority instance specification in the available area is inquired to be insufficient, continuing to inquire the inventory of the next priority instance specification;
The available area comprises an available area, and when the inventory of the current priority instance specification in the available area is inquired to be sufficient, computing resources are created for new tasks of users by selecting the current priority instance specification; when the inventory of the current priority instance specification in the available area is inquired to be insufficient, continuing to inquire the inventory of the next priority instance specification, wherein the method comprises the following steps: if the inventory of the current priority instance specification in the available area is determined to be sufficient, creating computing resources in the cluster according to the current instance specification, and running a new job to end the process; and if the inventory of the current priority instance specification in the available area is not sufficient, returning to the query step, and querying the inventory of the next priority instance specification of the current priority instance specification until all the inventories of different types of instance specifications in the resource configuration item are queried, and ending the process.
2. The method of claim 1, the method further comprising before:
the cloud management server receives the pre-configured resource demand configuration information;
wherein the resource requirement configuration information includes one or more resource configuration items, each resource configuration item including one or more different types of instance specifications.
3. The method of claim 2, wherein the querying whether the inventory of current priority instance specifications in the availability zone is sufficient comprises:
For different resource configuration items, the cloud management server inquires whether the inventory of the current instance specification in the available area is sufficient or not from the instance specification with the highest priority in the determined instance specifications of different types.
4. The method of claim 2, wherein the available area comprises two or more and all belong to the same geographic area; the querying whether the inventory of the current priority instance specification in the available area is sufficient includes:
for different resource configuration items, the cloud management server inquires whether the inventory corresponding to the current instance specification in the available area is sufficient or not from the instance specification with the highest priority in the determined instance specifications of different types; the available area corresponding to each resource configuration item comprises one or more than one available areas, and the available areas corresponding to different resource configuration items are different in provided example specification.
5. The method of claim 4, wherein the method further comprises:
Aiming at different resource configuration items, if the inventory of the current priority instance specification in the corresponding available area is determined to be sufficient, computing resources are created in the cluster according to the current instance specification, a new job is operated, and the process is ended;
If the inventory of the current priority instance specification in the corresponding available area is not sufficient, returning to the query step, and querying the inventory of the next priority instance specification of the current priority instance specification until all the inventories of different types of instance specifications in the resource configuration item are queried, and ending the process.
6. The method of claim 2, wherein the available area comprises two or more and is distributed over different geographic areas; the querying whether the inventory of the current priority instance specification in the available area is sufficient includes:
for different resource configuration items, the cloud management server inquires whether the inventory of the current instance specification in the corresponding available area of the corresponding cluster is sufficient or not from the instance specification with the highest priority in the determined instance specifications of different types; wherein, one or more than one available area corresponding to each resource configuration item is distributed in one or more than one cluster, and the instance specifications provided by the available areas corresponding to different resource configuration items are different.
7. The method of claim 6, wherein the method further comprises:
aiming at different resource configuration items, if the inventory of the current priority instance specification in the corresponding available area of the corresponding cluster is determined to be sufficient, computing resources are created in the cluster according to the current instance specification, a new job is operated, and the process is ended;
And if the inventory of the current priority instance specification in the corresponding available region of the corresponding cluster is determined to be insufficient, returning to the inquiring step, inquiring the inventory of the next priority instance specification of the current priority instance specification until all the inventories of different types of instance specifications in the resource configuration item are inquired, and ending the flow.
8. A computer-readable storage medium storing computer-executable instructions for performing the method of implementing cloud computing resource allocation of any of claims 1-7.
9. An apparatus for implementing cloud computing resource allocation, comprising a memory and a processor, wherein the memory has stored therein instructions executable by the processor to: a method for performing the method for implementing cloud computing resource allocation of any of claims 1-7.
10. A cloud management server, comprising: the system comprises a determining module, a query module and a processing module; wherein,
The determining module is used for determining different types of example specifications of the configurable new task according to the resource demand configuration information pre-configured by the user;
The query module is used for querying whether the inventory of the current priority instance specification in the available area is sufficient or not according to the determined priority order of the instance specifications of different types;
The processing module is used for selecting the current priority instance specification to create computing resources for a new task of a user when the inventory of the current priority instance specification in the available area is queried to be sufficient; when the inventory of the current priority instance specification in the available area is inquired to be insufficient, returning to the inquiry module to continuously inquire the inventory of the next priority instance specification; the available area comprises an available area, and when the inventory of the current priority instance specification in the available area is inquired to be sufficient, computing resources are created for new tasks of users by selecting the current priority instance specification; when the inventory of the current priority instance specification in the available area is inquired to be insufficient, continuing to inquire the inventory of the next priority instance specification, wherein the method comprises the following steps: if the inventory of the current priority instance specification in the available area is determined to be sufficient, creating computing resources in the cluster according to the current instance specification, and running a new job to end the process; and if the inventory of the current priority instance specification in the available area is not sufficient, returning to the query step, and querying the inventory of the next priority instance specification of the current priority instance specification until all the inventories of different types of instance specifications in the resource configuration item are queried, and ending the process.
CN201911344153.7A 2019-12-24 2019-12-24 Method and device for realizing cloud computing resource allocation and cloud management server Active CN113032134B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911344153.7A CN113032134B (en) 2019-12-24 2019-12-24 Method and device for realizing cloud computing resource allocation and cloud management server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911344153.7A CN113032134B (en) 2019-12-24 2019-12-24 Method and device for realizing cloud computing resource allocation and cloud management server

Publications (2)

Publication Number Publication Date
CN113032134A CN113032134A (en) 2021-06-25
CN113032134B true CN113032134B (en) 2024-06-18

Family

ID=76451449

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911344153.7A Active CN113032134B (en) 2019-12-24 2019-12-24 Method and device for realizing cloud computing resource allocation and cloud management server

Country Status (1)

Country Link
CN (1) CN113032134B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2023179170A1 (en) * 2022-03-24 2023-09-28 华为云计算技术有限公司 Instance configuration method and apparatus, and device
CN115379000B (en) * 2022-10-26 2023-03-24 小米汽车科技有限公司 Service platform capacity expansion method and device, electronic equipment and storage medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104461744A (en) * 2014-12-18 2015-03-25 曙光云计算技术有限公司 Resource allocation method and device
CN108337109A (en) * 2017-12-28 2018-07-27 中兴通讯股份有限公司 A kind of resource allocation methods and device and resource allocation system

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108023742B (en) * 2016-10-31 2021-08-27 阿里巴巴集团控股有限公司 Application capacity expansion method, device and system
CN108429631B (en) * 2017-02-15 2021-01-15 华为技术有限公司 Method and device for instantiating network service
CN109766175A (en) * 2018-12-28 2019-05-17 深圳晶泰科技有限公司 Resource elastic telescopic system and its dispatching method towards high-performance calculation on cloud

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104461744A (en) * 2014-12-18 2015-03-25 曙光云计算技术有限公司 Resource allocation method and device
CN108337109A (en) * 2017-12-28 2018-07-27 中兴通讯股份有限公司 A kind of resource allocation methods and device and resource allocation system

Also Published As

Publication number Publication date
CN113032134A (en) 2021-06-25

Similar Documents

Publication Publication Date Title
US9628438B2 (en) Consistent ring namespaces facilitating data storage and organization in network infrastructures
CN115328663A (en) Method, device, equipment and storage medium for scheduling resources based on PaaS platform
CN113032134B (en) Method and device for realizing cloud computing resource allocation and cloud management server
US9104501B2 (en) Preparing parallel tasks to use a synchronization register
JP2018525743A (en) Load balancing method and apparatus
WO2020042612A1 (en) Method and device for storing and reading a message, server, and storage medium
CN110908774B (en) Resource scheduling method, equipment, system and storage medium
CN112463290A (en) Method, system, apparatus and storage medium for dynamically adjusting the number of computing containers
US20220345521A1 (en) Network edge computing method, apparatus, device and medium
CN107562803B (en) Data supply system and method and terminal
WO2022111456A1 (en) Core sharing method and apparatus based on many-core system, electronic device, and medium
CN112631680B (en) Micro-service container scheduling system, method, device and computer equipment
JP7440007B2 (en) Systems, methods and apparatus for querying databases
CN111435319A (en) Cluster management method and device
CN105760215A (en) Map-reduce model based job running method for distributed file system
CN115134373A (en) Data synchronization method and device, storage medium and electronic equipment
CN113630317A (en) Data transmission method and device, nonvolatile storage medium and electronic device
CN108279982B (en) Method, system and equipment for managing pbs resources and hadoop resources
CN115878309A (en) Resource allocation method, device, processing core, equipment and computer readable medium
CN111221644B (en) Resource scheduling method, device and equipment
AU2018290375B2 (en) Resource allocation method and system
CN113873052B (en) Domain name resolution method, device and equipment of Kubernetes cluster
CN112559164A (en) Resource sharing method and device
CN111435925B (en) Snapshot data transmission method and device
CN111651276B (en) Scheduling method and device and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant