CN110636388A

CN110636388A - Service request distribution method, system, electronic equipment and storage medium

Info

Publication number: CN110636388A
Application number: CN201910931291.9A
Authority: CN
Inventors: 谢全泉; 李娟�; 苏楠
Original assignee: Inspur Beijing Electronic Information Industry Co Ltd
Current assignee: Inspur Beijing Electronic Information Industry Co Ltd
Priority date: 2019-09-29
Filing date: 2019-09-29
Publication date: 2019-12-31

Abstract

The application discloses a service request distribution method, which comprises the steps of obtaining historical performance data of all slave nodes; determining the performance change trend of each slave node according to the historical performance data, and determining the expected performance data of each slave node in a target time period according to the performance change trend; wherein the target time period is a time period after the current time; and setting the priority of each slave node according to the expected performance data, and distributing the service requests received in the target time period according to the priority. The method and the device can distribute the service request on the premise of keeping load balance, and improve cluster reliability. The application also discloses a service request distribution system, an electronic device and a storage medium, which have the beneficial effects.

Description

Service request distribution method, system, electronic equipment and storage medium

Technical Field

The present application relates to the field of server technologies, and in particular, to a method and a system for allocating service requests, an electronic device, and a storage medium.

Background

The load balancing means that the load is balanced and distributed to a plurality of operation units for operation. For example, data processing tasks needing to be processed within a period of time can be distributed to FTP servers, Web servers, enterprise core application servers and other main task servers, and the like, so that the work tasks are completed cooperatively.

Load balancing is one of effective means for improving system reliability, and after a certain node in a related technology cluster fails or when the load of the certain node is too high, the load balancing is triggered, and the priority of each server for receiving requests is determined according to the load state of each server, so that the load difference among different servers is reduced. However, the load balancing scheme in the related art has hysteresis and low cluster reliability.

Therefore, how to distribute service requests and improve cluster reliability on the premise of keeping load balance is a technical problem that needs to be solved by those skilled in the art at present.

Disclosure of Invention

The application aims to provide a service request distribution method, a service request distribution system, an electronic device and a storage medium, which can distribute service requests on the premise of keeping load balance and improve cluster reliability.

In order to solve the above technical problem, the present application provides a service request allocation method, where the service request allocation method includes:

acquiring historical performance data of all slave nodes;

determining the performance change trend of each slave node according to the historical performance data, and determining the expected performance data of each slave node in a target time period according to the performance change trend; wherein the target time period is a time period after the current time;

and setting the priority of each slave node according to the expected performance data, and distributing the service requests received in the target time period according to the priority.

Optionally, the setting the priority of each slave node according to the expected performance data includes:

determining an expected load of each said slave node for said target time period based on said expected performance data;

determining a priority for each of the slave nodes based on the expected load; wherein the expected load magnitude is inversely related to the priority level.

Optionally, allocating the service request received by the target time segment according to the priority includes:

updating a service request distribution probability table according to the priority corresponding to each slave node; the service request distribution probability table comprises request distribution probabilities corresponding to each slave node, and the sum of the request distribution probabilities corresponding to all the slave nodes is 1;

and distributing the service request received by the target time period to the corresponding slave node according to the distribution probability table.

Optionally, the method further includes:

when node fault information is received, setting the request distribution probability of a slave node corresponding to the node fault information to be zero, and updating the request distribution probability table.

Optionally, the historical performance data includes any one or a combination of bandwidth, IOPS, CPU utilization, and memory utilization.

Optionally, the determining the performance trend of each slave node according to the historical performance data includes:

determining a performance evaluation value of the historical performance data acquired in each period according to a weight value corresponding to each type of performance parameter in the historical performance data;

and determining the performance change trend of each slave node according to all the performance evaluation values.

Optionally, determining the performance variation trend of each slave node according to all the performance evaluation values includes:

determining the time weight of each performance evaluation value corresponding period; the earlier the cycle corresponding to the performance evaluation value is from the current moment, the smaller the time weight is;

and determining the performance change trend of each slave node according to all the performance evaluation values and the time weight.

The present application also provides a service request distribution system, which includes:

the historical data acquisition module is used for acquiring historical performance data of all slave nodes;

the expectation module is used for determining the performance change trend of each slave node according to the historical performance data and determining the expected performance data of each slave node in a target time period according to the performance change trend; wherein the target time period is a time period after the current time;

and the request distribution module is used for setting the priority of each slave node according to the expected performance data and distributing the service request received in the target time period according to the priority.

The application also provides a storage medium, on which a computer program is stored, and the computer program realizes the steps executed by the service request distribution method when executed.

The application also provides an electronic device, which comprises a memory and a processor, wherein the memory stores a computer program, and the processor realizes the steps executed by the service request allocation method when calling the computer program in the memory.

The application provides a service request distribution method, which comprises the steps of obtaining historical performance data of all slave nodes; determining the performance change trend of each slave node according to the historical performance data, and determining the expected performance data of each slave node in a target time period according to the performance change trend; wherein the target time period is a time period after the current time; and setting the priority of each slave node according to the expected performance data, and distributing the service requests received in the target time period according to the priority.

According to the method, historical performance data of each slave node is obtained, and the performance change trend of each new node is determined according to the historical performance data. The historical load conditions of the slave nodes can be known according to the historical performance data of the slave nodes, so the expected load conditions of the target time period can be determined according to the expected performance data determined by the performance change trend. The priority of each slave node is set based on the expected performance data, the service data received in the target time period is distributed according to the priority, and load balance of each slave node in the cluster is achieved. The method and the device can distribute the service request on the premise of keeping load balance, and improve cluster reliability. The application also provides a service request distribution system, a storage medium and an electronic device, which have the beneficial effects and are not described herein again.

Drawings

In order to more clearly illustrate the embodiments of the present application, the drawings needed for the embodiments will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present application, and that other drawings can be obtained by those skilled in the art without inventive effort.

Fig. 1 is a flowchart of a service request allocation method according to an embodiment of the present application;

fig. 2 is a flowchart of another service request allocation method provided in the embodiment of the present application;

fig. 3 is a schematic structural diagram of a service request distribution system according to an embodiment of the present application.

Detailed Description

In order to make the objects, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are some embodiments of the present application, but not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.

Referring to fig. 1, fig. 1 is a flowchart of a service request distribution method according to an embodiment of the present application.

The specific steps may include:

s101: acquiring historical performance data of all slave nodes;

the execution main body of this embodiment may be a master node of a cluster, the cluster may include a plurality of slave nodes, and the master node allocates a service request to each slave node, so that the slave node executes a service processing operation corresponding to the service request. The preset period of obtaining the historical performance data of the master node may be preset, so that the master node obtains the key performance index of each slave node at preset time intervals to obtain the historical performance data.

The historical performance data may include any one or a combination of bandwidth, IOPS (Input/Output Operations Per Second), CPU utilization, and memory utilization. As a possible implementation, the historical performance data may also include any one or a combination of bandwidth utilization, IOPS utilization, CPU utilization, and memory utilization. The bandwidth utilization rate refers to the ratio of the used bandwidth to the maximum bandwidth, and the IOPS utilization rate refers to the ratio of the actual read-write operation times per second to the maximum read-write operation times per second.

As a possible implementation manner, the present embodiment may acquire historical performance data corresponding to a specific historical time period, and may record the generation time of the historical performance data after acquiring the historical performance data.

S102: determining the performance change trend of each slave node according to the historical performance data, and determining the expected performance data of each slave node in a target time period according to the performance change trend;

after the historical performance data is obtained, the performance change trend of each slave node can be determined according to the corresponding relation between each performance index and the generation time in the historical performance data, and the expected performance data of the slave node in the future time period can be determined according to the performance change trend.

For example, the current time is 12:00, 11:55 is the time point of last obtaining of the historical performance data, the historical performance data of the node a at 11:55 is 60% of the CPU utilization rate and 50% of the memory utilization rate, and if the performance change trend of the node a is determined to be flat according to 10:55 to 11:55, the expected performance data of 12:00-12:05 is 60% of the CPU utilization rate and 50% of the memory utilization rate; if the performance change trend of the node A is determined to be linearly increased according to the ratio of 10: 55-11: 55, the expected performance data of 12:00-12:05 can be the CPU utilization rate of 66% and the memory utilization rate of 55%; if the performance change trend of the node A is determined to be linear rising according to the ratio of 10: 55-11: 55, the expected performance data of 12:00-12:05 can be the CPU utilization rate of 66% and the memory utilization rate of 55%; if the performance change trend of the node A is determined to be linear reduction according to the ratio of 10: 55-11: 55, the expected performance data of 12:00-12:05 can be 54% of CPU utilization rate and 45% of memory utilization rate; if the performance change trend of the node A is determined to be the exponential rise according to the ratio of 10: 55-11: 55, the expected performance data of 12:00-12:05 can be the CPU utilization rate of 95% and the memory utilization rate of 90%; if the performance change trend of the node A is determined to be exponentially reduced according to the ratio of 10:55 to 11:55, the expected performance data of 12:00 to 12:05 can be 15% of the CPU utilization rate and 10% of the memory utilization rate.

As a feasible implementation manner, the time difference between the ending time of the time period corresponding to the historical performance data acquired in S101 and the current time may be within a preset range, so as to improve the accuracy of obtaining the expected performance data according to the historical performance data.

S103: and setting the priority of each slave node according to the expected performance data, and distributing the service requests received in the target time period according to the priority.

The present embodiment establishes that, on the basis of having obtained the expected performance data of the slave node in the target time period, the priority of each slave node may be set according to the level of the performance data. Specifically, the expected performance data may include a plurality of performance indexes such as a bandwidth utilization rate, an IOPS utilization rate, a CPU utilization rate, and a memory utilization rate, the expected performance score of each slave node may be obtained by combining a weight coefficient corresponding to each performance index, and the priority of each slave node is set in the order of the scores from high to low. And distributing the service request received by the cluster at the near segment according to the priority. Specifically, the expected performance data represents the used performance of each slave node in the target time period, the higher the used performance is, the lower the priority is, the higher the probability that the slave node with the higher priority receives the service request is when the service request is distributed, so that the slave nodes with the lower used performance in the target time period receive more service requests, and further, the load balance of each slave node in the cluster is realized.

As a further introduction to the above embodiment, a specific process of setting the priority of each slave node in S103 may include the steps of:

step 1: determining an expected load of each said slave node for said target time period based on said expected performance data;

step 2: determining a priority for each of the slave nodes based on the expected load;

the above manner may calculate the expected load of the slave node in the target time period according to the bandwidth utilization, the IOPS utilization, the CPU utilization, and the memory utilization in the expected performance data. The priority mentioned above is the priority of allocating the service request, and the higher the expected load, the lower the priority, and the lower the expected load, the higher the priority. That is, the priority setting manner can make the probability of receiving the service request from the node with smaller expected load higher than the probability of receiving the service request from the node with larger expected load, thereby balancing the load of the nodes in the cluster.

The embodiment first obtains the historical performance data of each slave node, and determines the performance change trend of each new node according to the historical performance data. The historical load conditions of the slave nodes can be known according to the historical performance data of the slave nodes, so the expected load conditions of the target time period can be determined according to the expected performance data determined by the performance change trend. In this embodiment, the priority of each slave node is set based on the expected performance data, and the service data received in the target time period is allocated according to the priority, so as to achieve load balancing of each slave node in the cluster. The embodiment can distribute the service request on the premise of keeping load balance, and improves the cluster reliability.

As a further addition to the corresponding embodiment of fig. 1, the performance trend of the slave node may be determined by: firstly, determining a performance evaluation value of the historical performance data acquired in each period according to a weight value corresponding to each type of performance parameter in the historical performance data; and determining the performance change trend of each slave node according to all the performance evaluation values. Specifically, the determining the performance variation trend of each slave node according to all the performance evaluation values may include the following steps:

step 1: determining the time weight of each performance evaluation value corresponding period; the earlier the cycle corresponding to the performance evaluation value is from the current moment, the smaller the time weight is;

step 2: and determining the performance change trend of each slave node according to all the performance evaluation values and the time weight.

The embodiment introduces the time weight, so that the performance evaluation value closer to the current moment has higher weight, and the accuracy of the performance change trend is improved.

Referring to fig. 2, fig. 2 is a flowchart of another service request allocation method provided in the embodiment of the present application; this embodiment is a specific description of allocating service requests according to priorities in the embodiment corresponding to fig. 1, and a more preferred implementation may be obtained by combining the embodiment and the embodiment corresponding to fig. 1, where the embodiment may include the following steps:

s201: updating a service request distribution probability table according to the priority corresponding to each slave node;

the service request distribution probability table comprises request distribution probabilities corresponding to each slave node, and the sum of the request distribution probabilities corresponding to all the slave nodes is 1;

s202: and distributing the service request received by the target time period to the corresponding slave node according to the distribution probability table.

In the above embodiment, the corresponding request distribution probability is set for the priority of each level, and the service request distribution probability table is updated according to the distribution probability corresponding to each node. After receiving the service request in the target time period, the target slave node which needs to distribute the service request can be selected according to the distribution probability table. In this embodiment, there may be an operation of randomly selecting a target slave node, and the random selection operation may be performed according to the request distribution probability recorded in the distribution probability table.

As a further supplement to the above embodiment, when the master node receives the node failure information, the request allocation probability of the slave node corresponding to the node failure information may be set to zero, and the request allocation probability table may be updated.

The flow described in the above embodiment is explained below by an embodiment in practical use.

The embodiment may be applied to a server cluster including a plurality of nodes, where a performance prediction engine may be deployed on a master node of the server cluster, and the performance prediction engine may be configured to collect and store historical performance data (such as bandwidth, IOPS, CPU utilization, memory utilization, and the like) of key performance indexes of each slave node. The prediction engine may predict performance data for a future time period at regular intervals T based on historical performance data. The service priority of the slave nodes is set according to the performance predicted by each slave node, and the lower the predicted performance, the higher the priority of the slave node. When the service request is received by the master node, the node for processing the request can be determined according to the priority level of each slave node, and the service request is forwarded.

In the process of predicting the performance of the slave nodes, the master node may collect historical performance data of each slave node according to a certain frequency, and the master node may perform performance prediction at a fixed time interval T by using the prediction to obtain expected performance data of each slave node. The master node can calculate the expected load state of each slave node according to the performance index weight and the performance data of the master node. The master node may assign a probability of the request of the slave node as a priority, and the lower the load of the slave node, the higher the service priority of the slave node, the higher the selection probability of the slave node.

When the master node receives the service request, a slave node can be randomly selected as a service node (namely a target slave node) according to the selection probability of the slave node; wherein, the probability of the slave node being selected is higher as the request distribution probability is higher. After determining the serving node, the master node forwards the service request to the selected serving node.

The embodiment evaluates the load state of each slave node in a future period of time by predicting the performance of each slave node in the cluster so as to determine the low-load node. When a new service request accesses, the node with low load has higher probability to receive the service request, and the node with high load has lower probability to receive the service request, thereby realizing load balance. The embodiment does not rely on the current system state to evaluate the load of each node, but has the prospective function of balancing the load of future requests according to the possible pressure of the future system, thereby greatly improving the reliability of the system.

Referring to fig. 3, fig. 3 is a schematic structural diagram of a service request distribution system according to an embodiment of the present application;

the system may include:

a historical data obtaining module 100, configured to obtain historical performance data of all slave nodes;

an expectation module 200, configured to determine a performance variation trend of each slave node according to the historical performance data, and determine expected performance data of each slave node in a target time period according to the performance variation trend; wherein the target time period is a time period after the current time;

a request allocating module 300, configured to set a priority of each slave node according to the expected performance data, and allocate a service request received in the target time period according to the priority.

Further, the request distribution module 300 includes:

an expected load determining unit, configured to determine, according to the expected performance data, an expected load of each slave node in the target time period;

the priority setting unit is used for determining the priority of each slave node according to the expected load; wherein the expected load magnitude is inversely related to the priority level.

An allocation unit, configured to allocate the service request received in the target time slot according to the priority

Further, the request distribution module 300 includes:

a node setting unit, configured to set a priority of each slave node according to the expected performance data;

the probability setting unit is used for updating a service request distribution probability table according to the priority corresponding to each slave node; the service request distribution probability table comprises request distribution probabilities corresponding to each slave node, and the sum of the request distribution probabilities corresponding to all the slave nodes is 1;

and the service request distribution unit is used for distributing the service requests received by the target time period to corresponding slave nodes according to the distribution probability table.

Further, the method also comprises the following steps:

and the probability table updating unit is used for setting the request distribution probability of the slave node corresponding to the node fault information to be zero and updating the request distribution probability table when the node fault information is received.

Further, the historical performance data includes any one or a combination of bandwidth, IOPS, CPU utilization, and memory utilization.

Further, it is contemplated that module 200 includes:

the performance evaluation value determining unit is used for determining the performance evaluation value of the historical performance data acquired in each period according to the weight value corresponding to each type of performance parameter in the historical performance data;

a trend determining unit, configured to determine a performance variation trend of each slave node according to all the performance evaluation values;

and the expected performance data determining unit is used for determining the expected performance data of each slave node in a target time period according to the performance change trend.

Further, the trend determining unit includes:

a time weight determining subunit, configured to determine a time weight of a period corresponding to each of the performance evaluation values; the earlier the cycle corresponding to the performance evaluation value is from the current moment, the smaller the time weight is;

and the performance change trend determining subunit is used for determining the performance change trend of each slave node according to all the performance evaluation values and the time weights.

Since the embodiment of the system part corresponds to the embodiment of the method part, the embodiment of the system part is described with reference to the embodiment of the method part, and is not repeated here.

The present application also provides a storage medium having a computer program stored thereon, which when executed, may implement the steps provided by the above-described embodiments. The storage medium may include: various media capable of storing program codes, such as a usb disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk, or an optical disk.

The application further provides an electronic device, which may include a memory and a processor, where the memory stores a computer program, and the processor may implement the steps provided by the foregoing embodiments when calling the computer program in the memory. Of course, the electronic device may also include various network interfaces, power supplies, and the like.

The embodiments are described in a progressive manner in the specification, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other. For the system disclosed by the embodiment, the description is relatively simple because the system corresponds to the method disclosed by the embodiment, and the relevant points can be referred to the method part for description. It should be noted that, for those skilled in the art, it is possible to make several improvements and modifications to the present application without departing from the principle of the present application, and such improvements and modifications also fall within the scope of the claims of the present application.

It is further noted that, in the present specification, relational terms such as first and second, and the like are used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element.

Claims

1. A service request distribution method is characterized by comprising the following steps:

acquiring historical performance data of all slave nodes;

2. The method of claim 1, wherein setting the priority of each slave node according to the expected performance data comprises:

3. The method of claim 1, wherein allocating the service requests received in the target time period according to the priority comprises:

4. The service request distribution method according to claim 3, further comprising:

5. The service request distribution method according to claim 1, wherein the historical performance data comprises any one or a combination of any of bandwidth, IOPS, CPU utilization, and memory utilization.

6. The service request distribution method according to any of claims 1 to 5, wherein determining a performance trend of each slave node according to the historical performance data comprises:

7. The method according to claim 6, wherein determining the performance trend of each slave node according to all the performance evaluation values comprises:

8. A service request distribution system, comprising:

9. An electronic device, comprising a memory in which a computer program is stored and a processor which, when invoked by the computer program in the memory, carries out the steps of the service request distribution method according to any of claims 1 to 7.

10. A storage medium having stored thereon computer-executable instructions which, when loaded and executed by a processor, carry out the steps of a service request distribution method according to any one of claims 1 to 7.