CN111835854A

CN111835854A - Slow task prediction method based on grey prediction algorithm

Info

Publication number: CN111835854A
Application number: CN202010689837.7A
Authority: CN
Inventors: 杨海龙; 马冲; 李云春
Original assignee: Beihang University
Current assignee: Beihang University
Priority date: 2020-07-17
Filing date: 2020-07-17
Publication date: 2020-10-27
Anticipated expiration: 2040-07-17
Also published as: CN111835854B

Abstract

The invention provides a slow task prediction method based on a gray prediction algorithm, which comprises the following steps of: the method comprises the following steps: on a big data cluster, counting the total number of tasks operated and the number of slow tasks generated by each node in unit time; step two: and extracting the information of the slow tasks of the nodes, establishing a grey prediction model, and predicting the density of the slow tasks in a future time unit according to the collected data of a plurality of time units. The invention provides a slow task prediction method based on a grey prediction algorithm, aiming at the problem that slow tasks in a current big data platform influence cluster performance. Different from the conventional slow task identification technology with poor timeliness and accuracy, the method hopes to establish a differential equation model by using a gray prediction algorithm, analyze the operation rule of the cluster nodes, accurately predict the slow task condition of the cluster nodes in a period of time in the future and provide effective optimization suggestions for cluster users.

Description

Slow task prediction method based on grey prediction algorithm

Technical Field

The invention relates to the field of slow task analysis of a big data platform, in particular to a slow task prediction method based on a grey prediction algorithm.

Background

The development of information technology promotes the continuous progress of the era, the society gradually enters the big data processing era nowadays, and the nation also takes the big data technology as the core strategy for preempting the high point of economic technology development. Big data applications tend to run in a distributed fashion on large-scale clusters or cloud platforms. An application of large data is likely to consist of thousands of processes, and the size of a node can reach thousands. Therefore, the performance data of a large number of nodes must be aggregated for correlation analysis by deeply understanding the running state of the application.

Today's large data clusters often contain many heterogeneous nodes, and the clusters can process various tasks, and performance problems, namely slow task problems, are easy to occur in the process of program execution. The number of tasks executed in a cluster may be thousands of tasks in a certain time, but most of the tasks have quite different execution times, and for those tasks with execution times much higher than those of most of the tasks, we call slow tasks. In the conventional large data-related document, a slow task is defined as a task whose execution time is 1.5 times larger than the median of the execution times of all tasks in the same stage, and this definition is widely used by subsequent referents. The slow task occupies a large amount of cluster resources, and the existence of the slow task can significantly affect the working performance of the cluster.

The identification and prediction of the slow tasks can provide reference for cluster maintenance personnel, and a new application process is preferentially distributed to nodes with fewer slow tasks, so that the utilization rate of cluster resources can be effectively improved; meanwhile, according to the intensity of the slow tasks, the nodes with abnormal work can be found quickly, and the abnormal nodes are guided to be repaired.

The prediction is a science for researching prediction theory, method, evaluation and application. The basic theory of the thinking mode of comprehensive prediction mainly comprises an inertia principle, an analogy principle and a related principle. The core problem of prediction is the technical approach of prediction, or mathematical model of prediction. Different from most prediction methods, the gray prediction method uses a generated data sequence instead of an original data sequence, and uses a differential equation to fully mine the essence of the system, so that the precision is high.

The following problems mainly exist in the current slow task analysis algorithm:

the analysis algorithm is not high in timeliness, and data in cluster operation is used in a plurality of algorithms, so that the result can be calculated only after the cluster operates for a long time, and the slow task damages the performance of the cluster. In addition, the accuracy of the analysis algorithm cannot be guaranteed, and for some nodes with large slow task quantity fluctuation, the relation between the slow task quantity and the task quantity is not analyzed, so that the prediction effect is poor.

Disclosure of Invention

In order to solve the problems, the invention adopts a method for respectively counting the slow task quantity and the task quantity of each node to process data. And taking the ratio of the slow task quantity to the task quantity of each node as the predicted data quantity. Not only accurately expresses the intensity of the slow tasks, but also avoids the influence caused by the fluctuation of the number of the slow tasks in different time. And then, establishing a model by adopting a gray prediction method, calculating the internal relation between the time phase and the slow task condition, giving an equation, and finally calculating the predicted value of the slow task intensity degree in the future time period.

The invention provides a slow task prediction method based on a gray prediction algorithm, which comprises the following steps of:

the method comprises the following steps: on a big data cluster, counting the total number of tasks operated and the number of slow tasks generated by each node in unit time;

step two: extracting information of node slow tasks, establishing a grey prediction model, and predicting the density of the slow tasks in a future time unit according to the collected data of a plurality of time units;

the first step comprises the following steps:

step (1.1) counting task running information on a big data platform;

in the cluster, collecting the starting time and the ending time of each task and the node number where the task runs according to a certain time interval as a time unit;

step (1.2) calculating the execution time of each task, and counting the total number of tasks of each node;

and calculating the execution time of each task according to the starting time and the ending time. Counting the total task quantity of each node;

step (1.3) identifying the slow task number of the corresponding node according to a slow task judgment rule;

according to the general rules of slow task related documents, taking 1.5 times of the median of the time length of all tasks as a slow task judgment threshold, judging the task with the running time length exceeding the threshold as a slow task, and counting the number of the slow tasks of each node according to the slow task judgment threshold;

the second step comprises the following steps:

step (2.1) extracting slow task information;

in different time units, the use conditions of the clusters are different, the number of slow tasks is greatly changed, and the basic requirements of the gray prediction algorithm are difficult to meet due to excessive fluctuation of numerical values; in order to eliminate the influence of different use conditions and meet the condition of a gray prediction algorithm, the ratio of the number of slow tasks to the total number of tasks is adopted to participate in modeling;

step (2.2) transformation of input data;

the ratio of each item of the sequence to the previous item is called a level ratio, the level ratio of the gray prediction requirement sequence cannot be too large, and if the condition cannot be met, necessary transformation processing needs to be carried out on the original sequence;

step (2.3) establishing a GM (1,1) prediction model;

GM (1,1) represents that the model is a first-order differential equation and only contains a gray model with 1 variable, and the gray prediction is mainly characterized in that the model uses a generated data sequence instead of an original data sequence; in the modeling process, the original data is accumulated to generate, and modeling is carried out after an approximate exponential law is obtained.

Step (2.4) bringing the time point needing to be predicted into the database, and calculating a prediction result;

through modeling calculation, an equation meeting the original number sequence is finally obtained, and at the moment, the time points needing to be predicted are substituted, so that a predicted value can be calculated; the predicted value is still in an exponential growth form, which means that the difference item by item is needed to obtain the true prediction result.

Further, the step (1.3) is executed as follows:

and the substep (3-1) is ordered according to the calculated task time length. Counting the total number of tasks of all nodes, and obtaining a median of the task time length at a position of half of the total number of the tasks after sequencing;

the median of the task duration is multiplied by 1.5 to obtain a threshold value for judging the slow task in the substep (3-2);

and (3) judging the source data item by item again, if the task duration exceeds a threshold value, obtaining the slow task quantity of each node by the corresponding node slow task quantity +1 and so on.

Further, the step (2.1) extracts the discussion of slow task information:

the prediction of the number of slow tasks is the final goal of the algorithm, but is difficult to directly implement, or the prediction effect is poor. Since the use of clusters varies widely at different times, the number of slow tasks varies widely and there is no apparent regularity.

To predict the density of slow tasks more accurately, additional data volumes must be established. The mean value and the variance of the task duration are good quantitative indexes, but cannot be obtained at the initial stage of a unit time interval, and the timeliness of the prediction algorithm is greatly influenced. A method of describing the intensity of slow tasks by the ratio of the number of slow tasks to the number of tasks is used. Specifically, the slow task number and the task number of each node are respectively counted, and the ratio of the slow task number and the task number of each node is calculated. This ratio is used as input data for the prediction model.

Further, the execution conditions required to satisfy the gray prediction in step (2.2) are as follows:

let reference data be x⁽⁰⁾＝(x⁽⁰⁾(1),x⁽⁰⁾(2),…,x⁽⁰⁾(n)), calculating a rank ratio of the sequences:

if all the step ratios λ (k) fall within the tolerable coverage

Inner, then sequence x⁽⁰⁾Can be used asData from model GM (1,1) were grey predicted. Otherwise, it needs to be aligned with sequence x⁽⁰⁾The necessary transformation is done so that it falls within the containment coverage. Namely, taking a proper constant c to perform translation transformation:

y⁽⁰⁾(k)＝x⁽⁰⁾(k)+c,k＝1,2,…,n,

let sequence y⁽⁰⁾＝(y⁽⁰⁾(1),y⁽⁰⁾(2),…,y⁽⁰⁾(n)) step ratio:

further, in the step (2.3), the grey prediction model modeling calculation process is as follows:

(2.3.1) analyzing the level ratio of the input data, and if the level ratio meets the condition of a gray prediction algorithm, directly modeling; if the gray prediction algorithm condition is not met, translation transformation is carried out, and the level ratio is enabled to fall within the acceptable coverage;

(2.3.2) obtaining a number sequence of an approximate exponential law through accumulation operation;

(2.3.3) calculating a symbolic solution of a differential equation through the fitting parameters;

(2.3.4) solving a sequence of predicted values, wherein the sequence of predicted values comprises the predicted values of a known time period and a future time period, and the predicted values at the time are also a sequence of exponential law;

and (2.3.5) calculating a final result, namely a slow task prediction value in a future time period through differential operation.

Preferably, the prediction method of the gray model GM (1,1) in step (2.3) is:

known reference data column x⁽⁰⁾＝(x⁽⁰⁾(1),x⁽⁰⁾(2),…,x⁽⁰⁾(n)), accumulating 1 time to generate a sequence:

x⁽¹⁾＝(x⁽¹⁾(1),x⁽¹⁾(2),…,x⁽¹⁾(n))

＝(x⁽⁰⁾(1),x⁽⁰⁾(1)+x⁽⁰⁾(2),…,x⁽⁰⁾(1)+…+x⁽⁰⁾(n)),

in the formula:

x⁽¹⁾the mean generation sequence of (a):

z⁽¹⁾＝(z⁽¹⁾(2),z⁽¹⁾(3),…,z⁽¹⁾(n)),

in the formula: z is a radical of⁽¹⁾(k)＝0.5x⁽¹⁾(k)+0.5x⁽¹⁾(k-1),k＝2,3,…,n。

Establishing a gray differential equation:

x⁽⁰⁾(k)+az⁽¹⁾(k)＝b,k＝2,3,…,n，

the corresponding whitening differential equation is:

for convenience of presentation, let u ═ a, b]^TIn each case, Y is ═ x⁽⁰⁾(2),x⁽⁰⁾(3),…,x⁽⁰⁾(n)]^TMemory for recording

Then, by the least square method, the calculation is made such that j (u) ═ Y-Bu^TThe estimate of u for which (Y-Bu) reaches a minimum is:

then, solving the equation to obtain a predicted value:

and also

Further, the method for counting task information on a big data platform in the step (1.1) comprises the following steps:

the task information mainly comes from log analysis, and a log file generated in the operation process of the big data platform records the execution condition of each task of each node, such as the log files of Hadoop and Spark platforms. The corresponding relation among the nodes, the tasks and the task duration can be obtained by analyzing the log file.

Further, the function of the ratio of the number of slow tasks to the number of tasks in step (2.1):

in order to improve the accuracy and the practicability, the data volume of the ratio of the number of slow tasks to the number of tasks is established to participate in model calculation. Although the data amount cannot directly predict the number of slow tasks, the data amount can better reflect the density of the slow tasks. If the predicted value of the data volume is larger, the performance of the corresponding node is reflected to be poorer, and as a user, fewer program processes are required to be distributed to the node, more processes are required to be distributed to the node with the lower predicted value, and finally the function of improving the cluster performance is achieved.

Has the advantages that:

in a large data platform, clusters often need to process various processes, the performance of nodes is also different, and in some heterogeneous clusters, the performance of the nodes is more obviously different. The generation of slow tasks is always infinite under the influence of various factors, but the operation state of a certain node at different times is always relatively stable. According to the rule, the invention adopts a grey prediction algorithm, establishes a differential equation model for the slow task generation of each node, analyzes historical data, and finally can predict the slow task condition of the node in a period of time in the future. The prediction result of the invention can give guidance suggestion for the optimization of the cluster, more processes are allocated to the nodes with sparse slow task prediction, and less processes are allocated to the nodes with dense slow task prediction. Thereby effectively improving the working performance of the cluster.

Drawings

FIG. 1 is a block diagram of a system for implementing a slow task prediction method based on a gray prediction algorithm in accordance with the present invention;

FIG. 2 is a flow chart of a slow task prediction method based on a gray prediction algorithm according to the present invention;

FIG. 3 is a flow chart of the gray prediction model modeling calculation of the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is further described in detail below with reference to the accompanying drawings. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention. In addition, the technical features involved in the embodiments of the present invention described below may be combined with each other as long as they do not conflict with each other.

The basic idea of the invention is to analyze the slow task condition by analyzing the log of the big data platform, extract the data characteristics according to the long-time slow task occurrence condition, and inject the data into the grey prediction model to predict the slow task intensity in a period of time in the future, thereby providing the process distribution suggestion for the cluster manager and finally achieving the goal of improving the cluster working performance.

FIG. 1 is a schematic diagram of a system architecture for implementing the slow task prediction method based on the gray prediction algorithm of the present invention. The large data platform such as Spark and Hadoop is composed of a large number of nodes, and logs are generated in the running process to record the running process of the program. The invention obtains the log information from the big data platform for analysis, and the log analyzer has the main functions of counting the starting time and the ending time of each task of each node and then transmitting the task information to the slow task analyzer. The slow task analyzer is responsible for calculating a slow task threshold value, counting the task number of each node and the slow task number, and calculating the ratio of the slow task number to the task number. The grey prediction model calculation module firstly processes information of a slow task to enable the data volume to meet the requirements of the grey prediction model, and particularly reduces the level ratio of the data volume sequence. And then modeling calculation is carried out, a time period ID to be predicted is brought in, and a prediction result is calculated.

FIG. 2 is a flowchart of the slow task prediction method based on the gray prediction algorithm of the present invention, and the detailed flow includes the following steps:

the first step comprises the following steps:

step (1.1) counting task running information on a big data platform;

and according to the general rules of the slow task related documents, taking 1.5 times of the median of the time length of all the tasks as a slow task judgment threshold, judging the task with the running time length exceeding the threshold as a slow task, and counting the number of the slow tasks of each node according to the result. The method comprises the following specific steps:

and the substep (3-1) is ordered according to the calculated task time length. And counting the total number of the tasks of all the nodes, and obtaining the median of the task time at the half position of the total number of the tasks after the task time is sequenced.

And (5) multiplying the median of the task time length by 1.5 to obtain a threshold value for judging the slow task in the substep (3-2).

The second step comprises the following steps:

step (2.1) extracting slow task information;

in different time units, the use conditions of the clusters are different, the number of slow tasks is greatly changed, and the basic requirements of the gray prediction algorithm are difficult to meet due to the excessive fluctuation of numerical values. In order to eliminate the effect of different use cases and meet the condition of the gray prediction algorithm, the ratio of the number of slow tasks to the total number of tasks is adopted to participate in modeling.

Step (2.2) transformation of input data;

the ratio of each term of the sequence to the previous term is called a level ratio, and gray prediction requires that the level ratio of the sequence cannot be too large, and if the condition cannot be met, necessary transformation processing needs to be performed on the original sequence. The execution conditions that need to satisfy the gray prediction are:

if all the step ratios λ (k) fall within the tolerable coverage

Internal, then sequencex⁽⁰⁾Gray prediction can be performed as data of the model GM (1, 1). Otherwise, it needs to be aligned with sequence x⁽⁰⁾The necessary transformation is done so that it falls within the containment coverage. Namely, taking a proper constant c to perform translation transformation:

y⁽⁰⁾(k)＝x⁽⁰⁾(k)+c,k＝1,2,…,n,

step (2.3) establishing a GM (1,1) prediction model;

GM (1,1) represents a gray model which is a first-order differential equation and only contains 1 variable, and the main characteristic of gray prediction is that the model uses a generated data sequence instead of an original data sequence. In the modeling process, the original data is accumulated to generate, and modeling is carried out after an approximate exponential law is obtained. The prediction method of the gray model GM (1,1) is as follows:

x⁽¹⁾＝(x⁽¹⁾(1),x⁽¹⁾(2),…,x⁽¹⁾(n))

in the formula:

x⁽¹⁾the mean generation sequence of (a):

z⁽¹⁾＝(z⁽¹⁾(2),z⁽¹⁾(3),…,z⁽¹⁾(n)),

Establishing a gray differential equation:

x⁽⁰⁾(k)+az⁽¹⁾(k)＝b,k＝2,3,…,n，

the corresponding whitening differential equation is:

then, solving the equation to obtain a predicted value:

and also

and finally obtaining an equation meeting the original number sequence through modeling calculation, and substituting the time points needing to be predicted at the moment to calculate the predicted value. The predicted value is still in an exponential growth form, which means that the difference item by item is needed to obtain the true prediction result.

FIG. 3 is a flow chart of the gray prediction model modeling calculation of the present invention, the detailed flow includes the steps of:

(1) analyzing the level ratio of input data, and if the level ratio meets the condition of a gray prediction algorithm, directly modeling; if the gray prediction algorithm condition is not met, translation transformation is carried out, and the level ratio is enabled to fall within the acceptable coverage.

(2) And (4) accumulation operation, aiming at obtaining the number sequence of approximate exponential law.

(3) And fitting the parameters and calculating a symbolic solution of the differential equation.

(4) And solving a sequence of predicted values, wherein the sequence of predicted values comprises the predicted values of the known time period and the future time period, and the predicted values at the moment are also a sequence of exponential rules.

(5) And (4) performing difference operation to calculate a final result, namely a slow task predicted value in a future time period.

The invention has not been described in detail and is within the skill of the art.

The above description is only a part of the embodiments of the present invention, but the scope of the present invention is not limited thereto, and any changes or substitutions that can be easily conceived by those skilled in the art within the technical scope of the present invention are included in the scope of the present invention.

Claims

1. A slow task prediction method based on a gray prediction algorithm is characterized by comprising the following steps:

step two: extracting information of node slow tasks, establishing a grey prediction model, and predicting the density of the slow tasks in a future time unit according to collected data of a plurality of time units;

the first step comprises the following steps:

step (1.1) counting task running information on a big data platform;

in the cluster, collecting the starting time and the ending time of each task and the node number where the task runs according to a preset time interval as a time unit;

calculating the execution time of each task according to the starting time and the ending time, and counting the total task quantity of each node;

taking 1.5 times of the median of the time length of all the tasks as a slow task judgment threshold, judging the task with the running time length exceeding the threshold as a slow task, and counting the number of the slow tasks of each node according to the slow task judgment threshold;

the second step comprises the following steps:

step (2.1) extracting slow task information;

in different time units, the use conditions of the clusters are different, and the ratio of the slow task quantity to the total task quantity is adopted to participate in modeling;

step (2.2) transformation of input data;

the ratio of each term of the series to the preceding term is called the rank ratio, and grey prediction requires that the rank ratio of the series fall within the acceptable coverage

In the method, n is the total amount of data, and if the condition cannot be met, the original number sequence needs to be subjected to necessary transformation processing;

step (2.3) establishing a grey prediction model;

GM (1,1) represents that the model is a first-order differential equation and only contains a gray model with 1 variable, the model is not an original data sequence but a generated data sequence when the gray model is predicted, and in the modeling process, the original data is accumulated to generate, and then modeling is carried out after an approximate exponential law is obtained;

and finally obtaining an equation meeting the original number sequence through modeling calculation, substituting the time points needing to be predicted at the moment to calculate a predicted value, wherein the predicted value is still in an exponential growth form, and obtaining a real prediction result by making a difference item by item.

2. The slow-task prediction method based on gray prediction algorithm according to claim 1, characterized in that:

the step (1.3) is executed by the following specific steps:

the substep (3-1) is to sort according to the calculated task duration, count the total number of tasks of all nodes, the task duration after sorting obtains the median of the task duration in half of the total number of tasks;

and the substep (3-2) judges the counted data item by item again, if the task duration exceeds a threshold value, the corresponding node slow task number is +1, and so on, so as to obtain the slow task number of each node.

3. The slow-task prediction method based on gray prediction algorithm according to claim 1, characterized in that:

the step (2.1) of extracting slow task information specifically comprises the following steps:

and respectively counting the slow task quantity and the task quantity of each node, calculating the ratio of the slow task quantity and the task quantity of each node, and taking the ratio as input data of a prediction model.

4. The slow-task prediction method based on gray prediction algorithm according to claim 1, characterized in that:

the execution conditions required to satisfy the gray prediction in the step (2.2) are as follows:

x⁽⁰⁾(k) represents the kth data, and n represents the total amount of data;

if all the step ratios λ (k) fall within the tolerable coverage

Inner, then sequence x⁽⁰⁾Gray prediction can be performed as data of a model GM (1, 1); otherwise, it needs to be aligned with sequence x⁽⁰⁾And (3) performing transformation processing to enable the constant to fall into the allowable coverage, namely taking a constant c to perform translation transformation:

y⁽⁰⁾(k)＝x⁽⁰⁾(k)+c,k＝1,2,…,n,

5. the slow-task prediction method based on gray prediction algorithm according to claim 1, characterized in that:

the prediction method of the gray model GM (1,1) in the step (2.3) comprises the following steps:

x⁽¹⁾＝(x⁽¹⁾(1),x⁽¹⁾(2),…,x⁽¹⁾(n))

in the formula:

x⁽¹⁾the mean generation sequence of (a):

z⁽¹⁾＝(z⁽¹⁾(2),z⁽¹⁾(3),…,z⁽¹⁾(n)),

in the formula: z is a radical of⁽¹⁾(k)＝0.5x⁽¹⁾(k)+0.5x⁽¹⁾(k-1),k＝2,3,…,n；

Establishing a gray differential equation as follows, wherein a and b are undetermined coefficients and are calculated by substituting data;

x⁽⁰⁾(k)+az⁽¹⁾(k)＝b,k＝2,3,…,n，

the corresponding whitening differential equation is:

for convenience of illustration, let u ═ a, b]^TIn each case, Y is ═ x⁽⁰⁾(2),x⁽⁰⁾(3),…,x⁽⁰⁾(n)]^TMemory for recording

then, solving the equation to obtain a predicted value:

and also

6. The slow-task prediction method based on gray prediction algorithm according to claim 1, characterized in that: the method for counting the task running information on the big data platform in the step (1.1) specifically comprises the following steps:

the task information is from log analysis, and a log file generated by a big data platform in the running process records the execution condition of each task of each node, including log files of Hadoop and Spark platforms; the corresponding relation among the nodes, the tasks and the task duration can be obtained by analyzing the log file.

7. The slow-task prediction method based on gray prediction algorithm according to claim 1, characterized in that:

the ratio of the number of slow tasks to the number of tasks in the step (2.1) can reflect the density of the slow tasks, and the larger the predicted value of the data volume is, the worse the performance of the corresponding node is reflected, so that the user can allocate fewer program processes to the node, allocate more processes to the node with the predicted value lower than the node, and finally play a role in improving the cluster performance.

8. The slow-task prediction method based on gray prediction algorithm according to claim 1, characterized in that: in the step (2.3), the grey prediction model modeling calculation process is as follows: