CN106502799A

CN106502799A - A kind of host load prediction method based on long memory network in short-term

Info

Publication number: CN106502799A
Application number: CN201611271570.XA
Authority: CN
Inventors: 周余; 宋斌斌; 于耀
Original assignee: Nanjing University
Current assignee: Nanjing University
Priority date: 2016-12-30
Filing date: 2016-12-30
Publication date: 2017-03-15

Abstract

The present invention is a kind of using long load predicting method of the memory network to cloud computing center main frame in short-term, belongs to cloud computing and deep learning field.The present invention solve problem be：In cloud computing environment, load on host computers changes violent problem.The present invention proposes the method that a kind of following load on host computers of prediction has improved cloud computing system scheduling.The core of the main algorithm of the present invention is to be modeled the relation between historical data and Future Data using the characteristic of long memory network in short-term, and there is network longterm memory function, output to be the load of prediction.The present invention utilizes the prediction loaded by the method for neutral net, by, compared with existing certain methods at present, method proposed by the present invention can be obtained and more accurately be predicted the outcome.

Description

A kind of host load prediction method based on long memory network in short-term

Technical field

The invention belongs to cloud computing and deep learning field, mainly for load on host computers change in cloud computing environment acutely, A kind of the features such as noise is big, it is proposed that new method based on the long host load prediction of memory network in short-term.

Background technology

In cloud computation data center, the change of load is generally very fierce, in order to tackle various complex situations in time, I Need to be predicted the situation of various resources in data center.In cloud computation data center, the service condition of cpu resource is anti- The ruuning situation of application program on main frame is answered, therefore when scheduling virtual machine is carried out, cpu resource is the resource of overriding concern. When the load of certain main frame exceedes threshold value, the performance for running virtual machine on the host can be severely impacted, and therefore may be used With by some virtual machine (vm) migrations on this main frame to other free hosts, so as to mitigate the load of main frame；When on some main frames Load be less than threshold value when, the virtual machine on these main frames can be merged into other main frames or closing, so as to reduce cloud computing The energy consumption at center.

Loading condition of some load predicting methods that presently, there are mainly for main frame in grid (Grid) calculating, but In cloud computing environment, the type of main frame is different, and running on main frame for task is also different, and this is resulted in cloud environment Load on host computers situation of change is more complicated, and certain methods before can not be entered to the loading condition of main frame in cloud environment well Row prediction.

In order to solve problems of the prior art, this paper presents a kind of based on long memory network (Long in short-term Short-Term Memory) method the context of load data is modeled, output is and final predicts the outcome.

Content of the invention

The purpose of the present invention：For the management that various types of main frames are carried out present cloud computing center centralization, unified It is supplied to various types of users to use, causes under cloud computing environment, running on each main frame for task is different, which is born The more complicated situation of the situation of change of load, it is proposed that a kind of can accurately predict following a period of time in load situation of change Method, is the scheduling virtual machine of next step, and resource integrated management is laid a good foundation.

For problems of the prior art, the present invention proposes a kind of new host load prediction method, Neng Gouyou Imitated is predicted to the load on host computers in following a period of time, and whole system is made up of following two main modulars：

Module one, the collection cloud computing load on host computers historical data of month, very long load sequence are divided into a lot of solid Sizing, continuous historical data and prediction data, select suitable model to be modeled data.

Module two, make using training set long memory network in short-term learn corresponding parameter, and selected by cross validation collection The best model of generalization is selected, output is final predicting the outcome.

For module one, related One-dimension Time Series, Ke Yitong before and after in the middle of cloud computing, the load of main frame is substantially The method of a lot of time series analyses is crossed the prediction that loaded.In this module, we pass through to fix greatly load partition Little subsequence is modeled so as to network.The input of network moment t is the historical data x=(d of one group of load_t-1, d_t-2..., d_t-n), wherein each x represent t before length for n load histories sequence data.Corresponding it is output asWherein o be length for length after t for the future load of m prediction data.Assume true Data be y=(d_t+1, d_t+2..., d_t+m), the problem that we want to solve is to find a history value between future value Mapping f：

Intuitively, closer to the history value of current time t, more related to predictive value.But, distant with current time Historic load be also possible to provide some useful information, the trend for such as changing etc., help that we are loaded is pre- Survey.So we want to construct a model, in-plant load information can be utilized, can also utilize load letter at a distance Breath.

f(x；N)=f (g₁(x_t-1, x_t-2..., x_t-k), g₂(x_t-(k+1)..., x_t-n))

Recognition with Recurrent Neural Network (Recurrent Neural Networks) is based on this simple thought, and tradition Unlike propagated forward network, Recognition with Recurrent Neural Network has an intermediateness layer by the mechanism that feeds back so that it is very suitable Preferably doing time series modeling.In addition, in order to be modeled to longterm memory, we are by the intermediateness layer of Recognition with Recurrent Neural Network Long mnemon in short-term is replaced with, the framework of whole model is as shown in Figure 1.

Module two carries out load estimation using long memory network in short-term.

Once being previously used in view of certain information, forget that the state in this Geju City may have for network very much With.In addition, compared to manually determining when to delete old state, it is intended that network oneself can decide when to delete Remove.In this case, long memory network in short-term is undoubtedly one and selects well.

The framework of long memory network in short-term is as illustrated in fig. 2, it is assumed that the input of network moment t is historic load x_t, output For h_t(final prediction o is obtained through a full articulamentum again_t).Context in order to enable the network to load sequence is entered Row modeling, we define a temporary location s_tFor store-memory.Meanwhile, in order to be able to enable load estimation to get long-time Information, we are using the long feature definitions of memory network in short-term three gate functions (input layer, intermediate layer, output layers)：

Carry out the transmission of control information.Wherein, σ is sigmoid nonlinear activation functions, the weight and partially of W, U, b for network Move.Then, we obtain finally entering for each moment network, intermediate value, output on this basis：

i_t=σ (b+Ux_t+Wh_t-1)

When output is calculated, we select ReLu nonlinear activation functions to carry out the training of accelerator nerve network.

Then for a certain group of training data (x, y), its corresponding cost function is defined as by we Wherein N be prediction length, o_iIt is predictive value, y_iIt is actual value.

Description of the drawings

Illustrate and technical solution of the present invention is further understood for providing, and constitute a part for description, with The enforcement of the present invention for explaining technical scheme, does not constitute the restriction to technical solution of the present invention together.Accompanying drawing It is described as follows：

Fig. 1 is the Organization Chart of whole model.

Fig. 2 is the Organization Chart of long memory network in short-term.

Specific embodiment

Describe embodiments of the present invention below with reference to accompanying drawing in detail, whereby to the present invention how application technology means Carry out solve problem, and reach technique effect realize that process can fully understand and implement according to this.Illustrate in the flow process of accompanying drawing Step can be executed in the different computer systems of such as one group of computer executable instructions, and, although in flow charts Logical order is shown, but in some cases, shown or described step can be executed with the order being different from herein.

The implementation procedure of algorithm is specifically described below

Step 1-1, data acquisition.The historical data of n days load on host computers before collection.Using the CPU detection instruments on main frame Every 5 minutes obtain CPU load data, which is divided into according to a certain percentage Training Set (training dataset), Validation Set (cross validation collection) and Test Set (test data set), and very long load sequence is divided into a lot of solid Sizing, continuous historical data and prediction data.

Step 1-2, e-learning.The long study of memory network in short-term is made using training set to corresponding parameter, by most Littleization cost function, the weight coefficient for obtaining and the value of bias term, and the best model of generalization is selected by cross validation collection.

Step 1-3, carry out load estimation.Using the parameter for obtaining network from the study of long short term memory networking, input data Collection, output are load on host computers.

Those skilled in the art should be understood that the system structure and each step of the above-mentioned present invention can be with general Realizing, they can concentrate on single computing device computing device, or be distributed in the net of multiple computing devices compositions On network, alternatively, they can be realized with the executable program code of computing device, it is thus possible to be stored in depositing Executed by computing device in storage device, or they are fabricated to each integrated circuit modules respectively, or by them Multiple modules or step are fabricated to single integrated circuit module to realize.So, the present invention is not restricted to any specific hardware Combine with software.

Although the embodiment shown or described by the present invention is as above, described content is only to facilitate understand this The embodiment that invents and adopt, is not limited to the present invention.Technical staff in any the technical field of the invention, Without departing from disclosed herein spirit and scope on the premise of, can implement formal and details on do any repairing Change and change, but the scope of patent protection of the present invention, still must be defined by the scope of which is defined in the appended claims.

Claims

1. a kind of method and system based on the long host load prediction of memory network in short-term, is characterized in that comprising following main step Suddenly：

Step one, the collection cloud computing load on host computers historical data of month, which is divided into training set, intersection according to a certain percentage Checking collection and test set.

Step 2, very long load sequence is divided into a lot of fixed sizes, continuous historical datas and prediction data.

Step 3, the long study of memory network in short-term is made using training set to corresponding parameter, and selected by cross validation collection The best model of generalization.

, in short-term in memory network, output is final predicting the outcome for step 4, the length for succeeding in school the data feeding of test set.

2. two it is characterised by the step of claim 1 methods described, we pass through the subsequence of load partition fixed size So that network is modeled.The input of network moment t is the historical data x=(d of one group of load_t-1, d_t-2..., d_t-n), its In each x represent t before length for n load histories sequence data.Corresponding it is output as Wherein o be length for length after t for the future load of m prediction data.Assume that real data are y=(d_t+1, d_t+2..., d_t+m), then for a certain group of training data (x, y), its corresponding cost function is defined as by weWherein N be prediction length, o_iIt is predictive value, y_iIt is actual value.

3. the step of claim 1 methods described three be characterised by our Selection utilization length in short-term memory network being loaded The modeling of prediction.Long memory network in short-term is one kind of Recognition with Recurrent Neural Network, it is assumed that the input of network moment t is historic load x_t, it is output as h_t(final prediction o is obtained through a full articulamentum again_t).Before and after enabling the network to load sequence Relation is modeled, and we define a temporary location s_tFor store-memory.Meanwhile, in order to be able to enable load estimation to get Prolonged information, we utilize the long feature definitions of memory network in short-term three gate functions (input layer, intermediate layer, output Layer)： To control The transmission of information processed.Then, we obtain finally entering for each moment network, intermediate value, output on this basis：i_t=σ (b+Ux_t+Wh_t-1),When output is calculated, we select ReLu activation letters Several training to accelerate network.