CN109033450B

CN109033450B - Elevator equipment fault prediction method based on deep learning

Info

Publication number: CN109033450B
Application number: CN201810962887.0A
Authority: CN
Inventors: 王莉; 江海洋
Original assignee: Taiyuan University of Technology
Current assignee: Taiyuan University of Technology
Priority date: 2018-08-22
Filing date: 2018-08-22
Publication date: 2021-11-05
Anticipated expiration: 2038-08-22
Also published as: CN109033450A

Abstract

The invention relates to an elevator equipment fault prediction method based on deep learning, belonging to the technical field of elevator fault prediction; the technical problem to be solved is to provide a method for timely and accurately predicting the elevator fault type and the elevator fault time; in order to solve the technical problems, the method comprises the following specific steps: firstly, collecting elevator fault record information and establishing a real-time elevator fault information base; then processing the elevator fault information into an event sequence and a time sequence; respectively taking the event sequence and the time sequence as input data of the double LSTM, and obtaining output embedding of the two sequences through iterative training of a recurrent neural network; training to obtain the background knowledge of the intensity function and the nonlinear representation of the historical influence by combining the joint layer with the two output embeddings; finally, predicting the type and time of the elevator fault according to the representation result of the intensity function; the elevator maintenance system can assist elevator maintenance personnel to take related preventive measures as soon as possible, and fault events are avoided.

Description

Elevator equipment fault prediction method based on deep learning

Technical Field

The invention discloses an elevator equipment fault prediction method based on deep learning, and belongs to the technical field of elevator fault prediction.

Background

With the increasing number of high-rise buildings, the quality of elevators is receiving much attention. The daily life of people is affected by elevators which are stopped in failure, unsmooth in operation and even have accidents. The method for reducing the failure rate of the elevator and timely and accurately detecting and removing the failure needs further research. Most of the existing elevator fault diagnosis and detection methods are detection methods for a mechanical system, an electrical control system and a safety protection system of an elevator by combining the structure and the principle of the elevator, but a large amount of analysis time is consumed.

Disclosure of Invention

The invention overcomes the defects of the prior art, and the technical problem to be solved is to provide a timely and accurate elevator equipment fault prediction method based on deep learning, so that the purpose of predicting the type and time of the elevator fault is achieved, a maintenance master can detect regularly according to the prediction result, and the occurrence rate of the elevator fault is reduced.

In order to solve the technical problems, the invention adopts the technical scheme that: the elevator equipment fault prediction method based on deep learning comprises the following steps:

step 1, establishing a real-time elevator fault information base, and screening effective elevator fault information as a preprocessing information sequence of a network according to four principles of fault information correlation, complete information, non-repeated information and non-human error operation of the fault information, wherein the elevator fault information comprises fault recording information and elevator basic information, and the fault recording information comprises: elevator fault type, fault reason and fault time, elevator basic information includes: the production date of the elevator, the position of the elevator, the type of the elevator and the service life of the elevator;

step 2, constructing a time sequence, which comprises two characteristics: 1) counting the failure times of various types of elevators, 2) basic information of the elevators, and combining the two characteristics to form a time sequence;

and 3, constructing an event sequence, wherein the event sequence comprises two characteristics: 1) the data recording sequence of the elevator fault types is arranged in sequence according to the fault occurrence time, 2) the time interval between two adjacent fault events, and the two characteristics are combined to form an event sequence;

step 4, constructing an LSTM neural network;

step 5, training a time sequence and an event sequence by using a double LSTM neural network to obtain a background knowledge representation and a history influence representation of the intensity function;

step 6, integrating background knowledge representation and historical influence representation through the joint layer;

step 7, predicting elevator fault types through a fault type prediction layer by using the strength function learned through double LSTM, and quantizing the loss values predicted by categories by using a classification loss layer;

step 8, predicting elevator fault time through a fault time prediction layer by using the strength function learned through double LSTM, and quantizing the loss value predicted by time by using a regression loss layer;

step 9, continuously and iteratively training a neural network model based on the step 7 and the step 8 to obtain an optimal network model, and then respectively predicting the type of the elevator fault and the elevator fault time through a fault type prediction layer and a fault time prediction layer by using the trained optimal model;

and step 10, perfecting the optimization model, updating the elevator fault information base in real time, inputting the real-time updated data into the model to test the accuracy of the model, and correcting the perfection model according to the actual feedback condition.

Specifically, the "constructing a time series" in step 2 includes: assuming that the tested elevators have M types, the effective fault types have N types, and the number of the elevator fault time windows is N, the number of the k-th fault types of the jth time window of the ith elevator can be recorded as the number of the k-th fault types of the jth time window of the ith elevator

1) Counting the times of various fault types by taking one week as a time window

2) The elevator basic information, the two characteristics are combined to form time series data, and the time series data are specifically expressed as follows:

wherein M represents the elevator model, d represents the elevator production date, l represents the elevator position, and N represents the number of elevator failure time windows, and double transverse lines are taken as examples in the representation to represent the statistical times of N failure types and the representation unit of the elevator basic information in the first time window of the Mth elevator.

Specifically, the "constructing an event sequence" in step 3 includes: 1) storing all elevator information separately according to different elevator ids, converting elevator fault time into timestamp data, arranging elevator fault types according to fault occurrence time for each elevator, 2) calculating interval time of adjacent fault events of each elevator, and 3) forming a minimum event sequence unit by the fault type T and the timestamp interval I, wherein the minimum event sequence unit is specifically represented as follows:

the elevator id represents the unique identifier of the elevator-the elevator number.

Specifically, the method of "constructing the LSTM neural network" in step 4 is as follows:

the specific formula of the recurrent neural network variant LSTM used in the invention is defined as follows:

i_t＝g(W_i x_t+U_iy_t-1+V_ic_t-1+b_i)，

f_t＝g(W_fx_t+U_fy_t-1+V_fc_t-1+b_f),

c_t＝f_tc_t-1+i_t⊙tanh(W_cx_t+U_cy_t-1+b_c),

o_t＝g(W_ox_t+U_oy_t-1+V_oc_t+b_o),

y_t＝o_t⊙tanh(c_t)

the above is the formula for the calculation of input gate, forget gate, cell state, and output gate in LSTM, where i_tFormula for calculating the input gate, f_tFormula for calculating a forgetting gate, c_tRepresenting the cell state calculation formula, o_tAnd y_tFormula of calculation, o, which jointly represents the output gate_tIndicating which part of the cell state is to be output using the g function, and finally y_tIndicates that the cell state is treated with tanh and o_tMultiplying is the part which is determined to be output; wherein U and V are parameter matrixes to be learned in neural network training respectively, which represent the sequential multiplication of array elements, the function g adopts a sigmoid activation function,

a sequence of inputs is represented that is,

representing the generation of the hidden layer states, the above formula can be simplified as: (y)_t,c_t)＝LSTM(x_ty_t-1+c_t-1)。

The LSTM construction process comprises the following specific steps:

(1) network initialization: determining the number i of input layer nodes, the number j of hidden layer nodes, the number k of output layer nodes and the state dimension of a unit of the network, initializing the connection weight among neurons of the input layer, the hidden layer and the output layer, and initializing the connection weight W of input gates, forgetting gates, output gates and cell units_i,W_f,W_O,W_cInitializing the threshold b_f,b_c,b_o,b_iGiving a learning rate and a neuron excitation function;

(2) calculating the output value of each neuron: f for LSTM_t,i_t,c_t,o_t,h_t；

(3) And (3) error calculation: inversely calculating an error term of each neuron according to the prediction output and the expected output matrix;

(4) updating the weight value: updating the network connection weight W according to the network prediction error_i,W_f,W_O,W_c；

(5) Updating a threshold value: updating network node threshold b according to network prediction error_f,b_c,b_o,b_i；

(6) Judging whether the operation is finished or not, and if not, returning to the step (2);

(7) after finishing, using the trained double LSTM neural network to enter the step 5;

specifically, the method of "training the time sequence and the event sequence using the dual LSTM neural network" in step 5 is as follows:

the specific formula is as follows:

in the above formula

Which represents a time series of the images of the object,

represents a sequence of events, wherein z_iRepresenting the type of event in the sequence of events, t_iTime stamps indicating the occurrence of the events, the two sequences being used to learn background knowledge and historical impact, respectively.

Specifically, the method of "fusing the background knowledge representation and the historical influence representation through the joint layer" in the step 6 is as follows: adopting the tanh function as the joint function of the join layer to construct the nonlinear mapping of the strength function of the point process,

the formula of the nonlinear mapping of the point process intensity function constructed by the joint layer join layer is as follows:

specifically, in step 7, "predicting elevator fault types by fault type prediction layers using the strength function learned by the dual LSTM, and quantizing the loss values predicted by categories using the classification loss layers" includes:

the calculation formulas of the failure prediction main class and the failure prediction subclass are as follows:

U_t＝softMax(W_ue_t+b_U)

u_t＝softMax(W_u[e_t,U_t]+b_u)

in the above formula, U and U represent the main and sub-classes, respectively, and W_uAnd b_uRepresenting a parameter matrix to be learned by the model in the training process; first, e is divided by utilizing softMax function_tPredicting failure dominant categories as inputU_t(ii) a Then e is processed again by utilizing the softMax function_tAnd the main class prediction value U_tThe predicted value of the subclass is calculated as input.

Specifically, in step 8, "the elevator fault time is predicted by the fault time prediction layer by using the strength function learned by the double LSTM. The time-predicted loss values are quantified using a regression loss layer. The method specifically comprises the following steps:

s_t＝W_se_t+b_s

wherein s is_tIs a time stamp, W, for each event_sAnd b_sRepresenting the parameter matrix to be learned by the model during the training process.

Specifically, in the step 9, "predicting the type of the elevator fault and the elevator fault time through the fault type prediction layer and the fault time prediction layer by using a trained optimal model" specifically includes:

where the loss of the entire model is the sum of the time prediction loss and the event type prediction loss. And predicting the event type by using a cross entropy loss function, and predicting the time stamp by using a square loss function. The specific loss function formula is as follows:

in the above formula, N represents the total number of nodes, and the event points are indexed by l, i.e. l represents the second event, W_uA parameter matrix representing the class of events learned by the neural network model,

is the time stamp of the next event point,

representing historical information of time points, adopting a Gaussian penalty function for a time loss function part:

wherein σ represents the uniform variance, and σ is taken²＝10；

And training the whole model in a circulating way by minimizing loss to finally obtain the optimal model.

Specifically, the method of "perfecting the optimization model" in the step 10 is as follows: and obtaining an elevator fault prediction model through a training set, then obtaining a test set by utilizing a real-time updated fault information base, testing the accuracy of the model based on the test set, and perfecting the optimization model in real time.

Through the steps, the construction of an elevator system fault prediction model based on a point process strength function of double LSTM can be completed, time sequence and event sequence data are formed by preprocessing elevator fault data, namely, fault representation can be obtained through a double LSTM neural network model, a nonlinear representation function of the strength function is further obtained by combining two embedding methods, the whole model is trained circularly, and finally the fault time and the fault type of the elevator can be predicted through the strength function of the point process. The method is suitable for solving the problem of elevator fault type and time prediction in practical problems, can help an elevator maintainer to predict the type and time of the elevator to be in fault by using the known elevator fault information, can make preventive measures in advance, avoids the occurrence of elevator faults, reduces the loss and danger caused by the elevator faults, and has high practical application value.

In summary, the invention discloses a method for predicting elevator equipment faults based on deep learning, which provides a data mining algorithm for respectively training an event sequence and a time sequence by using double LSTM and then combining a point process intensity function aiming at online operation measurement data of high-density sampling in an elevator system. The method and the system start from the data perspective, remotely monitor the fault elevator in real time based on the elevator historical fault information sequence, adopt the neural network model to dig out the internal rules of the elevator fault from the massive elevator alarm information, and accurately predict the type and time of the elevator fault in time, thereby assisting the elevator maintenance personnel to take related preventive measures as soon as possible and avoiding the occurrence of fault events.

LSTM (Long Short-term memory) is a Long Short-term memory network, a time recurrent neural network, suitable for processing and predicting important events with relatively Long intervals and delays in time series. LSTM differs from RNN mainly in that it incorporates a "processor" in the algorithm that determines the usefulness of the information, and this processor-oriented architecture is called a cell. Three doors, namely an input door, a forgetting door and an output door, are placed in one cell. A message enters the LSTM network and may be determined to be useful based on rules. Only the information which is in accordance with the algorithm authentication is left, and the information which is not in accordance with the algorithm authentication is forgotten through a forgetting door. It has been proven that LSTM is an effective technique to solve the problem of long order dependence, and the universality of this technique is very high. Various researchers put forward their own variable versions according to the LSTM, so that the LSTM can deal with the vertical problem of the great diversity.

Application of RNN to sequence data: from an application scenario, RNN, as a building block of the present invention, can be applied to two kinds of sequence data: the time sequence and the event sequence can be cooperatively modeled, and are specifically introduced as follows:

(1) time series: refers to a kind of synchronization sequence for recording information statistics of related data in the same time interval. Time series can capture time-varying features in a recent time window in time, so RNNs often use time series as input for relevant sequence prediction problems. For example, a video frame can be considered as a time series data, and we can analyze its intrinsic relation based on the historical video frame to predict the content of the next frame. RNN has been widely used in the fields of video analysis and speech recognition.

(2) The sequence of events: refers to an asynchronous sequence of event random features recorded by time stamps. The event sequence takes the randomly generated time stamp as the input of the RNN, so that the event sequence can capture the long-term dependence relationship among the events more efficiently. Therefore, the invention uses the double LSTM to carry out cooperative modeling on the two sequence data, can capture the updating rule of synchronous information and the asynchronous randomness of burst information, and avoids the limitation brought by parameter assumption by utilizing the nonlinear mapping of the condition intensity function generated by the recurrent neural network.

The point process refers to a time series or a stochastic model of a time series. The point process is a mathematical framework of a modeling sequence recommendation algorithm, and measures the dynamic variability of the point process by using an intensity function. The point process is divided into an event point process and an object point process. Event point process: an earthquake or other catastrophic event; an access event of the server; a factory manufactured off-grade product event; traffic accident incidents that occur at the three-way road, etc. The marked point process is used for predicting the occurrence rule problem of earthquake and aftershock at first; and (3) object point process: the location of the car on the highway; DNA gene, etc.

Introduction of the development history of the intensity function: the point process is used as a mathematical framework of a modeling sequence model, and the dynamic variability of the point process is measured by adopting a conditional strength function. Definition of the intensity function: in the time window [ t + dt), λ (t) represents the time at which the event H is historical_t＝{z_i,t_i|t_iThe occurrence probability of a new event under the premise that < t } occurs is as follows:

wherein E (dN (t) H_t) Is shown in history H_tOn the basis of the expected value of the number of events occurring within the time interval t + dt). The conditional intensity function plays a key role in the point process sequence prediction algorithm, and different point processes in the parameterized form of the intensity function have different effects. The variation of the intensity function of the point process is as follows: a poisson process; reinforcing the Poisson process; a hokes process; an active site process; a self-correction point process; the RMTPP model; a TRPP model.

The parameterized form of the intensity function consists of two parts: background knowledge and historical characteristics. The above methods are summarized in table 1. From table 1, it can be seen that the former five intensity functions are parameter models artificially assumed according to prior knowledge, and have certain limitations, and the models cannot completely conform to complex dynamic changes of the real sequence problem. While the RMTPP model learns the parameters from the historical features of the event sequence using LSTM. Although the background characteristics of the time series are ignored, the RMTPP model also achieves good effect as a semi-parameterized intensity function; the TRPP model uses dual LSTM to model time and event sequences, respectively, as background knowledge and historical information of intensity functions. Therefore, the model is constructed by mainly using the ideas of the last two models.

TABLE 1 Point Process Strength function

Compared with the prior art, the invention has the following beneficial effects.

1. The invention combines a neural network LSTM and a point process for elevator fault prediction, provides a method for respectively training an event sequence and a time sequence by using double LSTMs, and then combines a data mining algorithm of a point process intensity function, and from the perspective of data, based on an elevator historical fault information sequence, learns the change rule of the elevator fault type by nonlinear representation of the neural network by adopting the internal rule contained in the neural network model training data, thereby achieving the purpose of predicting the elevator fault type and the elevator fault time, enabling a maintenance master to regularly detect according to the prediction result, and reducing the occurrence rate of the elevator fault.

2. The method utilizes mass data of elevator faults and utilizes the neural network to learn the occurrence internal rule of the elevator faults, thereby avoiding misjudgment of artificial hypothesis and ensuring that the model is more accurate and efficient.

3. According to the invention, the model utilizes the real-time updated fault information base, so that the accuracy of the model can be verified in time, and the model can be optimized in time, so that the model has timeliness.

Drawings

FIG. 1 is a flow chart of the general steps of the present invention.

FIG. 2 is a time series and a time series co-modeling diagram of the present invention.

Fig. 3 is a frame diagram of the type of failure prediction and time of failure prediction in the present invention.

Detailed Description

The present invention will be described in further detail with reference to the accompanying drawings.

As shown in fig. 1, the elevator equipment fault prediction method based on deep learning of the present invention includes the following steps:

step 1, establishing a real-time elevator fault information base, and screening effective elevator fault information as a preprocessing information sequence of a network according to four principles of fault information correlation, complete information, non-repeated information and non-human error operation of the fault information, wherein the elevator fault information comprises fault recording information and elevator basic information, and the fault recording information comprises: elevator fault type, fault reason and fault time, elevator basic information includes: the production date of the elevator, the position of the elevator, the type of the elevator and the service life of the elevator.

specifically, the "constructing a time series" includes: assuming that the tested elevators have M types, the effective fault types have N types, and the number of the elevator fault time windows is N, the number of the k-th fault types of the jth time window of the ith elevator can be recorded as the number of the k-th fault types of the jth time window of the ith elevator

specifically, the "construction of the event sequence" is performed by: 1) storing all elevator information separately according to different elevator ids, converting elevator fault time into timestamp data, arranging elevator fault types according to fault occurrence time for each elevator, 2) calculating interval time of adjacent fault events of each elevator, and 3) forming a minimum event sequence unit by the fault type T and the timestamp interval I, wherein the minimum event sequence unit is specifically represented as follows:

Step 4, constructing an LSTM neural network;

the specific method comprises the following steps:

i_t＝g(W_i x_t+U_iy_t-1+V_ic_t-1+b_i)，

f_t＝g(W_fx_t+U_fy_t-1+V_fc_t-1+b_f),

c_t＝f_tc_t-1+i_t⊙tanh(W_cx_t+U_cy_t-1+b_c),

o_t＝g(W_ox_t+U_oy_t-1+V_oc_t+b_o),

y_t＝o_t⊙tanh(c_t)

a sequence of inputs is represented that is,

The LSTM construction process comprises the following specific steps:

specifically, the method for training the time sequence and the event sequence by using the double LSTM neural network comprises the following steps:

the specific formula is as follows:

in the above formula

Which represents a time series of the images of the object,

the specific method comprises the following steps: adopting the tanh function as the joint function of the join layer to construct the nonlinear mapping of the strength function of the point process,

the specific method comprises the following steps:

the calculation formulas of the main class and the subclass of the fault prediction are as follows:

U_t＝softMax(W_ue_t+b_U)

u_t＝softMax(W_u[e_t,U_t]+b_u)

in the above formula, U and U represent the main and sub-classes, respectively, and W_uAnd b_uRepresenting a parameter matrix to be learned by the model in the training process; first, e is divided by utilizing softMax function_tPredicting a failure primary category U as input_t(ii) a Then e is processed again by utilizing the softMax function_tAnd the main class prediction value U_tThe predicted value of the subclass is calculated as input.

the specific method comprises the following steps:

s_t＝W_se_t+b_s

specifically, the method for respectively predicting the type of the elevator fault and the elevator fault time through the fault type prediction layer and the fault time prediction layer by using the trained optimal model comprises the following steps:

is the time stamp of the next event point,

wherein σ represents the uniform variance, and σ is taken²＝10；

Specifically, the method for "perfecting the optimization model" comprises the following steps: and obtaining an elevator fault prediction model through a training set, then obtaining a test set by utilizing a real-time updated fault information base, testing the accuracy of the model based on the test set, and perfecting the optimization model in real time.

The

above steps

2 and 3 are shown in fig. 2, which shows the generation of the time series and the event series, and the two series are modeled cooperatively. Because the time sequence consists of basic information and statistical information, the time sequence carries background knowledge information of data, the asynchronously and randomly generated time stamp sequence of the event sequence carries bursty information, and more complete information connotation can be captured only by cooperatively modeling the two sequences.

Fig. 3 is a main block diagram of elevator fault type prediction and time of failure prediction, as shown in steps 5-9.

The embodiments of the present invention have been described in detail with reference to the accompanying drawings, but the present invention is not limited to the above embodiments, and various changes can be made within the knowledge of those skilled in the art without departing from the gist of the present invention.

Claims

1. The elevator equipment fault prediction method based on deep learning is characterized by comprising the following steps:

step 4, constructing an LSTM neural network;

2. The deep learning based elevator equipment failure prediction method of claim 1, wherein:

the method of "constructing a time series" in step 2 is as follows: assuming that the tested elevators have M types, the effective fault types have N types, and the number of the elevator fault time windows is N, the number of the k-th fault types of the jth time window of the ith elevator can be recorded as the number of the k-th fault types of the jth time window of the ith elevator

3. The deep learning based elevator equipment failure prediction method of claim 1, wherein:

the method of "constructing an event sequence" in step 3 is as follows: 1) storing all elevator information separately according to different elevator ids, converting elevator fault time into timestamp data, arranging elevator fault types according to fault occurrence time for each elevator, 2) calculating interval time of adjacent fault events of each elevator, and 3) forming a minimum event sequence unit by the fault type T and the timestamp interval I, wherein the minimum event sequence unit is specifically represented as follows:

4. The deep learning based elevator equipment failure prediction method of claim 1, wherein:

the method for constructing the LSTM neural network in the step 4 comprises the following steps:

the specific formula of the recurrent neural network variant LSTM used is defined as follows:

i_t＝g(W_ix_t+U_iy_t-1+V_ic_t-1+b_i)，

f_t＝g(W_fx_t+U_fy_t-1+V_fc_t-1+b_f),

c_t＝f_tc_t-1+i_t⊙tanh(W_cx_t+U_cy_t-1+b_c),

o_t＝g(W_ox_t+U_oy_t-1+V_oc_t+b_o),

y_t＝o_t⊙tanh(c_t)

a sequence of inputs is represented that is,

5. The deep learning based elevator equipment failure prediction method of claim 1, wherein:

the method for training the time sequence and the event sequence by using the double LSTM neural network in the step 5 comprises the following steps:

the specific formula is as follows:

wherein z is_iRepresenting the type of event in the sequence of events, t_iTime stamps indicating the occurrence of the events, the two sequences being used to learn background knowledge and historical impact, respectively.

6. The deep learning based elevator equipment failure prediction method of claim 1, wherein:

the method for fusing the background knowledge representation and the historical influence representation through the joint layer in the step 6 comprises the following steps: adopting the tanh function as the joint function of the join layer to construct the nonlinear mapping of the strength function of the point process,

7. the deep learning based elevator equipment failure prediction method of claim 1, wherein:

in the step 7, "predicting the elevator fault type through a fault type prediction layer by using the strength function learned through the double LSTM, and quantizing the loss value predicted by the category by using the classification loss layer" includes:

U_t＝softMax(W_ue_t+b_U)

u_t＝softMax(W_u[e_t,U_t]+b_u)

8. The deep learning based elevator equipment failure prediction method of claim 6, wherein:

in the step 8, the elevator fault time is predicted through a fault time prediction layer by using the strength function learned through double LSTMs; the method for quantizing the loss value predicted by time by using the regression loss layer specifically comprises the following steps:

s_t＝W_se_t+b_s

9. The deep learning based elevator equipment failure prediction method of claim 1, wherein:

in the step 9, "the type of the elevator fault and the elevator fault time are predicted respectively through the fault type prediction layer and the fault time prediction layer by using the trained optimal model", the method specifically comprises the following steps:

wherein the loss of the entire model is the sum of the time prediction loss and the event type prediction loss; predicting the event type by using a cross entropy loss function, and predicting the timestamp by using a square loss function; the specific loss function formula is as follows:

is the time stamp of the next event point,

history information representing the point in time of the time,

and (3) representing a fault prediction subclass of the event point, adopting a Gaussian penalty function for the time loss function part:

wherein σ represents the uniform variance, and σ is taken²＝10；

10. The deep learning based elevator equipment failure prediction method of claim 1, wherein:

the method for "perfecting the optimization model" in the step 10 comprises the following steps: and obtaining an elevator fault prediction model through a training set, then obtaining a test set by utilizing a real-time updated fault information base, testing the accuracy of the model based on the test set, and perfecting the optimization model in real time.