CN110008253A

CN110008253A - The industrial data association rule mining and unusual service condition prediction technique of strategy are generated based on two stages frequent item set

Info

Publication number: CN110008253A
Application number: CN201910244856.6A
Authority: CN
Inventors: 徐正国; 王豆; 陈积明; 程鹏; 孙优贤
Original assignee: Zhejiang University ZJU
Current assignee: Zhejiang University ZJU
Priority date: 2019-03-28
Filing date: 2019-03-28
Publication date: 2019-07-12
Anticipated expiration: 2039-03-28
Also published as: CN110008253B

Abstract

The invention discloses a kind of industrial data association rule minings and unusual service condition prediction technique that strategy is generated based on two stages frequent item set, can be applied to the prognostic and health management of industrial process.The present invention is introduced into association rule mining in industrial equipment failure predication, finds the relevance between operating parameter by association rules mining algorithm.For industrial data feature, start with from the variation tendency of equipment operating parameter, by being most important quota student into transaction set with operating parameter variation tendency, and the association rule mining between parameter and parameter is carried out based on this, then association rule mining result is introduced into the prediction of industrial equipment unusual service condition, to obtain more accurate prediction result.For in engineering failure predication and health control have major application value.

Description

The industrial data association rule mining and different of strategy is generated based on two stages frequent item set Normal operating condition prediction technique

Technical field

The invention belongs to reliability maintenance field of engineering technology, it is related to a kind of generating strategy based on two stages frequent item set Industrial data association rule mining and unusual service condition prediction technique.

Background technique

As the emergence of complication system and the demand of industrial process real-time monitoring are continuously increased, modern industrial equipment Multiple sensors are often equipped in the process of running to be monitored its operating status.Meanwhile it may in equipment running process There is various faults mode, a certain failure may correspond to several signs, and in the case, single-sensor information can not complete body Existing equipment running status, the failure predication based on multi-sensor information are come into being.Failure predication based on multi-sensor information It is intended to the operating status using comprehensive sensor information analytical equipment, to carry out more reliable device diagnostic and prediction.With The sustainable development of sensing technology, using multiple sensors carry out equipment status monitoring, fault diagnosis and prediction have become Development trend.

There are certain relevances between its operating parameter in equipment running process still rarely has at present in failure predication field The work that association rule mining is combined with failure predication.And in fact, for time series data, equipment fault or failure The correlated characteristic extracted often by parameter or from parameter is embodied, and prediction is often to parameter or correlated characteristic Variation tendency predicted.Excavate the correlation rule between parameter, available more complete parameter, that is, equipment running status letter Breath provides certain foundation for subsequent prediction.

Summary of the invention

For the status of the prior art, present invention aim to address rarely have consideration in the Predicting Technique of available data driving Sensing data the case where there are correlation rules, propose a kind of unit exception operating condition prediction side based on operating parameter correlation rule Method constructs more applicable wavelet neural network and carries out unusual service condition prediction (failure predication).

Now design of the invention is described below:

The present invention is portrayed using relevance of the correlation rule to industrial process operating parameter, have studied based on when ordinal number According to the unusual service condition forecasting problem of association rule mining.In order to which association rule are excavated in sequence level for time series data Then, the invention proposes a kind of time series data association rules mining algorithms that process is generated comprising two stages frequent item set. In the first stage, basic model of the variation tendency information of extraction time sequence as association rule mining, discovery time sequence The frequent item set of change shape；In second stage, based on the frequent item set of time series variation form, discovery sequence is base The frequent item set of this mode, and association rule mining has been carried out to sequence two-by-two.Then, the resulting correlation rule phase of excavation is utilized The system variable of pass carries out unusual service condition prediction, and correlation rule is introduced into wavelet neural network and improves forecasting accuracy.This hair The method of bright proposition accounts for operating parameter correlation rule, can obtain more accurate failure predication result.

According to the above inventive concept, the invention proposes a kind of industrial datas that strategy is generated based on two stages frequent item set Association rule mining and prediction technique, the specific steps are as follows:

Step 1: to time series data piece-wise linearization expression and symbolism, construction be suitable for association rule mining from Dissipate type data set；

Step 2: the frequent item set of data set is generated using two stage Frequent Itemsets Mining Algorithm；

Step 3: correlation rule being generated according to frequent item set, extracts the pass for meeting minimum support and minimal confidence threshold Connection rule；

Step 4: association rule mining result being introduced into wavelet neural network, and the unusual service condition for industrial equipment is pre- It surveys.

Based on above scheme, each step can specifically use following implementation:

Preferably, the step 1 includes following sub-step:

Step 1.1: note sensor measurement time series isN is sensor Quantity, k length of time series；Initially fitting starting point isInitially fitting terminal isNote is fitted starting pointBeing fitted terminal isError of fitting threshold value is ω_E；

Step 1.2: for eachPiecewise fitting is carried out as follows:

1.2.1 waypoint count value count=1 is initialized；

1.2.2 successively to each fitting starting pointExecute step 1)-step 4):

1) end=start+h is calculated first；

2) for dataAnd be fitted using least square method, it counts Calculate error of fitting ERR；

3) if error of fitting ERR is not more than error of fitting threshold value ω_E, then 1) h=h+1, gos to step again；

4) if error of fitting ERR is greater than error of fitting threshold value ω_E, obtainLine segment be fitted sequenceStart=start+h records waypointReset h=2, count=count+1；

1.2.3 circulation executes 1.2.2 and terminates greater than k until end, the line segment time series after being fittedAnd waypointThe segmentation point sequence P of compositionⁱ；

Step 1.3: the time series after any sensor is fittedIt is denoted as Y_k={ y₁,y₂,…,y_k, it extracts every and intends The trend and numerical information of zygonema section, and a matching line segment s is indicated by the way of following triple_i:

Wherein, k_iIndicate the slope of the line segment,Indicate the span of the line segment on a timeline, r_iIndicate the segment data Growth rate, data { y corresponding for the line segment_j,y_j+1,…,y_j+h,J is the starting point of the line segment；

To line segment time series Y_kIn all line segments carry out triple expression, obtain triad sequence S_n={ s₁, s₂,…,s_n, wherein n indicates time series X_kLine segment number after segmentation；

Step 1.4: cluster being carried out to a serial of line sections in triad sequence and symbolism is carried out to line segment, is set for indicating Standby or different system version describes line segment s using Euclidean distance_iAnd s_jSimilarity d_ij:

Wherein, d_ijIndicate line segment s_iAnd s_jSimilarity, d_ijIt is smaller, then it represents that two lines section has more like variation shape State, ω_kAnd ω_rFor weight；

Then according to index of similarity d_ij, using K-means clustering algorithm to S_nIt is clustered, and is same class line segment point The variation pattern that operating parameter is indicated with a same symbol, obtains the sequence F of symbolism_n={ f₁,f₂,…,f_n, f₁, f₂,…,f_nRespectively indicate the 1,2nd ..., the symbol that n line segment is assigned to；

Step 1.5: for the time of measuring sequence of every two sensorWithMerge it and is segmented point sequence PⁱAnd P^j, It is denoted as P^ij, n_ij- 1 is PⁱAnd P^jWaypoint number after merging；And by the waypoint after merging to its symbolism sequenceWith It is split reconstruct, the symbolism sequence after being reconstructedWith

Preferably, the step 2 includes following sub-step:

Step 2.1: for time of measuring sequenceWithCorresponding operating parameter VⁱAnd V^j, it is obtained by step 1 Measurement sequence symbolism data beWithTransaction set is constituted by it, I.e. each affairs are denoted as WithIncluded in line segment class code be denoted as respectivelyWithRemember that two stage minimum support threshold value is respectively minsup₁And minsup₂；

Step 2.2: by single sweep operation data set, calculating the support of each single item, obtain frequent 1- item collection, by as follows 2.2.1~2.2.3 process carries out:

2.2.1: note σ () is the support counting of item or item collection, is initially 0；IfClass code be t_k, t expression A or b；

2.2.2: for each affairsCalculate σ (t_k)=σ (t_k)+1；

2.2.3: for each t_kIfNot less than minimum support threshold value minsup₁, then it is assumed that t_kFor frequent 1- Item collection retains t_kAnd record corresponding support counting；IfLess than minimum support threshold value minsup₁, then it is assumed that t_k It is not frequent 1- item collection；

Step 2.3: using the frequent 1- item collection t obtained in step 2.2_k2- item collection is constituted, and calculates its support, to It was found that frequent 2- item collection, carries out according to the following procedure:

2.3.1: note a_pAnd b_qRespectively pass through step 2.2 from former line segment class codeWithThe item of middle reservation；

2.3.2 for each { a_p,b_q, execute following steps:

1) each is present inIn { a_p,b_q, calculate σ ({ a_p,b_q)=σ ({ a_p,b_q})+1

If 2)Not less than minsup₁, then it is assumed that { a_p,b_qIt is frequent 2- item collection, retain { a_p,b_qAnd record Corresponding support counting；

Step 2.4: using the frequent 2- item collection { a obtained in step 2.3_p,b_qCalculate every two operating parameter entirely counting According to the support of concentration, and the frequent item set of parameter level is obtained, carried out according to the following procedure: to every two operating parameter VⁱAnd V^j Item collection { the V of compositionⁱ,V^j, calculate σ ({ Vⁱ,V^j)=sum (σ ({ a_p,b_q)), ifNot less than minimum support threshold value minsup₂, then retain { Vⁱ,V^jAnd corresponding support is recorded, and calculate σ (Vⁱ)=sum (σ (a_p))；σ(V^j)=sum (σ (b_q))。

Preferably, the step 3 includes following sub-step:

Step 3.1: to the every group of { V for meeting support threshold obtained in step 2ⁱ,V^j, generate following correlation rule: V^j →VⁱAnd Vⁱ→V^j, note minimal confidence threshold is minconf；

Step 3.2: according to every group of correlation rule of generation, calculating its confidence threshold value, extract the process of correlation rule such as Under: for each correlation rule Vⁱ→V^j, calculateIf conf (Vⁱ→V^j) be not less than Minimal confidence threshold is minconf, then retains correlation rule Vⁱ→V^jAnd record corresponding support and confidence level ωⁱ。

Preferably, the step 4 includes following sub-step:

Step 4.1: for any group association parameter extracted from correlation rule, being denoted as { V¹,V²,…,V^u, wherein u table Show the quantity of relevant parameter, V^uFor rule it is consequent i.e. target component, for each correlation rule Vⁱ→V^u, i=1,2 ... u- 1, there is a confidence level, is denoted as ωⁱ；For target component V^u, using wavelet neural network, carry out unusual service condition prediction；

Step 4.2: construction training sample: remembering that preset prediction step is l, the one group of association extracted by association rule mining Parameter is { V¹,V²,…,V^u, the complete training dataset being made of it is denoted as Construct following matrix I_trainIt is inputted for the training of neural network:

Wherein, I_trainIn it is each be classified as a trained input sample, construct training output O_trainAre as follows:

Step 4.3: be trained using the training sample of construction to wavelet neural network: input parameter is Vⁱ, i=1, 2 ... u-1, output parameter V^u, wherein utilizing the confidence level ω obtained by correlation rule in netinitⁱ, i=1, The initial weight between network input layer and hidden layer is arranged in 2 ... u-1；

Step 4.4: new data prediction: remembering that threshold value occurs for preset unusual service condition is ω_p, for new collected sensor Measurement data carries out l step prediction using model trained in step 4.3, if obtained target component predicted value is relative to first The normal drift value that begins is more than set threshold value ω_p, then it is assumed that unusual service condition occurs.

Preferably, before equipment does not fail, with the update of data, after every measurement data for updating predetermined quantity, Simultaneously training pattern need to be reconfigured, to obtain more accurate prediction result.

It is proposed by the present invention a kind of the industrial data association rule mining of strategy and pre- to be generated based on two stages frequent item set Survey method can be used for the Complex Industrial Systems of sensor measurement.By being associated rule digging to industrial equipment operating parameter, Corresponding parameter association is obtained, and is introduced into wavelet neural network prediction, more accurate prediction effect can be obtained. This will provide solid support to subsequent plant maintenance plan, for the equipment maintenance and management stringent to reliability requirement It is of great advantage, there are bright prospects in terms of practical implementation.

Detailed description of the invention

Fig. 1 compares for 7 prediction result of IDV (13) variable in embodiment and with true value；

Fig. 2 compares for 11 prediction result of IDV (13) variable in embodiment and with true value；

Fig. 3 is that IDV (13) variable 7 predicts error rate in embodiment；

Fig. 4 is that IDV (13) variable 11 predicts error rate in embodiment.

Specific embodiment

A specific embodiment of the invention is further described now in conjunction with attached drawing.

This example is specifically described concrete operation step by Tennessee-Yi Siman (TE) process simulation data and tests below The effect of card method.

The sampling interval of the data set is 3 minutes, each collected change of sensor under the data set record sampling interval Measurement.Under each service condition (the failure operation state under normal operating condition and 21 kinds of preset failures), imitate The measurement data of true process will all generate two class data sets, i.e. training set and test set.Wherein, for the acquisition of training set Journey is the measured value of all 52 variables obtained in the case of simulation process runs 25 small, wherein except normal operation Outside the training set that state acquisition arrives, the acquisition of remaining 21 training set data is as a child to introduce failure in simulation process operation 1, And only record the measurement data of subsequent 24 hours.In other words, the training set of normal operating condition has 500 observation samples, The training set acquired under remaining 21 malfunction is 480 observation samples.In addition, for 22 test sets, data are Simulation process runs 48 collected all variable measurements of hour institute, that is to say, that includes 960 in each test set Sample data.It should be noted that corresponding failure is at simulation run 8 hours when emulating to 21 kinds of procedure faults It introduces afterwards.Therefore, for the test set under 21 failure operation states, preceding 160 observation samples are normal data, after 800 observation samples are fault data.In TE process simulation model, only IDV (13) is a soft fault, therefore, In this example, we are tested using the related data of IDV (13).Industrial data association rule mining and unusual service condition prediction side Detailed process is as follows for method:

Step 1: to time series data piece-wise linearization expression and symbolism, construction be suitable for association rule mining from Dissipate type data set.This step specifically includes following sub-step:

Step 1.1: note sensor measurement time series isN is sensor Quantity, k length of time series；Initially fitting starting point isInitially fitting terminal isNote is fitted starting pointBeing fitted terminal isError of fitting threshold value is ω_E.It should be noted that in the present invention, i, j are tables as subscript The number for showing sensor is only to indicate ordinal number as subscript, unrelated with sensor number.

Step 1.2: for eachPiecewise fitting is carried out as follows:

1.2.1 waypoint count value count=1 is initialized；

1.2.2 successively to each fitting starting pointExecute step 1)-step 4):

1) end=start+h is calculated first；

4) if error of fitting ERR is greater than error of fitting threshold value ω_E, obtainLine segment be fitted sequenceRecord waypointReset h=2, count=count+1；

1.2.3 circulation executes 1.2.2 and terminates greater than k until end, the line segment time sequence after obtaining least square method fitting ColumnAnd waypointThe segmentation point sequence P of compositionⁱ；

Step 1.3: the time series after any sensor is fittedIt is denoted as Y_k={ y₁,y₂,…,y_k, wherein having more The line segment of the aforementioned least square method fitting of item.The trend and numerical information of every matching line segment are extracted, and uses following ternary The mode of group indicates a matching line segment s_i:

Step 1.4: cluster being carried out to a serial of line sections in triad sequence and symbolism is carried out to line segment, is set for indicating Standby or different system version, to prepare for subsequent association rule mining.Line segment s is described using Euclidean distance_i And s_jSimilarity d_ij:

Step 1.5: for the time of measuring sequence of every two sensorWithMerge it and is segmented point sequence PⁱAnd P^j, It is denoted as P^ij, n_ij- 1 is PⁱAnd P^jWaypoint number after merging；And by the waypoint after merging respectively to its symbolism sequence WithIt is split reconstruct, the symbolism sequence after being reconstructedWith

Step 2: the frequent item set of data set is generated using two stage Frequent Itemsets Mining Algorithm.This step specifically includes Following sub-step:

Step 2.1: for time of measuring sequenceWithCorresponding operating parameter VⁱAnd V^j, it is obtained by step 1 Measurement sequence symbolism data beWithTransaction set is constituted by it, I.e. each affairs are denoted as WithIncluded in line segment class code be denoted as respectivelyWithRemember that two stage minimum support threshold value is respectively minsup₁And minsup₂.? In this example, minimum support threshold value is set are as follows: minsup₁=0.2, minsup₂=0.2.

2.2.2: for each affairsCalculate σ (t_k)=σ (t_k)+1；

2.2.3: for each t_kIfNot less than minimum support threshold value minsup₁, then it is assumed that t_kIt is frequent 1- item collection retains t_kAnd record corresponding support counting；IfLess than minimum support threshold value minsup₁, then it is assumed that t_kIt is not frequent 1- item collection；

2.3.2 for each { a_p,b_q, execute following steps:

1) each is present inIn { a_p,b_q, calculate σ ({ a_p,b_q)=σ ({ a_p,b_q})+1

Step 3: correlation rule being generated according to frequent item set, extracts the pass for meeting minimum support and minimal confidence threshold Connection rule.This step specifically includes following sub-step:

Step 3.1: to the every group of { V for meeting support threshold obtained in step 2ⁱ,V^j, generate following correlation rule: V^j →VⁱAnd Vⁱ→V^j, note minimal confidence threshold is minconf；In this example, in this example, minimal confidence threshold is set are as follows: Minconf=0.7；

This step generates the correlation rule for meeting threshold condition, and extracts part relevant parameter and its confidence value such as table 1 It is shown.As seen from the results in Table 1, this example will use variable 7 and variable 11 to carry out predicted operation as target component.

Step 4: association rule mining result being introduced into wavelet neural network, and the unusual service condition for industrial equipment is pre- It surveys.This step specifically includes following sub-step:

Step 4.2: construction training sample: remembering that preset prediction step is l, in this example, prediction step is set as 10.By The group association parameter that association rule mining extracts is { V¹,V²,…,V^u, the complete training dataset being made of it is denoted asConstruct following matrix I_trainIt is inputted for the training of neural network:

Specifically, training set herein not merely uses the fault data of IDV (13) correlated variables, while also using Data under correlated variables normal operating condition.

Step 4.3: be trained using the training sample of construction to wavelet neural network: input parameter is Vⁱ, i=1, 2 ... u-1, output parameter V^u, wherein utilizing the confidence level ω obtained by correlation rule in netinitⁱ, i=1, The initial weight between network input layer and hidden layer is arranged in 2 ... u-1.In this example, for variable 7, input layer is 4 sections Point, hidden layer are 8 nodes；For variable 11, input layer is 3 nodes, and hidden layer is 6 nodes, the output of two variables Layer is 1 node, wherein the wavelet basis function used is Morlet morther wavelet basic function, and with the related confidence in table 1 Initialization weight of the angle value as neural network input layer and hidden layer；

Step 4.4: new data prediction: remembering that threshold value occurs for preset unusual service condition is ω_p, for new collected sensor Measurement data carries out l step prediction using model trained in step 4.3, if obtained target component predicted value is relative to first The normal drift value that begins is more than set threshold value ω_p, then it is assumed that unusual service condition occurs.Before equipment does not fail, with data It updates, every update predetermined quantity N^lMeasurement data after, need to reconfigure and training pattern, to obtain more accurately predicting knot Fruit, wherein N^lDepending on sensor sample frequency and actual industrial field demand.This example utilizes test set (totally 960 samplings Point) preceding 300 data to verify prediction effect, and neural network is updated according to every 10 data.It is arranged herein different The threshold value that (i.e. its normal value certain percentage of parameter drift-out) occurs for normal operating condition (failure) is ω_p=0.015.

1 correlation rule of table

Regular preceding paragraph	Rule is consequent	Confidence level
			Variable 13	Variable 7	0.7527
Variable 16	Variable 7	0.7446
			Variable 36	Variable 7	0.7017
Variable 35	Variable 11	0.7513
			Variable 36	Variable 11	0.7390

Table 2 always predicts error rate

	Introduce correlation rule	It is not introduced into correlation rule
			Variable 7	1.0482	1.8548
Variable 11	0.8536	1.2135

Fig. 1 and Fig. 2 is 13 prediction result of variable 7 and variable, and the advantage of correlation rule is introduced for verifying, and this example ties prediction Fruit is compared with the neural network prediction result being not introduced under conditions of correlation rule.In fig. 1 and 2, perpendicular solid line is The actual unusual service condition time of origin under our threshold value setting, erect dotted line and perpendicular chain-dotted line be respectively introduce correlation rule and It is not introduced into the predicted value of abnormal time of origin under the premise of correlation rule.By Fig. 1 and Fig. 2 it is found that the method institute that the present invention is mentioned Obtained prediction result can preferably approaching to reality value obtain very well especially in the prediction of first half test data Prediction result, this is because first half is the operation data under normal condition, training set is more complete and numerical value is relatively concentrated. In the prediction of out-of-service time, the method that is mentioned of the present invention has also obtained preferable prediction result, in Fig. 1, predicted value and true Real value is compared to 8 sampled points have been lagged, and in Fig. 2, predicted value has then lagged 5 sampled points compared with true value.Be not introduced into The prediction result of correlation rule is compared, and the mentioned method of the present invention obviously obtains more accurate prediction result.Variable 7 and variable 11 Prediction error rate calculated result it is as shown in Figure 3 and Figure 4.Meanwhile being further quantized result, this example also calculates total prediction and misses Rate, as shown in table 2.From the point of view of integrally predicting error, the introducing of correlation rule significantly reduces the prediction error of neural network, This point has also obtained good embodiment in the data that chart 2 is presented.

Claims

1. a kind of industrial data association rule mining and unusual service condition prediction technique that strategy is generated based on two stages frequent item set, It is characterized in that, specific step is as follows:

Step 1: to time series data piece-wise linearization expression and symbolism, construction is suitable for the discrete type of association rule mining Data set；

Step 3: correlation rule being generated according to frequent item set, extracts the association rule for meeting minimum support and minimal confidence threshold Then；

Step 4: association rule mining result being introduced into wavelet neural network, and is predicted for the unusual service condition of industrial equipment.

2. a kind of industrial data association rule mining for generating strategy based on two stages frequent item set according to claim 1 And unusual service condition prediction technique, it is characterised in that the step 1 includes following sub-step:

Step 1.1: note sensor measurement time series isN is number of sensors, K length of time series；Initially fitting starting point isInitially fitting terminal isNote is fitted starting pointIt is quasi- Closing terminal isError of fitting threshold value is ω_E；

Step 1.2: for eachPiecewise fitting is carried out as follows:

1.2.1 waypoint count value count=1 is initialized；

1.2.2 successively to each fitting starting pointExecute step 1)-step 4):

1) end=start+h is calculated first；

2) for dataAnd be fitted using least square method, it calculates quasi- Close error E RR；

4) if error of fitting ERR is greater than error of fitting threshold value ω_E, obtainLine segment be fitted sequence Start=start+h records waypointReset h=2, count=count+1；

Step 1.3: the time series after any sensor is fittedIt is denoted as Y_k={ y₁,y₂,…,y_k, extract every fit line The trend and numerical information of section, and a matching line segment s is indicated by the way of following triple_i:

Wherein, k_iIndicate the slope of the line segment,Indicate the span of the line segment on a timeline, r_iIndicate the growth of the segment data Rate, data { y corresponding for the line segment_j,y_j+1,…,y_j+h,J is the starting point of the line segment；

To line segment time series Y_kIn all line segments carry out triple expression, obtain triad sequence S_n={ s₁,s₂,…, s_n, wherein n indicates time series X_kLine segment number after segmentation；

Step 1.4: to a serial of line sections in triad sequence carry out cluster and to line segment carry out symbolism, for indicate equipment or The different version of system describes line segment s using Euclidean distance_iAnd s_jSimilarity d_ij:

Wherein, d_ijIndicate line segment s_iAnd s_jSimilarity, d_ijIt is smaller, then it represents that two lines section has more like change shape, ω_k And ω_rFor weight；

Then according to index of similarity d_ij, using K-means clustering algorithm to S_nIt is clustered, and is same class line segment distribution one A the same symbol obtains the sequence F of symbolism to indicate the variation pattern of operating parameter_n={ f₁,f₂,…,f_n, f₁,f₂,…, f_nRespectively indicate the 1,2nd ..., the symbol that n line segment is assigned to；

Step 1.5: for the time of measuring sequence of every two sensorWithMerge it and is segmented point sequence PⁱAnd P^j, it is denoted as P^ij, n_ij- 1 is PⁱAnd P^jWaypoint number after merging；And by the waypoint after merging to its symbolism sequenceWithIt carries out Segmentation reconstruct, the symbolism sequence after being reconstructedWith

3. a kind of industrial data association rule mining for generating strategy based on two stages frequent item set according to claim 2 And unusual service condition prediction technique, it is characterised in that the step 2 includes following sub-step:

Step 2.1: for time of measuring sequenceWithCorresponding operating parameter VⁱAnd V^j, it is obtained by step 1 and measures sequence The symbolism data of column areWithTransaction set is made of it, i.e., it is each Affairs are denoted as WithIncluded in line segment class code be denoted as respectivelyWithRemember that two stage minimum support threshold value is respectively minsup₁And minsup₂；

Step 2.2: by single sweep operation data set, calculating the support of each single item, frequent 1- item collection is obtained, by following 2.2.1 ~2.2.3 process carries out:

2.2.2: for each affairsCalculate σ (t_k)=σ (t_k)+1；

2.2.3: for each t_kIfNot less than minimum support threshold value minsup₁, then it is assumed that t_kIt is 1- frequent Collection retains t_kAnd record corresponding support counting；IfLess than minimum support threshold value minsup₁, then it is assumed that t_kIt is not Frequent 1- item collection；

Step 2.3: using the frequent 1- item collection t obtained in step 2.2_k2- item collection is constituted, and calculates its support, to find Frequent 2- item collection, carries out according to the following procedure:

2.3.1: note a_pAnd b_qRespectively pass through step 2.2 from former line segment class codeWith The item of middle reservation；

2.3.2 for each { a_p,b_q, execute following steps:

1) each is present inIn { a_p,b_q, calculate σ ({ a_p,b_q)=σ ({ a_p,b_q})+1

Step 2.4: using the frequent 2- item collection { a obtained in step 2.3_p,b_qEvery two operating parameter is calculated in entire data set In support, and obtain the frequent item set of parameter level, carry out according to the following procedure: to every two operating parameter VⁱAnd V^jIt constitutes Item collection { Vⁱ,V^j, calculate σ ({ Vⁱ,V^j)=sum (σ ({ a_p,b_q)), ifNot less than minimum support threshold value minsup₂, then retain { Vⁱ,V^jAnd corresponding support is recorded, and calculate σ (Vⁱ)=sum (σ (a_p))；σ(V^j)=sum (σ (b_q))。

4. a kind of industrial data association rule mining for generating strategy based on two stages frequent item set according to claim 3 And unusual service condition prediction technique, it is characterised in that the step 3 includes following sub-step:

Step 3.1: to the every group of { V for meeting support threshold obtained in step 2ⁱ,V^j, generate following correlation rule: V^j→Vⁱ And Vⁱ→V^j, note minimal confidence threshold is minconf；

Step 3.2: according to every group of correlation rule of generation, calculating its confidence threshold value, the process for extracting correlation rule is as follows: right In each correlation rule Vⁱ→V^j, calculateIf conf (Vⁱ→V^j) set not less than minimum Confidence threshold is minconf, then retains correlation rule Vⁱ→V^jAnd record corresponding support and confidence level ωⁱ。

5. a kind of industrial data association rule mining for generating strategy based on two stages frequent item set according to claim 4 And unusual service condition prediction technique, it is characterised in that the step 4 includes following sub-step:

Step 4.1: for any group association parameter extracted from correlation rule, being denoted as { V¹,V²,…,V^u, wherein u indicates to close Join the quantity of parameter, V^uFor rule it is consequent i.e. target component, for each correlation rule Vⁱ→V^u, i=1,2 ... u-1 have One confidence level, is denoted as ωⁱ；For target component V^u, using wavelet neural network, carry out unusual service condition prediction；

Step 4.2: construction training sample: remembering that preset prediction step is l, the group association parameter extracted by association rule mining For { V¹,V²,…,V^u, the complete training dataset being made of it is denoted asConstruction Following matrix I_trainIt is inputted for the training of neural network:

Step 4.3: be trained using the training sample of construction to wavelet neural network: input parameter is Vⁱ, i=1,2 ... u- 1, output parameter V^u, wherein utilizing the confidence level ω obtained by correlation rule in netinitⁱ, i=1,2 ... u-1, Initial weight between network input layer and hidden layer is set；

Step 4.4: new data prediction: remembering that threshold value occurs for preset unusual service condition is ω_p, for new collected sensor measurement Data carry out l step prediction using model trained in step 4.3, if obtained target component predicted value is relative to initially just Normal drift value is more than set threshold value ω_p, then it is assumed that unusual service condition occurs.

6. a kind of industrial data association rule mining for generating strategy based on two stages frequent item set according to claim 1 And unusual service condition prediction technique, it is characterised in that before equipment does not fail, with the update of data, every update predetermined quantity After measurement data, simultaneously training pattern need to be reconfigured, to obtain more accurate prediction result.