CN112257917A

CN112257917A - Time series abnormal mode detection method based on entropy characteristics and neural network

Info

Publication number: CN112257917A
Application number: CN202011116876.4A
Authority: CN
Inventors: 苏维均; 牛雨晴; 于重重; 赵霞; 韩璐
Original assignee: Beijing Technology and Business University
Current assignee: Beijing Technology and Business University
Priority date: 2020-10-19
Filing date: 2020-10-19
Publication date: 2021-01-22
Anticipated expiration: 2040-10-19
Also published as: CN112257917B

Abstract

The invention provides a time series abnormal pattern detection method based on entropy characteristics and a neural network, which comprises the following steps: 1) extracting a second-order difference ratio sample entropy characteristic sequence from the time sequence in the training data set; 2) training a generated confrontation network model to obtain a generator and a corresponding discriminator; 3) calculating the abnormal score of the characteristic sequence and constructing a threshold value; 4) and carrying out abnormity judgment on the input data to be detected according to the threshold value. The method has the advantages that the time series data are subjected to feature extraction by utilizing the difference rate sample entropy, so that the abnormal mode is more obvious; a new abnormal score calculation method is established, and the accuracy and the generalization of model identification are improved, so that the method has higher practicability and application value.

Description

Time series abnormal mode detection method based on entropy characteristics and neural network

Technical Field

The invention relates to prediction of coal mine thermodynamic composite disasters, in particular to a time series abnormal mode detection method based on entropy characteristics and a neural network, and belongs to the field of emergency safety.

Background

Coal as a main energy source occupies an irreplaceable important position in the energy structure of China, the area left after coal mining is a goaf, ventilation in the goaf is poor, more coal is left, and combustible gas is generated by continuous oxidation, so that coal spontaneous combustion, gas explosion and other coal mine thermal power disasters are easily caused. The concentration change of the released combustible gas shows a certain rule along with the development of time, the inflection points of monitoring data in different stages are effectively detected, and when the gas concentration is greatly changed, the gas concentration can be considered to enter an abnormal mode, so that the possibility of disasters such as coal spontaneous combustion and the like is shown. The gas generation content of different coal mines is different, and if the gas content value is only used as the judgment standard of disaster occurrence, great errors can be caused when the method is applied to other coal mines, so that the detection of the abnormal mode can improve the generalization of disaster judgment, and a new thought is provided for the detection of the coal composite disaster.

With the research and the deepening of the artificial intelligence theory by people, the prediction of coal and gas by applying the time sequence prediction method becomes a new trend, people introduce the prediction into the quantitative evaluation and analysis of coal and gas disasters and integrate the theory of computer technology, support vector machines, artificial neural networks and the like for research, but the prediction methods are difficult to apply to complex data, have the problems of easy falling into local minimum values, appear an overfitting phenomenon, have low accuracy and are large in limitation.

With the improvement of information technology, the problem of abnormality detection in time series has become a research focus in recent years. Time series anomalies generally refer to a series of data that is significantly different from other data, and such anomalies do not refer to random bias, but rather to differences due to different mechanisms. The abnormal mode of the gas time sequence data is detected, and a theoretical basis can be provided for the coal mine thermal power disaster. If the time series data has the abnormal mode, the change trend of the data is greatly changed, and the abnormal mode can be used as a judgment basis for disaster occurrence.

In the conventional method (CN201910809956.9), GAN is used for anomaly detection of a time sequence, which mainly uses an optimized GAN generator and a discriminator to build an anomaly detection model, and uses a generated residual and an identification loss output by the model as a judgment basis for judging abnormal data. But most of time series do not change significantly, and the time series are directly used as input data of the GAN, so that the characteristics are not significant enough; meanwhile, a more effective judgment criterion is obtained by using the generated residual error and the identification loss output by the model, and how to improve the accuracy and universality of the abnormity judgment is yet to be researched.

Disclosure of Invention

The invention aims to realize a time series abnormal mode detection method based on entropy characteristics and a neural network. The method of the invention is divided into 4 stages: extracting a second-order difference ratio sample entropy characteristic sequence from the time sequence in the training data set; training a generated confrontation network model to obtain a generator and a corresponding discriminator; calculating the abnormal score of the characteristic sequence and constructing a threshold value; and carrying out abnormity judgment on the input data to be detected according to the threshold value. Specifically, the method of the present invention comprises the steps of:

A. the method specifically comprises the following steps of extracting a second-order difference ratio sample entropy characteristic sequence from a time sequence in a training data set:

A1. dividing a training data set into two sets which are respectively marked as a training data set 1 and a training data set 2;

all the training data set 1 is normal data, and the training data set 2 comprises normal data and abnormal data;

A2. for training data set 1 time series

Carrying out segmentation by using the formula 1 according to the window size W and the step length d in a sliding manner to obtain a sequence segment set W with the length of L, wherein the ith time sequence segment is recorded as s_i；

s_i＝[x_1+(i-1)d,x_2+(i-1)d,…,x_1+(i-1)d+w](formula 1)

SaidT_train1 × T representing the number of time series of training data sets_trainRepresenting a training data set time series dimension;

A3. performing difference ratio operation on each sequence segment in the sequence segment set W to obtain a second-order difference ratio sequence of all the sequence segments, wherein the specific implementation is as follows:

A3.1. for sequence segment s_iCalculating a second order difference ratio sequence G ═ G using equation 2₁,g₂,…,g_w′Solving the standard deviation std;

said

Is the e-order difference value of the u time point,

is the e-order difference value of the u-1 time point;

A3.2. dividing a second-order difference rate sequence with w 'data points by taking m time sequence data points as a subsequence, totaling w' -m +1 subsequence segments, and marking as K2_i＝{q₁,q₂,…,q_w′-m+1}；

A4. Carrying out sample entropy feature extraction on the second-order difference rate sequences of all the sequence segments to obtain the second-order difference rate sample entropy feature sequences of all the sequence segments, and concretely realizing the following steps:

A4.1. calculating any two subsequence fragments q_aAnd q is_bA distance D [ q ] between_a,q_b]The distance is determined by the maximum difference of the corresponding position elements in the two subsequence segments;

A4.2. calculating the subsequence fragment q_aObtaining the similarity probability of the subsequences with the distance between subsequences smaller than the threshold value by formula 3, and obtaining the average similarity probability of the second-order difference rate sequence by formula 4;

r is a similarity threshold;

A4.3. according to the steps A4.1-A4.2, the average similarity probability B is recalculated by taking m +1 as the subsequence length^m+1(r) obtaining a second-order difference ratio sample entropy feature SE by formula 5;

A5. carrying out sectional average preprocessing on the difference rate sample entropy sequence to obtain the difference rate sample entropy sequence, and concretely realizing the following steps:

A5.1. from X_t(t-1, 2.. t-w), and extracting a sequence segment S with the length w_t＝{X_t,X_t+1,...,X_w+t-1}^1×tSumming according to formula 6, and then averaging according to formula 7;

sum_t＝X_t+X_t+1...X_w+t-1(formula 6)

sum_t＝sum_tW; (formula 7)

A5.2. Repeating the step A4.1, taking out t-w sequence segments in total, and adding sum_tForming a new entropy sequence S of difference ratio samples_t'＝{sum₁,sum₂,…,sum_t-w}^1×t；

B. Training a generated confrontation network model to obtain a generator and a corresponding discriminator, and the specific implementation is as follows:

B1. randomly sampled noisy data Z ═ Z_iI is 1,2, …, n, where n corresponds to the number of samples. The generator model G is a plurality of LSTM memory units, the number of the memory units is set, Z is input into the generator model G, and reconstructed sample sequence data G (Z) is generated;

B2. entropy sequencing of new difference ratio samplesS_t' inputting the generated reconstructed sample sequence data G (Z) into a built discriminator model D;

B3. updating the model parameters by using a random gradient descent algorithm according to the value of the loss function, updating the parameters of the discriminator, and then updating the parameters of the generator according to the noise data by using an Adam optimization algorithm;

B4. saving the model parameters, repeating the steps B1-B3 to carry out loop iteration, and finally obtaining a trained generator model G capable of generating a normal time sequence and a corresponding discriminator model D;

C. calculating the abnormal score of the characteristic sequence and constructing a threshold, wherein the method is specifically realized as follows:

C1. using time series in training data set 2

Repeating the steps A2-A5, and extracting the features to obtain a new feature sequence

C2. Randomly sampling noise data Z_valInputting the data into a generator G which is completed in training, and generating a reconstructed sample G (Z)_val) Calculating the abnormal score R of the input sample by using the generated error_scoreThe method is concretely realized as follows:

C2.1. for the reconstructed sample G (Z) with the length of n_val) New signature sequence with training data set 2

The elements in the absolute error E are sorted from small to large to obtain the sorted absolute error E_i′＝{e′₁,e′₂,…,e′_nGet the absolute error E after sorting_i′＝{e′₁,e′₂,…,e′_nMean value M of;

C2.2. e'_iComparing the extracted elements with the average value M, and taking out E'_iMiddle { e'_k,e′_k+1,…,e′_nAre data elements greater than the mean M, the number beingn-k + 1; initializing weight sequence W_i′＝{w′₁,w′₂,…,w′_n}^T,w′_1～n-2X 'is provided'_nCorresponding weight w'_nIs lambda, x'_n-1Corresponding weight w'_n-1Is 1-lambda, the weight sequence W is updated_i' size of element in, W is represented by formula 8_i' update is performed;

C2.3. using the updated weight sequence W_i'and sequenced sample E'_iCalculating the generation abnormality score R of the training sample set 2 by equation 9_score；

R_score＝E_i′·W_i' (formula 9)

C3. Outputting a generated sample and a new characteristic sequence by using the discriminator D trained in the step B

The similarity probability P of (2), calculating the discriminant anomaly score D_scoreIs 1-P;

C4. using discriminant anomaly score D_scoreAnd generating an anomaly score R_scoreThe anomaly score O is calculated by equation 10, and a threshold is established according to the training data set 2, specifically implemented as follows:

O＝W_D×D_score+W_G×R_score(formula 10)

W is_DAnd W_GGenerating weights of the abnormal scores for the discrimination abnormal scores and the samples respectively;

C4.1. will train the data set

The maximum abnormal score and the minimum abnormal score in the result are used as the maximum boundary and the minimum boundary, the maximum abnormal score and the minimum abnormal score are divided averagely, and the abnormal score of the q-th training data set 2 is calculated through an equation 11;

C4.2. the abnormal score corresponding to the maximum F1 score is used as a threshold, and the calculation mode of F1 is as shown in formula 12;

the Pre is the proportion of the positive sample predicted to be positive in all the positive samples, and the Rec is the proportion of the positive sample predicted to be positive in all the positive samples; TP is the positive sample predicted to be positive by the model; FP is a negative sample predicted to be positive by the model; FN is the positive sample predicted as negative by the model;

D. the method specifically comprises the following steps of judging the abnormity of input data to be detected according to a threshold value:

D1. inputting a time series of data sets to be detected

Repeating the steps A1-A5, and extracting the entropy characteristics of the difference rate samples to obtain a new time sequence

D2. Repeating steps C1-C4 to obtain

Inputting the data into a trained generation countermeasure network, and calculating the abnormal score O of the data to be detected by using the formula 10_real；

D3. Abnormality score O obtained by calculation_realAnd C, comparing the data to be detected with the threshold value obtained by calculation in the step C, if the abnormal score is larger than the threshold value, judging that the data to be detected contains an abnormal mode, otherwise, judging that the data to be detected does not contain the abnormal mode.

The method has the advantages that the time series data are subjected to feature extraction by utilizing the difference rate sample entropy, so that the abnormal mode is more obvious; a new abnormal score calculation method is established, the accuracy and the generalization of time series abnormal pattern detection are improved, and the method has higher practicability and application value.

Drawings

FIG. 1: general flow chart of abnormal pattern detection

Detailed Description

The present invention will be further described below as an example by performing CO time series prediction on experimental data and performing a description of a time series abnormal pattern detection method based on a difference ratio entropy characteristic and generation of a countermeasure network according to a time series data amount, an input-output dimension, and the like with reference to the accompanying drawings.

The general flow chart of the method is shown in figure 1. The method comprises the following steps: 1) extracting a second-order difference ratio sample entropy characteristic sequence from the time sequence in the training data set; 2) training a generated confrontation network model to obtain a generator and a corresponding discriminator; 3) calculating the abnormal score of the characteristic sequence and constructing a threshold value; 4) and carrying out abnormity judgment on the input data to be detected according to the threshold value. The invention is further described below by way of example according to the following steps:

A1. selecting experimental data, wherein a research object is a CO gas concentration one-dimensional time sequence, selecting a training data set, dividing the training data set into two sets which are respectively marked as a training data set 1 and a training data set 2;

A2. setting the sliding window size of the sequence segment to be 10 and the step length to be 1 for the training data set 1 which is all normal data to slide for segmentation;

A3. performing difference rate operation on each sequence segment in the sequence segment set to obtain a second-order difference rate sequence of all the sequence segments, wherein the specific implementation is as follows:

A3.1. the CO gas concentration sequence is totally 348 data, and the formula is utilized

The second order difference ratio series was obtained to have 345 parts of data, G ═ G, as shown in table 2₁,g₂,…,g_w′And find the standard deviation std to be 0.11, and the partial data are as follows:

A3.2. dividing a second-order difference rate sequence with 345 data points by taking 6 time sequence data points as a subsegment, and counting 340 subsequences which are marked as K2_i＝{q₁,q₂,…,q_w′-m+1Part of the data are as follows:

A4. carrying out sample entropy feature extraction on the second-order difference rate sequences of all the sequence segments to obtain second-order difference rate sample entropy feature sequences of all the sequence segments, and concretely realizing the following steps;

A4.1. calculating the second-order difference ratio sample entropy characteristics of each sequence segment to finally obtain a complete second-order difference ratio sample entropy sequence, wherein partial data are as follows:

A5.1. from X_t(t 1,2.. t-w), and taking out the sequence with the length wColumn segment S_t＝{X_t,X_t+1,...,X_w+t-1}^1×tSumming and then averaging;

A5.2. repeating the step A4.1, taking out t-w sequence segments in total, and adding sum_tConstitute a new sequence S_t'＝{sum₁,sum₂,…,sum_t-w}^1×tPart of the data are as follows:

B1. randomly sampled noisy data Z ═ Z_iI ═ 1,2, …, n }, where n is 330. The generator model is a plurality of LSTM memory units, the number of the memory units is set, Z is input into the built generator model, and reconstructed sample sequence data G (Z) is generated;

B2. entropy S of new difference ratio samples_t' and the generated reconstructed sample sequence data G (Z) are input into a built discriminator model D, and partial parameter data are as follows:

B4. saving the model parameters, returning to B2 for 1000 times of loop iteration, setting the learning rate to be 0.1, and finally obtaining a trained generator model G and a discriminant model D;

C1. the steps A2-A5 are first repeated for a time series of training data set 2 containing normal data and abnormal data

Extracting features to obtain new feature sequence

Part of the data are as follows:

C2. using discriminant anomaly score D_scoreAnd sample generation anomaly score R_scoreCalculating an anomaly score O;

C2.1. will train the data set

The maximum abnormal score and the minimum abnormal score in the result are used as the maximum and minimum boundaries, and the maximum abnormal score and the minimum abnormal score are averagely divided to obtain the abnormal score of the training data set 2 of the q-th section

C2.2. The maximum F1 score is 0.8916, and the corresponding abnormal score O is used as a threshold value, so that the threshold value is 0.375;

D1. inputting time series samples of a data set to be detected

Repeating the steps A2-A5, and extracting the entropy characteristics of the difference rate samples to obtain a new time sequence

Part of the data are as follows:

D2. repeating steps C1-C4 to obtain

Inputting the abnormal score O into a trained generation countermeasure network, and calculating the abnormal score O of an actual data sample_realIs 0.572;

D3. abnormality score O obtained by calculation_realAnd C, comparing the abnormal score with the threshold value calculated in the step C, if the abnormal score is larger than the threshold value, judging that the sample is an abnormal sample, and actually processing the whole sample as follows:

the method realizes a time series abnormal mode detection method based on the difference rate entropy characteristics and generation of the countermeasure network, and can detect whether the sequence section contains an abnormal mode, thereby achieving the purpose of providing judgment basis for the occurrence of coal mine thermal dynamic disasters; a new abnormal score calculation method is established, so that the accuracy and the generalization of model identification are improved, and the method has higher application value.

Finally, it is noted that the disclosed embodiments are intended to aid in further understanding of the invention, but those skilled in the art will appreciate that: various substitutions and modifications are possible without departing from the spirit and scope of the invention and the appended claims. Therefore, the invention should not be limited to the embodiments disclosed, but the scope of the invention is defined by the appended claims.

Claims

1. A time series abnormal pattern detection method based on entropy characteristics and a neural network comprises the following steps:

A2. for training data set 1 time series

The window size W and the step length d are used for sliding segmentation to obtain a sequence segment set W with the length of L, wherein the ith time sequence segment is marked as s_iThe calculation formula is as follows:

s_i＝[x_1+(i-1)d,x_2+(i-1)d,…,x_1+(i-1)d+w]

the T is_train1 × T representing the number of time series of training data sets_trainRepresenting a training data set time series dimension;

A3. performing difference rate operation on each sequence segment in the sequence segment set W to obtain a second-order difference rate sequence of all the sequence segments;

A4. carrying out sample entropy feature extraction on the second-order difference rate sequences of all the sequence segments to obtain second-order difference rate sample entropy feature sequences of all the sequence segments;

A5. carrying out sectional average pretreatment on the difference rate sample entropy sequence to obtain a difference rate sample entropy sequence;

B2. entropy sequencing S of new difference ratio samples_t' inputting the generated reconstructed sample sequence data G (Z) into a built discriminator model D;

C1. using time series in training data set 2

C2. Randomly sampling noise data Z_valInputting the data into a generator G which is completed in training, and generating a reconstructed sample G (Z)_val) Calculating the abnormal score R of the input sample by using the generated error_score；

C4. using discriminant anomaly score D_scoreAnd generating an anomaly score R_scoreCalculating an abnormal score O, and establishing a threshold value according to the training data set 2, wherein the calculation formula is as follows:

O＝W_D×D_score+W_G×R_score

D1. inputting a time series of data sets to be detected

D2. Repeating steps C1-C4 to obtain

2. The method for detecting the abnormal pattern of the time series based on the entropy features and the neural network as claimed in claim 1, wherein the difference ratio operation is performed on each sequence segment in the sequence segment set W to obtain the second order difference ratio sequence of all the sequence segments, and the method is implemented as follows:

A3.1. for sequence segment s_iCalculating the second order difference rate sequence G ═ G₁,g₂,…,g_w′And solving the standard deviation std thereof, wherein the calculation formula is as follows:

said

Is the e-order difference value of the u time point,

is the e-order difference value of the u-1 time point;

A3.2. dividing m time sequence data points into w' dataThe second order difference ratio sequence of points, totaling w' -m +1 subsequences fragments, is denoted as K2_i＝{q₁,q₂,…,q_w′-m+1}。

3. The method for detecting the abnormal pattern of the time series based on the entropy characteristics and the neural network as claimed in claim 1, wherein the sample entropy characteristics are extracted from the second order difference ratio sequences of all the sequence segments to obtain the second order difference ratio sample entropy characteristic sequences of all the sequence segments, and the specific implementation steps are as follows:

A4.2. calculating the subsequence fragment q_aProbability of similarity to the remainder of the subsequence fragment. Using the occupation ratio of the subsequence segments with the distance between the subsequence segments smaller than a threshold value and the average similarity probability of the second-order difference rate sequence as the second-order difference rate sample entropy, wherein the calculation formula is as follows:

r is a similarity threshold;

A4.3. according to the steps A4.1-A4.2, the average similarity probability B is recalculated by taking m +1 as the subsequence length^m+1(r), the second order difference ratio sample entropy characteristic SE, the calculation mode is as follows:

4. the method for detecting the abnormal pattern of the time series based on the entropy characteristics and the neural network as claimed in claim 1, wherein the difference ratio sample entropy sequence is subjected to the segmented average preprocessing to obtain the difference ratio sample entropy sequence, and the method is specifically realized as follows:

A5.1. from X_t(t-1, 2.. t-w), and extracting a sequence segment S with the length w_t＝{X_t,X_t+1,...,X_w+t-1}^1×tSumming and then averaging, wherein the calculation formula is as follows:

sum_t＝X_t+X_t+1...X_w+t-1

sum_t＝sum_t/w；

A5.2. repeating the step A4.1, taking out t-w sequence segments in total, and adding sum_tForming a new entropy sequence S of difference ratio samples_t'＝{sum₁,sum₂,…,sum_t-w}^1×t。

5. The entropy feature and neural network-based time series abnormal pattern detection method of claim 1, wherein the noise data Z is randomly sampled_valInputting the data into a generator G which is completed in training, and generating a reconstructed sample G (Z)_val) Calculating the abnormal score R of the input sample by using the generated error_scoreThe method is concretely realized as follows:

The elements in the absolute error E are sorted from small to large to obtain the sorted absolute error E_i′＝{e′₁,e′₂,…,e′_nGet absolute error E 'after sorting'_i＝{e′₁,e′₂,…,e′_nMean value M of;

C2.2. e'_iComparing the extracted elements with the average value M, and taking out E'_iMiddle { e'_k,e′_k+1,…,e′_n-data elements larger than the mean M, number n-k + 1; initializing weight sequence W_i′＝{w′₁,w′₂,…,w′_n}^T,w′_1～n-2X 'is provided'_nCorresponding weight w'_nIs lambda, x'_n-1Corresponding weight w'_n-1Is 1-lambda, the weight sequence W is updated_i' the size of the middle element, the calculation formula is:

C2.3. using the updated weight sequence W_i'and sequenced sample E'_iCalculating the abnormal score R of training sample set 2_scoreThe calculation formula is as follows:

R_score＝E_i′·W_i′。

6. the entropy feature and neural network-based time series abnormal pattern detection method of claim 1, wherein a discriminant abnormality score D is used_scoreAnd generating an anomaly score R_scoreThe anomaly score O is calculated by equation 10, and a threshold is established according to the training data set 2, specifically implemented as follows:

C4.1. will train the data set

The maximum abnormal score and the minimum abnormal score in the result are used as the maximum boundary and the minimum boundary, the maximum abnormal score and the minimum abnormal score are averagely divided, and the abnormal score of the q-th training data set 2 is calculated, wherein the calculation formula is as follows:

C4.2. the abnormal score corresponding to the maximum F1 score is used as a threshold, and the calculation formula of F1 is as follows:

the Pre is the proportion of positive samples predicted to be positive in all the positive samples predicted to be positive; rec is the proportion of positive samples predicted to be positive among all positive samples. TP is the positive sample predicted to be positive by the model; FP is a negative sample predicted to be positive by the model; FN is the positive sample that is predicted to be negative by the model.