CN116975755A

CN116975755A - Noise-tag-scene-oriented stepping robust semi-supervised gas turbine anomaly detection method and system

Info

Publication number: CN116975755A
Application number: CN202310656040.0A
Authority: CN
Inventors: 叶建君; 沈晓兵; 车斌; 滕昊真; 孙健; 张铁汉; 赵春晖; 杨佳阳; 陈旭; 宋鹏宇
Original assignee: Xiaoshan Power Plant Of Zhejiang Zhengneng Electric Power Co ltd; Zhejiang University ZJU
Current assignee: Xiaoshan Power Plant Of Zhejiang Zhengneng Electric Power Co ltd; Zhejiang University ZJU
Priority date: 2023-06-05
Filing date: 2023-06-05
Publication date: 2023-10-31

Abstract

The invention discloses a noise label scene-oriented stepping robust semi-supervised gas turbine anomaly detection method and system. The invention designs an arctangent support vector machine model and an average absolute error neural network model based on a symmetry loss theory to construct a complementary Lu Bangji classifier, and has stronger robust characteristic on noise labels; secondly, a complementary noise filtering mechanism is provided, noise labels are filtered in a stepping mode, a complementary Lu Bangji classifier is updated, and the risk of overfitting on the noise labels is further reduced; in addition, the invention also designs a self-adaptive data supplementing strategy, and the problem of rare data labels of the gas turbine is overcome by utilizing a high-confidence pseudo label sample to realize the expansion of a labeled data set. The invention can effectively complete the task of detecting the abnormality of the operation of the gas turbine under the condition that the sample labels are rare and the label noise exists, and provides a new method for detecting the abnormality and safely operating and maintaining the gas turbine generator set.

Description

Noise-tag-scene-oriented stepping robust semi-supervised gas turbine anomaly detection method and system

Technical Field

The invention discloses a noise-tag-scene-oriented stepping robust semi-supervised gas turbine anomaly detection method and system. The invention belongs to the field of industrial anomaly detection, and particularly relates to anomaly detection of industrial process data with few labels and noise on labels.

Background

The gas turbine generator set is typical large-scale industrial equipment and has the characteristics of huge scale, difficult manual inspection, frequent change of operation load and the like. With the acceleration of modern industry to high-end, complex and intelligent, it becomes more difficult to realize efficient operation and maintenance of the gas turbine generator set. Anomaly detection is an important task in the operation and maintenance of a gas turbine, and specifically includes evaluating the state of the operation process of the gas turbine, and determining whether there is an anomaly in the equipment such as the gas turbine, steam turbine, and generator. Reliable anomaly detection is of great importance to ensure the safety of the power generation process of the combustion engine. Therefore, how to detect the abnormality of the gas turbine generator set timely and accurately in the running process of the gas turbine generator set is always a hot spot for technical staff to study.

With the advent of internet of things (Internet of Things, ioT) technology and artificial intelligence (Artificial Intelligence, AI), one can conveniently store large amounts of industrial process data, explore more efficient information from these data, and ultimately understand the underlying operating rules of an industrial process. Over the past three decades, many pattern recognition methods have been applied to anomaly detection tasks in industrial processes and have achieved satisfactory results. Most of the existing anomaly detection methods based on pattern recognition use some supervised learning algorithms to build classification models, and effective information indicating potential operation behaviors can be extracted from industrial process samples and labels by using the classification models, so that anomaly recognition is finally completed. However, the above method requires a large number of labeled samples as support, which has a significant limitation in practical applications. In the actual operation process of the gas engine, the unlabeled process data are relatively easy to obtain, the operation states of thousands of process data are judged one by one, and the time-consuming and expensive process of obtaining sufficient and clear indication of the equipment operation state data is realized. The lack of the label sample severely limits the application performance of the deep supervised learning model in the abnormal detection task of the gas turbine. In addition, some data may be marked incorrectly, i.e. carry noisy labels, during manual marking of the engine data due to lack of expert knowledge, occasional mistakes by the operator, etc. Such inaccurate label information may lead to erroneous guidance for training of the model, eventually resulting in reduced accuracy of anomaly detection. In the actual abnormal detection task of the operation of the combustion engine, the two problems often occur simultaneously, and a great challenge is brought to the traditional supervision abnormal detection.

Disclosure of Invention

The invention aims to design a stepping iteration complementary noise filtering strategy and further establish an anomaly detection method aiming at the problems that the data labels of a gas turbine are rare and noise exists. Firstly, an arctangent support vector machine model and an average absolute error neural network model are designed based on a symmetry loss theory to construct a complementary Lu Bangji classifier, and the complementary Lu Bangji classifier has strong robust characteristics on noise labels; secondly, a complementary noise filtering mechanism is provided, noise labels are filtered in a stepping mode, a complementary Lu Bangji classifier is updated, and the risk of overfitting of the system to the noise labels is further reduced; in addition, the invention designs a self-adaptive data supplementing strategy, and the problem of rare data labels of the fuel engine is overcome by utilizing a high-confidence pseudo label sample to realize the expansion of a labeled data set.

The aim of the invention is realized by the following technical scheme:

a step robust semi-supervised gas turbine anomaly detection method oriented to a noise label scene comprises the steps of obtaining gas turbine operation data, and inputting the gas turbine operation data into a trained complementary Lu Bangji classifier to obtain an anomaly detection result; wherein the complementary Lu Bangji classifier is obtained by training the following steps:

step 1: constructing a training set based on the collected combustion engine operation data, wherein the training set comprises a labeled training set and a non-labeled training set;

step 2: constructing a complementary Lu Bangji classifier, wherein the complementary Lu Bangji classifier model comprises a parallel arctangent support vector machine model and an average absolute error neural network model; the loss function adopted in the training of the arctangent support vector machine model is as follows:

wherein w is ₁ And b ₁ Representing parameters to be optimized of the arctangent support vector machine model, wherein xi represents a penalty coefficient, L represents the number of labeled samples, and L is L ² Representing the square of the two norms,label for the ith labeled sample, +.>The fuel engine operation data of the ith tagged sample; the loss function adopted in the training of the average absolute error neural network model is the average absolute error; training the complementary Lu Bangji classifier by using the labeled training set to obtain a first complementary Lu Bangji classifier;

step 3: predicting each sample of the labeled training set by using a first complementary Lu Bangji classifier, judging whether the corresponding sample is a noise label or not according to a risk evaluation value of a prediction result, and filtering samples of the noise label; retraining the complementary Lu Bangji classifier with the noise-removed labeled training set to obtain a second complementary Lu Bangji classifier;

step 4: randomly selecting samples in the non-tag training set to form a non-tag subset, predicting each sample in the non-tag subset by using a second complementary Lu Bangji classifier to obtain corresponding pseudo tags, sorting according to the confidence of the pseudo tags, selecting one or more samples with the maximum confidence, deleting the samples from the non-tag training set, and forming a tagged sample with the corresponding pseudo tags, and adding the tagged sample into the tagged training set;

and (3) alternately performing complementary noise label filtering and self-adaptive data supplementing according to the steps 2-4 until the set number of rounds T is reached, and obtaining the final trained complementary Lu Bangji classifier.

Further, the collected fuel engine operation data is normalized fuel engine operation data.

Further, in the step 3, each sample in the labeled training set is predicted by using a first complementary Lu Bangji classifier, and whether the corresponding sample is a noise label or not is judged according to a risk evaluation value of a prediction result, and the sample of the noise label is filtered, specifically:

predicting each sample of the labeled training set by using a first complementary Lu Bangji classifier, and calculating a risk evaluation value of a prediction result; the risk evaluation value of the prediction result calculated based on the arctangent support vector machine model is a negative value of the product of the prediction label of each sample corresponding to the arctangent support vector machine model and the label carried by the sample; the risk evaluation value of the prediction result calculated based on the average absolute error neural network model is the absolute value of the label difference value carried by the sample after the average absolute error neural network model is in linear transformation corresponding to the prediction label of each sample and is mapped to the [ -1,1] interval;

according to the risk evaluation value of the prediction result, all the labeled samples are divided into 4 categories, including:

secure sample L _c : the risk evaluation value of the prediction result calculated based on the arctangent support vector machine model does not exceed a first threshold mu ₁ And the risk evaluation value of the prediction result calculated based on the mean absolute error neural network model does not exceed the second threshold mu ₂ Is a sample of (2);

risk sample L _d1 : risk evaluation value of prediction result calculated based on arctangent support vector machine model exceeds first threshold mu ₁ While the risk evaluation value of the prediction result calculated based on the mean absolute error neural network model does not exceed the second threshold μ ₂ Is a sample of (2);

second-class risk sample L _d2 : the risk evaluation value of the prediction result calculated based on the mean absolute error neural network model exceeds a second threshold mu ₂ While the risk evaluation value of the prediction result calculated based on the arctangent support vector machine model does not exceed the first threshold mu ₁ Is a sample of (2);

dangerous sample L _e : risk evaluation value of prediction result calculated based on arctangent support vector machine model exceeds first threshold mu ₁ And the risk evaluation value of the prediction result calculated based on the mean absolute error neural network model exceeds a second threshold mu ₂ Is a sample of (2);

the dangerous samples are judged to be samples containing noise labels, and the samples of the noise labels are filtered.

Further, the method further comprises the following steps:

retraining the complementary Lu Bangji classifier by using the labeled training set after filtering the dangerous samples to obtain a third complementary Lu Bangji classifier;

predicting each sample of the labeled training set after dangerous samples are filtered by using a third complementary Lu Bangji classifier, and calculating a risk evaluation value of a prediction resultAnd->Wherein (1)>The risk evaluation values of the prediction results calculated based on the average absolute error neural network model and the average absolute error neural network model are respectively marked with the index i as the serial number of the sample;

calculating a labeled training set after filtering dangerous samples and one type of dangerous samples simultaneouslyIs the mean difference delta of (1) ₁ If delta ₁ If the number is more than 0, judging the risk sample as a sample containing a noise label;

and/or calculating a labeled training set after filtering the dangerous samples and the second-class dangerous samples simultaneouslyIs the mean difference delta of (1) ₂ If delta ₂ And (3) judging the second-class risk samples as samples of the noisy labels.

Further, the confidence of the pseudo tag is expressed as:

c _1,j representing pseudo tagsConfidence of c _2,j Representing pseudo tag->Confidence of (2); />Representing the output of the arctangent support vector machine model and the average absolute error neural network model in the second complementary Lu Bangji classifier, respectively, corresponding to the ith labeled sample; h is the number of samples of the unlabeled subset.

Further, the method further comprises the following steps:

if the abnormal detection result of the arctangent support vector machine model or the average absolute error neural network model in the trained complementary Lu Bangji classifier is abnormal, triggering an abnormal alarm;

if the abnormal detection results of the arctangent support vector machine model or the average absolute error neural network model in the trained complementary Lu Bangji classifier are normal, an alarm is not triggered.

Further, the structure of the average absolute error neural network model is a full-connection layer neural network.

Further, the arctangent support vector machine model has a structure of a soft interval support vector machine.

The system comprises a memory, a processor and a computer program which is stored in the memory and can run on the processor, wherein the processor realizes the noise-tag-scene-oriented step-type robust semi-supervised gas engine abnormality detection method when executing the computer program.

The beneficial effects of the invention are as follows: aiming at the problems that the data labels of the gas turbine are rare and noise exists, the invention designs a stepping iteration complementary noise filtering strategy and further establishes an anomaly detection model. Firstly, an arctangent support vector machine model and an average absolute error neural network model are designed based on a symmetry loss theory to construct a classifier, and the classifier has strong robust characteristics on noise labels; secondly, a complementary noise filtering mechanism is provided, noise labels are filtered step by step, a classifier is updated, and the risk of overfitting of the system to the noise labels is further reduced; in addition, the invention designs a self-adaptive data supplementing strategy, and the problem of rare data labels of the fuel engine is overcome by utilizing a high-confidence pseudo label sample to realize the expansion of a labeled data set. Compared with the traditional supervised learning-based anomaly detection method, the method provided by the invention realizes reasonable utilization of insufficient labeled samples, cleans the labeled data set based on a stepping noise label filtering mechanism, builds a complementary Lu Bangji classifier as a reliable anomaly detection model, and improves the accuracy and F1 score of anomaly detection. Compared with the traditional supervised learning-based anomaly detection method, the method can effectively resist the influence of noise labels, and can establish an anomaly detection model with good performance under the condition of rare labels.

Drawings

FIG. 1 is a flow chart of a stepwise robust semi-supervised combustion engine anomaly detection method for a noise label scenario;

FIG. 2 is a schematic diagram of noise risk assessment;

FIG. 3 is a schematic diagram of the operation of the base classifier for complementary verification of the evaluation result;

Detailed Description

The present embodiment uses the actual engine operating data to verify the validity of the present method. The invention will now be described in further detail with reference to the drawings and to specific examples.

In order to prevent the heat channel parts of the gas turbine from being damaged due to over-temperature during the operation of the gas turbine generator set, the gas turbine is provided with a cooling air system. In the embodiment, 1000 samples before and after the occurrence of abnormal cooling air pressure of the compressor of the gas turbine of the unit in a certain period are collected as training sets to build a model, and 400 samples before and after the occurrence of the same abnormal state in another period are taken as two test sets to perform abnormality detection test; wherein 50 samples in the training samples are labeled samples, and the rest training samples are unlabeled samples; each sample contains 33 measuring points of process variables which are mainly collected by sensors positioned at the positions of a unit front-end module, a unit turbine, a combustion chamber and a unit compressor in a gas turbine generator set; the process variables include in particular: the natural gas volumetric flow rate of the front-end module, the natural gas mass flow rate of the front-end module, the gas inlet temperature (substituting ambient temperature) of the gas compressor, the gas compressor outlet pressure 1, the gas compressor outlet pressure 2, the gas compressor outlet temperature, the gas compressor bearing temperature 1, the gas compressor bearing temperature 2, the gas compressor bearing temperature 3, the gas compressor thrust tile bearing generator end temperature 1, the gas compressor thrust tile bearing generator end temperature 2, the gas compressor thrust tile bearing generator end temperature 3, the gas compressor thrust tile bearing gas engine end temperature 1, the gas compressor thrust tile bearing gas engine end temperature 2, the gas compressor thrust tile bearing gas engine end temperature 3 the method comprises the following steps of 1, 2 and 2 parts of compressor bearing vibration, 2 parts of compressor side large shaft vibration, 2 parts of compressor inlet channel pressure difference, 2 parts of turbine exhaust average temperature, 1 part of turbine side large shaft vibration, 2 parts of combustion chamber pressure difference, 1 part of combustion engine cooling air regulating valve position, 2 parts of combustion engine cooling air regulating valve position, 1 part of combustion engine buzzing, 2 parts of combustion engine rotating speed, 2 parts of combustion engine power, 1 part of combustion engine 2-stage stator blade ring holding cavity cooling air pressure, 2 parts of combustion engine 2-stage stator blade ring holding cavity cooling air pressure, 1 part of combustion engine 3-stage stator blade ring holding cavity cooling air pressure and 2 parts of combustion engine 3-stage stator blade ring holding cavity cooling air pressure.

According to the noise-tag-scene-oriented stepping robust semi-supervised gas turbine anomaly detection method, as shown in fig. 1, in an online stage, gas turbine operation data are acquired, and the gas turbine operation data are input into a trained complementary Lu Bangji classifier to obtain an anomaly detection result.

Specifically, the complementary Lu Bangji classifier was obtained by training the offline modeling phase as follows:

step 1: collecting the operation data of the gas engine as a training set, wherein the training set is divided into two parts, namely, the data which are collected during the operation of the gas engine and are used for clearly indicating the operation state, namely, a labeled sample; data which are collected during the running of the gas engine and do not explicitly indicate the running state, namely a label-free sample;

collecting the operation data of the combustion engine as a training set, and recording asWherein N is ₁ And N ₂ Respectively represent the number of labeled and unlabeled exemplars, wherein X _train The data dimension of each sample is V, that is, each sample is the data of the running state corresponding to one time, including V process variables, in this embodiment v=33; x is X _train Is of N ₁ The samples correspond to the tag +.>The part of the sample and the label thereof form a label sample set which is marked asWith N ₂ The samples are unlabeled samples, and form an unlabeled sample set, which is marked as +.>In the present embodiment, N ₁ ＝50，N ₂ =950; wherein the sample tag-> Indicating that the sample is marked as normal; />Indicating that the sample is marked as abnormal; sample tags have a certain proportion of noise conditions, i.e. for some samples, the tag marked is wrong;

in general, in order to ensure detection accuracy, data is subjected to normalization processing, specifically:

for a pair ofX _train And (3) carrying out standardization processing, wherein the calculation formula is as follows:

wherein, the liquid crystal display device comprises a liquid crystal display device,x _i,v,train representing the value of the v-th process variable in the i-th training sample collected;representing the mean of the v-th process variable in all training samples; s is(s) _v,train Representing standard deviations of the v-th process variable in all training samples; x is x _i,v,train Representing the standardized numerical values of the corresponding samples and the corresponding variables;

for training dataX _train Data set obtained after normalizationX _train With N ₁ The samples correspond to the labels indicating the running state of the gas engine, and the samples and the labels form a label sample set which is recorded asWith N ₂ The samples are unlabeled samples, and form an unlabeled sample set, which is marked as +.>

Step 2: constructing a complementary type Lu Bangji classifier, wherein the complementary type Lu Bangji classifier model comprises a parallel arctangent support vector machine model and an average absolute error neural network model; specifically:

(2.1) constructing an arctangent support vector machine model, which means a support vector machine model for constructing a loss function by adopting an arctangent function, wherein the support vector machine model has a structure of a conventional support vector machine, such as a soft interval support vector machine, and the like, and specifically, training an arctangent support vector machine classifier f ₁ The loss function of (2) is:

wherein w is ₁ And b ₁ Representing parameters to be optimized of the arctangent support vector machine model, wherein xi represents a penalty coefficient, L represents the number of labeled samples, and L is L ² Representing the square of the two norms, the penalty coefficient ζ in this embodiment is set to 0.078; the loss functionThe method has symmetrical property and strong robustness to tag noise; for a particular sample x, the output of the arctangent support vector machine model is noted as

o ₁ ＝f ₁ (x)＝w ₁ ^T x+b ₁

The arctangent support vector machine classifier f ₁ The model parameters of (2) are updated in a gradient descent mode so as to complete training; the parameter updating mode is as follows:

wherein ε ₁ Representing ginsengLearning rate of number update, epsilon in this embodiment ₁ Set to 0.5; the value is an assignment operator, and the parameter is updated;

(2.2) constructing an average absolute error neural network model, wherein the average absolute error neural network model of the invention is a neural network model for constructing a loss function by adopting an average absolute error function, and the average absolute error neural network model can adopt a conventional neural network, such as a full-connection layer neural network and the like, and comprises an input layer and an output layer, and in the embodiment, the number of nodes of the input layer is 33, and the number of nodes of the output layer is 1; for a particular sample x, the output of the mean absolute error neural network model is recorded as

Wherein w is ₂ And b ₂ Model parameters representing an average absolute error neural network model; the mean absolute error neural network classifier f ₂ Is updated by means of batch gradient descent to complete model training, wherein the parameter updating learning rate is epsilon ₂ The number of samples in each training batch is M, ε in this example ₂ Set to 0.001, m set to 10; training classifier f ₂ Specifically the average absolute error function:

wherein, the liquid crystal display device comprises a liquid crystal display device,represent a certain labeled training sample->Corresponding tag o _2,train Representing that the mean absolute error neural network model corresponds to a training sample +.>An output of (2); the average absolute error loss also has symmetrical property, and has stronger robustness to the label noise;

(2.3) training the complementary Lu Bangji classifier by using the tag training set to obtain a first complementary Lu Bangji classifier; wherein the arctangent support vector machine model and the average absolute error neural network model can be trained simultaneously or sequentially, and the same training times or different training times can be adopted, in this embodiment, the arctangent support vector machine classifier f ₁ Training run t ₁ Set to 10; mean absolute error neural network classifier f ₂ Training run t ₂ Set to 20;

the trained arctangent support vector machine model and the average absolute error neural network model form a first complementary type Lu Bangji classifier, and the first complementary type Lu Bangji classifier has strong robustness to noise labels;

step 3: predicting each sample of the labeled training set by using a first complementary Lu Bangji classifier, judging whether the corresponding sample is a noise label or not according to a risk evaluation value of a prediction result, and filtering samples of the noise label; retraining the complementary Lu Bangji classifier with the noise-removed labeled training set to obtain a second complementary Lu Bangji classifier; specifically:

(3.1) noise risk assessment, wherein the method is schematically shown in fig. 2-3, specifically:

using two trained classifiers f ₁ And f ₂ Performing noise risk assessment; the two classifiers evaluate the label risks of the existing labeled data respectively, and each selects a sample exceeding a risk threshold; the threshold value corresponding to each classifier can be given in advance according to experience, f ₁ The threshold value of (1) is expressed as mu ₁ ，f ₂ The threshold value of (1) is expressed as mu ₂ Mu in this example ₁ Sum mu ₂ Respectively 0.2 and 0.9; model f based on arctangent support vector machine ₁ The risk evaluation value of the calculated prediction result is the arctangent support directionThe negative value of the product of the predictive label corresponding to each sample and the label carried by that sample is expressed as:

wherein, the liquid crystal display device comprises a liquid crystal display device,classifier f representing arctangent support vector machine ₁ The evaluation corresponds to the ith labeled training sampleRisk of (1)>Classifier f representing arctangent support vector machine ₁ Corresponds to sample->An output of (2);

neural network model f based on average absolute error ₂ The risk evaluation value of the calculated prediction result is that the average absolute error neural network model is linearly transformed to the prediction label corresponding to each sample and mapped to [ -1,1]The absolute value of the tag difference value carried by the sample after the interval is expressed as:

wherein, the liquid crystal display device comprises a liquid crystal display device,mean absolute error neural network model f ₂ The evaluation corresponds to the ith labeled training sampleRisk of (1)>Mean absolute error neural network model f ₂ Corresponds to sample->An output of (2);

according to the result of noise risk assessment, all labeled sample sets are divided into 4 categories, including:

secure sample L _c : risk evaluation value of prediction result calculated based on arctangent support vector machine model, namely arctangent support vector machine classifier f ₁ The noise risk imparted does not exceed the corresponding threshold mu ₁ And the risk evaluation value of the prediction result calculated based on the mean absolute error neural network model is the mean absolute error neural network classifier f ₂ The noise risk imparted does not exceed the corresponding threshold mu ₂ Is a sample of (2);

risk sample L _d1 : arctangent support vector machine classifier f ₁ The noise risk conferred exceeds the corresponding threshold μ ₁ And the mean absolute error neural network classifier f ₂ The noise risk imparted does not exceed the corresponding threshold mu ₂ Is a sample of (2);

second-class risk sample L _d2 : mean absolute error neural network classifier f ₂ The noise risk conferred exceeds the corresponding threshold μ ₂ While arctangent support vector machine classifier f ₁ The noise risk imparted does not exceed the corresponding threshold mu ₁ Is a sample of (2);

dangerous sample L _e : arctangent support vector machine classifier f ₁ The noise risk conferred exceeds the corresponding threshold μ ₁ And mean absolute error neural network classifier f ₂ The noise risk conferred exceeds the corresponding threshold μ ₂ Is a sample of (2);

(2.2) complementary verification of noise evaluation results

(2.2.1) wherein the dangerous sample is judged as a sample containing a noise tag, and the dangerous sample L _e Performing direct filtering operation; i.e.

L′＝L-L _e

Wherein L' represents a filtered dangerous sample L _e A label sample set is obtained later;

(2.2.2) further comprising determining whether the first and second class risk samples are noise-containing label samples, specifically as follows:

(2.2.2.1) retraining the complementary Lu Bangji classifier with the remaining labeled samples L' to obtain a third complementary Lu Bangji classifier, the arctangent support vector machine classifier and the average absolute error neural network classifier of the third complementary Lu Bangji classifier being denoted as f, respectively ₁ ' and f ₂ ' re-calculating the risk of remaining label samples, respectively noted asAnd->

(2.2.2.2) calculating a labeled training set after filtering the dangerous sample and the risk sample simultaneouslyIs the mean difference delta of (1) ₁ If delta ₁ If the number is more than 0, judging the risk sample as a sample containing a noise label; the method comprises the following steps:

calculating a labeled training set L' after dangerous samples are filteredAverage risk, noted as

Calculating labeled training set L' -L after filtering dangerous samples and one type of dangerous samples simultaneously _d1 A kind of electronic deviceAverage risk, noted as

Calculate the difference delta between the two ₁ Is recorded as

(2.2.2.3) calculating a labeled training set after filtering the dangerous sample and the second-class dangerous sample simultaneouslyIs the mean difference delta of (1) ₂ If delta ₂ Judging the class II risk samples as samples containing noise labels if the number of the class II risk samples is more than 0; the method comprises the following steps:

Calculating labeled training set L' -L after filtering dangerous samples and class II dangerous samples simultaneously _d2 A kind of electronic deviceAverage risk, noted as

Calculate the difference delta between the two ₂ Is recorded as

(2.2.3) further cleaning the labeled data set by further judging whether the first-class risk sample and the second-class risk sample are samples of noisy labels, and obtaining a final remaining labeled sample set L ", which is specifically as follows:

if delta ₁ > 0 and delta ₂ > 0, then for a class of risk samples L _d1 And L _d2 The filtering operation is performed, namely the following operation is performed:

L″＝L′-L _d1 -L _d2

if delta ₁ > 0 and delta ₂ Less than or equal to 0, for a type of risk sample L _d1 Executing filtering operation;

L″＝L′-L _d1

if delta ₁ Less than or equal to 0 and delta ₂ > 0, then for the second class risk sample L _d2 Executing filtering operation;

L″＝L′-L _d2

if delta ₁ Less than or equal to 0 and delta ₂ Less than or equal to 0, for a type of risk sample L _d1 And L _d2 The reservation operation is performed, i.e. the following operations are performed:

L″＝L′

retraining the classifier by using the residual labeled sample L' after the filtering of the complementary noise label to obtain a second complementary Lu Bangji classifier, wherein the arctangent support vector machine classifier and the average absolute error neural network classifier are respectively marked as f ₁ "sum f ₂ ”；

Step 4: randomly selecting samples in the non-tag training set to form a non-tag subset, predicting each sample in the non-tag subset by using a second complementary Lu Bangji classifier to obtain corresponding pseudo tags, sorting according to confidence levels of the pseudo tags, selecting a plurality of samples with the largest confidence levels, deleting the samples from the non-tag training set, forming labeled samples with the corresponding pseudo tags, adding the labeled samples into the labeled training set, and performing self-adaptive data supplementation; the arctangent support vector machine model and the average absolute error neural network model can be respectively performed, and the method specifically comprises the following steps:

from no markRandomly selecting H samples from the tag data set U to form a non-tag subsetIn this embodiment, H is selected to be 400, and an arctangent support vector machine classifier f is used ₁ U-feed ₁ The label-free sample in (1) predicts the corresponding pseudo label, specifically:

wherein, the liquid crystal display device comprises a liquid crystal display device,representing the arctangent support vector machine classifier f ₁ "give sample>Pseudo tag of->Representation classifier f ₁ "corresponds to>Sign (·) represents a sign function, specifically:

wherein z represents an argument of the sign function;

calculating and evaluating the confidence of each pseudo tag:

wherein c _1,j The representation corresponds to a samplePseudo tag of->Confidence of (2);

all unlabeled exemplars are tested according to their corresponding confidence levels c _1,j Sorting from big to small, and selecting h with highest confidence coefficient of pseudo tag ₁ The individual samples form a set of candidate samples, denoted asCollecting pseudo labels corresponding to the samples to form a pseudo label sample set, which is marked as +.>In this embodiment, h ₁ 1 is selected;

randomly picking H samples from the unlabeled dataset U to form an unlabeled subsetClassifier f using mean absolute error neural network ₂ U-feed ₂ The label-free sample in (1) predicts the corresponding pseudo label, specifically:

wherein, the liquid crystal display device comprises a liquid crystal display device,representing the mean absolute error neural network classifier f ₂ "give sample>Is a pseudo tag of (a),representation classifier f ₂ "corresponds to>And evaluate the confidence of each pseudo tag:

wherein c _2,j The representation corresponds to a samplePseudo tag of->Confidence of (2);

all unlabeled exemplars are tested according to their corresponding confidence levels c _2,j Sorting from big to small, and selecting h with highest confidence coefficient of pseudo tag ₂ The samples, forming a set of candidate samples, are denoted asCollecting pseudo labels corresponding to the samples to form a pseudo label sample set, which is marked as +.>In this embodiment, h ₂ 1 is selected;

sample set Q ₁ And Q ₂ Adding the samples and the pseudo tags thereof in the tagged data set, namely executing the following operation;

L←L∪Q ₁ ∪Q ₂

wherein, U represents the complement operation of the set;

sample set P ₁ And P ₂ The samples in (a) are removed from the unlabeled dataset, i.e., the following operations are performed:

U←U-(P ₁ ∪P ₂ ).

repeating the operations of noise risk assessment, complementation verification on the noise assessment result and self-adaptive data supplementation according to the steps 2-4 until the set number of rounds T is reached to obtain a final trained complementary Lu Bangji classifier; t in this example is selected to be 20; record final trainingThe finished arctangent support vector machine classifier and the average absolute error neural network classifier are f ₁ ^final And f ₂ ^final 。

Abnormality detection can be performed by using a trained complementary Lu Bangji classifier, specifically as follows:

inputting a real-time collected fuel engine operation data sample into a trained complementary Lu Bangji classifier, and fusing a discrimination result of the base classifier on an online sample operation state so as to detect abnormality of the fuel engine;

wherein, the operation data of the combustion engine collected in real time is standardized, taking a test sample as an example,x _test is V, i.e., the sample corresponds to V process variables; the specific standardized mode is as follows:

wherein, the liquid crystal display device comprises a liquid crystal display device,x _v,test and x _v,test Respectively representing values before and after the standardization of the v-th process variable in the collected test sample; the standardized test data x _test Input to a trained classifier f ₁ ^final Giving a prediction result; the method comprises the following steps:

result ₁ ＝sign(o _1,test )

wherein o is _1,test Representation classifier f ₁ ^final Corresponds to x _test Is specifically expressed as

o _1,test ＝f ₁ ^final (x _test )

result ₁ =1 indicates that the discrimination result is normal, result ₁ = -1 indicates that the discrimination result is abnormal;

the standardized test data x _test Input to a trained classifierIn the following, the prediction result is given:

result ₂ ＝sign(o _2,test -0.5)

wherein o is _2,test Representation classifierCorresponds to x _test Is specifically expressed as

result ₂ =1 indicates that the discrimination result is normal, result ₂ = -1 indicates that the discrimination result is abnormal;

further, if the anomaly detection result of the arctangent support vector machine model or the mean absolute error neural network model is anomaly, i.e., the classifier f ₁ ^final Or classifierJudging the online sample as an abnormal sample, and triggering an abnormal alarm;

if the abnormal detection results of the arctangent support vector machine model or the average absolute error neural network model are normal, namely the classifier f ₁ ^final ClassifierThe online sample is determined to be a normal sample, and an alarm is not triggered.

The effect of the invention is explained by using the sample of the gas turbine compressor in abnormal cooling air pressure as the test set to perform fault detection analysis. In the aspect of noise setting, the invention respectively examines the situations that the initial tag noise proportion is 20% and 30%; each set of experiments was repeated 5 times at the same parameter settings.

In order to more clearly embody the superiority of the complementary noise filtering robust semi-supervised combustion engine anomaly detection task, two existing noise label learning methods are selected, and mutual teaching strategies co-learning (Han B, yao Q, yu X, et al Co-learning: robust training of deep neural networks with extremely noisy labels [ J ]. Advances in neural information processing systems,2018,31 ]), and importance weighting importance reweighting (Liu T, tao D.classification with noisy labels by importance reweighting [ J ]. IEEE Transactions on pattern analysis and machine intelligence,2015,38 (3): 447-461.) are compared. Adding a common co-tracking strategy on the basis of a co-tracking method in an experiment to complete self-adaptive data supplementation; the self-adaptive data supplementation is completed by adding a common self-tracking strategy on the basis of the method of importance reweighting.

For more visual comparison, tables 1 and 2 list the mean and variance of the anomaly detection Accuracy (Accuracy) and the F1 score (F1-score) over the test set for 5 replicates, respectively, when the initial tag noise ratios were 20% and 30%, respectively. The computational expressions of Accuracy and F1-score are:

it can be found that when the noise proportion in the label sample is low (20%), good abnormality detection effect can be obtained by all three methods, wherein the accuracy of the method and the accuracy of the F1 fraction in five experiments are higher than those of other two methods, and the variance is smaller; when the noise proportion in the label sample is higher (30%), the abnormality detection performance of the comparison method is obviously reduced, but the precision and the F1 fraction of the method provided by the invention can still be kept at a high level, and the precision and the F1 fraction of the method can obtain the optimal results in the three methods on the indexes of mean and variance. The method provided by the invention has obvious robustness on the noise label and strong stability on model training, and further proves the feasibility and effectiveness of the method.

Table 1 comparison of anomaly detection performance at 20% tag noise ratio for different methods

Table 2 comparison of anomaly detection performance at a tag noise ratio of 30% for different methods

Corresponding to the embodiment of the noise-tag-scene-oriented step-by-step robust semi-supervised combustion engine anomaly detection method, the invention also provides the embodiment of the noise-tag-scene-oriented step-by-step robust semi-supervised combustion engine anomaly detection system.

The invention discloses a noise-tag-scene-oriented stepping type robust semi-supervised gas turbine anomaly detection system, which comprises a memory, a processor and a computer program stored on the memory and capable of running on the processor, wherein the processor realizes the noise-tag-scene-oriented stepping type robust semi-supervised gas turbine anomaly detection method when executing the computer program.

For system embodiments, reference is made to the description of method embodiments for the relevant points, since they essentially correspond to the method embodiments. The system embodiments described above are merely illustrative, wherein the elements illustrated as separate elements may or may not be physically separate, and the elements shown as elements may or may not be physical elements, may be located in one place, or may be distributed over a plurality of network elements. Some or all of the modules may be selected according to actual needs to achieve the purposes of the present invention. Those of ordinary skill in the art will understand and implement the present invention without undue burden.

It is apparent that the above examples are given by way of illustration only and are not limiting of the embodiments. Other variations or modifications of the above teachings will be apparent to those of ordinary skill in the art. It is not necessary or exhaustive of all embodiments. And obvious variations or modifications thereof are contemplated as falling within the scope of the present invention.

Claims

1. The step-by-step robust semi-supervised gas turbine anomaly detection method for the noise label scene is characterized by comprising the steps of acquiring gas turbine operation data, and inputting the gas turbine operation data into a trained complementary Lu Bangji classifier to obtain an anomaly detection result; wherein the complementary Lu Bangji classifier is obtained by training the following steps:

wherein w is ₁ And b ₁ Representing parameters to be optimized of the arctangent support vector machine model, wherein xi represents a penalty coefficient, L represents the number of labeled samples, and L is L ² Representing the square of the two norms,label for the ith labeled sample, +.>The fuel engine operation data of the ith tagged sample; the loss function adopted in the training of the average absolute error neural network model is the average absolute error; complementary pairs using labeled training setsTraining the classifier of the formula Lu Bangji to obtain a first complementary Lu Bangji classifier;

2. The method of claim 1, wherein the collected engine operation data is normalized engine operation data.

3. The method of claim 1, wherein in the step 3, each sample in the labeled training set is predicted by using a first complementary Lu Bangji classifier, and whether the corresponding sample is a noise label is determined according to a risk evaluation value of a prediction result, and the sample of the noise label is filtered, specifically:

4. A method according to claim 3, further comprising:

using third complementary robustnessThe base classifier predicts each sample of the labeled training set after dangerous samples are filtered, and calculates a risk evaluation value of a predicted resultAnd->Wherein (1)>The risk evaluation values of the prediction results calculated based on the average absolute error neural network model and the average absolute error neural network model are respectively marked with the index i as the serial number of the sample;

5. The method of claim 1, wherein the confidence level of the pseudo tag is expressed as:

c _1,j representing pseudo tagsConfidence of c _2,j Representing pseudo tag->Confidence of (2); />Respectively representing the output of the arctangent support vector machine model and the average absolute error neural network model in the second complementary Lu Bangji classifier corresponding to the ith unlabeled exemplar; h is the number of samples of the unlabeled subset.

6. The method as recited in claim 1, further comprising:

7. The method of claim 1, wherein the mean absolute error neural network model is structured as a full-connected layer neural network.

8. The method of claim 1, wherein the arctangent support vector machine model is structured as a soft-spaced support vector machine.

9. A noise-tag-scene-oriented step-by-step robust semi-supervised combustion engine anomaly detection system, comprising a memory, a processor, and a computer program stored on the memory and executable on the processor, wherein the processor implements a noise-tag-scene-oriented step-by-step robust semi-supervised combustion engine anomaly detection method as recited in any one of claims 1-8 when executing the computer program.