CN115358274A - DCGAN-CNN-based fault classification method - Google Patents

DCGAN-CNN-based fault classification method Download PDF

Info

Publication number
CN115358274A
CN115358274A CN202211046543.8A CN202211046543A CN115358274A CN 115358274 A CN115358274 A CN 115358274A CN 202211046543 A CN202211046543 A CN 202211046543A CN 115358274 A CN115358274 A CN 115358274A
Authority
CN
China
Prior art keywords
data
network
dcgan
cnn
fault
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211046543.8A
Other languages
Chinese (zh)
Inventor
黑新宏
张宽
费蓉
姬文江
谢国
高苗
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xian University of Technology
Original Assignee
Xian University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xian University of Technology filed Critical Xian University of Technology
Priority to CN202211046543.8A priority Critical patent/CN115358274A/en
Publication of CN115358274A publication Critical patent/CN115358274A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Complex Calculations (AREA)

Abstract

The invention discloses a DCGAN-CNN-based fault classification method, which comprises the steps of firstly, extracting time-frequency characteristic information from a fault signal by using wavelet transform to form a new characteristic fault data set; then constructing a generator and a discriminator based on a convolutional network; inputting the characteristic fault data set into a DCGAN network model for training, and generating a synthesized fault data sample after training is finished; adding the synthesized fault data sample into a characteristic fault data set, and dividing a training set and a testing set; constructing a classifier based on the CNN, and training a network by using a training set to obtain a CNN classification model; and testing the classification model by using the test set, and performing model evaluation through accuracy, precision, recall rate and F1-Score evaluation indexes. The invention enhances the data by combining the wavelet transformation with the DCGAN to improve the fault diagnosis accuracy under the condition of data imbalance.

Description

DCGAN-CNN-based fault classification method
Technical Field
The invention belongs to the technical field of fault diagnosis, and particularly relates to a fault classification method based on DCGAN-CNN.
Background
The modern industrial process system has irreplaceable effect in promoting social progress and development. Along with the improvement of the science and technology level, industrial equipment is increasingly complicated, informationized and intelligentized, and a fault occurs in equipment, so that other parts can be influenced to a certain extent, and therefore, the safety and the reliability of machinery can be effectively improved by accurately diagnosing the fault. In recent years, the intelligent fault diagnosis technology based on deep learning develops rapidly, and reliable guarantee is provided for normal operation of industrial machines. Compared with the traditional method for manually selecting the features, the fault diagnosis framework based on deep learning has strong feature learning capability, and can automatically select the distinguishing features beneficial to accurate classification. Due to the particularity of deep learning training, the amount of data required is large, but in practice training samples between different machine operating states are often unbalanced. For a mechanical system in operation, which works under normal conditions most of the time, sufficient normal samples are collected, but the frequency of mechanical failure is low, the number of collected samples is limited, and therefore an imbalance exists between normal and failed samples.
For fault sample classification of unbalanced data, generating a countermeasure network (GAN) can solve the data imbalance problem by generating a small number of class samples. As a generation model, GAN and its derivative models are commonly used for generating samples for data enhancement and data preprocessing methods in deep learning, and have wide application scenarios in the fields of image processing, biomedicine, network, information security, and the like. The invention adopts wavelet transformation to extract the fault signal characteristics, generates a countermeasure network (DCGAN) based on the variant-depth convolution generated countermeasure network to solve the problem of data imbalance, and constructs a classifier by combining with the CNN to realize the fault classification task.
Disclosure of Invention
The invention aims to provide a DCGAN-CNN-based fault classification method, which can improve the fault diagnosis accuracy under data imbalance.
The technical scheme adopted by the invention is that a DCGAN-CNN fault classification method is implemented according to the following steps:
step 1, extracting time-frequency characteristic information from a fault signal by using wavelet transform to form a new characteristic fault data set;
step 2, constructing a DCGAN network model: constructing a generator and a discriminator based on a convolutional network;
step 3, inputting the characteristic fault data set in the step 1 into a DCGAN network model for training, and generating a synthesized fault data sample after training is finished;
step 4, adding the synthesized fault data sample obtained in the step 3 into the characteristic fault data set obtained in the step 1, and dividing a training set and a testing set;
step 5, constructing a classifier based on the CNN, and training the network by using a training set to obtain a CNN classification model;
and 6, testing the classification model obtained in the step 5 by using the test set in the step 4, and performing model evaluation according to the accuracy, precision, recall rate and F1-Score evaluation index.
The present invention is also characterized in that,
the step 1 is implemented according to the following steps:
step 1.1, selecting a wavelet function and a scale a, and aligning the wavelet function with a starting point of a signal to be analyzed;
step 1.2, starting from the signal initial position, calculating the approximation degree of the signal to be analyzed and the wavelet function at the moment, namely calculating to obtain the wavelet coefficient by using a formula (1), wherein the larger the wavelet coefficient is, the closer the waveform of the signal and the selected wavelet function at the moment is represented;
step 1.3, shifting the wavelet function to the right for a unit time along a time axis, namely scanning along the time axis, and repeating the step 1.2 to obtain the wavelet coefficient at the moment until the whole length of the signal to be analyzed is scanned;
step 1.4, changing the scale a, and repeating the steps 1.2-1.3 to complete the scanning of the frequency axis;
and respectively completing the time domain and frequency domain feature analysis of the fault signal based on the step 1.3 and the step 1.4.
The DCGAN network model in step 2 consists of two modules: the generator network G and the discriminator network D, the task of the generator is to receive the noise z distributed randomly, make the sample G (z) synthesized accord with the true sample distribution; the task of the discriminator is to receive the data G (z) of the generator and the real sample data x and to discriminate between true and false of the received data.
Step 2 the network construction process is specifically implemented according to the following steps:
step 2.1, constructing a generator network G: the input of the generator network G is random noise z, the random noise z is converted into the size of a real sample through three-layer deconvolution operation, batch normalization is carried out among the three-layer deconvolution operation, and the output of the generator is an image with the same size as the data set in the step 1;
step 2.2, constructing a discriminator network D: the input of the discriminator network D is the real sample data x in the step 1 and the synthetic sample data G (z) output by the generator G in the step 2, the discriminator network D passes the input data through three convolution layers, and finally the output layer calculates the probability of whether the input image is real by using a sigmoid activation function.
Step 3 is specifically implemented according to the following steps:
the loss function of DCGAN is given by the following formula (2):
Figure BDA0003822605680000041
wherein G, D represent the generator and the discriminator, x-p data (x) Represents the distribution of real data, z-p z (z) represents a random noise distribution, D (x) represents an output result of the real data passing through the discriminator, and D (G (z)) represents an output result of the generator synthetic data G (z) passing through the discriminator;
optimizing the DCGAN network model based on the formula (2), specifically as follows:
step 3.1, updating the parameters of the generator network by minimizing the loss of the formula (2), so that the output of the generator network is more similar to the real data;
step 3.2, updating network parameters of the discriminator by maximizing the loss result of the formula (2), so that the discriminator network can more accurately distinguish real data from synthetic data;
the countermeasure training is carried out through the step 3.1 and the step 3.2, the DCGAN network parameters are updated, the generator and the discriminator reach a relative balance state, the trained DCGAN can be used for generating data, and the problem of unbalanced original data is solved.
Step 5 is specifically implemented according to the following steps:
and (3) adopting the CNN as a classifier, taking the data in the step (4) as input, forming a CNN network by using 3 convolution kernels of 3 × 3 convolution layers, two 2 × 2 maximum pooling layers, a full connection layer and a softmax output layer, and training a CNN network model by using a training set to obtain a CNN classification model.
The invention has the advantages that the DCGAN-CNN-based fault classification method improves the problems of gradient disappearance, mode collapse and the like of a basic GAN model, simultaneously uses wavelet transformation, better utilizes time domain and frequency domain information of fault signals, extracts time-frequency image characteristics to facilitate DCGAN processing, effectively expands fault data and improves classification accuracy.
Drawings
FIG. 1 is an overall flow chart of a DCGAN-CNN-based fault classification method according to the present invention.
Detailed Description
The present invention will be described in detail below with reference to the accompanying drawings and specific embodiments.
The invention relates to a DCGAN-CNN fault classification based method, a flow chart is shown in figure 1, and the method is implemented according to the following steps:
step 1, extracting time-frequency characteristic information from a fault signal by using wavelet transform to form a new characteristic fault data set;
the step 1 is implemented according to the following steps:
wavelet transform is a time domain-frequency domain analysis method of signals, and has the capability of representing local characteristics of the signals in both time domain and frequency domain. The meaning is that after the basic wavelet function is shifted by tau, the basic wavelet function and the signal to be analyzed x (t) are subjected to inner product under different scales a, namely
Figure BDA0003822605680000051
Wherein WT (a, τ) represents wavelet coefficients; a is more than 0 and is called as a scale factor, the expansion and contraction of the wavelet function are controlled, and the frequency of the corresponding signal is controlled; and tau represents displacement, controls the translation of the wavelet function and corresponds to the time information of the signal.
Step 1.1, selecting a wavelet function and a scale a, and aligning the wavelet function with a starting point of a signal to be analyzed;
step 1.2, starting from the initial position of the signal, calculating the approximation degree of the signal to be analyzed and the wavelet function at the moment, namely calculating to obtain the wavelet coefficient by using a formula (1), wherein the larger the wavelet coefficient is, the closer the waveform of the signal and the selected wavelet function at the moment is represented;
step 1.3, shifting the wavelet function to the right for a unit time along a time axis, namely scanning along the time axis, and repeating the step 1.2 to obtain the wavelet coefficient at the moment until the whole length of the signal to be analyzed is scanned;
step 1.4, changing the scale a, and repeating the steps 1.2-1.3 to complete the scanning of the frequency axis;
and respectively completing the time domain and frequency domain feature analysis of the fault signal based on the step 1.3 and the step 1.4.
Step 2, constructing a DCGAN network model: constructing a generator and a discriminator based on a convolutional network;
the DCGAN network model in step 2 consists of two modules: the generator network G and the discriminator network D, the task of the generator is to receive the noise z distributed randomly, make the sample G (z) synthesized accord with the true sample distribution; the task of the discriminator is to receive the data G (z) of the generator and the real sample data x and to discriminate the authenticity of the received data.
Step 2, the network construction process is implemented according to the following steps:
step 2.1, constructing a generator network G: the input of the generator network G is random noise z, the random noise z is converted into the size of a real sample through three layers of deconvolution operations, batch normalization is carried out among the three layers of deconvolution operations, and the output of the generator is an image with the same size as the data set in the step 1; the method is beneficial to processing the training problem caused by poor initialization, and meanwhile, the model training is accelerated, so that the training stability is improved;
step 2.2, constructing a discriminator network D: the input of the discriminator network D is the real sample data x in the step 1 and the synthetic sample data G (z) output by the generator G in the step 2, the discriminator network D passes the input data through three convolution layers, and finally the output layer calculates the probability of whether the input image is real by using a sigmoid activation function.
Step 3, inputting the characteristic fault data set in the step 1 into a DCGAN network model for training, and generating a synthesized fault data sample after training is finished;
step 3 is specifically implemented according to the following steps:
the loss function of DCGAN is given by the following formula (2):
Figure BDA0003822605680000071
wherein G, D represent the generator and the discriminator, x-p data (x) Represents the distribution of real data, z-p z (z) represents a random noise distribution, D (x) represents an output result of the real data passing through the discriminator, and D (G (z)) represents an output result of the generator synthetic data G (z) passing through the discriminator;
optimizing the DCGAN network model based on the formula (2), specifically as follows:
step 3.1, updating generator network parameters through minimizing the loss of the formula (2), so that the output of the generator network is more similar to real data;
step 3.2, updating network parameters of the discriminator by maximizing the loss result of the formula (2), so that the discriminator network can more accurately distinguish real data from synthetic data;
the countermeasure training is carried out through the step 3.1 and the step 3.2, the DCGAN network parameters are updated, so that the generator and the discriminator reach a relative balance state, the trained DCGAN can be used for generating data, and the problem of original data imbalance is solved.
Step 4, adding the synthesized fault data sample obtained in the step 3 into the characteristic fault data set obtained in the step 1, and dividing a training set and a testing set;
step 5, constructing a classifier based on the CNN, and training the network by using a training set to obtain a CNN classification model;
step 5 is specifically implemented according to the following steps:
and (3) adopting the CNN as a classifier, taking the data in the step (4) as input, forming a CNN network by using 3 convolution kernels of 3 × 3 convolution layers, two 2 × 2 maximum pooling layers, a full connection layer and a softmax output layer, and training a CNN network model by using a training set to obtain a CNN classification model.
And 6, testing the classification model obtained in the step 5 by using the test set in the step 4, and performing model evaluation according to the accuracy, precision, recall rate and F1-Score evaluation index.

Claims (6)

1. A DCGAN-CNN fault classification based method is characterized by being implemented according to the following steps:
step 1, extracting time-frequency characteristic information from a fault signal by using wavelet transform to form a new characteristic fault data set;
step 2, constructing a DCGAN network model: constructing a generator and a discriminator based on a convolutional network;
step 3, inputting the characteristic fault data set in the step 1 into a DCGAN network model for training, and generating a synthesized fault data sample after training is finished;
step 4, adding the synthesized fault data sample obtained in the step 3 into the characteristic fault data set obtained in the step 1, and dividing a training set and a testing set;
step 5, constructing a classifier based on the CNN, and training the network by using a training set to obtain a CNN classification model;
and 6, testing the classification model obtained in the step 5 by using the test set in the step 4, and performing model evaluation according to the accuracy, precision, recall rate and F1-Score evaluation index.
2. The method for fault classification of DCGAN-CNN according to claim 1, wherein the step 1 is specifically implemented according to the following steps:
step 1.1, selecting a wavelet function and a scale a, and aligning the wavelet function with a starting point of a signal to be analyzed;
step 1.2, starting from the initial position of the signal, calculating the approximation degree of the signal to be analyzed and the wavelet function at the moment, namely calculating to obtain the wavelet coefficient by using a formula (1), wherein the larger the wavelet coefficient is, the closer the waveform of the signal and the selected wavelet function at the moment is represented;
step 1.3, shifting the wavelet function to the right for a unit time along a time axis, namely scanning along the time axis, and repeating the step 1.2 to obtain the wavelet coefficient at the moment until the whole length of the signal to be analyzed is scanned;
step 1.4, changing the scale a, and repeating the steps 1.2-1.3 to complete the scanning of the frequency axis;
and respectively completing the time domain and frequency domain feature analysis of the fault signal based on the step 1.3 and the step 1.4.
3. The method according to claim 2, wherein the DCGAN network model in step 2 comprises two modules: the generator network G and the discriminator network D, the task of the generator is to receive the noise z distributed randomly, make the sample G (z) synthesized accord with the true sample distribution; the task of the discriminator is to receive the data G (z) of the generator and the real sample data x and to discriminate the authenticity of the received data.
4. The method for fault classification based on DCGAN-CNN according to claim 3, wherein the step 2 network construction process is specifically implemented according to the following steps:
step 2.1, constructing a generator network G: the input of the generator network G is random noise z, the random noise z is converted into the size of a real sample through three-layer deconvolution operation, batch normalization is carried out among the three-layer deconvolution operation, and the output of the generator is an image with the same size as the data set in the step 1;
step 2.2, constructing a discriminator network D: and (3) inputting the real sample data x in the step (1) and the synthetic sample data G (z) output by the generator G in the step (2) by the discriminator network D, passing the input data through three convolution layers by the discriminator network D, and finally calculating the probability of whether the input image is real or not by using a sigmoid activation function by the output layer.
5. The method for fault classification of DCGAN-CNN according to claim 4, wherein the step 3 is specifically implemented according to the following steps:
the loss function of DCGAN is given by the following formula (2):
Figure FDA0003822605670000031
wherein G and D represent a generator and a discriminator, respectively, and x to p data (x) Represents the distribution of real data, z-p z (z) represents a random noise distribution, D (x) represents an output result of the real data passing through the discriminator, and D (G (z)) represents an output result of the generator synthetic data G (z) passing through the discriminator;
based on the above formula (2), the DCGAN network model is optimized, specifically as follows:
step 3.1, updating generator network parameters through minimizing the loss of the formula (2), so that the output of the generator network is more similar to real data;
step 3.2, updating network parameters of the discriminator by maximizing the loss result of the formula (2), so that the discriminator network can more accurately distinguish real data from synthetic data;
the countermeasure training is carried out through the step 3.1 and the step 3.2, the DCGAN network parameters are updated, so that the generator and the discriminator reach a relative balance state, the trained DCGAN can be used for generating data, and the problem of original data imbalance is solved.
6. The method for fault classification of DCGAN-CNN according to claim 5, wherein the step 5 is specifically implemented according to the following steps:
and (3) adopting the CNN as a classifier, taking the data in the step (4) as input, forming a CNN network by using a convolution layer with 3 convolution kernels of 3 x 3, two maximum pooling layers of 2 x 2, a full connection layer and a softmax output layer, and training a CNN network model by using a training set to obtain a CNN classification model.
CN202211046543.8A 2022-08-30 2022-08-30 DCGAN-CNN-based fault classification method Pending CN115358274A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211046543.8A CN115358274A (en) 2022-08-30 2022-08-30 DCGAN-CNN-based fault classification method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211046543.8A CN115358274A (en) 2022-08-30 2022-08-30 DCGAN-CNN-based fault classification method

Publications (1)

Publication Number Publication Date
CN115358274A true CN115358274A (en) 2022-11-18

Family

ID=84004639

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211046543.8A Pending CN115358274A (en) 2022-08-30 2022-08-30 DCGAN-CNN-based fault classification method

Country Status (1)

Country Link
CN (1) CN115358274A (en)

Similar Documents

Publication Publication Date Title
CN111914883B (en) Spindle bearing state evaluation method and device based on deep fusion network
CN109655259A (en) Combined failure diagnostic method and device based on depth decoupling convolutional neural networks
CN109472817B (en) Multi-sequence magnetic resonance image registration method based on loop generation countermeasure network
CN110297479B (en) Hydroelectric generating set fault diagnosis method based on convolutional neural network information fusion
CN109781411A (en) A kind of combination improves the Method for Bearing Fault Diagnosis of sparse filter and KELM
CN110110768A (en) Fault Diagnosis of Roller Bearings based on Concurrent Feature study and multi-categorizer
CN115127814A (en) Unsupervised bearing fault diagnosis method based on self-adaptive residual error countermeasure network
CN113887136A (en) Improved GAN and ResNet based electric vehicle motor bearing fault diagnosis method
CN111275108A (en) Method for performing sample expansion on partial discharge data based on generation countermeasure network
CN112926728B (en) Small sample turn-to-turn short circuit fault diagnosis method for permanent magnet synchronous motor
CN115374820A (en) Rotary machine cross-domain fault diagnosis method based on multi-source sub-domain adaptive network
CN114091539A (en) Multi-mode deep learning rolling bearing fault diagnosis method
CN115962946A (en) Bearing fault diagnosis method based on improved WGAN-GP and Alxnet
CN116340859A (en) Marine wind turbine generator gearbox fault diagnosis method based on vibration signals under noise background
CN116403032A (en) Breaker fault evaluation method based on multi-domain information fusion and deep learning
CN116401603A (en) Multi-mode bearing fault intelligent diagnosis method based on transfer learning
CN117113078A (en) Small sample bearing fault mode identification method and system based on multi-source data integration
CN117475191A (en) Bearing fault diagnosis method for feature alignment domain antagonistic neural network
CN116541771A (en) Unbalanced sample bearing fault diagnosis method based on multi-scale feature fusion
CN116610935A (en) Mechanical fault detection method based on engine vibration signal multi-mode analysis
CN115358274A (en) DCGAN-CNN-based fault classification method
CN115982566A (en) Multi-channel fault diagnosis method for hydroelectric generating set
CN115828088A (en) High-voltage shunt reactor vibration abnormity fault diagnosis method based on deep learning
CN110631827B (en) Gearbox fault diagnosis method based on improved collaborative representation classification
CN112801955B (en) Plankton detection method under unbalanced population distribution condition

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination