CN113310689B

CN113310689B - Aeroengine transmission system fault diagnosis method based on domain self-adaptive graph convolution network

Info

Publication number: CN113310689B
Application number: CN202110588075.6A
Authority: CN
Inventors: 孙闯; 李天福; 赵志斌; 王诗彬; 田绍华; 严如强; 陈雪峰
Original assignee: Xian Jiaotong University
Current assignee: Xian Jiaotong University
Priority date: 2021-05-27
Filing date: 2021-05-27
Publication date: 2022-07-12
Anticipated expiration: 2041-05-27
Also published as: CN113310689A

Abstract

The invention discloses a method for diagnosing faults of an aeroengine transmission system based on a domain self-adaptive graph convolution network, which comprises the following steps: acquiring first vibration signals of an aircraft engine transmission system with unknown faults at different rotating speeds, and taking the first vibration signals as label-free target domain data; constructing a domain self-adaptive graph convolution network, acquiring second vibration signals of an aircraft engine transmission system with known faults at different rotating speeds, taking the second vibration signals as source domain data with labels, and taking the source domain data and part of target domain data as input to train the domain self-adaptive graph convolution network; inputting the other part of target domain data into the trained domain self-adaptive graph convolution network to obtain a prediction label of the target domain data, and realizing the migration diagnosis of the faults of the transmission system of the aircraft engine at different rotating speeds.

Description

Aeroengine transmission system fault diagnosis method based on domain self-adaptive graph convolution network

Technical Field

The disclosure belongs to the field of mechanical fault diagnosis, and particularly relates to a method for diagnosing faults of an aeroengine transmission system based on a domain self-adaptive graph convolution network.

Background

Intelligent diagnostics plays an important role in health management systems for aircraft engine transmission systems, which have been widely used in modern industry and whose primary purpose is to monitor equipment status and reduce down time. At present, an unsupervised domain self-adaptive method is successfully applied to mechanical fault diagnosis under variable working conditions. In the unsupervised domain self-adaptive method, three types of information such as class labels, domain labels and data structures are important in the process of realizing migration from a labeled source domain to an unlabeled target domain. However, most existing unsupervised domain adaptation methods only use the first two kinds of information, and neglect modeling of the data structure, which makes the information contained in the features extracted by the deep network incomplete. Therefore, there is a need for a model that can embed data structure information also in the extracted features.

The above information disclosed in this background section is only for enhancement of understanding of the background of the invention and therefore it may contain information that does not form the prior art that is already known in this country to a person of ordinary skill in the art.

Disclosure of Invention

Aiming at the defects in the prior art, the invention aims to provide a domain-adaptive graph convolution network-based aeroengine transmission system fault diagnosis method, which converts the features extracted by a convolution neural network into association graph data through a design graph generation layer, and then uses the graph convolution network to model the generated association graph, so that the structural information among the data is also embedded into the extracted features, the information contained in the features is more complete, and the distinguishability of the extracted features and the robustness of a model are improved.

In order to achieve the above purpose, the present disclosure provides the following technical solutions:

a method for diagnosing faults of an aeroengine transmission system based on a domain self-adaptive graph convolution network comprises the following steps:

s100: acquiring first vibration signals of an aircraft engine transmission system with unknown faults at different rotating speeds, and taking the first vibration signals as label-free target domain data;

s200: constructing a domain self-adaptive graph convolution network, acquiring second vibration signals of an aircraft engine transmission system with known faults at different rotating speeds, taking the second vibration signals as source domain data with labels, and taking the source domain data and part of target domain data as input to train the domain self-adaptive graph convolution network;

s300: inputting the other part of target domain data into the trained domain self-adaptive graph convolution network to obtain a prediction label of the target domain data, and realizing the migration diagnosis of the faults of the transmission system of the aircraft engine at different rotating speeds.

Preferably, in step S200, the domain adaptive graph convolution network includes:

the characteristic extractor F comprises four layers of convolutional neural networks, one graph generation layer and two layers of graph convolution networks, and is used for extracting characteristic values of source domain data and part of target domain data;

the fault classifier C comprises a full connection layer and is used for identifying the fault type carried by the source domain data;

a domain discriminator D including three layers of map convolutional layers for discriminating a difference between source domain data and target domain data;

a data structure aligner S, said data structure aligner S comprising a maximum mean difference distance estimator for aligning data structures of source domain data and target domain data.

Preferably, the expression of the feature extractor F is:

and is

Wherein GConv (-) and Conv (-) denote graph convolution layer operation and convolution operation, respectively, GGL (-) denotes a graph generation layer,

x represents an adjacency matrix and a node characteristic matrix of the correlation diagram constructed by the diagram generation layer, respectively.

Preferably, the expression of the fault classifier C is:

C(X)＝aX+b

where a and b represent the weight and bias of the fully-connected layer, respectively.

Preferably, the expression of the domain discriminator D is:

D(X_s，X_t)＝Ｃonv(|X_s，X_t|)

wherein, X_sAnd X_tRespectively representing the learned source domain data features and target domain data features. | represents the splicing operation.

Preferably, the expression of the data structure aligner S is:

wherein the content of the first and second substances,

representing a set of mapping functions, k (·) representing

M and n represent the number of source domain data and target domain data, respectively,

and

respectively representing the ith sample from the source domain and the target domain,

and

respectively, the jth sample from the source domain and the target domain, s and t respectively, the source domain and the target domain, and i and j respectively, the position of the sample in the sample set.

Preferably, in step S200, the training of the domain adaptive graph convolution network includes the following steps:

s201: dividing the labeled source domain data into a first training sample and a first test sample, dividing part of unlabeled target domain data into a second training sample and a second test sample, and simultaneously inputting the first training sample and the second training sample into a feature extractor FThe convolutional neural network carries out feature extraction to obtain source domain features

And target domain characteristics

S202: characterizing the source domain

Inputting the data into a fault classifier C to obtain a predicted value of the fault category of the source domain data

And taking the cross entropy loss as a fault classification loss, wherein the fault classification loss is expressed as:

where E represents the mathematical expectation, CE represents the cross-entropy loss,

representing source domain samples

The label of (1);

s203: characterizing a source domain

And target domain characteristics

Inputting the prediction value of each type of data domain label into a domain discriminator D, recording the prediction value as 0 or 1, calculating the two-classification cross entropy loss with the real domain label after obtaining the domain label, and obtaining the domain classification loss, wherein the domain classification loss is expressed as:

wherein f () takes 0 or 1 to indicate whether this type of data belongs to the source domain or the target domain;

s204: characterizing a source domain

And target domain characteristics

Inputting into a data structure aligner S, measuring the structural difference between the source domain features and the target domain features through an MMD distance estimator to obtain a structural difference loss, wherein the structural difference loss is expressed as:

where φ (-) represents a non-linear mapping function, Ω represents a measure of this distance by embedding the extracted features into the regenerative kernel Hilbert space, | | - | computationally |²Representing a calculation of Euclidean distance;

s205: constructing an objective function using the fault classification loss, the domain classification loss, and the structural difference loss, the objective function being represented as:

L_Total＝L_C+γL_D+κL_MMD

wherein L is_TotalRepresenting the overall objective function, γ and κ representing the equilibrium coefficients;

s206: updating parameters of the domain adaptive graph convolution network according to the objective function, and repeatedly executing the steps S201 to S205 until reaching a specified training time, wherein the parameters of the domain adaptive graph convolution network are updated according to the following formula:

wherein the content of the first and second substances,

representing a differential operator, eta a learning rate, theta_F、θ_CAnd theta_DLearnable parameters, θ ', representing the feature extractor, the fault classifier, and the domain discriminator, respectively'_CAnd θ'_DRespectively representing parameters updated by learning, L_Total、L_CAnd L_DRespectively representing an overall objective function, a fault classification loss and a domain classification loss;

s207: and testing the trained domain adaptive graph convolution network by using the first test sample and the second test sample.

Preferably, the source domain characteristics are obtained

And target domain characteristics

Then, the extracted source domain features and target domain features need to be converted into associated graph data through a graph generation layer, and the specific conversion process comprises the following steps:

a. inputting the node characteristic matrix X extracted by the convolutional neural network into a full connection layer of a fault classifier C for nonlinear mapping;

b. calculating the product of the output of the full connection layer of the fault classifier C and the output transposition to obtain a product result matrix, and normalizing the matrix according to rows to obtain a normalized matrix A;

c. using Top-k sorting mechanism to select the Top k values with maximum value of each row in matrix A, thereby obtaining adjacent matrix

After the node characteristic matrix X and the adjacency matrix of the graph are obtained

The dependency graph data is obtained.

Preferably, the expression of the map-generating layer is:

wherein X represents a node feature matrix extracted by a convolutional neural network, MLP (-) represents a fully connected layer,

the output of the MLP is represented as,

to represent

Transpose of (a), normalization (b), A denotes the constructed adjacency matrix, Top-k (c) denotes the index returning the first k maxima of A line by line,

is the finally obtained adjacency matrix.

Preferably, after obtaining the association map data, modeling the association map data by using a graph convolution network so that the structural information between the source domain and the target domain data is embedded into the extracted source domain feature and the extracted target domain feature.

Compared with the prior art, the beneficial effect that this disclosure brought does:

the method and the device model class labels, domain labels and data structure information contained in unsupervised domain self-adaptation, so that the information contained in the extracted vibration signal features is more complete, the domain difference between source domain data and target domain data is reduced, and the distinguishability of the features and the diagnosis precision on the target domain data are improved.

Drawings

FIG. 1 is a flow chart of a method for diagnosing faults of an aeronautical engine transmission system based on a domain adaptive graph convolution network according to an embodiment of the present disclosure;

FIG. 2 is a schematic diagram of a domain adaptive graph convolution network provided by another embodiment of the present disclosure;

FIG. 3 is a schematic diagram of a process for generating a correlation diagram in a diagram generation layer according to another embodiment of the disclosure;

4(a) to 4(c) are schematic diagrams of correlation diagram data modeling provided by another embodiment of the present disclosure;

5(a) -5 (d) are vibration signals at different faults provided by another embodiment of the present disclosure;

FIG. 6 is a schematic diagram of the results of a fault migration diagnosis of an aircraft engine driveline under different operating conditions according to another embodiment of the present disclosure.

Detailed Description

Specific embodiments of the present disclosure will be described in detail below with reference to fig. 1 to 6 of the accompanying drawings. While specific embodiments of the disclosure are shown in the drawings, it should be understood that the disclosure can be embodied in various forms and should not be limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art.

It should be noted that certain terms are used throughout the description and claims to refer to particular components. As one skilled in the art will appreciate, various names may be used to refer to a component. This specification and claims do not intend to distinguish between components that differ in name but not function. In the following description and in the claims, the terms "include" and "comprise" are used in an open-ended fashion, and thus should be interpreted to mean "include, but not limited to. The description which follows is a preferred embodiment of the invention, but is made for the purpose of illustrating the general principles of the invention and not for the purpose of limiting the scope of the invention. The scope of the present disclosure is to be determined by the terms of the appended claims.

To facilitate an understanding of the embodiments of the present disclosure, the following detailed description is to be considered in conjunction with the accompanying drawings, and the drawings are not to be construed as limiting the embodiments of the present disclosure.

In one embodiment, as shown in fig. 1, a method for diagnosing a failure of an aeronautical engine transmission system based on a domain adaptive graph convolution network includes the following steps:

s100: collecting vibration signals of an aircraft engine transmission system at different rotating speeds;

in the step, bevel gear breakage, bevel gear abrasion, bevel gear return to factory overhaul and normal state of the transmission system of the aircraft engine at different rotating speeds are classified. Firstly, sample division is carried out on collected signals in four states, and corresponding labels are marked on the divided samples, namely 0 represents broken teeth, 1 represents gear wear, 2 represents factory return overhaul, and 3 represents normal. After the corresponding training samples are obtained, the domain adaptive graph convolution network disclosed by the disclosure is trained. And finally, inputting the test sample into a trained network, outputting a prediction result of the fault classifier C, if the output value is 0, indicating that the fault is broken teeth, if the output value is 1, indicating that the gear is worn, if the output value is 2, indicating that the gear is returned to a factory for overhaul, and if the output value is 3, indicating that the gear is in a normal state.

S200: constructing a domain self-adaptive graph convolution network, and training the domain self-adaptive graph convolution network by taking labeled source domain data and part of unlabeled target domain data as input data;

in this step, 20% of the target domain data is generally selected to train the domain adaptive graph convolution network, so that the feature extractor learns how to effectively extract features in the target domain data.

S300: and inputting the residual label-free target domain data into the trained domain self-adaptive graph convolution network, outputting the prediction result of the fault classifier, and realizing fault migration diagnosis of the aero-engine transmission system at different rotating speeds.

The above embodiments constitute a complete technical solution of the present disclosure, and by modeling the class label, the domain label and the data structure information included in the unsupervised domain adaptation, the information included in the extracted vibration signal feature is more complete, thereby reducing the domain difference between the source domain and the target domain data, and improving the distinguishability of the feature and the diagnosis precision on the target domain data.

In another embodiment, in step S200, as shown in fig. 2, the domain adaptive graph convolution network includes:

the characteristic extractor F comprises four layers of convolutional neural networks, one graph generation layer and two graph convolution layers, and is used for extracting a vibration signal characteristic value;

the fault classifier C comprises a full connection layer and is used for classifying and identifying faults;

The expression of the feature extractor F is:

and is

x represents an adjacency matrix and a node feature matrix of the correlation diagram constructed by the diagram generation layer, respectively.

The expression of the fault classifier C is as follows:

C(X)＝aX+b

The expression of the domain discriminator D is:

D(X_s，X_t)＝Ｃonv(|X_s，X_t|)

The expression of the data structure aligner S is:

wherein the content of the first and second substances,

representing a set of mapping functions, k (-) representing

and

and

respectively, j-th samples from the source domain and the target domain, s and t respectively, samples from the source domain and the target domain, and i and j respectively, the positions of the samples in the sample set.

In another embodiment, in step S200, training the domain adaptive graph convolution network includes the following steps:

s201: dividing the labeled source domain data into a first training sample and a first test sample, dividing part of unlabeled target domain data into a second training sample and a second test sample, and dividing the first training sample and the second training sampleSimultaneously inputting the convolution neural network in the feature extractor F for feature extraction to obtain source domain features

And target domain characteristics

S202: characterizing the source domain

representing source domain samples

The label of (1);

s203: characterizing a source domain

And target domain characteristics

wherein Γ (·) takes a value of 0 or 1 to indicate whether the class of data belongs to a source domain or a target domain;

s204: characterizing a source domain

And target domain characteristics

s205: constructing an objective function using the fault classification loss, the domain classification loss, and the structural difference loss, the objective function being expressed as:

L_Total＝L_C+γL_D+κL_MMD

wherein the content of the first and second substances,

representing a differential operator, eta a learning rate, theta_F、θ_ＣAnd theta_DLearnable parameters, θ ', representing the feature extractor, the fault classifier, and the domain discriminator, respectively'_CAnd θ'_DRespectively representing parameters updated by learning, L_Total、L_CAnd L_DRespectively representing the overall objective function, the fault classification loss and the domain classification loss.

In this step, a parameter θ is set_FIs 1, the learning rate is 0.001, and the parameter theta is corrected according to the overall objective function_FObtaining a deviation derivative

With a value of 3, θ can be obtained according to the above formula_FThe updated value was 0.997.

In the step, the performance of the domain self-adaptive graph convolution network is evaluated through a total precision index, and the evaluation method comprises the following steps: the total number of correctly classified samples is divided by the total number of samples.

If the overall accuracy index is lower than the set value in the test process, the number of training samples needs to be increased to train the model again until the index reaches the set value.

In another embodiment, after obtaining the source domain features and the target domain features, the extracted source domain features and target domain features need to be converted into association map data through a map generation layer, as shown in fig. 3, a specific conversion process includes the following steps:

a. inputting the node characteristic matrix X extracted by the convolutional neural network into a full connection layer for nonlinear mapping;

b. calculating the product of the output of the full-connection layer and the output transpose to obtain a product result matrix, and normalizing the matrix according to rows to obtain a normalized matrix A;

The dependency graph data is obtained.

In this embodiment, the expression of the map generation layer is:

and represents the output of the MLP (or MLP),

to represent

Normalized, A represents the constructed adjacency matrix, Top-k (-) represents the index that returns the first k maxima of A row by row,

is the finally obtained adjacency matrix.

In another embodiment, after obtaining the association graph data, modeling the association graph data through a graph convolution network, so that the structural information between the source domain data and the target domain data is embedded into the extracted source domain features and target domain features.

In this embodiment, as shown in fig. 4(a) to 4(c), modeling the association graph data by the graph convolution network includes the following steps:

a. as shown in fig. 4(a), determining the range of the aggregation neighborhood node at each convolution, where K ═ 1 indicates that the range of the aggregation neighborhood node is a node at a distance of 1 from the central node;

b. as shown in fig. 4(b), each node iterates to aggregate node information with a distance of 1 until a specified number of training times is reached;

c. as shown in fig. 4(c), the node feature output is learned and can be used for node classification and fault diagnosis.

In one embodiment, the acquired vibration signals at 4 rotation speeds are respectively numbered as 0, 1, 2 and 3, wherein, as shown in fig. 5(a), 0 represents a normal state; as shown in fig. 5(b), 1 represents surface wear; as shown in fig. 5(c), 2 denotes a broken tooth; the tip end collapse is shown at 3 in FIG. 5 (d). One group of data collected under any two different working conditions is used as source domain data and is labeled, and the other group of data is used as target domain data without labels, so that 12 migration diagnosis tasks can be performed in total. In the experiment, 5 existing methods, namely, a Convolutional Neural Network (CNN), a maximum mean difference method (MMD), a relationship alignment method (CORAL), a domain adaptive network (DANN), and a conditional domain adaptive network (CDANN), were compared in addition to the Domain Adaptive Graph Convolution Network (DAGCN), and the experimental results are shown in table 1.

TABLE 1

FIG. 6 is a schematic illustration of the aeroengine driveline fault migration diagnostic results of the present disclosure at different rotational speeds.

From the results of table 1 and fig. 6, it can be seen that the average value of the diagnosis results obtained by the domain adaptive graph convolution network in each migration task is 78.81%, which is better than all the comparison methods, and the domain adaptive graph convolution network can obtain the best diagnosis effect in each migration task. The model can learn the characteristics with the domain invariant characteristics, so that the effective migration diagnosis of the aircraft engine transmission system under different working conditions is realized.

The foregoing describes the general principles of the present disclosure in conjunction with specific embodiments, however, it is noted that the advantages, effects, etc. mentioned in the present disclosure are merely examples and are not limiting, and they should not be considered essential to the various embodiments of the present disclosure. Furthermore, the foregoing disclosure of specific details is for the purpose of illustration and description and is not intended to be limiting, since the foregoing disclosure is not intended to be exhaustive or to limit the disclosure to the precise details disclosed.

Claims

1. A method for diagnosing faults of an aeroengine transmission system based on a domain self-adaptive graph convolution network comprises the following steps:

s200: constructing a domain self-adaptive graph convolution network, acquiring a second vibration signal of an aircraft engine transmission system with known faults at different rotating speeds, taking the second vibration signal as source domain data with labels, and taking the source domain data and part of target domain data as input to train the domain self-adaptive graph convolution network, wherein the domain self-adaptive graph convolution network comprises:

the characteristic extractor F comprises four layers of convolutional neural networks, a graph generation layer and two layers of graph convolutional networks, is used for extracting characteristic values of source domain data and part of target domain data, and is also used for converting the extracted characteristic values of the source domain data and the target domain data into association graph data and modeling the association graph data;

a data structure aligner S, said data structure aligner S comprising a maximum mean difference distance estimator for aligning data structures of source domain data and target domain data;

s300: and inputting the other part of target domain data into the trained domain self-adaptive graph convolution network to obtain a prediction label of the target domain data, thereby realizing the migration diagnosis of the faults of the transmission system of the aero-engine at different rotating speeds.

2. The method of claim 1, wherein the feature extractor F is expressed as:

and is

3. The method of claim 1, wherein the fault classifier C has the expression:

C(X)＝aX+b

4. The method of claim 1, wherein the domain discriminator D has the expression:

D(X_s,X_t)＝Conv(|X_s,X_t|)

wherein, X_sAnd X_tRespectively representing the learned source domain data characteristics and target domain data characteristics, |, representing the splicing operation.

5. The method of claim 1, wherein the data structure aligner, S, is expressed as:

wherein the content of the first and second substances,

representing a set of mapping functions, k (·) representing

and

and

6. The method of claim 1, wherein the training of the domain adaptive graph convolution network in step S200 comprises the steps of:

s201: dividing source domain data with labels into a first training sample and a first test sample, dividing part of target domain data without labels into a second training sample and a second test sample, simultaneously inputting the first training sample and the second training sample into a convolutional neural network in a feature extractor F for feature extraction to obtain source domain features

And target domain characteristics

S202: characterizing the source domain

Inputting the data into a fault classifier C to obtain a predicted value of a source domain data fault category

representing source domain samples

The label of (1);

s203: characterizing a source domain

And target domain characteristics

wherein Γ (·) takes either 0 or 1 to indicate whether the class of data belongs to the source domain or the target domain;

s204: characterizing a source domain

And target domain characteristics

L_Total＝L_C+γL_D+κL_MMD

wherein L is_TotalRepresents the overall objective function, γ and κ represent the balance coefficients;

wherein the content of the first and second substances,

representing a differential operator, eta a learning rate, theta_F、θ_CAnd theta_DLearnable parameters, θ ', of the feature extractor, the fault classifier and the domain discriminator, respectively'_CAnd θ'_DRespectively representing parameters updated by learning, L_Total、L_CAnd L_DRespectively representing an overall objective function, a fault classification loss and a domain classification loss;

7. The method of claim 6, wherein the source domain characteristics are obtained

And target domain characteristics

b. calculating the product of the output of the full-connection layer of the fault classifier C and the output transpose to obtain a product result matrix, and normalizing the matrix according to rows to obtain a normalized matrix A;

The dependency graph data is obtained.

8. The method of claim 7, wherein the graph generation layer is expressed as:

the output of the MLP is represented as,

to represent

Transpose of (a), normaize (-) denotes normalization, A denotes the constructed adjacency matrix, Top-k (-) denotes the index returning the first k maxima of A line by line,

is the finally obtained adjacency matrix.

9. The method of claim 7, wherein after obtaining the association graph data, the association graph data is modeled by using a graph convolution network such that structural information between the source domain and the target domain data is embedded in the extracted source domain features and target domain features.