CN111983994B - V-PCA fault diagnosis method based on complex industrial chemical process - Google Patents
V-PCA fault diagnosis method based on complex industrial chemical process Download PDFInfo
- Publication number
- CN111983994B CN111983994B CN202010813193.8A CN202010813193A CN111983994B CN 111983994 B CN111983994 B CN 111983994B CN 202010813193 A CN202010813193 A CN 202010813193A CN 111983994 B CN111983994 B CN 111983994B
- Authority
- CN
- China
- Prior art keywords
- matrix
- pca
- fault
- data
- fault diagnosis
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B23/00—Testing or monitoring of control systems or parts thereof
- G05B23/02—Electric testing or monitoring
- G05B23/0205—Electric testing or monitoring by means of a monitoring system capable of detecting and responding to faults
- G05B23/0218—Electric testing or monitoring by means of a monitoring system capable of detecting and responding to faults characterised by the fault detection method dealing with either existing or incipient faults
- G05B23/0243—Electric testing or monitoring by means of a monitoring system capable of detecting and responding to faults characterised by the fault detection method dealing with either existing or incipient faults model based detection method, e.g. first-principles knowledge model
- G05B23/0254—Electric testing or monitoring by means of a monitoring system capable of detecting and responding to faults characterised by the fault detection method dealing with either existing or incipient faults model based detection method, e.g. first-principles knowledge model based on a quantitative model, e.g. mathematical relationships between inputs and outputs; functions: observer, Kalman filter, residual calculation, Neural Networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/16—Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/18—Complex mathematical operations for evaluating statistical data, e.g. average values, frequency distributions, probability functions, regression analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F30/00—Computer-aided design [CAD]
- G06F30/20—Design optimisation, verification or simulation
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Computational Mathematics (AREA)
- Pure & Applied Mathematics (AREA)
- Mathematical Optimization (AREA)
- Mathematical Analysis (AREA)
- General Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Evolutionary Computation (AREA)
- Software Systems (AREA)
- Algebra (AREA)
- Life Sciences & Earth Sciences (AREA)
- Operations Research (AREA)
- Probability & Statistics with Applications (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Hardware Design (AREA)
- Geometry (AREA)
- Evolutionary Biology (AREA)
- Computing Systems (AREA)
- Artificial Intelligence (AREA)
- Automation & Control Theory (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses a V-PCA fault diagnosis method based on a complex industrial chemical process. Due to inaccurate feature extraction of the traditional Principal Component Analysis (PCA), the phenomena of high false alarm rate and high false missing report rate frequently exist when the method is used for fault diagnosis. To this end, the present invention employs a new data preprocessing method to improve the performance index of PCA in fault diagnosis. Compared with the Euclidean distance preprocessing method of the traditional PCA method, the method cannot accurately reflect the correlation between data. Therefore, a change rate-based preprocessing method is provided, and the conditions of high false alarm rate and high missing report rate of PCA in fault diagnosis are effectively improved. Finally, the traditional PCA method and the PCA method based on the change rate preprocessing are verified through a simulation example. Experimental simulation results show that the method has better detection performance.
Description
Technical Field
The invention belongs to the field of industrial and chemical engineering fault diagnosis, and particularly relates to a V-PCA (principal component analysis based on change rate preprocessing) fault diagnosis method based on a complex industrial and chemical process.
Background
In the complex industrial chemical process, the failure of equipment and system can produce adverse effect on production safety, product quality or efficiency, and even cause personal casualty or pollution to the environment in serious cases. With the improvement of the integration level of the equipment, the industrial environment is increasingly complex and the requirements on the quality of products are continuously improved, and the requirements on the safety and the reliability of the equipment are increasingly high, so that the fault diagnosis technology is more and more emphasized.
The core of fault diagnosis is a diagnosis method, and the existing fault diagnosis methods can be mainly divided into two types: the fault diagnosis method based on the analytical model and the fault diagnosis method based on the data driving. The data-driven method is to transform process data from a measurement space to a feature space and then analyze the process data under the condition that an accurate mathematical model is difficult to establish by a target, so as to realize fault diagnosis of a system, and a multivariate statistical method is a representative of the method, wherein the most common method is a Principal Component Analysis (PCA) method. Although the PCA method is widely applied in the field of fault detection and fault diagnosis, the conventional PCA data preprocessing generally adopts normalization to make data dimensionless, and neglects the influence of dimension on the correlation between variables, thereby resulting in the loss of information after normalization. In an actual system, because the dimensions of the component variables are different, some variables playing an important role in the system cannot detect a fault with a smaller absolute value because the absolute value of the variables is smaller. Minor faults of these important variables play a key role in the stability and safety of system operation, and if the faults cannot be eliminated, the faults can affect the normal operation of the system, and even cause accidents.
Disclosure of Invention
In order to overcome the defects and shortcomings of the prior art, the invention designs a V-PCA method based on a complex industrial chemical process. Compared with the traditional PCA method, the fault diagnosis method based on the V-PCA has better effect on process system detection from the relative change of the samples.
In order to achieve the purpose, the invention is realized by the following technical scheme: the method comprises a data preprocessing stage, a V-PCA model establishing stage, an online fault detection and fault diagnosis stage and a simulation test stage.
Wherein the data preprocessing stage comprises the steps of:
step X1, constructing the collected original complex industrial chemical engineering TE data set into a multi-dimensional variable system matrix Xn∈Rm×n,XnAs follows:
wherein, the column vector xi(j)=[x1(j),x2(j),...xm(j)]TI 1,2, m, j 1,2, n represents a system variable, the data matrix is preprocessed, XnHas a mean vector of bn
Step X2, calculating the mean vector b of each variable samplen
Wherein lm=[1,1,...1]∈R1×mThe original data is transformed to obtain a matrix X*。
Wherein the transformation method is as follows
The physical significance of this equation is centered on the sample mean of each variable, and the rate of change represents the degree to which the raw data deviates from the center. Then, whether the test sample data collected in real time falls in a normal range is checked through the change rate obtained after the normal sample data is preprocessed, and if the projection is outside the normal region, a fault occurs.
Wherein the stage of establishing the V-PCA model comprises the following steps:
y1, for the data matrix X*Covariance decomposition is performed, and the number of principal elements is selected to obtain the following formula
Wherein Λ ═ diag { λ ═ λ1,λ2,…,λmIs a matrix of eigenvalues of S, and the elements on its diagonal satisfy λ1>λ2>…>λm,V∈Rm×mIs the eigenvector matrix of S, P ∈ Rm×rThe first r column of V contains all pivot information,is the remaining m-r columns of V, containing residual information.
Y2, converting the original data matrix X*Decomposing to obtain principal component subspace and residual subspace
In the above formula, T ∈ Rn×rTo score the matrix, P ∈ Rm×rIn the form of a load matrix, the load matrix,is a main element space, and is characterized in that,is the residual space.
The online fault detection and fault diagnosis phase comprises the following steps:
step Z1, compute SPE statistics. SPE statistics for measuring changes in sample vector projections in residual space
Here, the first and second liquid crystal display panels are,for the control limit with a confidence level of α, it is calculated from the following equation
Wherein the content of the first and second substances,λjeigenvalues of a covariance matrix of X, cαIs a threshold value of a standard normal distribution at a confidence level alpha.
Step Z2, calculating Hotelling T2Statistics are obtained. It is used to measure the variation of sample vector in principal component space
In the above formula, Λ ═ diag { λ1,λ2,…,λA},For the control limit with confidence α, it is calculated by the following formula
Here, Fr,n-A;rIs the value of the F distribution with r and n-r degrees of freedom with a confidence level of alpha.
And step Z3, fault isolation based on SPE contribution rate. The contribution graph of the SPE is defined as follows:
wherein the content of the first and second substances,for each variable contribution to the statistic SPE, whereξiIs in identity matrix ImColumn i.
Step Z4, based on T2Fault isolation of the contribution rate. T is2The contribution ratio of (c) is defined as follows:
whereinD=PTΛ-1P calculates the contribution rate of each variable of the system to the fault according to the formula, and the larger is regarded as the cause variable of the fault, thereby carrying out fault isolation.
Compared with the prior art, the invention has the following advantages and beneficial effects: compared with the traditional PCA method for preprocessing in the Euclidean space, the preprocessing method based on the change rate can better reflect the relevant information among all variables, can better reflect the tiny faults in the TE process through the change rate, and can accurately diagnose the faults of the variables which are smaller but play an important role. The simulation result shows that the method provided by the invention can well detect the abnormity, and the failure report-missing rate of the method is obviously reduced compared with the traditional PCA (principal component analysis) as can be seen from the SPE statistic.
Drawings
FIG. 1 is a graph of PCA statistic variation;
FIG. 2 is a graph of the variance of the V-PCA statistic.
Detailed Description
The invention is further described below with reference to the drawings.
The invention provides a V-PCA (principal component analysis based on change rate preprocessing) fault diagnosis method based on a complex industrial chemical process. The fault diagnosis method based on the change rate has better effect on process system detection from the relative change of the sample compared with the traditional PCA method. Through the conversion rate thought, the conversion rate conversion is carried out on the complex industrial and chemical data set TE, then the fault diagnosis is carried out by applying the PCA method, the provided method is verified to be capable of effectively detecting the fault and reducing the occurrence of fault false alarm.
The invention is realized by the following technical scheme: a V-PCA fault diagnosis method based on a complex industrial chemical process. The method mainly comprises four stages, namely a data preprocessing stage and a V-PCA model establishing stage; online fault detection and fault diagnosis; and (5) a simulation test stage.
Wherein the data preprocessing stage comprises the steps of:
step X1, collecting original complex industrial chemical TE numberThe data set is constructed into a multi-dimensional variable system matrix Xn∈Rm ×n,XnAs follows.
Wherein, the column vector xi(j)=[x1(j),x2(j),...xm(j)]TI 1,2, m, j 1,2, n represents a system variable, the data matrix is preprocessed, XnIs b as the mean vector of each variable samplen
Step X2, calculating XnMean vector of samples of variables of
Wherein lm=[1,1,...1]∈R1×mThe original data is transformed to obtain a matrix X*。
Wherein the transformation method is as follows
The physical meaning of the formula is that the mean value of each variable sample is taken as the center, and the change rate represents the degree of deviation of the original data from the center. And then, checking whether the test sample data acquired in real time falls in a normal range or not by the change rate obtained after preprocessing the normal data sample, and if the projection is outside the normal region, determining that a fault occurs. Wherein the stage of establishing the V-PCA model comprises the following steps:
y1, for the data matrix X*Covariance decomposition is performed, and the number of principal elements is selected to obtain the following formula
Wherein Λ ═ diag { λ ═ λ1,λ2,…,λmIs a matrix of eigenvalues of S, and the elements on its diagonal satisfy λ1>λ2>…>λm,V∈Rm×mIs the eigenvector matrix of S, P ∈ Rm×rThe first r column of V contains all pivot information,is the remaining m-r columns of V, containing residual information.
Y2, converting the original data matrix X*Decomposing to obtain principal component subspace and residual subspace
In the above formula, T ∈ Rn×rTo score the matrix, P ∈ Rm×rIn the form of a load matrix, the load matrix,is a main element space, and is characterized in that,is the residual space.
The online fault detection and fault diagnosis phase comprises the following steps:
step Z1, compute SPE statistics. SPE statistics for measuring changes in sample vector projections in residual space
Here, the first and second liquid crystal display panels are,for the control limit with a confidence level of α, it is calculated from the following equation
Wherein the content of the first and second substances,λjis X*Characteristic value of the covariance matrix of (c)αIs a threshold value of a standard normal distribution at a confidence level alpha.
Step Z2, calculating HotellingT2Statistics are obtained. It is used to measure the variation of sample vector in principal component space
In the above formula, Λ ═ diag { λ1,λ2,…,λA},For the control limit with confidence α, it is calculated by the following formula
Here, Fr,n-A;rIs the value of the F distribution with r and n-r degrees of freedom with a confidence level of alpha.
And step Z3, fault isolation based on SPE contribution rate. The contribution graph of the SPE is defined as follows:
wherein the content of the first and second substances,for each variable's contribution to the statistics SPE,ξiis in identity matrix ImColumn i.
Step Z4, based on T2Fault isolation of the contribution rate. T is2The contribution ratio of (c) is defined as follows:
wherein D ═ PTΛ-1P calculates the contribution rate of each variable of the system to the fault according to the formula, and the larger is regarded as the cause variable of the fault, thereby carrying out fault isolation.
A simulation test stage:
to verify the effectiveness of the above method and compare it with the conventional PCA method. Simulation tests are performed by TE processes, which are models established from actual chemical processes, and have been widely used as reference processes for control and monitoring research. Acquiring a test data set from the samples by acquiring data on line, and calculating T for the processed data by a V-PCA method2And the statistic and the SPE statistic are used for monitoring whether the data exceed the control limit of the normal state. For samples that exceed the control limit, each process variable pair T is calculated2And the contribution rate of the statistic and the SPE statistic, wherein the variable with the largest contribution rate is the variable which can cause the fault.
The TE process simulates 21-medium failure modes common in real industrial processes, and a detailed description of these 21 failure modes is shown in table 1. This example collected 22 simulation data sets (1 normal data set, 21 fault data sets, one for each fault mode) each including 960 sample points. In these 21 failure modes, the process initially runs in normal mode, and at the 160 th sampling point, a failure occurs, which persists until the end of the process.
Table 1 the 21 failure modes in the TE process are described below
For fault (5) condenser cooling water inlet temperature step change, fault detection of the conventional PCA method is shown in fig. 1, and fault detection of V-PCA is shown in fig. 2. Because the control loop configuration can compensate for the condenser cooling water inlet temperature step change and compensate most of the variables to near their steady state values, the conventional PCA method only detects a system fault between the 160 th sample point to the 340 th sample point, and then the statistics can hardly detect the fault, however, the fault is always present and affects the process system. T of conventional PCA method2The failure false-positive rates of the statistic and the SPE statistic are 0.7388 and 0.57 respectively, compared with the T of the V-PCA method based on the change rate2And the failure and false-positive rates of the SPE statistics are 0.7 and 0.0038 respectively, and the comparison results of the two methods are shown in the following table 2.
Claims (1)
1. A V-PCA fault diagnosis method based on a complex industrial chemical process is characterized by comprising the following steps:
a data preprocessing stage:
step X1, constructing the collected original complex industrial chemical engineering TE data set into a multi-dimensional variable system matrix Xn∈Rm×n,XnAs follows;
wherein, the column vector xi(j)=[x1(j),x2(j),...xm(j)]TI 1,2, m, j 1,2, n represents a system variable, the data matrix is preprocessed, XnHas a mean vector of bn
Step X2, calculating XnMean vector of samples of variables of
Wherein lm=[1,1,...1]∈R1×mThe original data is transformed to obtain a matrix X*;
Wherein the transformation method is as follows
The physical significance of the formula is that the sample mean value of each variable is taken as the center, and the change rate represents the degree of deviation of original data from the center; then, whether the test sample data acquired in real time falls in a normal range is checked through the change rate obtained after the normal sample data is preprocessed, and if the projection is outside the normal region, a fault occurs;
establishing a V-PCA model stage:
y1, for the data matrix X*Covariance decomposition is performed, and the number of principal elements is selected to obtain the following formula
Wherein Λ ═ diag { λ ═ λ1,λ2,…,λmIs a matrix of eigenvalues of S, and the elements on its diagonal satisfy λ1>λ2>…>λm,V∈Rm×mIs the eigenvector matrix of S, P ∈ Rm×rThe first r column of V contains all pivot information,is the remaining m-r columns of V, containing residual information;
y2 step, mixing X*Decomposing to obtain principal component subspace and residual subspace
In the above formula, T ∈ Rn×rTo score the matrix, P ∈ Rm×rIn the form of a load matrix, the load matrix,is a main element space, and is characterized in that,is a residual space;
and (3) in an online fault detection and fault diagnosis stage:
step Z1, calculating SPE statistic; SPE statistics for measuring changes in sample vector projections in residual space
Here, the first and second liquid crystal display panels are,for the control limit with a confidence level of alpha,calculated from the following equation
Wherein the content of the first and second substances, λjeigenvalues of a covariance matrix of X, cαA threshold value of a standard normal distribution at a confidence level α;
step Z2, calculating Hotelling T2Statistics; it is used to measure the variation of sample vector in principal component space
In the above formula, Λ ═ diag { λ1,λ2,…,λA},For the control limit with confidence α, it is calculated by the following formula
Here, Fr,n-A;rIs the F distribution value with r and n-r degrees of freedom and a confidence level of alpha;
step Z3, based on the fault isolation of SPE contribution rate, the contribution graph of SPE is defined as follows:
wherein the content of the first and second substances,for each variable contribution to the statistic SPE, whereξiIs in identity matrix ImThe ith column;
step Z4, based on T2Fault isolation of contribution rate; t is2The contribution ratio of (c) is defined as follows:
wherein D ═ PTΛ-1P calculates the contribution rate of each variable of the system to the fault according to the formula, and the larger is regarded as the cause variable of the fault, thereby carrying out fault isolation.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010813193.8A CN111983994B (en) | 2020-08-13 | 2020-08-13 | V-PCA fault diagnosis method based on complex industrial chemical process |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010813193.8A CN111983994B (en) | 2020-08-13 | 2020-08-13 | V-PCA fault diagnosis method based on complex industrial chemical process |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111983994A CN111983994A (en) | 2020-11-24 |
CN111983994B true CN111983994B (en) | 2021-08-20 |
Family
ID=73435437
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010813193.8A Active CN111983994B (en) | 2020-08-13 | 2020-08-13 | V-PCA fault diagnosis method based on complex industrial chemical process |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111983994B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115933604B (en) * | 2022-12-20 | 2023-11-07 | 广东石油化工学院 | Fault detection and prediction control method based on data driving |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2018006073A1 (en) * | 2016-07-01 | 2018-01-04 | Ayasdi, Inc. | Scalable topological data analysis |
CA3047930A1 (en) * | 2016-12-22 | 2018-06-28 | Liveramp, Inc. | Mixed data fingerprinting with principal components analysis |
CN109327421A (en) * | 2017-08-01 | 2019-02-12 | 阿里巴巴集团控股有限公司 | Data encryption, machine learning model training method, device and electronic equipment |
CN108052954B (en) * | 2017-11-01 | 2021-08-03 | 佛山科学技术学院 | Fault diagnosis method of sample space based on multi-stage high-dimensional features |
CN111142501B (en) * | 2019-12-27 | 2021-10-22 | 浙江科技学院 | Fault detection method based on semi-supervised autoregressive dynamic hidden variable model |
-
2020
- 2020-08-13 CN CN202010813193.8A patent/CN111983994B/en active Active
Also Published As
Publication number | Publication date |
---|---|
CN111983994A (en) | 2020-11-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108062565B (en) | Double-principal element-dynamic core principal element analysis fault diagnosis method based on chemical engineering TE process | |
CN108549908B (en) | Chemical process fault detection method based on multi-sampling probability kernel principal component model | |
CN109739214B (en) | Method for detecting intermittent faults in industrial process | |
CN110244692B (en) | Chemical process micro-fault detection method | |
CN108958226B (en) | TE process fault detection method based on survival information potential-principal component analysis algorithm | |
CN109407652A (en) | Multivariable industrial process fault detection method based on major-minor pca model | |
CN111368428B (en) | Sensor precision degradation fault detection method based on monitoring second-order statistics | |
CN115358297A (en) | Injection molding machine abnormity detection method and system based on improved MKECA method | |
CN111983994B (en) | V-PCA fault diagnosis method based on complex industrial chemical process | |
CN110119579B (en) | OICA-based complex industrial process fault monitoring method | |
CN110942258B (en) | Performance-driven industrial process anomaly monitoring method | |
CN112149054B (en) | Construction and application of orthogonal neighborhood preserving embedding model based on time sequence expansion | |
CN110751217A (en) | Equipment energy consumption ratio early warning analysis method based on principal component analysis | |
CN116627116B (en) | Process industry fault positioning method and system and electronic equipment | |
CN111563685A (en) | Power generation equipment state early warning method based on auto-associative kernel regression algorithm | |
CN114995338A (en) | Industrial process micro-fault detection method based on normative variable analysis and JS divergence fusion | |
CN113253682B (en) | Nonlinear chemical process fault detection method | |
CN115047853A (en) | Micro fault detection method based on recursion standard variable residual error and kernel principal component analysis | |
CN114371677A (en) | Industrial process state monitoring method based on spectral radius-interval principal component analysis | |
Ma et al. | Process monitoring of the pneumatic control valve using canonical variate analysis | |
CN116401577B (en) | Quality-related fault detection method based on MCF-OCCA | |
Gao et al. | Principal component analysis for process monitoring in distributed system environment | |
Wu et al. | Fault detection and diagnosis based on sparse representation classification (SRC) | |
CN117556202B (en) | Industrial process micro fault detection method based on probability correlation slow feature analysis | |
Peng et al. | An online performance monitoring using statistics pattern based kernel independent component analysis for non-Gaussian process |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |