CN112001115A - Soft measurement modeling method of semi-supervised dynamic soft measurement network - Google Patents

Soft measurement modeling method of semi-supervised dynamic soft measurement network Download PDF

Info

Publication number
CN112001115A
CN112001115A CN202010690011.2A CN202010690011A CN112001115A CN 112001115 A CN112001115 A CN 112001115A CN 202010690011 A CN202010690011 A CN 202010690011A CN 112001115 A CN112001115 A CN 112001115A
Authority
CN
China
Prior art keywords
soft measurement
data
model
network
semi
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010690011.2A
Other languages
Chinese (zh)
Other versions
CN112001115B (en
Inventor
刘涵
郭润元
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xian University of Technology
Original Assignee
Xian University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xian University of Technology filed Critical Xian University of Technology
Priority to CN202010690011.2A priority Critical patent/CN112001115B/en
Publication of CN112001115A publication Critical patent/CN112001115A/en
Application granted granted Critical
Publication of CN112001115B publication Critical patent/CN112001115B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F30/00Computer-aided design [CAD]
    • G06F30/20Design optimisation, verification or simulation
    • G06F30/27Design optimisation, verification or simulation using machine learning, e.g. artificial intelligence, neural networks, support vector machines [SVM] or training a model
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/18Complex mathematical operations for evaluating statistical data, e.g. average values, frequency distributions, probability functions, regression analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2218/00Aspects of pattern recognition specially adapted for signal processing
    • G06F2218/02Preprocessing
    • G06F2218/04Denoising

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Pure & Applied Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computational Mathematics (AREA)
  • Mathematical Optimization (AREA)
  • Evolutionary Biology (AREA)
  • Mathematical Analysis (AREA)
  • Geometry (AREA)
  • Operations Research (AREA)
  • Probability & Statistics with Applications (AREA)
  • Medical Informatics (AREA)
  • Algebra (AREA)
  • Computer Hardware Design (AREA)
  • Databases & Information Systems (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a soft measurement modeling method of a semi-supervised dynamic soft measurement network, which is implemented according to the following steps: denoising and redundancy removing are carried out on the training set data based on a CEEMD and Isomap method; carrying out serialization and normalization processing on the training set data; and finishing the establishment of a soft measurement model of the semi-supervised dynamic soft measurement network based on the training set data. The modeling method provided by the invention removes noise and redundancy of data by using a method combining CEEMD and Isomap, the CEEMD has completeness, the result has no obvious modal aliasing phenomenon, the Isomap has strong nonlinear feature transformation capability, the advantages of the CEEMD and the Isomap are combined, so that the noise and the redundancy in the original data are effectively removed, the information loss is reduced to the maximum extent, the data are serialized, the historical data are introduced for dynamic modeling, and compared with the traditional soft measurement method, the method provided by the invention can predict variables more accurately by using the model.

Description

Soft measurement modeling method of semi-supervised dynamic soft measurement network
Technical Field
The invention belongs to the technical field of intelligent signal processing and industrial artificial intelligence, and particularly relates to a soft measurement modeling method of a semi-supervised dynamic soft measurement network.
Background
In a complex industrial process, due to the reasons of severe production environment, complex working conditions, limited detection technology or cost and the like, key quality variables in the process often cannot be measured reliably and in real time. In order to overcome the problem, a soft measurement technology is developed, which takes an auxiliary variable which is easy to measure in the process as an input and a main variable which is desired to be measured as an output, and establishes a model which can predict the main variable, thereby realizing accurate estimation of a key quality variable.
With the development of information technology and the wide application of a distributed control system in a complex industrial process, massive industrial process big data can be collected and used for establishing a soft measurement model, but external environment disturbance factors and process fluctuation comprise raw material composition change, random interference in data transmission and storage processes and other factors, so that industrial data often have a large amount of data noise, and meanwhile, massive industrial production data inevitably introduce a data redundancy problem, namely the same production condition and the colinearity among different auxiliary variables which repeatedly occur in the production process, and the noise and redundancy contained in the data can seriously influence the accuracy of a data-driven soft measurement model and need to be removed in a preprocessing stage. In addition, industrial process data is often a continuous time series, and if static soft measurement modeling is performed on the industrial process data, the front-back relation of the data cannot be captured, so that the model estimation precision is low and the robustness is poor in practical application. Therefore, to build an accurate data-driven soft measurement model, dynamic modeling is required.
Disclosure of Invention
The invention aims to provide a soft measurement modeling method of a semi-supervised dynamic soft measurement network, which can remove the noise and redundancy of variable data and capture the dynamic characteristics among data so as to establish an accurate dynamic soft measurement prediction model.
In order to solve the technical problem, the invention discloses a soft measurement modeling method of a semi-supervised dynamic soft measurement network, which is implemented according to the following steps:
step 1, denoising and redundancy removing processing are carried out on training set data based on a Complementary integrated Empirical Mode Decomposition (CEEMD) and an Isomap method;
step 2, carrying out serialization and normalization processing on the training set data processed in the step 1;
and 3, completing soft measurement model establishment of a semi-supervised dynamic soft measurement network (SSDGRU-MLR) based on the training set data processed in the step 2.
Further, step 1, denoising and redundancy removing processing are carried out on the training set data based on a CEEMD and Isomap method, and the specific steps are as follows:
step 1.1, applying a CEEMD algorithm to an original auxiliary variable training data set X to obtain IMFs of various orders;
step 1.2, calculating correlation coefficient indexes of IMFs of all orders and original variable signals, judging whether the IMFs are noise or not based on a set threshold constant, and eliminating the IMFs judged to be noise, wherein a calculation formula of the correlation coefficient is as follows:
Figure BDA0002589014100000021
in the formula (1) (. rho. (X))v(t),cvi(t)) represents the original auxiliary variable Xv(t) and its ith IMFcvi(t), i 1, N, the covariance between N,
Figure BDA0002589014100000022
and
Figure BDA0002589014100000023
are each Xv(t) and the standard deviation of the ith IMF, wherein the value range of | rho | is between 0 and 1, and the closer to 1, the higher the similarity is;
and step 1.3, performing nonlinear feature transformation on the residual IMF through an Isomap algorithm, and then performing data reconstruction by using a new mode function and an original remainder item obtained after dimensionality reduction to finally obtain an auxiliary variable X' after denoising and redundancy removal.
Further, in step 1.3, in the implementation process of the Isomap algorithm, the method for calculating the geodesic distance is as follows: the geodesic distance between a sample point and its neighborhood is replaced by the Euclidean distance between them; the sample point and points outside its neighborhood are replaced by the shortest path between them on the manifold.
Further, step 2, performing serialization and normalization processing on the training set data processed in step 1, specifically comprising the following steps:
step 2.1, after denoising and redundancy removing operation, performing serialization operation on auxiliary variable data, and predicting a dominant variable at the t + ts + z moment according to the auxiliary variable data of a total ts time step from the t moment to the t + ts moment, wherein ts is the time window length of input data, and z is the time step length of the dominant variable needing to be predicted, so as to obtain data X 'after serialization of the input data X';
step 2.2, carrying out standardization treatment by using a Z-SCORE method, converting the data into data with a mean value of 0 and a variance of 1, wherein the formula is as follows:
Figure BDA0002589014100000031
wherein in formula (2), X' is the data after serialization, mu and sigma respectively represent the mean and variance, and XinExpressing the sequence data which is used for inputting into the neural network after standardization, wherein only the characteristic data is subjected to standardization treatment, and the original value of the label data is kept unchanged;
further, step 3, completing the establishment of the SSDGRU-MLR soft measurement model based on the training set data processed in step 2, and specifically comprising the following steps:
step 3.1, the SSDGRU-MLR is a network formed by combining a GRU unit with an MLP in supervised learning after multi-layer stacking, the output of the last layer of the DGRU outputs a soft measurement prediction result through a fully connected MLP network, wherein the MLP is a neural network of a single hidden layer and is used for regression fitting of the final key quality variable;
step 3.2, before training the model, firstly, initializing the parameters of the model, and adopting an XVaier initialization mode to make the number of nodes of the current network layer be ninThe number of output nodes is noutThe way of Xvaier initialization is to achieve uniform distribution as follows:
Figure BDA0002589014100000041
after initialization is completed, serialized data X is processedinThe loss function of the whole model training process can be defined as follows when the loss function is input into the soft measurement model:
Figure BDA0002589014100000042
wherein in the formula (4), ytRepresents the label output corresponding to the t-th sequence sample, n-ts +1 represents the number of samples after serialization,
Figure BDA0002589014100000043
a prediction output representing the t-th sequence sample; and based on the optimization target of the minimum loss function, updating and adjusting parameters of the whole model through a BPTT algorithm, and finally completing batch training of the whole SSDGRU-MLR soft measurement model through multiple iterations.
Compared with the prior art, the invention can obtain the following technical effects:
1) the invention discloses a soft measurement modeling method of a semi-supervised dynamic soft measurement network, which removes noise and redundancy of data by using a method of combining CEEMD and Isomap, wherein CEEMD has completeness and the result has no obvious modal aliasing phenomenon, and Isomap has strong nonlinear feature transformation capability, and combines the advantages of CEEMD and Isomap so as to effectively remove the noise and redundancy in the original data and reduce information loss to the maximum extent, and meanwhile, data is serialized so as to introduce historical data for dynamic modeling. A Semi-Supervised depth Gated loop sensor network (SSDGRU-MLR) formed by DGRUs and MLPs is adopted to carry out Semi-Supervised dynamic modeling on the preprocessed serialized data, the SSDGRU-MLR not only can utilize a large number of non-label samples in the process, but also can help to extract high-level representation in variables, and meanwhile, GRU Units in the structure can capture dynamic characteristics of the data and spread the data along with time, so that the modeling effect is improved.
2) In the training process of the SSDGRU-MLR, a dropout technology is used for avoiding the generation of overfitting, and meanwhile, a callback function is designed to ensure the smooth training of the model. Through the soft measurement prediction experiment, the experimental result can be analyzed, compared with the traditional soft measurement method, the variable prediction by using the model of the invention is more accurate, and the effectiveness and superiority of the method are proved in a comparison experiment based on the industrial example of the air preheater.
Drawings
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the invention and not to limit the invention. In the drawings:
FIG. 1 is a flow chart of a soft measurement modeling method of a semi-supervised dynamic soft measurement network according to the present invention;
FIG. 2 is a SSDGRU-MLR model diagram of a soft measurement modeling method via a semi-supervised dynamic soft measurement network of the present invention;
FIG. 3 is a cross-sectional view of an air preheater in an industrial example of a method of soft measurement modeling via a semi-supervised dynamic soft measurement network of the present invention;
FIG. 4 is a graph of a loss function obtained when trained by the soft measurement modeling method of a semi-supervised dynamic soft measurement network of the present invention;
FIG. 5 is a CEEMD decomposition result diagram of an inlet temperature variable obtained by the soft measurement modeling method of the semi-supervised dynamic soft measurement network of the present invention;
FIG. 6 is a diagram of the predicted rotor thermal deformation results obtained by the soft measurement modeling method of the semi-supervised dynamic soft measurement network of the present invention;
fig. 7 is a diagram of soft measurement prediction error analysis obtained by the soft measurement modeling method of the semi-supervised dynamic soft measurement network of the present invention.
Detailed Description
The following embodiments are described in detail with reference to the accompanying drawings, so that how to implement the technical features of the present invention to solve the technical problems and achieve the technical effects can be fully understood and implemented.
The invention discloses a soft measurement modeling method of a semi-supervised dynamic soft measurement network, which is implemented according to the following steps as shown in figure 1:
step 1, denoising and redundancy removing processing are carried out on training set data based on a Complementary integrated Empirical Mode Decomposition (CEEMD) and an Isomap method;
the method comprises the following specific steps:
step 1.1, applying a CEEMD algorithm to an original auxiliary variable training data set X to obtain IMFs of various orders;
step 1.2, calculating correlation coefficient indexes of IMFs of all orders and original variable signals, judging whether the IMFs are noise or not based on a set threshold constant, and eliminating the IMFs judged to be noise, wherein a calculation formula of the correlation coefficient is as follows:
Figure BDA0002589014100000061
in the formula (1) (. rho. (X))v(t),cvi(t)) represents the original auxiliary variable Xv(t) and its ith IMFcvi(t), i 1, N, the covariance between N,
Figure BDA0002589014100000062
and
Figure BDA0002589014100000063
are each Xv(t) and the standard deviation of the ith IMF, wherein the value range of | rho | is between 0 and 1, and the closer to 1, the higher the similarity is;
step 1.3, performing nonlinear feature transformation on the residual IMF through an Isomap algorithm, and then performing data reconstruction by using a new mode function and an original remainder obtained after dimensionality reduction to finally obtain an auxiliary variable X' after denoising and redundancy removal;
in the implementation process of the Isomap algorithm, the geodesic distance is calculated as follows: the geodesic distance between a sample point and its neighborhood is replaced by the Euclidean distance between them; the sample point and points outside its neighborhood are replaced by the shortest path between them on the manifold;
step 2, carrying out serialization and normalization processing on the training set data processed in the step 1;
the method comprises the following specific steps:
step 2.1, after denoising and redundancy removing operation, performing serialization operation on auxiliary variable data, and predicting a dominant variable at the t + ts + z moment according to the auxiliary variable data of a total ts time step from the t moment to the t + ts moment, wherein ts is the time window length of input data, z is the time step of the dominant variable needing to be predicted, and the two parameters (ts and z) need to be set by combining an industrial background to obtain data X 'after serialization of the input data X';
step 2.2, carrying out standardization treatment by using a Z-SCORE method, converting the data into data with a mean value of 0 and a variance of 1, wherein the formula is as follows:
Figure BDA0002589014100000071
wherein in formula (2), X' is the data after serialization, mu and sigma respectively represent the mean and variance, and XinExpressing the sequence data which is used for inputting into the neural network after standardization, wherein only the characteristic data is subjected to standardization treatment, and the original value of the label data is kept unchanged;
step 3, completing the soft measurement model establishment of a semi-supervised dynamic soft measurement network (SSDGRU-MLR) based on the training set data processed in the step 2;
the method comprises the following specific steps:
step 3.1, the SSDGRU-MLR is a network formed by combining a GRU unit with an MLP in supervised learning after multi-layer stacking, the overall structure of the network is shown in FIG. 2, and the output of the last layer of the DGRU outputs the soft measurement prediction result through the MLP network which is completely connected. Wherein, the MLP is a neural network of a single hidden layer and is used for regression fitting of final key quality variables;
step 3.2, before training the model, firstly, the parameters of the model need to be initialized, including the parameter W inside each GRU unitr、Ur、Wz、UzW, U and the weights and offsets W between layers of the network1、b1、Wj、bj、Wl、blAnd the node number of the current network layer is n by adopting an XVaier initialization modeinThe number of output nodes is noutThe way of Xvaier initialization is to achieve uniform distribution as follows:
Figure BDA0002589014100000072
after initialization is finished, the serialized data is input into a soft measurement model, hidden layer output h corresponding to the last GRU layer at the last time step is obtained through DGRU forward propagation, and then a data set is formed
Figure BDA0002589014100000081
Inputting the variables into an MLP network, and obtaining predicted values of key dominant variables through MLP forward propagation
Figure BDA0002589014100000082
Wherein h istRepresenting the hidden layer output, y, obtained from DGRU in the t-th sequence sample input modeltAnd (4) representing the label output corresponding to the t-th sequence sample, wherein n-ts +1 represents the number of the serialized samples.
Figure BDA0002589014100000083
Representing the predicted output of the t-th sequence sample. The prediction error can be obtained through the label value and the predicted value, and therefore, the loss function of the whole model training process can be defined as follows:
Figure BDA0002589014100000084
wherein y istRepresents the t-th sequence sampleAnd outputting the corresponding label, wherein n-ts +1 represents the number of the samples after serialization.
Figure BDA0002589014100000085
Representing the predicted output of the t-th sequence sample. And based on the optimization target of the minimum loss function, updating and adjusting parameters of the whole model through a BPTT algorithm, and finally completing batch training of the whole SSDGRU-MLR soft measurement model through multiple iterations.
The following experiments show that the soft measurement modeling method of the semi-supervised dynamic soft measurement network is effective and feasible and has certain superiority:
based on the industrial example of the air preheater rotor thermal deformation soft measurement, the test set data is input into the established soft measurement model, the deformation prediction results obtained by adopting other soft measurement methods and the method of the invention are compared, the effectiveness and superiority of the modeling method of the invention are analyzed,
the method comprises the following specific steps:
(1) FIG. 3 is a cross-sectional view of an air preheater rotor, and based on an industrial example of soft measurement of air preheater rotor thermal deformation, the serialized real-time preheater temperature data is divided into a training set, a validation set and a test set, wherein the number of samples in the training set is 8571, the number of samples in the validation set is 2143, and the number of samples in the test set is 1000. Completing model training according to the steps 1-3, and in the training process, using ticks to ensure that the model is effectively trained: carrying out disorder processing on the data; using the same dropout mask for each time step; setting a callback function in a program to monitor the change of the verification loss;
(2) the input test set tests the effectiveness and superiority of the prediction capability of the soft measurement model, 10 soft measurement models are established to carry out a comparison test experiment, and the 10 soft measurement models comprise a traditional method MLP and a support vector regression SVR in the field of machine learning, deep learning DBN, DLSTM and DGRU methods and corresponding deep learning methods which are subjected to denoising and redundancy removal. Among the 10 comparative experiments, the effect of EMD and CEEMD on the decomposition of the auxiliary variables was also compared. Use ofSNR and overall orthogonality index IOTThe denoising and redundancy removing result is measured numerically with the root mean square error RMSE; mean Absolute Error (MAE), Mean Square Error (MSE) and coefficient of measure R are used2As an evaluation index of the predictive performance of the dynamic soft measurement model.
Table 1 shows the test results of comparative experiments using 10 soft measurement models, and the results verify the necessity and superiority of introducing deep learning into soft measurements to perform semi-supervised modeling.
For 6 models, namely No. 3-8 models, each deep learning model is provided with two groups of experiments which are preprocessed and not preprocessed, and as can be seen from Table 1, no matter which model among DBN, DLSTM and DGRU is adopted, the model which is subjected to denoising and redundancy removing processing achieves higher prediction precision than the corresponding model which is not preprocessed, and the result also shows that the operation of denoising and redundancy removing processing on auxiliary variable data is necessary. Meanwhile, the DBN in the model No. 3 and the model No. 4 is established based on the static assumption of the industrial process, compared with the static DBN model, the estimation accuracy of the dynamic DLSTM and DGRU model is greatly improved, the MAE value and the MSE value are obviously smaller, the feedback structure of the DLSTM and DGRU model obtains the dynamic characteristics in the data, and therefore the prediction performance of the soft measurement model is improved;
in addition, effective training of the dynamic model is needed to be taken as a basis for realizing good dynamic performance, in 6 models, the model No. 8 realizes the optimal prediction result, the model is taken as an example to be drawn to obtain a graph 4, the graph 4 is a loss function curve graph during training drawn by the soft measurement modeling method of the semi-supervised dynamic soft measurement network, a triangular curve in the graph represents the loss of a training set, a fork curve represents the loss of a verification set, and the two loss curves tend to be stable after continuously descending trend along with the increase of the number of iteration rounds and are closely adjacent and well fitted. The prediction result of the DGRU is better than that of the DLSTM, so that the DGRU structure is more applicable to solving the problem, and compared with the widely used DLSTM, the application value of the SSDGRU-MLR model in the field of soft measurement modeling is worthy of further popularization;
FIG. 5 is a CEEMD decomposition result diagram of an inlet temperature variable obtained by the soft measurement modeling method of the semi-supervised dynamic soft measurement network, taking the decomposition of the inlet temperature variable as an example, the first action is an inlet temperature variable signal, the rest actions are IMFs obtained by CEEMD decomposition, and the IMFs obtained by CEEMD has no obvious modal aliasing phenomenon, so that noise signals can be accurately expressed, meanwhile, the denoising and redundancy removing results in Table 2 show that the RMSE indexes of the model 8 and the model 10 are one order of magnitude smaller than that of the model 9 and the SNR is one order of magnitude larger than that of the model 10, which shows that the favorable decomposition result based on CEEMD makes the denoising performance of the model 8 and the model 10 more excellent, and then I is subjected to I-EMD decompositionOTThe indexes are analyzed, although both the PCA method and the Isomap method are suitable for dimension reduction processing, the linear PCA method emphasizes the orthogonality of principal elements, and in the face of IMF data with strong nonlinearity, the Isomap method keeps the geodesic distance unchanged in the feature transformation process, so that more important information of original variables is kept, and high-quality reconstruction signals with less information loss are obtained. Through the analysis, the effectiveness and the superiority of the noise reduction and redundancy removal method combining CEEMD and Isomap are further verified.
Fig. 6 is a rotor thermal deformation prediction result diagram obtained by the soft measurement modeling method of the semi-supervised dynamic soft measurement network, 1000 actual continuous test samples are selected, and meanwhile, a model 1, a model 3, a model 4 and a model 8 in table 1 are used for comparison experiments, and as can be seen from fig. 6, the rotor deformation prediction value of the model 8 is better tracked and adapted to the change of a real value compared with other 3 static models, the prediction error is smaller, and the most accurate prediction is realized. The model 8 is not only modeled by applying serialized data, but also adopts an SSDGRU-MLR model to further capture dynamic characteristics among data, so that the prediction precision is remarkably improved, and the dynamic performance is strongest in the four models.
In order to further evaluate the dynamic prediction performance of the model, the rotor thermal-type variation prediction error map of the model 8 is also analyzed, fig. 7 is a prediction error analysis map obtained by the soft measurement modeling method of the semi-supervised dynamic soft measurement network, wherein fig. 7(a) is a prediction error map at each moment, fig. 7(b) is a frequency histogram (and a kernel density estimation curve) of the prediction error, and fig. 7(c) is a time delay scatter map of the prediction error. As can be seen from the observation of FIG. 7, the prediction error drift of each point is small, the nuclear density curve is approximate to zero mean value, the shape is bell-shaped, the prediction error of the model is approximate to normal distribution, and the dynamic soft measurement result is real and reliable. The point distribution does not show a correlation relation on the whole, the useful information of each auxiliary variable and the dynamic characteristics among data are effectively utilized by the proposed model, and no redundant effective information in errors can be used for prediction.
By observing fig. 4-fig. 7 and table 1-table 2 and combining the above analysis, it is clear that the soft measurement modeling method of the semi-supervised dynamic soft measurement network of the present invention is effective and feasible and has certain advantages.
The invention relates to a soft measurement modeling method of a semi-supervised dynamic soft measurement network based on CEEMD, Isomap and DGRU, which removes noise and redundancy of data by using a method of combining CEEMD and Isomap. A Semi-Supervised depth Gated loop sensor network (SSDGRU-MLR) formed by DGRUs and MLPs is adopted to carry out Semi-Supervised dynamic modeling on the preprocessed serialized data, the SSDGRU-MLR not only can utilize a large number of non-label samples in the process, but also can help to extract high-level representation in variables, and meanwhile, GRU Units in the structure can capture dynamic characteristics of the data and spread the data along with time, so that the modeling effect is improved. In the training process of the SSDGRU-MLR, a dropout technology is used for avoiding the generation of overfitting, and meanwhile, a callback function is designed to ensure the smooth training of the model. Through the soft measurement prediction experiment that carries on, it can be reachd to carry out the analysis to the experimental result, compares in traditional soft measurement method, uses the novel semi-supervised dynamic soft measurement method in this patent to predict the variable can be more accurate, and validity and the superiority of method have obtained the proof in the contrast experiment based on air heater industry example.
Table 1 shows the results of comparative experiments using 10 soft measurement models in the examples
Figure BDA0002589014100000121
Table 2 shows the results of the de-noising and de-redundancy experiments in the examples
Figure BDA0002589014100000122
While the foregoing description shows and describes several preferred embodiments of the invention, it is to be understood, as noted above, that the invention is not limited to the forms disclosed herein, but is not to be construed as excluding other embodiments and is capable of use in various other combinations, modifications, and environments and is capable of changes within the scope of the inventive concept as expressed herein, commensurate with the above teachings, or the skill or knowledge of the relevant art. And that modifications and variations may be effected by those skilled in the art without departing from the spirit and scope of the invention as defined by the appended claims.

Claims (5)

1. A soft measurement modeling method of a semi-supervised dynamic soft measurement network is characterized by comprising the following steps:
step 1, denoising and redundancy removing processing are carried out on training set data based on a CEEMD and Isomap method;
step 2, carrying out serialization and normalization processing on the training set data processed in the step 1;
and 3, finishing the establishment of a soft measurement model of the semi-supervised dynamic soft measurement network based on the training set data processed in the step 2.
2. The soft measurement modeling method of the semi-supervised dynamic soft measurement network as recited in claim 1, wherein step 1, denoising and redundancy removing processing are performed on training set data based on a CEEMD and Isomap method, and the specific steps are as follows:
step 1.1, applying a CEEMD algorithm to an original auxiliary variable training data set X to obtain IMFs of various orders;
step 1.2, calculating correlation coefficient indexes of IMFs of all orders and original variable signals, judging whether the IMFs are noise or not based on a set threshold constant, and eliminating the IMFs judged to be noise, wherein a calculation formula of the correlation coefficient is as follows:
Figure FDA0002589014090000011
in the formula (1) (. rho. (X))v(t),cvi(t)) represents the original auxiliary variable Xv(t) and its ith IMFcvi(t), i 1, N, the covariance between N,
Figure FDA0002589014090000012
and
Figure FDA0002589014090000013
are each Xv(t) and the standard deviation of the ith IMF, wherein the value range of | rho | is between 0 and 1, and the closer to 1, the higher the similarity is;
and step 1.3, performing nonlinear feature transformation on the residual IMF through an Isomap algorithm, and then performing data reconstruction by using a new mode function and an original remainder item obtained after dimensionality reduction to finally obtain an auxiliary variable X' after denoising and redundancy removal.
3. The soft measurement modeling method of the semi-supervised dynamic soft measurement network of claim 2, wherein in the implementation process of the Isomap algorithm, the step 1.3 is to calculate the geodetic distance by the following method: the geodesic distance between a sample point and its neighborhood is replaced by the Euclidean distance between them; the sample point and points outside its neighborhood are replaced by the shortest path between them on the manifold.
4. The soft measurement modeling method of the semi-supervised dynamic soft measurement network as recited in claim 3, wherein in step 2, the training set data processed in step 1 is serialized and normalized, and the specific steps are as follows:
step 2.1, after denoising and redundancy removing operation, performing serialization operation on auxiliary variable data, and predicting a dominant variable at the t + ts + z moment according to the auxiliary variable data of a total ts time step from the t moment to the t + ts moment, wherein ts is the time window length of input data, and z is the time step length of the dominant variable needing to be predicted, so as to obtain data X 'after serialization of the input data X';
step 2.2, carrying out standardization treatment by using a Z-SCORE method, converting the data into data with a mean value of 0 and a variance of 1, wherein the formula is as follows:
Figure FDA0002589014090000021
wherein in formula (2), X' is the data after serialization, mu and sigma respectively represent the mean and variance, and XinThe normalized sequence data is used for input into the neural network, and it should be noted that only the feature data is normalized, and the tag data is kept unchanged.
5. The soft measurement modeling method of the semi-supervised dynamic soft measurement network as recited in claim 4, wherein in step 3, the establishment of the SSDGRU-MLR soft measurement model is completed based on the training set data processed in step 2, and the specific steps are as follows:
step 3.1, the SSDGRU-MLR is a network formed by combining a GRU unit with an MLP in supervised learning after multi-layer stacking, the output of the last layer of the DGRU outputs a soft measurement prediction result through a fully connected MLP network, wherein the MLP is a neural network of a single hidden layer and is used for regression fitting of the final key quality variable;
step 3.2, before training the model, firstly, initializing the parameters of the model, and adopting an XVaier initialization mode to make the number of nodes of the current network layer be ninThe number of output nodes is noutThe way of Xvaier initialization is to achieve uniform distribution as follows:
Figure FDA0002589014090000031
after initialization is completed, serialized data X is processedinThe loss function of the whole model training process can be defined as follows when the loss function is input into the soft measurement model:
Figure FDA0002589014090000032
wherein in the formula (4), ytRepresents the label output corresponding to the t-th sequence sample, n-ts +1 represents the number of the sequenced samples, yt predictA prediction output representing the t-th sequence sample; and based on the optimization target of the minimum loss function, updating and adjusting parameters of the whole model through a BPTT algorithm, and finally completing batch training of the whole SSDGRU-MLR soft measurement model through multiple iterations.
CN202010690011.2A 2020-07-17 2020-07-17 Soft measurement modeling method of semi-supervised dynamic soft measurement network Active CN112001115B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010690011.2A CN112001115B (en) 2020-07-17 2020-07-17 Soft measurement modeling method of semi-supervised dynamic soft measurement network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010690011.2A CN112001115B (en) 2020-07-17 2020-07-17 Soft measurement modeling method of semi-supervised dynamic soft measurement network

Publications (2)

Publication Number Publication Date
CN112001115A true CN112001115A (en) 2020-11-27
CN112001115B CN112001115B (en) 2024-04-02

Family

ID=73467052

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010690011.2A Active CN112001115B (en) 2020-07-17 2020-07-17 Soft measurement modeling method of semi-supervised dynamic soft measurement network

Country Status (1)

Country Link
CN (1) CN112001115B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112668196A (en) * 2021-01-04 2021-04-16 西安理工大学 Mechanism and data hybrid driven generation type countermeasure network soft measurement modeling method
CN113642247A (en) * 2021-08-29 2021-11-12 浙江工业大学 Fermentation process soft measurement method based on depth map network analysis
CN114841000A (en) * 2022-05-09 2022-08-02 杭州师范大学 Soft measurement modeling method based on modal common feature separation

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105354170A (en) * 2015-10-26 2016-02-24 河海大学 CEEMD and singular value decomposition based recognition method
CN107451102A (en) * 2017-07-28 2017-12-08 江南大学 A kind of semi-supervised Gaussian process for improving self-training algorithm returns soft-measuring modeling method
CN108197743A (en) * 2017-12-31 2018-06-22 北京化工大学 A kind of prediction model flexible measurement method based on deep learning
CN108628164A (en) * 2018-03-30 2018-10-09 浙江大学 A kind of semi-supervised flexible measurement method of industrial process based on Recognition with Recurrent Neural Network model
CN108760316A (en) * 2018-08-16 2018-11-06 苏州大学 Information fusion method is joined in the change of variation mode decomposition
WO2019179340A1 (en) * 2018-03-19 2019-09-26 河北工业大学 Eemd- and msb-based failure feature extraction method for rolling-element bearing
CN110852527A (en) * 2019-11-20 2020-02-28 成都理工大学 Reservoir physical property parameter prediction method combining deep learning
US10678196B1 (en) * 2020-01-27 2020-06-09 King Abdulaziz University Soft sensing of a nonlinear and multimode processes based on semi-supervised weighted Gaussian regression

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105354170A (en) * 2015-10-26 2016-02-24 河海大学 CEEMD and singular value decomposition based recognition method
CN107451102A (en) * 2017-07-28 2017-12-08 江南大学 A kind of semi-supervised Gaussian process for improving self-training algorithm returns soft-measuring modeling method
CN108197743A (en) * 2017-12-31 2018-06-22 北京化工大学 A kind of prediction model flexible measurement method based on deep learning
WO2019179340A1 (en) * 2018-03-19 2019-09-26 河北工业大学 Eemd- and msb-based failure feature extraction method for rolling-element bearing
CN108628164A (en) * 2018-03-30 2018-10-09 浙江大学 A kind of semi-supervised flexible measurement method of industrial process based on Recognition with Recurrent Neural Network model
CN108760316A (en) * 2018-08-16 2018-11-06 苏州大学 Information fusion method is joined in the change of variation mode decomposition
CN110852527A (en) * 2019-11-20 2020-02-28 成都理工大学 Reservoir physical property parameter prediction method combining deep learning
US10678196B1 (en) * 2020-01-27 2020-06-09 King Abdulaziz University Soft sensing of a nonlinear and multimode processes based on semi-supervised weighted Gaussian regression

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112668196A (en) * 2021-01-04 2021-04-16 西安理工大学 Mechanism and data hybrid driven generation type countermeasure network soft measurement modeling method
CN113642247A (en) * 2021-08-29 2021-11-12 浙江工业大学 Fermentation process soft measurement method based on depth map network analysis
CN114841000A (en) * 2022-05-09 2022-08-02 杭州师范大学 Soft measurement modeling method based on modal common feature separation
CN114841000B (en) * 2022-05-09 2024-05-14 杭州师范大学 Soft measurement modeling method based on modal common feature separation

Also Published As

Publication number Publication date
CN112001115B (en) 2024-04-02

Similar Documents

Publication Publication Date Title
Zhang et al. LSTM-based analysis of industrial IoT equipment
CN109060001B (en) Multi-working-condition process soft measurement modeling method based on feature transfer learning
CN112001115A (en) Soft measurement modeling method of semi-supervised dynamic soft measurement network
CN107423503B (en) Method for predicting butane concentration at bottom of debutanizer by Gaussian process regression modeling based on nuclear slow characteristic analysis and time-lag estimation
CN108399434B (en) Analysis and prediction method of high-dimensional time series data based on feature extraction
Su et al. Prediction model of permeability index for blast furnace based on the improved multi-layer extreme learning machine and wavelet transform
CN110276784B (en) Correlation filtering moving target tracking method based on memory mechanism and convolution characteristics
CN111539132A (en) Dynamic load time domain identification method based on convolutional neural network
CN113780420B (en) GRU-GCN-based method for predicting concentration of dissolved gas in transformer oil
CN115828754A (en) Cutter wear state monitoring method based on multi-scale space-time fusion network model
CN114298134A (en) Wind power prediction method and device and electronic equipment
CN113780242A (en) Cross-scene underwater sound target classification method based on model transfer learning
CN112504682A (en) Chassis engine fault diagnosis method and system based on particle swarm optimization algorithm
CN115761868A (en) Robust self-adaptive updating method for facial expression classification in uncertain environment
CN116226739A (en) Map convolution network industrial process fault diagnosis method based on space-time fusion
Dubbs Test set sizing via random matrix theory
CN108984851B (en) Weighted Gaussian model soft measurement modeling method with time delay estimation
CN111610514B (en) Inversion method and device for propagation characteristics of evaporation waveguide
CN113126489A (en) CNN-GRU-BINN-based intelligent BIT design method for heavy-duty gas turbine control system
CN116680639A (en) Deep-learning-based anomaly detection method for sensor data of deep-sea submersible
CN115062764B (en) Intelligent illuminance adjustment and environmental parameter Internet of things big data system
CN116415505A (en) System fault diagnosis and state prediction method based on SBR-DBN model
CN115564021A (en) Fault root cause sequencing method in polyester fiber polymerization process
CN111160464B (en) Industrial high-order dynamic process soft measurement method based on multi-hidden-layer weighted dynamic model
CN115062542A (en) Polymerization reaction process quality prediction method based on two-dimensional steady LSTM

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant