CN105278520A - Complex industrial process running state evaluation method and application based on T-KPRM - Google Patents

Complex industrial process running state evaluation method and application based on T-KPRM Download PDF

Info

Publication number
CN105278520A
CN105278520A CN201510369102.5A CN201510369102A CN105278520A CN 105278520 A CN105278520 A CN 105278520A CN 201510369102 A CN201510369102 A CN 201510369102A CN 105278520 A CN105278520 A CN 105278520A
Authority
CN
China
Prior art keywords
matrix
running status
overbar
data
evaluation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510369102.5A
Other languages
Chinese (zh)
Other versions
CN105278520B (en
Inventor
褚菲
代伟
马小平
吴奇
叶俊锋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China University of Mining and Technology CUMT
Original Assignee
China University of Mining and Technology CUMT
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China University of Mining and Technology CUMT filed Critical China University of Mining and Technology CUMT
Priority to CN201510369102.5A priority Critical patent/CN105278520B/en
Publication of CN105278520A publication Critical patent/CN105278520A/en
Application granted granted Critical
Publication of CN105278520B publication Critical patent/CN105278520B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B23/00Testing or monitoring of control systems or parts thereof
    • G05B23/02Electric testing or monitoring
    • G05B23/0205Electric testing or monitoring by means of a monitoring system capable of detecting and responding to faults
    • G05B23/0218Electric testing or monitoring by means of a monitoring system capable of detecting and responding to faults characterised by the fault detection method dealing with either existing or incipient faults
    • G05B23/0243Electric testing or monitoring by means of a monitoring system capable of detecting and responding to faults characterised by the fault detection method dealing with either existing or incipient faults model based detection method, e.g. first-principles knowledge model
    • G05B23/0254Electric testing or monitoring by means of a monitoring system capable of detecting and responding to faults characterised by the fault detection method dealing with either existing or incipient faults model based detection method, e.g. first-principles knowledge model based on a quantitative model, e.g. mathematical relationships between inputs and outputs; functions: observer, Kalman filter, residual calculation, Neural Networks

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Mathematical Physics (AREA)
  • General Physics & Mathematics (AREA)
  • Automation & Control Theory (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a complex industrial process running state evaluation method and application based on T-KPRM. Combined with advantages of PRM and T-KPLS, the method comprises the steps of: further decomposing a high-dimension principal component subspace and a residual error subspace of KPLS, separating the part relevant to output from the part irrelevant to output, separating the part having a relatively large residual error from the part of final noise, and accurately extracting variable information relevant to the output. By adopting the method, the field industrial process running state can be conveniently grasped. According to the invention, an off-line evaluation module of the operation state is established, a sliding window technology is introduced, and the similarity between an online data window and corresponding evaluation grades is utilized for carrying out complex industrial process operation state online evaluation; in addition, according to an Euclidean distance between the sliding data window and an optimal evaluation grade, contribution rates of corresponding variables are calculated, and non-optimum factors of the operation state are identified, so that filed operators can adjust and improve the production strategy in time, and the production efficiency is improved.

Description

Based on T-KPRM complex industrial process evaluation of running status method and application
Technical field
The present invention relates to a kind of industrial process evaluation of running status method based on T-KPRM and application thereof, belong to industrial processes evaluation of running status technical field.
Background technology
Good industrial process running status is effective guarantee of enterprise product quality and economic benefit.But in complex industrial process, running status can be subject to the impact of various uncertain factor often, and causes it to depart from optimum operating point.Meanwhile, from the data of industry spot, in gatherer process, also can be subject to various interference (as sensor fault etc.), and cause there is outlier in sample data.Therefore, the industry park plan state on-line evaluation strategy proposing a kind of robust is necessary for grasping the ruuning situation of complex industrial in real time.And utilize dense medium to carry out the sorting of raw coal, a complicated industry park plan process just.Therebetween, need constantly to carry out measuring and adjustation to coal preparation technique parameter, to ensure the stable of product quality and quantity, also will ensure that the safety of production run is carried out simultaneously.
Running status on-line evaluation refers under commercial production remains on the normal condition run, and makes further differentiation to the good and bad situation of production run.Operating personnel according to evaluation result, can carry out production adjustment, to guarantee that production run process is in optimum state all the time.
No matter dense medium coal cleaning process, as domestic application coal preparation technique more widely, is dual medium cyclone dressing or gravity dense medium separation technique, and carrying out observing and controlling to Media density is the key ensureing product quality.To obtain good separating effect, just must ensure that Media density can be stabilized in a comparatively ideal scope.By dense medium coal separation density observing and controlling process operation state on-line evaluation, site operation personnel can be allowed to understand the concrete condition of Media density observing and controlling process operation state in time.Then according to evaluation result, corresponding adjustment is made to coal preparation technique, improve the efficiency of separation of coal separation process, increase the output capacity of cleaned coal, thus improve the economic benefit of enterprise.In addition, the raising of the efficiency of separation also contributes to development clean energy resource and recycling economy.
Dense medium cyclone is utilized to carry out the technological process of raw coal separation as shown in Figure 1, wherein, raw coal sends into mixing tank together with circulatory mediator, sorting is carried out again through two product heavy-medium cyclones, its underflow and overflow carry out de-mediated, dehydration and relevant treatment respectively, finally obtain cleaned coal, middle coal and spoil product, medium is then undertaken reclaiming, concentrating by medium system, and finally recycles.Whole assorting room is divided into coal stream flow process and medium flow process etc., and the sensor related in technique mainly contains densitometer, magnetic-substance content meter, liquid level gauge etc.Utilize the density in the involutory Jie's bucket of densitometer to monitor, utilize liquid level gauge to monitor the liquid level of closing suspending liquid in Jie's bucket; Utilize magnetic-substance content meter to monitor the coal slime in assorting room, this external system is equipped with all kinds of solenoid valve and is used for controlling the flow such as medium, clear water.Major control object in dense medium assorting room is close the heavy-media suspension density in Jie's bucket, and the control of density relates to Density feedback and ash content feeds back, the density of suspending liquid that Density feedback is used in involutory Jie's bucket carries out closed loop adjustment, ash content feedback is used for monitoring sorted product quality, adjust according to ash content result dense medium density of suspending liquid set-point, to reach the object of stabilized product quality.Coal preparation plant fast ash chemical examination detect be former pit ash degree, grey calibration is the basic index of Coal Quality.Ash calibration is less, and show that the coal impurity that washing goes out is fewer, Coal Quality is better.
At present, existing relevant scholar has done some researchs to production run evaluation of running status.To the research of dense-medium separation density observing and controlling process aspect, as far away in South Korea etc., Ding Yanjuan etc., proposed method only rests on density TT&C system on the theoretical analysis of whole dense-medium separation process importance, does not propose concrete density monitoring strategies.Liu Yan etc., propose a kind of evaluation of running status strategy based on T-KPLS algorithm.But, the data of industry spot are nonlinear often, but also include the outlier of some, the evaluation model using existing method (as T-KPLS, Fisher discriminatory analysis) to set up easily is subject to the impact of outlier, lose the due generalization ability of the method, have impact on the accuracy of evaluation result.
Summary of the invention
For above-mentioned prior art Problems existing, the invention provides a kind of complex industrial process evaluation of running status method based on T-KPRM, not only can solve the nonlinear problem of data rapidly, and can the information of each variable in leaching process data exactly, the impact of outlier on evaluation model precision can also be overcome; The application of a kind of complex industrial process evaluation of running status method (T-KPRM) based on T-KPRM provided refers to the complex industrial process evaluation of running status method based on core inclined robust latent variable technology, namely utilizes core inclined robust latent variable technology the method to can be used for setting up the evaluated off-line model of running status; Then, introduce sliding window technique, calculate the similarity between online data window and corresponding opinion rating, utilize similarity to carry out the on-line evaluation of complex industrial process running status.Simultaneously, Euclidean distance between slip data window and optimum opinion rating can be utilized, calculate the contribution rate of relevant variable, the non-optimal factor of running status is identified, there is provided decision-making foundation for site operation personnel adjusts production strategy in real time, thus improve the production efficiency of enterprise.
To achieve these goals, a kind of complex industrial process evaluation of running status method based on T-KPRM that the present invention adopts, supposes that process data matrix is X ∈ R n × J, N is sample number, and J is process variable number; Exporting data matrix is Y ∈ R n, comprise output procedure variable; Concrete steps then based on the complex industrial process evaluation of running status method of core inclined robust latent variable technology are as follows:
Step one, zero-mean and unit variance process are carried out to each row of input data matrix X; In like manner, also standardization is carried out to output data matrix Y;
Step 2, by input data matrix X through Nonlinear Mapping Φ: x i∈ R n→ Φ (x i) ∈ F projects to high-dimensional feature space F, and calculate nuclear matrix K:K=Φ in F space tΦ;
Step 3, standardization is carried out to nuclear matrix K;
Step 4, PRM algorithm is run to input nucleus matrix K and output matrix Y, specific as follows:
A1, to establish be the lever weights of i-th sample data, can be represented by following formula:
w i x = f ( || t i - med L 1 ( T ) || med i || t i - med L 1 ( T ) || , c ) , i = 1 , 2 , ... n - - - ( 1 ) ,
And f ( z , c ) = 1 ( 1 + | z / c | ) 2 - - - ( 2 ) ,
Wherein, || || represent Euclidean distance, med represents median, med l1represent L1 median, t ibe the PLS score of i-th sample data, c is constant;
If the residual error weights of i-th sample data can be defined by following formula:
Wherein, r irepresent the residual error between i-th sample data predicted value and actual value, represent the robust size estimation of residual error, can be calculated by following formula: r ~ = med i | r i - med j ( r j ) | , i , j = 1 , 2 , ... n - - - ( 4 ) ;
The then comprehensive weight w of i-th sample data i, can be determined by following formula: utilize formula (1), (3), (5) initialization weights W i;
B1, respectively input nucleus matrix K and output matrix Y to be weighted, after weighting, to obtain input nucleus matrix K wwith output matrix Y w, then PLS1 regression model is set up to the inputoutput data after weighting, revises its score vector simultaneously;
C1, calculate the residual error r of each data sample i, utilize formula (1), (3), (5) to upgrade sample weights W i;
If the relative difference of the double q of calculating in d1 front and back is less than threshold value (as 10 -2), then enter step 5, otherwise return b1;
Step 5, now input data matrix become K w, export data matrix and become Y w, from output matrix Y wthe middle u extracting convergence i, make i=1, K wi=K w, Y wi=Y w;
A2, make Y wiin any one row equal u i;
B2, calculating K wscore vector, t i=K wiu i, t i← t i/ || t i||;
c2、 q i = Y W i T t i ;
D2, calculating Y wiscore vector u i=Y wiq i, u i← u i/ || u i||;
E2, judge u iwhether restrain, if convergence, proceed to step 6, otherwise return a2;
Step 6, calculating K wiload matrix:
Step 7, extract whole pivot, calculate input data matrix K wscore matrix T, input data matrix K wload matrix P, export data matrix Y wscore matrix U, export data matrix Y wload matrix Q, specific as follows:
A3, order K W i + 1 = K W i - t i p i T ; Y W i + 1 = Y W i - u i q i T ;
B3, make i=i+1, repeat step 5, six until extract A pivot, pivot number A can be determined by cross-validation method;
c3、T=[t 1,...,t A],P=[p 1,...,p A],U=[u 1,...,u A],Q=[q 1,...,q A];
Step 8, K w=TP t+ E, Y w=UQ t+ F;
Step 9, to pivot TP trun PCA algorithm:
Step 10, PCA algorithm is run to residual error E:
If exporting data matrix Y is single output variable, then the expression formula of T-KPRM model is as follows:
K W = t y p y T + T o P o T + T r P r T + E r y W = t y + F - - - ( 6 )
Above-mentioned score vector t ythat represent is in T and output y wdirectly related part, namely sets up the most critical variable drawn required for evaluation model, can be used for the foundation of running status evaluated off-line model; T orepresent in T with output y worthogonal part; ; T rit is the part having larger variance in residual error E; E rfinal residual error, namely noise;
When introducing new sample k new, its score matrix will be drawn by following formula:
t y = y W T T ( U T K W T ) - 1 U T k n e w = g y k n e w - - - ( 7 ) ,
Wherein, k newnew samples x newkernel function, can be calculated by following formula:
k new=Φ(X)Φ(x new)=[k(x 1,x new),...k(x n,x new)] T(8),
To k newcarry out equalization can obtain: k ‾ n e w = k n e w - K 1 t - 1 n k n e w + 1 n K 1 t - - - ( 9 ) ,
Wherein, 1 t=1/n [11...1] t∈ R n.Above-mentioned score vector t ycan be used for the foundation of running status evaluated off-line model.
A kind of application of the complex industrial process evaluation of running status method based on T-KPRM, specifically refer to the identification utilizing core inclined robust latent variable technology to set up the evaluated off-line model of running status, the on-line evaluation of complex industrial process running status and non-optimal factor, specifically first utilize core inclined robust latent variable technology to set up evaluated off-line model, the evaluated off-line model score vector of corresponding running status grade is wherein, c is the number of running status grade; Then, introduce sliding window technique, calculate the similarity between online data window and corresponding opinion rating, utilize similarity to carry out the on-line evaluation of complex industrial process running status; Euclidean distance between recycling slip data window and optimum opinion rating, calculates the contribution rate of relevant variable, identifies the non-optimal factor of running status.
Carrying out in evaluation of running status process, consider that single sample data is not enough to characterize the running status of production run, but also be easily subject to the interference of various factors, therefore, introducing a width is that the data window of H is as basic analytic unit; Then, the similarity between k time data window and corresponding opinion rating is calculated for carrying out the evaluation of process operation state; Meanwhile, a similarity threshold ε (0.5 < ε < 1) is set, for distinguishing the transition period of running status grade and the corresponding running status grade determined; Suppose the maximal value of similarity larger than threshold epsilon, then can deterministic process state grade be p; When running status is in transition period, time namely from the stage that a upper running status changes to next running status, the value of its similarity presents a rule slowly changed; Suppose that similarity is less than threshold epsilon and its size meets the rule increased progressively in turn, then can think that process operation state is in a transition period; Be enough to two hypothesis if discontented, just can think that running status is disturbed by other factors, result of its evaluation and being consistent before.
The flow process of on-line operation state evaluation is as follows:
The first step: the line window data matrix in structure k moment, X on, k=[x on, k-H+1..., x on, k] t, H is the width of data window;
Second step: standardization X on, k, the evaluated off-line model established before utilization, the line window data matrix after asking for standardization, X &OverBar; o n , k c , c = 1 , 2 , 3 , 4 ;
3rd step: calculate score vector
Wherein, by k o n , k c = &lsqb; k ( x 1 c , x &OverBar; o n , k c ) , k ( x 2 c , x &OverBar; o n , k c ) , ... , k ( x n c c , x &OverBar; o n , k c ) &rsqb; T Average centralization gained;
4th step: calculate the Euclidean distance between online data window and corresponding opinion rating c, wherein be equalization vector; Via T-KPRM method gained, then
5th step: utilize value, ask for the similarity between data window and corresponding opinion rating c:
Suppose d o n , k c &NotEqual; 0 , Then &gamma; o n , k c = 1 / d o n , k c &Sigma; p = 1 4 1 / d o n , k p - - - ( 10 ) , Wherein &Sigma; c = 1 4 &gamma; o n , k c = 1 , 0 &le; &gamma; o n , k c &le; 1 ;
6th step: judge running status grade according to similarity, the interpretational criteria of on-line operation state is as follows:
If criterion one then judge that this online data window belongs to some running status grade p determined;
If criterion two does not meet criterion one, and meets formula p = argmax { &gamma; o n , k p | &gamma; o n , k - l + 1 p < ... < &gamma; o n , k p , 1 &le; p &le; 4 } , Then judge that this data window is in the transition period of running status, be namely in the stage transformed to next running status from a upper running status, wherein, l is a positive integer, and size is determined by knowhow;
If criterion three is discontented be enough to two criterions, then prove that this data window is subject to the interference of other factors, the judged result of its running status be consistent before;
By the judgement of above three criterions, just can effectively determine to belong to some running status grades or the transition period be between state grade at the data window in k moment.
Non-optimal running status refers to all running statuses outside removing optimal operational condition, and this is comprising kilter, the transition period between general state, poor state and steady state (SS).Generally speaking, the supvr of enterprise wishes that production run runs on best state.But, when commercial production is inevitably in the non-optimal stage, goes to review those and cause running with regard to needing to go for a kind of suitable method the reason being in non-optimal conditions, thus provide foundation effectively for production adjustment.
In the research of the fault diagnosis based on data, the utilization of contribution plot method is commonplace.Particularly be directed to linear model, the method intuitively can find out the correlated variables that those causing trouble occur effectively.But traditional contribution plot method is not also suitable for the model set up by kernel function.Proposed here a kind of recognition methods of the non-optimal factor based on kernel function: first by calculating the partial derivative of Euclidean distance between online data window and optimal operational condition grade, obtain the contribution rate of sample data variable, the variable of larger contribution rate is wherein had to cause producing by being considered to the factor being in non-optimal running status, then to these factors Discern and judge in addition.
Euclidean distance between online data window and optimal operational condition grade is expressed as again
d o n , k 4 = t o n , k 4 T t o n , k 4 = k &OverBar; o n , k 4 T g y 4 T g y 4 k &OverBar; o n , k 4 = t r ( g y 4 k &OverBar; o n , k 4 k &OverBar; o n , k 4 T g y 4 T ) - - - ( 11 ) ,
In formula, represent the off-line model matrix of optimal operational condition, represent the core Mean Matrix of the online data window of corresponding optimal operational condition;
? in formula, the contribution rate of a jth variable be can be calculated by following formula:
Contr j = | &part; d o n , k 4 &part; x &OverBar; o n , k , j 4 | &CenterDot; x &OverBar; o n , k , j 4 = | &part; &part; x &OverBar; o n , k , j 4 t r ( g y 4 k &OverBar; o n , k 4 k &OverBar; o n , k 4 T g y 4 T ) | &CenterDot; x &OverBar; o n , k , j 4 = | t r &lsqb; g y 4 ( &part; k &OverBar; o n , k 4 k &OverBar; o n , k 4 T &part; x &OverBar; o n , k , j 4 ) g y 4 T &rsqb; | &CenterDot; x &OverBar; o n , k , j 4 , j = 1 , 2 , ... , m , - - - ( 12 ) ,
In formula, it is sample data window a jth variable, tr () is expressed as matrix trace;
In order to try to achieve above formula Contr jexpression formula accurately, must draw partial derivative, therefore, need the process carrying out some necessity:
A, nuclear matrix partial derivative be: &part; k ( x i , x &OverBar; o n , k 4 ) &part; x &OverBar; o n , k , j 4 = 1 &sigma; ( x i , j - x &OverBar; o n , k , j 4 ) k ( x i , x &OverBar; o n , k 4 ) - - - ( 13 ) ,
B, partial derivative is asked to two nuclear matrix simultaneously, can obtain:
&part; k ( x i , x &OverBar; o n , k 4 ) k ( x q , x &OverBar; o n , k 4 ) &part; x &OverBar; o n , k , j 4 = &part; k ( x i , x &OverBar; o n , k 4 ) &part; x &OverBar; o n , k , j 4 k ( x q , x &OverBar; o n , k 4 ) + k ( x i , x &OverBar; o n , k 4 ) &part; k ( x q , x &OverBar; o n , k 4 ) &part; x &OverBar; o n , k , j 4 = 1 &sigma; ( x i , j + x q , j - 2 x &OverBar; o n , k 4 ) k ( x q , x &OverBar; o n , k 4 ) - - - ( 14 ) ,
C, to calculate in the partial derivative of element that arranges of the i-th row q:
&part; ( k &OverBar; o n , k 4 , k &OverBar; o n , k 4 T ) p q &part; x &OverBar; o n , k , j 4 = &part; ( x p , x &OverBar; o n , k 4 ) k ( x q , x &OverBar; o n , k 4 ) &part; x &OverBar; o n , k , j 4 + ( A - a q ) &part; ( x p , x &OverBar; o n , k 4 ) &part; x &OverBar; o n , k , j 4 + ( A - a p ) &part; ( x q , x &OverBar; o n , k 4 ) &part; x &OverBar; o n , k , j 4 - 1 n 4 &Sigma; i = 1 n 4 &part; ( x i , x &OverBar; o n , k 4 ) k ( x p , x &OverBar; o n , k 4 ) &part; x &OverBar; o n , k , j 4 - 1 n 4 &Sigma; i = 1 n 4 &part; ( x i , x &OverBar; o n , k 4 ) k ( x q , x &OverBar; o n , k 4 ) &part; x &OverBar; o n , k , j 4 + ( a p + q q - 2 A ) n 4 &Sigma; i = 1 n 4 &part; ( x i , x &OverBar; o n , k 4 ) &part; x &OverBar; o n , k , j 4 + 1 n 4 2 &Sigma; i = 1 n 4 &Sigma; i &prime; = 1 n 4 &part; ( x i , x &OverBar; o n , k 4 ) k ( x i &prime; , x &OverBar; o n , k 4 ) &part; x &OverBar; o n , k , j 4 = 1 &sigma; ( x p , j + x q , j - 2 x &OverBar; o n , k , j 4 ) k ( x p , x &OverBar; o n , k 4 ) k ( x q , x &OverBar; o n , k 4 ) + ( A - a q ) &sigma; ( x p , j - x &OverBar; o n , k , j 4 ) k ( x p , x &OverBar; o n , k 4 ) + ( A - a p ) &sigma; ( x q , j - x &OverBar; o n , k , j 4 ) k ( x q , x &OverBar; o n , k 4 ) - 1 &sigma;n 4 &Sigma; i = 1 n 4 ( x i , j + x p . j - 2 x &OverBar; o n , k , j 4 ) k ( x i , x &OverBar; o n , k 4 ) k ( x p , x &OverBar; o n , k 4 ) - 1 &sigma;n 4 &Sigma; i = 1 n 4 ( x i , j + x q . j - 2 x &OverBar; o n , k , j 4 ) k ( x i , x &OverBar; o n , k 4 ) k ( x q , x &OverBar; o n , k 4 ) + ( a p + a q - 2 A ) &sigma;n 4 &Sigma; i = 1 n 4 ( x i , j - x &OverBar; o n , k 4 ) k ( x i , x &OverBar; o n , k 4 ) + 1 &sigma;n 4 2 &Sigma; i = 1 n 4 &Sigma; i &prime; = 1 n 4 ( x i , j + x i &prime; , j - 2 x &OverBar; o n , k , j 4 ) k ( x i , x &OverBar; o n , k 4 ) k ( x i &prime; , x &OverBar; o n , k 4 ) - - - ( 15 ) ,
In formula, a p = 1 n 4 &Sigma; i = 1 n 4 k ( x p , x i ) , A = 1 n 4 2 &Sigma; i = 1 n 4 &Sigma; i &prime; = 1 n 4 k ( x i , x i &prime; ) , N 4it is the sample number of the modeling data of optimal operational condition.
Compared with prior art, the complex industrial process evaluation of running status method (T-KPRM) based on core inclined robust latent variable technology that the present invention proposes combines the advantage of both PRM and T-KPLS, can grasp the running status of situ industrial process real-time and accurately, T-KPPM can decompose higher-dimension principal component subspace and the residual error subspace of KPLS further.In principal component subspace, the part irrelevant with output with exporting relevant part is separated.Meanwhile, in residual error subspace, separate there being the part of the part of larger residual error and final noise.Therefore, T-KPRM can extract exactly and export relevant variable information.The method can be used for the evaluated off-line model setting up running status.Meanwhile, introduce sliding window technique, calculate the similarity between online data window and corresponding opinion rating, similarity can be utilized to carry out the on-line evaluation of complex industrial process running status.Afterwards, utilize Euclidean distance between slip data window and optimum opinion rating, calculate the contribution rate of relevant variable, the non-optimal factor of running status is identified.Contrasted by simulation analysis, the evaluated off-line model of the running status utilizing the complex industrial process evaluation of running status method (T-KPRM) based on core inclined robust latent variable technology to set up, dense medium coal separation density observing and controlling process carries out running status on-line evaluation and the identification of non-optimal factor is of practical significance very much.By means of evaluating and the result of factor identification, operating personnel just can adjust in time and improve production strategy, improve production efficiency.In addition, this running status on-line evaluation method is all applicable to dual medium cyclone dressing and gravity dense medium separation technique.
Accompanying drawing explanation
Fig. 1 is the process chart utilizing dense medium cyclone to carry out raw coal separation;
Fig. 2 is the on-line operation state evaluation and the non-optimal factor identification process figure that the present invention is based on T-KPRM;
Fig. 3 is similarity and density observing and controlling running status on-line evaluation result schematic diagram;
Fig. 4 is existing method (T-KPLS) the density observing and controlling evaluation of running status result not having very noisy and outlier;
Fig. 5 is not for having institute's extracting method (T-KPRM) density observing and controlling evaluation of running status result of very noisy and outlier;
Fig. 6 is existing method (T-KPLS) the density observing and controlling process operation state evaluation result having very noisy and outlier;
Fig. 7 is for there being institute's extracting method (T-KPRM) density observing and controlling evaluation of running status result of very noisy and outlier;
Fig. 8 is the non-optimal factor identification of state grade " poor ";
Fig. 9 be " poor " arrive " in " between the non-optimal factor identification of transition period;
Figure 10 be state grade " in " the identification of non-optimal factor;
Figure 11 be " in " arrive the non-optimal factor identification of transition period between " good ";
Figure 12 is the non-optimal factor identification of state grade " good ";
Figure 13 is the non-optimal factor identification that " good " arrives transition period between " excellent ".
Embodiment
Below in conjunction with accompanying drawing and simulation analysis, the invention will be further described.
Based on a complex industrial process evaluation of running status method of T-KPRM, suppose that process data matrix is X ∈ R n × J, N is sample number, and J is process variable number; Exporting data matrix is Y ∈ R n, comprise output procedure variable; Concrete steps then based on the complex industrial process evaluation of running status method of core inclined robust latent variable technology are as follows:
Step one, zero-mean and unit variance process are carried out to each row of input data matrix X; In like manner, also standardization is carried out to output data matrix Y;
Step 2, by input data matrix X through Nonlinear Mapping Φ: x i∈ R n→ Φ (x i) ∈ F projects to high-dimensional feature space F, and calculate nuclear matrix K:K=Φ in F space tΦ;
Step 3, standardization is carried out to nuclear matrix K;
Step 4, PRM algorithm is run to input nucleus matrix K and output matrix Y, specific as follows:
A1, to establish be the lever weights of i-th sample data, can be represented by following formula:
w i x = f ( || t i - med L 1 ( T ) || med i || t i - med L 1 ( T ) || , c ) , i = 1 , 2 , ... n - - - ( 1 ) ,
And f ( z , c ) = 1 ( 1 + | z / c | ) 2 - - - ( 2 ) ,
Wherein, || || represent Euclidean distance, med represents median, med l1represent L1 median, t ibe the PLS score of i-th sample data, c is constant;
If the residual error weights of i-th sample data can be defined by following formula:
Wherein, r irepresent the residual error between i-th sample data predicted value and actual value, represent the robust size estimation of residual error, can be calculated by following formula: r ~ = med i | r i - med j ( r j ) | , i , j = 1 , 2 , ... n - - - ( 4 ) ;
The then comprehensive weight w of i-th sample data i, can be determined by following formula: utilize formula (1), (3), (5) initialization weights W i;
B1, respectively input nucleus matrix K and output matrix Y to be weighted, after weighting, to obtain input nucleus matrix K wwith output matrix Y w, then PLS1 regression model is set up to the inputoutput data after weighting, revises its score vector simultaneously;
C1, calculate the residual error r of each data sample i, utilize formula (1), (3), (5) to upgrade sample weights W i;
If the relative difference of the double q of calculating in d1 front and back is less than threshold value (as 10 -2), then enter step 5, otherwise return b1;
Step 5, now input data matrix become K w, export data matrix and become Y w, from output matrix Y wthe middle u extracting convergence i, make i=1, K wi=K w, Y wi=Y w;
A2, make Y wiin any one row equal u i;
B2, calculating K wscore vector, t i=K wiu i, t i← t i/ || t i||;
c2、 q i = Y W i T t i ;
D2, calculating Y wiscore vector u i=Y wiq i, u i← u i/ || u i||;
E2, judge u iwhether restrain, if convergence, proceed to step 6, otherwise return a2;
Step 6, calculating K wiload matrix:
Step 7, extract whole pivot, calculate input data matrix K wscore matrix T, input data matrix K wload matrix P, export data matrix Y wscore matrix U, export data matrix Y wload matrix Q, specific as follows:
A3, order K W i + 1 = K W i - t i p i T ; Y W i + 1 = Y W i - u i q i T ;
B3, make i=i+1, repeat step 5, six until extract A pivot, pivot number A can be determined by cross-validation method;
c3、T=[t 1,...,t A],P=[p 1,...,p A],U=[u 1,...,u A],Q=[q 1,...,q A];
Step 8, K w=TP t+ E, Y w=UQ t+ F;
Step 9, to pivot TP trun PCA algorithm:
Step 10, PCA algorithm is run to residual error E:
If exporting data matrix Y is single output variable, then the expression formula of T-KPRM model is as follows:
K W = t y p y T + T o P o T + T r P r T + E r y W = t y + F - - - ( 6 )
Above-mentioned score vector t ythat represent is in T and output y wdirectly related part, namely sets up the most critical variable drawn required for evaluation model, can be used for the foundation of running status evaluated off-line model; T orepresent in T with output y worthogonal part; ; T rit is the part having larger variance in residual error E; E rfinal residual error, namely noise;
When introducing new sample k new, its score matrix will be drawn by following formula:
t y = y W T T ( U T K W T ) - 1 U T k n e w = g y k n e w - - - ( 7 ) ,
Wherein, k newnew samples x newkernel function, can be calculated by following formula:
k new=Φ(X)Φ(x new)=[k(x 1,x new),...k(x n,x new)] T(8),
To k newcarry out equalization can obtain: k &OverBar; n e w = k n e w - K 1 t - 1 n k n e w + 1 n K 1 t - - - ( 9 ) ,
Wherein, 1 t=1/n [11...1] t∈ R n.Above-mentioned score vector t ycan be used for the foundation of running status evaluated off-line model.
T-KPRM, in conjunction with the advantage of both PRM and T-KPLS, can grasp the running status of situ industrial process real-time and accurately.T-KPPM can decompose higher-dimension principal component subspace and the residual error subspace of KPLS further.In principal component subspace, the part irrelevant with output with exporting relevant part is separated.Meanwhile, in residual error subspace, separate there being the part of the part of larger residual error and final noise.Therefore, T-KPRM can extract exactly and export relevant variable information.
A kind of application of the complex industrial process evaluation of running status method based on T-KPRM, specifically refer to the identification utilizing core inclined robust latent variable technology to set up the evaluated off-line model of running status, the on-line evaluation of complex industrial process running status and non-optimal factor, specifically first utilize core inclined robust latent variable technology to set up evaluated off-line model, the evaluated off-line model score vector of corresponding running status grade is wherein, c is the number of running status grade; Then, introduce sliding window technique, calculate the similarity between online data window and corresponding opinion rating, utilize similarity to carry out the on-line evaluation of complex industrial process running status; Euclidean distance between recycling slip data window and optimum opinion rating, calculates the contribution rate of relevant variable, identifies the non-optimal factor of running status.
Carrying out in evaluation of running status process, consider that single sample data is not enough to characterize the running status of production run, but also be easily subject to the interference of various factors, therefore, introducing a width is that the data window of H is as basic analytic unit; Then, the similarity between k time data window and corresponding opinion rating is calculated for carrying out the evaluation of process operation state; Meanwhile, a similarity threshold ε (0.5 < ε < 1) is set, for distinguishing the transition period of running status grade and the corresponding running status grade determined; Suppose the maximal value of similarity larger than threshold epsilon, then can deterministic process state grade be p; When running status is in transition period, time namely from the stage that a upper running status changes to next running status, the value of its similarity presents a rule slowly changed; Suppose that similarity is less than threshold epsilon and its size meets the rule increased progressively in turn, then can think that process operation state is in a transition period; Be enough to two hypothesis if discontented, just can think that running status is disturbed by other factors, result of its evaluation and being consistent before.
The flow process of on-line operation state evaluation is as follows:
The first step: the line window data matrix in structure k moment, X on, k=[x on, k-H+1..., x on, k] t, H is the width of data window;
Second step: standardization X on, k, the evaluated off-line model established before utilization, the line window data matrix after asking for standardization, X &OverBar; o n , k c , c = 1 , 2 , 3 , 4 ;
3rd step: calculate score vector
Wherein, by k o n , k c = &lsqb; k ( x 1 c , x &OverBar; o n , k c ) , k ( x 2 c , x &OverBar; o n , k c ) , ... , k ( x n c c , x &OverBar; o n , k c ) &rsqb; T Average centralization gained;
4th step: calculate the Euclidean distance between online data window and corresponding opinion rating c, wherein be equalization vector; Via T-KPRM method gained, then
5th step: utilize value, ask for the similarity between data window and corresponding opinion rating c:
Suppose d o n , k c &NotEqual; 0 , Then &gamma; o n , k c = 1 / d o n , k c &Sigma; p = 1 4 1 / d o n , k p --- ( 10 ) , Wherein &Sigma; c = 1 4 &gamma; o n , k c = 1 , 0 &le; &gamma; o n , k c &le; 1 ;
6th step: judge running status grade according to similarity, the interpretational criteria of on-line operation state is as follows:
If criterion one then judge that this online data window belongs to some running status grade p determined;
If criterion two does not meet criterion one, and meets formula p = argmax { &gamma; o n , k p | &gamma; o n , k - l +1 p < ... < &gamma; o n , k p , 1 &le; p &le; 4 } , Then judge that this data window is in the transition period of running status, be namely in the stage transformed to next running status from a upper running status, wherein, l is a positive integer, and size is determined by knowhow;
If criterion three is discontented be enough to two criterions, then prove that this data window receives the interference of other factors, the judged result of its running status be consistent before;
By the judgement of above three criterions, just can effectively determine to belong to some running status grades or the transition period be between state grade at the window data in k moment.
Non-optimal running status refers to all running statuses outside removing optimal operational condition, and this is comprising kilter, the transition stage of general state, poor state and state.Generally speaking, the supvr of enterprise wishes that production run runs on best state.But, when commercial production is inevitably in the non-optimal stage, goes to review those and cause running with regard to needing to go for a kind of suitable method the reason being in non-optimal conditions, thus provide foundation effectively for production adjustment.
In the research of the fault diagnosis based on data, the utilization of contribution plot method is commonplace.Particularly be directed to linear model, the method intuitively can find out the correlated variables that those causing trouble occur effectively.But traditional contribution plot method is not also suitable for the model set up by kernel function.Proposed here a kind of recognition methods of the non-optimal factor based on kernel function: first by calculating the partial derivative of Euclidean distance between online data window and optimal operational condition grade, obtain the contribution rate of sample data variable, the variable of larger contribution rate is wherein had to cause producing by being considered to the factor being in non-optimal running status, then to these factors Discern and judge in addition.
Euclidean distance between online data window and optimal operational condition grade is expressed as again
d o n , k 4 = t o n , k 4 T t o n , k 4 = k &OverBar; o n , k 4 T g y 4 T g y 4 k &OverBar; o n , k 4 = t r ( g y 4 k &OverBar; o n , k 4 k &OverBar; o n , k 4 T g y 4 T ) - - - ( 11 ) ,
In formula, represent the off-line model matrix of optimal operational condition, represent the core Mean Matrix of the online data window of corresponding optimal operational condition;
? in formula, the contribution rate of a jth variable be can be calculated by following formula:
Contr j = | &part; d o n , k 4 &part; x &OverBar; o n , k , j 4 | &CenterDot; x &OverBar; o n , k , j 4 = | &part; &part; x &OverBar; o n , k , j 4 t r ( g y 4 k &OverBar; o n , k 4 k &OverBar; o n , k 4 T g y 4 T ) | &CenterDot; x &OverBar; o n , k , j 4 = | t r &lsqb; g y 4 ( &part; k &OverBar; o n , k 4 k &OverBar; o n , k 4 T &part; x &OverBar; o n , k , j 4 ) g y 4 T &rsqb; | &CenterDot; x &OverBar; o n , k , j 4 , j = 1 , 2 , ... , m , - - - ( 12 ) ,
In formula, it is sample data window a jth variable, tr () is expressed as matrix trace;
In order to try to achieve above formula Contr jexpression formula accurately, must draw partial derivative, therefore, need the process carrying out some necessity:
A, nuclear matrix partial derivative be: &part; k ( x i , x &OverBar; o n , k 4 ) &part; x &OverBar; o n , k , j 4 = 1 &sigma; ( x i , j - x &OverBar; o n , k , j 4 ) k ( x i , x &OverBar; o n , k 4 ) - - - ( 13 ) ,
B, partial derivative is asked to two nuclear matrix simultaneously, can obtain:
&part; k ( x i , x &OverBar; o n , k 4 ) k ( x q , x &OverBar; o n , k 4 ) &part; x &OverBar; o n , k , j 4 = &part; k ( x i , x &OverBar; o n , k 4 ) &part; x &OverBar; o n , k , j 4 k ( x q , x &OverBar; o n , k 4 ) + k ( x i , x &OverBar; o n , k 4 ) &part; k ( x q , x &OverBar; o n , k 4 ) &part; x &OverBar; o n , k , j 4 = 1 &sigma; ( x i , j + x q , j - 2 x &OverBar; o n , k 4 ) k ( x q , x &OverBar; o n , k 4 ) - - - ( 14 ) ,
C, to calculate in the partial derivative of element that arranges of the i-th row q:
&part; ( k &OverBar; o n , k 4 , k &OverBar; o n , k 4 T ) p q &part; x &OverBar; o n , k , j 4 = &part; ( x p , x &OverBar; o n , k 4 ) k ( x q , x &OverBar; o n , k 4 ) &part; x &OverBar; o n , k , j 4 + ( A - a q ) &part; ( x p , x &OverBar; o n , k 4 ) &part; x &OverBar; o n , k , j 4 + ( A - a p ) &part; ( x q , x &OverBar; o n , k 4 ) &part; x &OverBar; o n , k , j 4 - 1 n 4 &Sigma; i = 1 n 4 &part; ( x i , x &OverBar; o n , k 4 ) k ( x p , x &OverBar; o n , k 4 ) &part; x &OverBar; o n , k , j 4 - 1 n 4 &Sigma; i = 1 n 4 &part; ( x i , x &OverBar; o n , k 4 ) k ( x q , x &OverBar; o n , k 4 ) &part; x &OverBar; o n , k , j 4 + ( a p + q q - 2 A ) n 4 &Sigma; i = 1 n 4 &part; ( x i , x &OverBar; o n , k 4 ) &part; x &OverBar; o n , k , j 4 + 1 n 4 2 &Sigma; i = 1 n 4 &Sigma; i &prime; = 1 n 4 &part; ( x i , x &OverBar; o n , k 4 ) k ( x i &prime; , x &OverBar; o n , k 4 ) &part; x &OverBar; o n , k , j 4 = 1 &sigma; ( x p , j + x q , j - 2 x &OverBar; o n , k , j 4 ) k ( x p , x &OverBar; o n , k 4 ) k ( x q , x &OverBar; o n , k 4 ) + ( A - a q ) &sigma; ( x p , j - x &OverBar; o n , k , j 4 ) k ( x p , x &OverBar; o n , k 4 ) + ( A - a p ) &sigma; ( x q , j - x &OverBar; o n , k , j 4 ) k ( x q , x &OverBar; o n , k 4 ) - 1 &sigma;n 4 &Sigma; i = 1 n 4 ( x i , j + x p . j - 2 x &OverBar; o n , k , j 4 ) k ( x i , x &OverBar; o n , k 4 ) k ( x p , x &OverBar; o n , k 4 ) - 1 &sigma;n 4 &Sigma; i = 1 n 4 ( x i , j + x q . j - 2 x &OverBar; o n , k , j 4 ) k ( x i , x &OverBar; o n , k 4 ) k ( x q , x &OverBar; o n , k 4 ) + ( a p + a q - 2 A ) &sigma;n 4 &Sigma; i = 1 n 4 ( x i , j - x &OverBar; o n , k 4 ) k ( x i , x &OverBar; o n , k 4 ) + 1 &sigma;n 4 2 &Sigma; i = 1 n 4 &Sigma; i &prime; = 1 n 4 ( x i , j + x i &prime; , j - 2 x &OverBar; o n , k , j 4 ) k ( x i , x &OverBar; o n , k 4 ) k ( x i &prime; , x &OverBar; o n , k 4 ) - - - ( 15 ) ,
In formula, a p = 1 n 4 &Sigma; i = 1 n 4 k ( x p , x i ) , A = 1 n 4 2 &Sigma; i = 1 n 4 &Sigma; i &prime; = 1 n 4 k ( x i , x i &prime; ) , N 4it is the sample number of the modeling data of optimal operational condition.
Simulation analysis:
In heavy medium coal preparation process, need constantly to carry out measuring and adjustation to coal preparation technique parameter, to ensure the stable of product quality and quantity, and ensure that the safety of production run is carried out, therefore the measuring and adjustation of the density of heavy-media suspension and rheological characteristics is the emphasis of dense medium coal cleaning process parameter investigating.In actual production, the running status of Dense Media Density observing and controlling process usually can be subject to the impact of the factors such as procedure parameter drift and environmental perturbation, and the running status of off-target.Dense medium is utilized to carry out the sorting of raw coal, just a complicated industry park plan process.In order to grasp the running status of production in time, this method can be used for the dense-medium separation density observing and controlling process operation state on-line evaluation of Kernel-based methods data.
By considering on-the-spot actual production process to the in-depth analysis of dense-medium separation density observing and controlling process, 5 process variable and 1 output variable on-line evaluation for dense-medium separation density observing and controlling process operation state can be chosen.These 5 process variable are respectively: main separation density, main separation liquid level, main separation pressure, Magnetic Materials concentration, coal slurry content; 1 output variable is: grey calibration.Each input/output variable is as shown in table 1:
Table 1 input/output variable table
1) foundation of density observing and controlling process off-line evaluation model
According to the field data of certain coal preparation plant's density observing and controlling process, 9900 groups of samples can be gathered, for modeling and the classification of assessment of running status.According to long-term practical production experience, the density of the heavy medium observing and controlling process is divided into gradual change running status between 4 kinds of state grades and different brackets according to grey calibration (fast grey data), as shown in table 2:
Table 2 process status criteria for classifying
Wherein, modeling data collection (X 1, y 1), (X 2, y 2), (X 3, y 3), (X 4, y 4) respectively corresponding excellent in differ from 4 class running status grades.Meanwhile, the evaluation model of each running status grade of T-KPRM method establishment is utilized c=1,2,3,4.
2) based on the density observing and controlling process on-line operation state evaluation of T-KPRM
Can in actual production process, random acquisition 640 groups of test datas in addition, for running status on-line evaluation.As shown in table 3, test data comprises the excellent middle 3 kinds of gradual change running state data differed between 4 state grades and different brackets.Arranging similarity threshold is ε=0.8, and moving window width is H=4, step-length l=1, and the continuous incremental number of similarity is 3.Production run follows time series, according to difference, in, good, excellent order transition.
Table 3 each stage test sample book group number
As shown in Figure 3, (a), (b), (c) in Fig. 3, (d) four figure represent respectively slip data window and difference, in, the situation of change of similarity between good, excellent one of four states grade.Between the 1st to the 103rd sampling instant, the similarity of the data of time slip-window and grade " poor " is greater than similarity threshold 0.85, can be judged by interpretational criteria one, and now production run operates in " poor " state.In like manner, the 127th to the 355th sampling instant, the 380th to the 509th sampling instant, between the 529th to the 640th sampling instant, the data of online data window respectively with state " in ", " good ", the similarity of " excellent " is all greater than similarity threshold 0.85, then can determine that now production run operates in respectively " in ", " good " and " excellent " state.(e) figure of Fig. 3 is the running status on-line evaluation result of whole observing and controlling process, that the ordinate 1,2,3,4 in figure represents respectively is poor, in, good, excellent one of four states grade.
Removing four determines outside the running status of grade, and remaining sampling instant region is then the transition period between adjacent two steady state (SS)s.Such as, from employing the moment 104 to 106 between, the similarity between line window data and each state grade is all less than threshold value, and with " in " similarity increase progressively gradually.Namely meet judge according to interpretational criteria two, meeting incremental number before and after similarity is 3, then show between sampling instant 104 to 126, the data of time slip-window and the similarity of " poor " grade are all less than similarity threshold, and the similarity between the moment of front and back presents the trend of successively decreasing successively.Production run run now be in " poor " arrive " in " transition period.In like manner, also can be judged as between sampling instant 356 to 379,510 to 528, production run is in respectively " in " arrive the transition period that the transition period of " good " and " good " arrive " excellent ".In (e) figure of Fig. 3, ordinate 1.5,2.5 and 3.5 represents the transition period between running status respectively.
3) evaluation model contrast when very noisy and outlier is had
Consider that industrial field data often can be subject to various interference (as sensor fault, process disturbance etc.) in gatherer process, and cause there is outlier in sample data.The existence of these outlier often causes modeling and analysis methods to lose its due generalization ability, thus affects the precision of model.Based on the complex industrial process evaluation of running status method (T-KPRM) of core inclined robust latent variable technology owing to combining the advantage of PRM and T-KPLS two kinds of algorithms, not only can solve the nonlinear problem of data rapidly, the impact of outlier on evaluation model precision can also be overcome.Although existing method (as T-KPLS, FDA etc.) also may be used for the foundation of evaluation of running status model, but when including outlier in modeling sample data, the precision of its evaluation model will be affected, and cannot carry out evaluation and the identification of running status exactly.
For dense-medium separation density observing and controlling process, random acquisition 603 groups of test datas in addition, for verifying the robustness that T-KPRM method is good.As shown in table 4, still comprise the excellent middle 3 kinds of gradual change running state data differed between 4 state grades and different brackets in these test datas.And four groups that modeling data collection still adopts one to save, be (X respectively 1, y 1), (X 2, y 2), (X 3, y 3), (X 4, y 4).Utilize the evaluation model of each running status grade of T-KPRM method establishment simultaneously c=1,2,3,4.Existing method (T-KPLS) is utilized also to set up the evaluation model of each running status grade c=1,2,3,4.
The test data that table 4 contrasts for method
Here select the mode adding disturbance in modeling data to simulate outlier and very noisy.Consider the collection situation of production data reality, " excellent " is compared with the modeling data of " good " grade " in " and the modeling data of " poor " grade to lack, therefore in the modeling data of " excellent " and " good " grade, the sample data of random choose 5% adds disturbance respectively.The mode that disturbance adds: by selected 5% sample (outlier), be divided into two parts data, and add the disturbance of 30% to the input of a part of data, the output of another part data of reciprocity quantity adds the disturbance of 30%.Meanwhile, " in " and " poor " grade modeling data in the sample data of random choose 10% respectively add disturbance.The mode that disturbance adds: by selected 10% sample (outlier), be divided into two parts data, and add the disturbance of 30% to the input of a part of data, the output of another part data of reciprocity quantity adds the disturbance of 30%.
When modeling data is normal data, when namely not adding disturbance, existing method (T-KPLS) and institute extracting method (T-KPRM) is utilized to carry out density observing and controlling process operation state evaluation, the evaluation result contrast of two kinds of methods, as shown in Figure 4, Figure 5.
When including outlier and very noisy in modeling data, namely according to statement before, add disturbance, existing method (T-KPLS) and institute extracting method (T-KPRM) is utilized to carry out density observing and controlling process operation state evaluation, the evaluation result contrast of two kinds of methods, as shown in Figure 6, Figure 7.
Below the evaluation recognition accuracy of two kinds of methods is contrasted.As shown in table 5, table 6, wherein similarity threshold span is: 0.6 to 0.9.
The existing method of table 5 (T-KPLS) evaluates recognition accuracy
Similarity threshold Difference In Good Excellent
ε≥0.6 21% 13% 59% 12%
ε≥0.7 17% 9% 41% 6%
ε≥0.8 10% 7% 19% 4%
ε≥0.9 4% 2% 11% 3%
Table 6 extracting method (T-KPRM) evaluates recognition accuracy
Similarity threshold Difference In Good Excellent
ε≥0.6 99% 72% 99% 99%
ε≥0.7 99% 55% 98% 96%
ε≥0.8 96% 30% 79% 87%
ε≥0.9 88% 15% 9% 62%
Analyze as can be seen from table, the evaluation recognition accuracy of T-KPRM method obviously will be better than the evaluation recognition accuracy of existing method (T-KPLS method).
Therefore, the comparing result of comprehensive evaluation model figure and recognition accuracy is visible, owing to including given outlier and very noisy in modeling data, existing method (T-KPLS) loses original generalization ability, the evaluation result figure of its four class states set up, all lose evaluation precision, and higher evaluation recognition accuracy cannot be kept, illustrate that the noise resisting ability of original method (T-KPLS) is weak.And the evaluation model that institute extracting method (T-KPRM) sets up, although there is some a little disturbances, but the emulation profile of evaluation result under substantially maintaining noise-free case, and maintain higher evaluation recognition accuracy, illustrate that the noise resisting ability of T-KPRM method is strong, method robustness is also good than existing method (T-KPLS).
4) for the non-optimal factor identification of density observing and controlling process
Non-optimal conditions refers to that industry park plan state is in other steady state (SS)s outside " excellent " state or transition state.When dense-medium separation density observing and controlling process operation is in some non-optimal conditions determined or the transition period being in former and later two steady state (SS)s, non-optimal factor now remains unchanged.That is, the change of non-optimal factor can cause different evaluation results.
To the non-optimal factor identification simulation result of each running status of dense-medium separation density observing and controlling process, as shown in Fig. 8,9,10,11,12,13, what its ordinate of Fig. 8 to Figure 13 represented is contribution rate, 1 in horizontal ordinate, 2,3,4,5 corresponding five variablees respectively: main separation density, main separation liquid level, main separation pressure, Magnetic Materials concentration and coal slurry content.The sampling instant that identifying is got is all random choose from the transition period between each state grade and state grade.Fig. 8 shows that the contribution rate of now main separation density and main separation pressure is higher, and therefore causing the 50th sampling instant to be in non-optimal conditions principal element is main separation density and main separation pressure.As can be seen from Figure 9 main separation pressure and main separation liquid level are the principal elements that the 115th sampling instant is in non-optimal conditions.From Figure 10,11,12,13, in like manner also can show that it is in the principal element of non-optimal conditions, does not repeat them here respectively.
Can be found out by above simulation result, utilize the running status evaluated off-line model that the complex industrial process evaluation of running status method (T-KPRM) based on core inclined robust latent variable technology is set up, dense medium coal separation density observing and controlling process carries out evaluation of running status and the identification of non-optimal factor is of practical significance very much.By means of evaluating and the result of factor identification, site operation personnel just can adjust in time and improve production strategy, enhances productivity.This evaluation of running status method is all applicable to dual medium cyclone dressing and gravity dense medium separation technique.
In sum, the T-KPRM method that the present invention proposes combines the advantage of both PRM and T-KPLS, effectively can not only solve the nonlinear problem of density observing and controlling process data, effectively can also overcome the impact of outlier on evaluation model precision.For complex industrial production run evaluation of running status problem, the present invention proposes the method application, specifically based on a complex industrial process evaluation of running status method of T-KPRM, and the method is applied in dense-medium separation density observing and controlling process, obtains good evaluation effect.The method, by accurately extracting the information of each variable of field data, is set up the evaluation model of historical process data, is then utilized three interpretational criterias to carry out the on-line operation state evaluation of density observing and controlling process.Finally by evaluation result, carry out the identification of running status non-optimal factor, provide decision-making foundation for operating personnel adjust production strategy in real time, thus improve production efficiency and the economic benefit of enterprise.

Claims (2)

1. based on a complex industrial process evaluation of running status method of T-KPRM, it is characterized in that, suppose that process data matrix is X ∈ R n × J, N is sample number, and J is process variable number; Exporting data matrix is Y ∈ R n, comprise output procedure variable; Concrete steps then based on the complex industrial process evaluation of running status method of T-KPRM are as follows:
Step one, zero-mean and unit variance process are carried out to each row of input data matrix X; In like manner, also standardization is carried out to output data matrix Y;
Step 2, by input data matrix X through Nonlinear Mapping Φ: x i∈ R n→ Φ (x i) ∈ F projects to high-dimensional feature space F, and calculate nuclear matrix K:K=Φ in F space tΦ;
Step 3, standardization is carried out to nuclear matrix K;
Step 4, PRM algorithm is run to input nucleus matrix K and output matrix Y, specific as follows:
A1, to establish be the lever weights of i-th sample data, can be represented by following formula:
w i x = f ( | | t i - med L 1 ( T ) | | med i | | t i - med L 1 ( T ) | | , c ) , i 1 , 2 , ... n - - - ( 1 ) ,
And f ( z , c ) = 1 ( 1 + | z / c | ) 2 - - - ( 2 ) ,
Wherein, || || represent Euclidean distance, med represents median, med l1represent L1 median, t ibe the PLS score of i-th sample data, c is constant;
If the residual error weights of i-th sample data can be defined by following formula:
Wherein, r irepresent the residual error between i-th sample data predicted value and actual value, represent the robust size estimation of residual error, can be calculated by following formula: r ~ = med i | r i - med j ( r j ) | , i , j = 1 , 2 , ... n - - - ( 4 ) ;
The then comprehensive weight w of i-th sample data i, can be determined by following formula: utilize formula (1), (3), (5) initialization weights W i;
B1, respectively input nucleus matrix K and output matrix Y to be weighted, after weighting, to obtain input nucleus matrix K wwith output matrix Y w, then PLS1 regression model is set up to the inputoutput data after weighting, revises its score vector simultaneously;
C1, calculate the residual error r of each data sample i, utilize formula (1), (3), (5) to upgrade sample weights W i;
If the relative difference of the double q of calculating in d1 front and back is less than threshold value, then enters step 5, otherwise return b1;
Step 5, now input data matrix become K w, export data matrix and become Y w, from output matrix Y wthe middle u extracting convergence i, make i=1, K wi=K w, Y wi=Y w;
A2, make Y wiin any one row equal u i;
B2, calculating K wscore vector, t i=K wiu i, t i← t i/ || t i||;
c2、 q i = Y W i T t i ;
D2, calculating Y wiscore vector u i=Y wiq i, u i← u i/ || u i||;
E2, judge u iwhether restrain, if convergence, proceed to step 6, otherwise return a2;
Step 6, calculating K wiload matrix:
Step 7, extract whole pivot, calculate input data matrix K wscore matrix T, input data matrix K wload matrix P, export data matrix Y wscore matrix U and export data matrix Y wload matrix Q, specific as follows:
A3, order K W i + 1 = K W i - t i p i T ; Y W i + 1 = Y W i - u i q i T ;
B3, make i=i+1, repeat step 5, six until extract A pivot, pivot number A can be determined by cross-validation method;
c3、T=[t 1,…,t A],P=[p 1,…,p A],U=[u 1,…,u A],Q=[q 1,…,q A];
Step 8, K w=TP t+ E, Y w=UQ t+ F;
Step 9, to pivot TP trun PCA algorithm:
Step 10, PCA algorithm is run to residual error E:
If exporting data matrix Y is single output variable, then the expression formula of T-KPRM model is as follows:
{ K W = t y p y T + T o P o T + T r P r T + E r y W = t y + F - - - ( 6 )
Above-mentioned score vector t ythat represent is in T and output y wdirectly related part, namely sets up the most critical variable drawn required for evaluation model, can be used for the foundation of running status evaluated off-line model; T orepresent in T with output y worthogonal part; ; T rit is the part having larger variance in residual error E; E rfinal residual error, namely noise;
When introducing new sample k new, its score matrix will be drawn by following formula:
t y = y W T T ( U T K W T ) - 1 U T k n e w = g y k n e w - - - ( 7 ) ,
Wherein, k newnew samples x newkernel function, can be calculated by following formula:
k new=Φ(X)Φ(x new)=[k(x 1,x new),…k(x n,x new)] T(8),
To k newcarry out equalization can obtain: k &OverBar; n e w = k n e w - K 1 t - 1 n k n e w + 1 n K 1 t - - - ( 9 ) ,
Wherein, 1 t=1/n [11...1] t∈ R n.
2. the application of a kind of complex industrial process evaluation of running status method based on T-KPRM according to claim 1, it is characterized in that, a kind of application of the complex industrial process evaluation of running status method based on T-KPRM refers to the identification utilizing core inclined robust latent variable technology to set up the evaluated off-line model of running status, the on-line evaluation of complex industrial process running status and non-optimal factor, specifically first utilize core inclined robust latent variable technology to set up evaluated off-line model, the evaluated off-line model score vector of corresponding running status grade is wherein, c is the number of running status grade; Then, introduce sliding window technique, calculate the similarity between online data window and corresponding opinion rating, utilize similarity to carry out the on-line evaluation of industrial process running status; Euclidean distance between recycling slip data window and optimum opinion rating, calculates the contribution rate of relevant variable, identifies the non-optimal factor of running status.
CN201510369102.5A 2015-06-29 2015-06-29 Based on T-KPRM complex industrial process evaluation of running status methods and application Active CN105278520B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510369102.5A CN105278520B (en) 2015-06-29 2015-06-29 Based on T-KPRM complex industrial process evaluation of running status methods and application

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510369102.5A CN105278520B (en) 2015-06-29 2015-06-29 Based on T-KPRM complex industrial process evaluation of running status methods and application

Publications (2)

Publication Number Publication Date
CN105278520A true CN105278520A (en) 2016-01-27
CN105278520B CN105278520B (en) 2018-09-21

Family

ID=55147700

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510369102.5A Active CN105278520B (en) 2015-06-29 2015-06-29 Based on T-KPRM complex industrial process evaluation of running status methods and application

Country Status (1)

Country Link
CN (1) CN105278520B (en)

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106647650A (en) * 2016-09-22 2017-05-10 宁波大学 Distributed industrial process monitoring method based variable weighting PCA (Principal Component Analysis) model
CN106843172A (en) * 2016-12-29 2017-06-13 中国矿业大学 Complex industrial process On-line quality prediction method based on JY KPLS
CN107463093A (en) * 2017-07-13 2017-12-12 东北大学 A kind of blast-melted quality monitoring method based on KPLS robust reconstructed errors
CN108388218A (en) * 2018-02-08 2018-08-10 中国矿业大学 The adaptive batch process optimization method of amendment based on latent variable process migration model
CN108490912A (en) * 2018-03-12 2018-09-04 华中科技大学 A kind of multi-modal process modal identification method based on pivot similarity analysis
CN109276945A (en) * 2018-10-24 2019-01-29 东北大学 A kind of self-healing control method of dense dehydration pan feeding unusual service condition
CN109902872A (en) * 2019-02-28 2019-06-18 北京首钢股份有限公司 A kind of method and apparatus that performance prediction model is combined with research and development of products
CN110232221A (en) * 2019-05-24 2019-09-13 华南理工大学 Dam Crack influence factor dynamic Contribution Rate method
CN111813064A (en) * 2020-07-03 2020-10-23 浙江大学 Industrial process running state online evaluation method based on instant learning thought
CN112329191A (en) * 2019-08-05 2021-02-05 华中科技大学 Modal identification method and transition mode fault detection method in multi-mode process
CN112363462A (en) * 2020-09-29 2021-02-12 中国矿业大学 Static-dynamic cooperative sensing complex industrial process running state evaluation method
CN113848836A (en) * 2021-09-30 2021-12-28 中国矿业大学 ILSTM network-based complex industrial process running state evaluation method
CN114384870A (en) * 2021-11-10 2022-04-22 中国矿业大学 Complex industrial process running state evaluation method based on kernel local linear embedding PLS

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104062953A (en) * 2014-06-12 2014-09-24 东北大学 Gold hydrometallurgy process running state online evaluation method
CN104123451A (en) * 2014-07-16 2014-10-29 河海大学常州校区 Dredging operation yield prediction model building method based on partial least squares regression

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104062953A (en) * 2014-06-12 2014-09-24 东北大学 Gold hydrometallurgy process running state online evaluation method
CN104123451A (en) * 2014-07-16 2014-10-29 河海大学常州校区 Dredging operation yield prediction model building method based on partial least squares regression

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
JIA RUNDA等: "Kernal partial robus M-regression as a flexible robust nonlinear modeling technique", 《CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS》 *
吴盈 等: "基于样条变换偏鲁棒M-回归的电站热力过程数据检验", 《中国电机工程学报》 *
王姝: "基于数据的间歇过程故障诊断及预测方法研究", 《中国博士学位论文全文数据库》 *
贾润达: "基于核偏鲁棒M-回归的间歇反映过程混合模型辨识", 《高校化学工程学报》 *

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106647650A (en) * 2016-09-22 2017-05-10 宁波大学 Distributed industrial process monitoring method based variable weighting PCA (Principal Component Analysis) model
CN106843172A (en) * 2016-12-29 2017-06-13 中国矿业大学 Complex industrial process On-line quality prediction method based on JY KPLS
CN106843172B (en) * 2016-12-29 2019-04-09 中国矿业大学 Complex industrial process On-line quality prediction method based on JY-KPLS
CN107463093A (en) * 2017-07-13 2017-12-12 东北大学 A kind of blast-melted quality monitoring method based on KPLS robust reconstructed errors
CN107463093B (en) * 2017-07-13 2019-05-21 东北大学 A kind of blast-melted quality monitoring method based on KPLS robust reconstructed error
CN108388218A (en) * 2018-02-08 2018-08-10 中国矿业大学 The adaptive batch process optimization method of amendment based on latent variable process migration model
CN108490912A (en) * 2018-03-12 2018-09-04 华中科技大学 A kind of multi-modal process modal identification method based on pivot similarity analysis
CN109276945B (en) * 2018-10-24 2020-10-23 东北大学 Self-healing control method for feeding abnormal working condition in thickening dehydration process
CN109276945A (en) * 2018-10-24 2019-01-29 东北大学 A kind of self-healing control method of dense dehydration pan feeding unusual service condition
CN109902872A (en) * 2019-02-28 2019-06-18 北京首钢股份有限公司 A kind of method and apparatus that performance prediction model is combined with research and development of products
CN110232221A (en) * 2019-05-24 2019-09-13 华南理工大学 Dam Crack influence factor dynamic Contribution Rate method
CN112329191A (en) * 2019-08-05 2021-02-05 华中科技大学 Modal identification method and transition mode fault detection method in multi-mode process
CN111813064A (en) * 2020-07-03 2020-10-23 浙江大学 Industrial process running state online evaluation method based on instant learning thought
CN111813064B (en) * 2020-07-03 2021-06-25 浙江大学 Industrial process running state online evaluation method based on instant learning thought
CN112363462A (en) * 2020-09-29 2021-02-12 中国矿业大学 Static-dynamic cooperative sensing complex industrial process running state evaluation method
CN113848836A (en) * 2021-09-30 2021-12-28 中国矿业大学 ILSTM network-based complex industrial process running state evaluation method
CN114384870A (en) * 2021-11-10 2022-04-22 中国矿业大学 Complex industrial process running state evaluation method based on kernel local linear embedding PLS
CN114384870B (en) * 2021-11-10 2024-06-11 中国矿业大学 Complex industrial process running state evaluation method based on nuclear local linear embedded PLS

Also Published As

Publication number Publication date
CN105278520B (en) 2018-09-21

Similar Documents

Publication Publication Date Title
CN105278520A (en) Complex industrial process running state evaluation method and application based on T-KPRM
CN111949700B (en) Intelligent safety guarantee real-time optimization method and system for petrochemical device
CN104778378B (en) A kind of oil gas field the analysis of affecting factors about production decline method
CN106053067A (en) Bearing fault diagnosis method based on quantum genetic algorithm optimized support vector machine
Zhou et al. A novel soft sensor modeling approach based on difference-LSTM for complex industrial process
CN105205327B (en) A kind of ethylene production efficiency dynamic assessment method based on operating mode
CN101382556A (en) Soft measurement method for floating fine coal ash from slurry based on data drive
CN102169077B (en) Hybrid intelligent soft measuring method of overflow granularity index in wet grinding process
CN112529341A (en) Drilling well leakage probability prediction method based on naive Bayesian algorithm
CN106354017A (en) Method for controlling content ranges of components in rare earth extraction and separation process
CN106156434A (en) Sliding window time difference Gaussian process regression modeling method based on the low and deep structure of local time
CN102788696A (en) Evaluation method for health degree of bearing on basis of improved BP (Back Propagation) neural network and fuzzy set theory
CN112363462B (en) Static-dynamic cooperative sensing complex industrial process running state evaluation method
CN107065834A (en) The method for diagnosing faults of concentrator in hydrometallurgy process
Wang et al. Fault detection and classification for complex processes using semi-supervised learning algorithm
CN111079978A (en) Coal and gas outburst prediction method based on logistic regression and reinforcement learning
Dong et al. Quality monitoring and root cause diagnosis for industrial processes based on Lasso-SAE-CCA
CN109885012A (en) A kind of gold hydrometallurgy whole process real-time optimization compensation method
CN116244625A (en) Overflow type mill load indirect forecasting method based on multi-feature fusion neural network
CN104298187A (en) Gold hydrometallurgy full-flow three-layer structure process monitoring method
CN106485326A (en) A kind of hardness detection method in ore reduction production process
CN106647274B (en) Operating condition stable state method of discrimination in a kind of continuous flow procedure
CN104199285A (en) Leaching rate prediction method for wet metallurgy gold cyaniding leaching process
CN106950946A (en) A kind of hydrometallurgy exception control method based on optimization principles
CN115496124A (en) Blast furnace fault diagnosis method based on minimum maximum entropy collaborative training

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant