CN116340384A - Nuclear recursion maximum correlation entropy time sequence on-line prediction method based on rule evolution - Google Patents

Nuclear recursion maximum correlation entropy time sequence on-line prediction method based on rule evolution Download PDF

Info

Publication number
CN116340384A
CN116340384A CN202310105269.5A CN202310105269A CN116340384A CN 116340384 A CN116340384 A CN 116340384A CN 202310105269 A CN202310105269 A CN 202310105269A CN 116340384 A CN116340384 A CN 116340384A
Authority
CN
China
Prior art keywords
rule
data
time sequence
wake
dictionary
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310105269.5A
Other languages
Chinese (zh)
Inventor
韩敏
夏慧娟
梁漪
胡磊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dalian University of Technology
Original Assignee
Dalian University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dalian University of Technology filed Critical Dalian University of Technology
Priority to CN202310105269.5A priority Critical patent/CN116340384A/en
Publication of CN116340384A publication Critical patent/CN116340384A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2474Sequence data queries, e.g. querying versioned data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • G06F16/24564Applying rules; Deductive queries
    • G06F16/24566Recursive queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/16Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/18Complex mathematical operations for evaluating statistical data, e.g. average values, frequency distributions, probability functions, regression analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Mathematical Physics (AREA)
  • Mathematical Optimization (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Computational Mathematics (AREA)
  • Pure & Applied Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Software Systems (AREA)
  • Computational Linguistics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Algebra (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Biology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Operations Research (AREA)
  • Fuzzy Systems (AREA)
  • Computing Systems (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention belongs to the field of time sequence prediction, and provides a kernel recursion maximum correlation entropy time sequence on-line prediction method based on rule evolution. Firstly, preprocessing acquired data by using a normalization method and phase space reconstruction, and fully mining useful information in the data; then, realizing autonomous learning evolution of a rule base by utilizing the compatibility measurement and wake-up index double rules, and weakening adverse effects of outliers or complex noise; then, the kernel recursive maximum cross-correlation entropy method and the sparsification strategy are combined to update the model parameters, so that a compact dictionary is formed, the calculation complexity is reduced, the dynamic tracking capacity of the model to a time sequence is further enhanced, and the prediction precision is improved; and finally, outputting and predicting the trained model by adopting test data, and verifying the high efficiency of the model. The invention can carry out structural evolution aiming at an unknown complex environment, has stronger autonomy and robustness, and can realize the balance of high prediction precision and low calculation complexity.

Description

Nuclear recursion maximum correlation entropy time sequence on-line prediction method based on rule evolution
Technical Field
The invention belongs to the field of time sequence prediction, and relates to a method for online prediction of a kernel recursion maximum correlation entropy time sequence based on rule evolution.
Background
The time sequence is a group of data sets arranged according to time sequence, and widely exists in various fields of nature, industrial production, financial technology and the like. Along with the rapid development of sensors and storage devices, the data scale and the update speed of a time sequence are continuously improved, and the model for online prediction is required to show stronger performance on the aspects of nonlinearity, non-stability, non-Gaussian and the like of the time sequence while mining data hidden information in the face of environments with complex data burst and noise. Compared with other online prediction models, the kernel adaptive filter has the advantages of strong generalization capability, simple iterative update, low computational complexity and the like, and can effectively solve the problem of complex nonlinear prediction.
Although the kernel adaptive filter is widely used in the field of time series prediction, the following disadvantages still exist: (1) poor ability to capture time-varying features of a dynamic system. Although Wu et al use the related entropy criteria to replace the traditional mean square error criteria in the paper "Wu Z, shi J, zhang X, et al, kernel recursive maximum correntropy [ J ]. Signal Processing,2015,117:11-16 ], propose a kernel recursive maximum related entropy method, which improves the performance of the model in an outlier or non-gaussian noise environment, because the model only adjusts parameters, there is no structure adaptive evolution, resulting in poor time-varying property effects of model tracking time sequences, resulting in poor prediction accuracy. And (2) the calculation complexity of loading the complete dictionary is high. When the data is processed in each iteration, the model needs to be added with corresponding kernel space to store new data, and the size of the complete dictionary depends on the size of the data sample, which presents challenges for calculation time and memory, so that the model is difficult to be suitable for time series with large data scale.
Therefore, the invention takes a time sequence with larger scale and time-varying characteristics as a research object, and provides a method for online prediction of a nuclear recursion maximum correlation entropy time sequence based on rule evolution, so as to realize self-adaptive evolution of a model structure and self-adaptive update of parameters, further reduce calculation complexity and improve the prediction precision of the time sequence. The invention is funded by the national natural fund project (62173063).
Disclosure of Invention
Aiming at the problems of poor online prediction performance and over-high calculation cost in the prior art, the invention provides a method for online prediction of a kernel recursion maximum correlation entropy time sequence based on rule evolution.
In order to achieve the above purpose, the technical scheme adopted by the invention is as follows:
the method for online predicting the core recursion maximum correlation entropy time sequence based on rule evolution comprises the following specific steps:
step 1: time series data is collected from the real world and normalized.
First, a sample { (x (n), d (n)), n=1, 2, … }, where x (n) represents a model input vector consisting of t-dimensional feature inputs x (n), i.e., x (n) = [ x ] 1 (n),x 2 (n),…,x t (n)]D (n) represents a prediction target, and n represents a time; secondly, considering the dimension difference of the multidimensional features in the input vector, adopting a normalization method to process the data, and further reducing the adverse effect of the data on the accurate prediction of the model, wherein the calculation formula is as follows:
Figure BDA0004086150380000021
wherein x (n) and x' (n) are values before and after normalization of input data, respectively, x min And x max Respectively minimum and maximum values of input data; and similarly, carrying out normalization processing on the predicted target to obtain a normalized value d' (n) of the predicted target.
Step 2: the normalized time series data is subjected to phase space reconstruction, and the purpose of the phase space reconstruction is to deeply mine useful information (correlation, dynamics characteristics and the like) in the time series. Thus, phase space reconstruction is performed on the time series data, and the reconstructed input vector is used for the time series data
Figure BDA0004086150380000029
Expressed as:
Figure BDA0004086150380000022
wherein τ t (n) and m t (n) delay time and embedding dimension of the T-th feature sequence in the normalized input vector x' (n), T is transposed, and the prediction target is
Figure BDA0004086150380000023
Step 3: dividing the data set reconstructed in the step 2, and setting and initializing parameters.
80% of the reconstructed time series data are selected as training sets, the total amount of the training sets is recorded as N, and the rest is used as a test set.
Setting model parameters including kernel width sigma', gaussian kernel size sigma, cluster center learning rate eta, wake-up index learning rate beta, regularization parameter gamma and wake-up threshold value in related entropy
Figure BDA0004086150380000024
And a distance threshold δ.
At the 1 st moment, substituting the 1 st training data into the model, creating a first rule to form a rule base, and initializing key parameters of the model, including a wake-up index a 1 (1) =0, cluster center
Figure BDA0004086150380000025
Dictionary->
Figure BDA0004086150380000026
Expansion coefficient->
Figure BDA0004086150380000027
κ<·,·>Representing a gaussian kernel function, variables b and b * Is defined as:
Figure BDA0004086150380000028
where σ is the gaussian kernel size.
Step 4: and (3) performing iterative training of the model from the 2 nd moment according to the training data set and the parameter setting. In particular to
Step 4.1: loading 1 training data at each moment, and calculating compatibility measurement and wake-up index of the training data under each rule in a rule base;
the model employs compatibility metrics and wake-up index definition rules.
The compatibility measure ρ i Is used for measuring input vector
Figure BDA0004086150380000031
Correlation with the i-th normal clustering center R. When the compatibility metric reaches a maximum, it indicates that the degree of similarity between the current input sample and the rule is the greatest, at which point the rule is the most compatible rule. Input vector->
Figure BDA0004086150380000032
Clustering center R with ith rule i The compatibility metrics between (n) are as follows:
Figure BDA0004086150380000033
wherein ρ is i (n)∈[0,1]T represents the total feature number contained in the input vector, r represents the correlation dependence between two observed variables, calculated as:
Figure BDA0004086150380000034
wherein,,
Figure BDA0004086150380000035
and->
Figure BDA0004086150380000036
Respectively represent the nth characteristic input +.>
Figure BDA0004086150380000037
And the average value of the cluster center R.
The wake-up index a i Can be used as a compatibility measure ρ i To create new rules to reduce the negative impact of outliers. The calculation formula of the wake-up index a under the ith rule at the nth time is expressed as follows:
a i (n)=(1-β)a i (n-1)+β(1-ρ i (n)) (6)
where β represents the learning rate of the wake up index.
Step 4.2: will wake up index a i (n) minimum and wake-up threshold
Figure BDA0004086150380000038
Comparing, judging whether to create new rule, wherein the wake-up threshold value +.>
Figure BDA0004086150380000039
The range of (2) is 0-1, and the following two cases are specifically included:
case 1: when the minimum wake index is greater than the wake threshold, i.e
Figure BDA00040861503800000310
When a new rule is created, the number of rules in the rule base is increased, the new rule is updated to r=r+1, and parameters in the new rule including a cluster center are initialized>
Figure BDA00040861503800000311
Dictionary->
Figure BDA00040861503800000312
Expansion coefficient->
Figure BDA00040861503800000313
Case 2: when the minimum wake index is less than or equal to the wake threshold
Figure BDA00040861503800000314
I.e. < ->
Figure BDA00040861503800000315
When the clustering center is in the same time, the input vector (namely the current time sequence) is brought into the most compatible rule in the rule base, and the clustering center is recursively updated, wherein the calculation formula is as follows:
Figure BDA00040861503800000316
wherein eta epsilon [0,1] represents the learning rate of the cluster center.
And then, in the subsequent parameter updating of the model structure, a related entropy criterion is selected to replace a mean square error criterion in a traditional kernel recursion least square method as a cost function, and the kernel recursion maximum related entropy method is utilized to improve the prediction performance of the model under the condition of non-Gaussian noise or abnormal value. The optimization objective based on the correlation entropy is defined as:
Figure BDA0004086150380000041
wherein, omega is the weight of the filter,
Figure BDA0004086150380000042
indicating the current input at time j>
Figure BDA0004086150380000043
By means of the non-linearly mapped input vector, gamma represents regularization parameters, |·| represents L2 norms. Under the ith rule at the nth moment, obtaining an intermediate variable h by using a gradient descent method i (n)、z i (n) and lambda i (n) calculating intermediate variables according to formula (9), the calculation formula being:
Figure BDA0004086150380000044
wherein c ik (n) is the dictionary set in the ith rule at the nth time, k is the number of input vectors contained in the dictionary set, Q i (n-1) is a matrix variable of the ith rule at the n-1 th moment.
Since the kernel recursive maximum correlation entropy method performs parameter calculation based on sets of historical data, each set forms a dictionary, i.e., C i (n)=[c i1 (n),…,c in′ (n)]Where n' is the number stored in the dictionary set. If the dictionary contains all of the input data, this will result in a significant increase in computational burden. Therefore, in order to reduce the computational complexity, the sparsification processing is performed by adopting a novel criterion, so that only relevant input vectors are reserved in the dictionary, and a compact dictionary is formed. Defining a distance calculation formula as
Figure BDA0004086150380000045
In c ii* =[c 1 ,…,c m ]Representing the ith in the ith rule lower dictionary set c * A plurality of input vectors; thereafter, it is compared to a distance threshold, wherein the threshold ranges from 0 to 1, to determine whether the data is to be incorporated into a dictionary. The method comprises the following two parts:
(1) When (when)
Figure BDA0004086150380000046
At the time, the current input sample +.>
Figure BDA0004086150380000047
The inclusion of a dictionary is to be taken into account,
Figure BDA0004086150380000048
matrix variable Q at this time i (n) and expansion coefficient θ i (n) update formula is as follows:
Figure BDA0004086150380000049
wherein,,
Figure BDA00040861503800000410
e i (n) is a prediction error, expressed as
Figure BDA00040861503800000411
(2) When (when)
Figure BDA00040861503800000412
When the current data is excluded from the dictionary, the dictionary remains unchanged, and the matrix variable Q i (n) and expansion coefficient θ i The calculation form of (n) is as follows:
Figure BDA0004086150380000051
wherein the method comprises the steps of
Figure BDA0004086150380000052
Step 4.3: loading training data, and judging whether training is finished;
if the current moment is smaller than the total quantity of the training set, namely N is smaller than N, returning to the step 4.1, and entering the next moment for iterative updating; otherwise, the model training is completed, a rule base is exported, and the next step is entered.
Step 5: and (3) selecting the most compatible rule in the rule base to output and predict the test data set by using the model trained in the step (4), and then performing inverse normalization calculation on the sample data. Finally, the prediction accuracy of the model is measured using an evaluation index, including Root Mean Square Error (RMSE) and symmetric average absolute percentage error (SMAPE), defined as follows:
Figure BDA0004086150380000053
wherein d (j) and
Figure BDA0004086150380000054
the true value and the predicted value are respectively, and n is the number of samples.
Compared with the prior art, the invention has the following obvious advantages:
(1) The invention can carry out structural evolution aiming at an unknown complex environment, has stronger autonomy and robustness, and is characterized in that the invention constructs a rule base from nothing to nothing according to input data through compatibility measurement and wake-up index, self-adaptively updates a structure, and fully extracts useful information contained in the data, so that the rule quality is ensured;
(2) When data containing abnormal values or non-Gaussian noise is processed, the method can effectively inhibit adverse effects caused by the abnormal values or the non-Gaussian noise by adopting a kernel recursion maximum correlation entropy method, achieves real-time capture of dynamic changes of a time sequence, and generates accurate prediction results;
(3) In addition, the self-adaption and sparsification method of the rule base can ensure the compactness of the dictionary structure, and realize the balance of high prediction precision and low calculation complexity.
Drawings
FIG. 1 is a flow chart of a time series on-line prediction model of the present invention.
Fig. 2 (a) is a graph showing the prediction of Beijing PM2.5 time series according to the present invention.
Fig. 2 (b) is an error curve of the present invention for Beijing PM2.5 time series.
Detailed Description
The invention will be further described with reference to the accompanying drawings and specific examples.
The hardware equipment used by the invention comprises a PC machine.
As shown in fig. 1, the method for online predicting the temporal sequence of the maximum correlation entropy of the kernel recursion based on rule evolution is implemented as follows:
step 1: 8760 groups of PM2.5 pollutants and 4-dimensional capital international airport meteorological data, including air temperature, air pressure, dew point and wind speed, are obtained from 1 month 1 in 2019 to 12 months 31 in 2019, and the data are all collected once per hour. At this time, the input vector of the model is x (n) = [ x ] 1 (n),x 2 (n),…,x 5 (n)]Single-step prediction of Beijing PM2.5 contaminant is performed, and the prediction target is determined to be d (n) =x 1 (n+1); the data set { (x (n), d (n)), n=1, 2, … } is normalized according to the formula (1) to construct an input-output sample set (x '(n), d' (n)), so that the model prediction effect is not good due to the fact that the dimension difference of the multidimensional data set is large.
Step 2: in order to fully extract hidden information in the time sequence, determining that delay time and embedding dimension are tau=1 and m=20 respectively according to a phase space reconstruction theory, selecting 5000 groups of data after reconstruction, wherein input vectors and prediction targets are respectively
Figure BDA0004086150380000061
Step 3: 80% of the Beijing time series after reconstruction is used as a training set, the total data of the training set is 4000 at the moment, and the rest data are used as a test set. At the 1 st moment, the 1 st training data steps into the model, a first rule is created, a rule base is formed, and the model parameters are set as shown in table 1; initializing key parameters including wake index a 1 (1) =0, cluster center
Figure BDA0004086150380000062
Dictionary->
Figure BDA0004086150380000063
And expansion coefficient->
Figure BDA0004086150380000064
TABLE 1 model parameter settings
Figure BDA0004086150380000065
Step 4: and (3) utilizing the training set which arrives in sequence to evolve the rule base from the 2 nd moment, and iterating the training model. In particular to
Step 4.1: respectively calculating compatibility measurement and wake-up index of training data at the current moment and each rule in a rule base according to a formula (4) and a formula (6);
for Beijing 5-dimensional time series data, a vector is input at the nth moment
Figure BDA0004086150380000066
And the ith rule clustering center R i The compatibility metrics of (n) are as follows:
Figure BDA0004086150380000067
wherein the current time is input
Figure BDA0004086150380000071
Clustering center R with ith rule i The correlation dependence r between (n) is calculated according to equation (5).
Step 4.2: comparing wake-up index minimum value a imin (n) and wake threshold
Figure BDA0004086150380000072
Whether to add new rules is discussed, specifically, the following two cases:
case 1: when a is imin When (n) > 1e-5, a new rule is generated. At this time, the number of rules in the rule base is increased, i.e. r=r+1, and the key parameters under the rules are initialized, including the cluster center
Figure BDA0004086150380000073
Dictionary->
Figure BDA0004086150380000074
Expansion coefficient->
Figure BDA0004086150380000075
Case 2: when a is imin When (n) is less than or equal to 1e-5, storing the current time sequence into the most compatible rule base, and performing clustering center R i (n) updating, expressed as
Figure BDA0004086150380000076
Due to the complexity of external environmental noise, the acquired Beijing time series data has non-Gaussian characteristics, so that the final prediction effect of the model is not ideal. In order to solve the problems, an optimization target is established according to a formula (8), a related entropy criterion is essentially adopted to replace a traditional mean square error criterion, model parameter updating is performed based on a kernel recursion maximum related entropy method, and adverse effects of non-Gaussian noise or abnormal values are weakened. The calculation of the intermediate variables is then performed according to equation (9). In addition, the dimension of the kernel matrix in the kernel recursive maximum correlation entropy method depends on the size of the data amount, and it is difficult to avoid generating a large computational and storage burden when processing large-scale data. Therefore, a sparse method is adopted, so that the model only stores relevant input vectors, and a compact dictionary is formed. By comparison of
Figure BDA0004086150380000077
And analyzing whether the current data is added into the dictionary according to the relation between the current data and the distance threshold delta, wherein the relation is specifically divided into the following two parts:
(1) When (when)
Figure BDA0004086150380000078
When the current input data +.>
Figure BDA0004086150380000079
Added to the dictionary, matrix variable Q i (n) and expansion coefficient θ i (n) updating according to formula (10);
(2) When (when)
Figure BDA00040861503800000710
When the current input data +.>
Figure BDA00040861503800000711
Is rejected, dictionary is kept unchanged, matrix variable Q i (n) and expansion coefficient θ i (n) updating according to formula (11);
step 4.3: determining whether the training dataset is loaded, i.e., n < 4000?
If n is less than 4000, indicating that training is not finished, continuing to perform iterative updating at the next moment, and returning to the step 4.1; otherwise, the model training is completed, and the next step is entered.
Step 5: and testing on a trained model by using a test set, calculating the predicted value of the Beijing PM2.5 pollutant, performing inverse normalization treatment on the predicted value, and drawing a predicted curve and an error curve of the Beijing PM2.5 pollutant, which are shown in figure 2. As can be seen from the graph, the method can effectively track the change trend of the time sequence, has smaller prediction error value, and shows the accuracy of model prediction. Finally, RMSE and SMAPE were calculated according to equation (12), as shown in table 2. Compared with a quantization kernel recursion least square method (ANS-QKRRS) and a quantization kernel recursion generalized maximum correlation entropy method (QKRGMC) based on self-adaptive normalized sparsity, the method provided by the invention has the advantages that the obtained evaluation indexes are optimal, and the effectiveness of the method on Beijing PM2.5 prediction is further verified.
TABLE 2 prediction results of various methods
Figure BDA0004086150380000081
Finally, it should be noted that: the above examples are only for the purpose of illustrating the embodiments of the present invention, and it should be understood that the examples are only for the purpose of illustrating the present invention and not for the purpose of limiting the scope of the present invention, and that it is possible for those skilled in the art, after having read the present invention, to make several variations and modifications without departing from the spirit of the invention, and that various equivalent modifications of the present invention fall within the scope of the present invention as defined in the appended claims.

Claims (7)

1. The method for online predicting the core recursion maximum correlation entropy time sequence based on rule evolution is characterized by comprising the following steps of:
step 1: collecting time sequence data and carrying out normalization processing on the time sequence data;
step 2: carrying out phase space reconstruction on the normalized time sequence data, and deep mining useful information in the time sequence;
step 3: dividing the data set reconstructed in the step 2, and setting and initializing parameters;
step 4: according to the training data set and parameter setting, performing iterative training of the model from the 2 nd moment;
step 5: and (3) selecting the most compatible rule in the rule base to output and predict the test data set by using the model trained in the step (4), then performing inverse normalization calculation on the sample data, and finally measuring the model prediction precision by using the evaluation index.
2. The method for online prediction of a core recursive maximum correlation entropy time sequence based on rule evolution according to claim 1, wherein the step 1 is specifically as follows:
building samples { (x (n), d (n)), n=1, 2, … }, for the prediction problem, where x (n) represents a model input vector consisting of t-dimensional feature inputs x (n), i.e., x (n) = [ x ] 1 (n),x 2 (n),…,x t (n)]D (n) represents a prediction target, and n represents a time; secondly, considering the dimensional differences of the multidimensional features in the input vector, and processing the data by adopting a normalization method;
and carrying out normalization processing on the predicted target to obtain a normalized value d' (n) of the predicted target.
3. The method for online prediction of a kernel recursive maximum correlation entropy time sequence based on rule evolution according to claim 2, wherein in the step 1, the calculation formula of the normalization process is as follows:
Figure FDA0004086150370000011
wherein x (n) and x' (n) are values before and after normalization of input data, respectively, x min And x max Respectively, input dataIs a minimum and a maximum of (a).
4. The method for online prediction of a core recursive maximum correlation entropy time sequence based on rule evolution according to claim 1, wherein the step 2 is specifically as follows:
phase space reconstruction is carried out on the normalized time sequence data, and the reconstructed input vector is obtained
Figure FDA0004086150370000014
Expressed as:
Figure FDA0004086150370000012
wherein τ t (n) and m t (n) delay time and embedding dimension of the T-th feature sequence in the normalized input vector x' (n), T is transposed, and the prediction target is
Figure FDA0004086150370000013
5. The method for online prediction of a core recursive maximum correlation entropy time sequence based on rule evolution according to claim 1, wherein the step 3 is specifically as follows:
80% of the reconstructed time sequence data are selected as training sets, the total quantity of the training sets is recorded as N, and the rest is used as a test set;
setting model parameters including kernel width sigma', gaussian kernel size sigma, cluster center learning rate eta, wake-up index learning rate beta, regularization parameter gamma and wake-up threshold value in related entropy
Figure FDA0004086150370000021
A distance threshold delta;
at the 1 st moment, substituting the 1 st training data into the model, creating a first rule, forming a rule base, and initializing key parameters of the modelArrangements comprising wake-up index a 1 (1) =0, cluster center
Figure FDA0004086150370000022
Dictionary->
Figure FDA0004086150370000023
Expansion coefficient->
Figure FDA0004086150370000024
Kappa. Shows Gaussian kernel function, variables b and b * Is defined as:
Figure FDA0004086150370000025
where σ is the gaussian kernel size.
6. The method for online prediction of a core recursive maximum correlation entropy time sequence based on rule evolution according to claim 1, wherein the step 4 is specifically as follows:
step 4.1: loading 1 training data at each moment, and calculating compatibility measurement and wake-up index of the training data under each rule in a rule base;
the model adopts compatibility measurement and wake-up index definition rules;
the compatibility measure ρ i Is used for measuring input vector
Figure FDA0004086150370000026
Correlation with the i-th normal clustering center R; when the compatibility measurement value reaches the maximum, the similarity degree between the current input sample and the rule is indicated to be the maximum, and the rule is the most compatible rule; input vector->
Figure FDA0004086150370000027
Clustering center R with ith rule i The compatibility metrics between (n) are as follows:
Figure FDA0004086150370000028
wherein ρ is i (n)∈[0,1]T represents the total feature number contained in the input vector, r represents the correlation dependence between two observed variables, calculated as:
Figure FDA0004086150370000029
wherein,,
Figure FDA00040861503700000210
and->
Figure FDA00040861503700000211
Respectively represent the nth characteristic input +.>
Figure FDA00040861503700000212
And the average value of the cluster centers R;
the wake-up index a i Can be used as a compatibility measure ρ i To create new rules to reduce negative effects of outliers; the calculation formula of the wake-up index a under the ith rule at the nth time is expressed as follows:
a i (n)=(1-β)a i (n-1)+β(1-ρ i (n)) (6)
wherein, beta represents the learning rate of the wake-up index;
step 4.2: will wake up index a i (n) minimum and wake-up threshold
Figure FDA0004086150370000031
Comparing, judging whether to create new rule, wherein the wake-up threshold value +.>
Figure FDA0004086150370000032
The range of (2) is 0-1, and the following two cases are specifically included:
case 1: when the minimum wake index is greater than the wake threshold, i.e
Figure FDA0004086150370000033
When a new rule is created, the number of rules in the rule base is increased, the new rule is updated to r=r+1, and parameters in the new rule including a cluster center are initialized>
Figure FDA0004086150370000034
Dictionary for dictionary
Figure FDA0004086150370000035
Expansion coefficient->
Figure FDA0004086150370000036
Case 2: when the minimum wake index is less than or equal to the wake threshold
Figure FDA0004086150370000037
I.e. < ->
Figure FDA0004086150370000038
When the clustering center is in the same time, the input vector (namely the current time sequence) is brought into the most compatible rule in the rule base, and the clustering center is recursively updated, wherein the calculation formula is as follows:
Figure FDA0004086150370000039
wherein eta epsilon [0,1] represents the learning rate of the clustering center;
then, in the subsequent parameter updating of the model structure, a related entropy criterion is selected to replace a mean square error criterion in a traditional kernel recursion least square method as a cost function, and the kernel recursion maximum related entropy method is utilized to improve the prediction performance of the model under the condition of non-Gaussian noise or abnormal value; the optimization objective based on the correlation entropy is defined as:
Figure FDA00040861503700000310
wherein, omega is the weight of the filter,
Figure FDA00040861503700000311
indicating the current input at time j>
Figure FDA00040861503700000312
Through the nonlinear mapped input vector, gamma represents regularization parameters, and I.I.I represents L2 norms; under the ith rule at the nth moment, obtaining an intermediate variable h by using a gradient descent method i (n)、z i (n) and lambda i (n) calculating intermediate variables according to formula (9), the calculation formula being:
Figure FDA00040861503700000313
wherein c ik (n) is the dictionary set in the ith rule at the nth time, k is the number of input vectors contained in the dictionary set, Q i (n-1) is a matrix variable of the ith rule at the n-1 th moment;
since the kernel recursive maximum correlation entropy method performs parameter calculation based on sets of historical data, each set forms a dictionary, i.e., C i (n)=[c i1 (n),…,c in′ (n)]Wherein n' is the number stored in the dictionary set; if the dictionary contains all input data, the computational burden will be greatly increased; therefore, in order to reduce the computational complexity, the sparsification processing is performed by adopting a novel criterion, so that only relevant input vectors are reserved in the dictionary, and a compact dictionary is formed; defining a distance calculation formula as
Figure FDA00040861503700000314
In->
Figure FDA00040861503700000315
Representing the ith in the ith rule lower dictionary set c * A plurality of input vectors; then comparing the data with a distance threshold value to determine whether the data is included in a dictionary, wherein the threshold value ranges from 0 to 1; the method comprises the following two parts:
(1) When (when)
Figure FDA0004086150370000041
At the time, the current input sample +.>
Figure FDA0004086150370000042
The inclusion of a dictionary is to be taken into account,
Figure FDA0004086150370000043
matrix variable Q at this time i (n) and expansion coefficient θ i (n) update formula is as follows:
Figure FDA0004086150370000044
wherein,,
Figure FDA0004086150370000045
e i (n) is the prediction error, expressed as +.>
Figure FDA0004086150370000046
(2) When (when)
Figure FDA0004086150370000047
When the current data is excluded from the dictionary, the dictionary remains unchanged, and the matrix variable Q i (n) and expansion coefficient θ i The calculation form of (n) is as follows:
Figure FDA0004086150370000048
wherein the method comprises the steps of
Figure FDA0004086150370000049
Step 4.3: loading training data, and judging whether training is finished;
if the current moment is smaller than the total quantity of the training set, namely N is smaller than N, returning to the step 4.1, and entering the next moment for iterative updating; otherwise, the model training is completed, a rule base is exported, and the next step is entered.
7. The method for online prediction of a kernel recursive maximum correlation entropy time sequence based on rule evolution according to claim 1, wherein in the step 5, the prediction accuracy of the evaluation index metric model is adopted, and the prediction accuracy comprises a root mean square error RMSE and a symmetric average absolute percentage error SMAPE.
CN202310105269.5A 2023-02-13 2023-02-13 Nuclear recursion maximum correlation entropy time sequence on-line prediction method based on rule evolution Pending CN116340384A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310105269.5A CN116340384A (en) 2023-02-13 2023-02-13 Nuclear recursion maximum correlation entropy time sequence on-line prediction method based on rule evolution

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310105269.5A CN116340384A (en) 2023-02-13 2023-02-13 Nuclear recursion maximum correlation entropy time sequence on-line prediction method based on rule evolution

Publications (1)

Publication Number Publication Date
CN116340384A true CN116340384A (en) 2023-06-27

Family

ID=86881286

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310105269.5A Pending CN116340384A (en) 2023-02-13 2023-02-13 Nuclear recursion maximum correlation entropy time sequence on-line prediction method based on rule evolution

Country Status (1)

Country Link
CN (1) CN116340384A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117041359A (en) * 2023-10-10 2023-11-10 北京安视华业科技有限责任公司 Efficient compression transmission method for information data

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117041359A (en) * 2023-10-10 2023-11-10 北京安视华业科技有限责任公司 Efficient compression transmission method for information data
CN117041359B (en) * 2023-10-10 2023-12-22 北京安视华业科技有限责任公司 Efficient compression transmission method for information data

Similar Documents

Publication Publication Date Title
CN111563706A (en) Multivariable logistics freight volume prediction method based on LSTM network
CN110427654B (en) Landslide prediction model construction method and system based on sensitive state
CN110390561B (en) User-financial product selection tendency high-speed prediction method and device based on momentum acceleration random gradient decline
CN114399032B (en) Method and system for predicting metering error of electric energy meter
CN111784061B (en) Training method, device and equipment for power grid engineering cost prediction model
CN110987436B (en) Bearing fault diagnosis method based on excitation mechanism
CN116340384A (en) Nuclear recursion maximum correlation entropy time sequence on-line prediction method based on rule evolution
CN113344288A (en) Method and device for predicting water level of cascade hydropower station group and computer readable storage medium
CN116187835A (en) Data-driven-based method and system for estimating theoretical line loss interval of transformer area
CN114609994A (en) Fault diagnosis method and device based on multi-granularity regularization rebalance incremental learning
CN114565021A (en) Financial asset pricing method, system and storage medium based on quantum circulation neural network
CN116681945A (en) Small sample class increment recognition method based on reinforcement learning
CN116720620A (en) Grain storage ventilation temperature prediction method based on IPSO algorithm optimization CNN-BiGRU-Attention network model
CN110738363A (en) photovoltaic power generation power prediction model and construction method and application thereof
CN114418234A (en) Power battery manufacturing capacity online prediction method based on reinforcement learning
CN110276478B (en) Short-term wind power prediction method based on segmented ant colony algorithm optimization SVM
CN107704952A (en) A kind of attack of terrorism Forecasting Methodology based on stochastic subspace
CN114819107B (en) Mixed data assimilation method based on deep learning
CN111950696A (en) Short-term power load prediction method based on dimension reduction and improved neural network
CN117131654A (en) Target observation method based on nonlinear optimal disturbance of pre-analysis initial guess condition
CN116307139A (en) Wind power ultra-short-term prediction method for optimizing and improving extreme learning machine
CN116304587A (en) Rolling bearing degradation trend prediction method based on CAE and AGRU
CN112581311B (en) Method and system for predicting long-term output fluctuation characteristics of aggregated multiple wind power plants
CN111641471B (en) Weight design method for prediction in atomic clock signal combination control
CN112766537A (en) Short-term electric load prediction method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination