CN105354644A - Financial time series prediction method based on integrated empirical mode decomposition and 1-norm support vector machine quantile regression - Google Patents

Financial time series prediction method based on integrated empirical mode decomposition and 1-norm support vector machine quantile regression Download PDF

Info

Publication number
CN105354644A
CN105354644A CN201510849963.3A CN201510849963A CN105354644A CN 105354644 A CN105354644 A CN 105354644A CN 201510849963 A CN201510849963 A CN 201510849963A CN 105354644 A CN105354644 A CN 105354644A
Authority
CN
China
Prior art keywords
tau
sequence
time series
alpha
value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510849963.3A
Other languages
Chinese (zh)
Inventor
余乐安
杨泽斌
汤铃
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing University of Chemical Technology
Original Assignee
Beijing University of Chemical Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing University of Chemical Technology filed Critical Beijing University of Chemical Technology
Priority to CN201510849963.3A priority Critical patent/CN105354644A/en
Publication of CN105354644A publication Critical patent/CN105354644A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/06Asset management; Financial planning or analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Human Resources & Organizations (AREA)
  • Strategic Management (AREA)
  • Development Economics (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Physics & Mathematics (AREA)
  • Operations Research (AREA)
  • Finance (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Accounting & Taxation (AREA)
  • Theoretical Computer Science (AREA)
  • Game Theory and Decision Science (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Technology Law (AREA)
  • Quality & Reliability (AREA)
  • Tourism & Hospitality (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention belongs to the field of financial risk management, in particular to a time series probability distribution prediction method based on integrated empirical mode decomposition and nonlinear quantile regression. The method comprises the following steps: firstly, carrying out the integrated empirical mode decomposition on a financial price time series to obtain components with high regularity under different scales; secondly, independently predicting each component by 1-norm support vector machine quantile regression to obtain all quantile prediction results of each component; and thirdly, taking each quantile as a statistics target, independently adding the prediction result of each qnantile of each component, and integrating all prediction results to obtain the preduction result of each quantile so as to obtain the financial time series probability distribution prediction. The method provided by the invention can effectively predict a change probability of financial price, and can be applied to financial risk management and investment practice.

Description

A kind of Financial Time Series Forecasting method based on integrated empirical mode decomposition and 1-norm support vector machine quantile estimate
Technical field
The invention belongs to financial field, be specifically related to the Probability distribution prediction method of the price of various securities, earning rate.
Background technology
Financial time series, mainly comprises the data such as the price of various securities, earning rate.Although price movement has very strong uncertainty, among its change, always contain certain law.Such as, some experienced portfolio investors can rely on technical indicator, realize making prediction to the ups and downs of price, and therefrom make a profit.Find these rules, and these rules are used for the variation of forecast price, for the managing risk helping financial institution or common investor's science, making investment decision has great meaning.
Financial time series often has non-stationary and nonlinear feature, and traditional linear model is difficult to the rule caught wherein.At present conventional Forecast of Nonlinear Time Series model has the model such as neural network and support vector machine, and these models can the nonlinear characteristic of fit time sequence well, and then makes prediction.Although the precision of prediction of these models of mind is than the height of conventional model, the result that predicted is applied to decision-making and also has a certain distance.And these predictions are made prediction to the average of price, and lacked the description of the various possibilities to price movement, this is not easy to investor and carries out risk control.Quantile estimate model exactly can the distribution situation of estimated price variation, and it, by calculating a kind of absolute value residual error minimization function of unsymmetric form, estimates the regression function of each fractile respectively.Quantile estimate has lot of advantages, but is most importantly the overall picture that it more comprehensively can describe the distribution of explained variable condition, instead of only analyzes the average of explained variable.
The present invention, in conjunction with integrated empirical mode decomposition and 1-norm support vector machine quantile estimate model, predicts the variation distribution situation of financial time series.Wherein, integrated empirical mode decomposition is used for original time series being decomposed into regular strong component, to improve precision of prediction; Support vector machine quantile estimate model is used for estimating the predicted value of Nonlinear Time Series each fractile.
Summary of the invention
The object of the invention is to the deficiency existed in the Probability distribution prediction method for original financial time series, the Forecasting Methodology that a kind of precision is higher is provided, carries out risk management and investment decision with ancillary investment person.
The essence of time series forecasting modeling is the relation of history value by analytical sequence and its future value, sets up funtcional relationship.Use x trepresent known raw financial time series, wherein subscript t represents the time, total T phase a: t=1,2 ..., T; As set up anticipation function x t=f (x t-1, x t-2..., x t-l+1, x t-l), (l<T), wherein l represents the lag period of prediction, by autocorrelation analysis, and is determined by Schwarz minimization principle.Be such as 6 by analysis time series lag period l, then use its t-1, t-2 ..., the value of t-6 phase, as independent variable, predicts the value of t phase.Use y irepresent the dependent variable x of i-th training sample l+i, x irepresent the independent variable vector [x of i-th training sample i, x i+1..., x i+l-2, x i+l-1], then original time series { x 1, x 2..., x ttraining set (the x that one group of sample size is T-l can be formed i, y i), i=1,2 ..., T-l+1, T-l.For given training set (x i, y i), (i=1,2 ..., T-l+1, T-l), use 1-norm support vector machine quantile estimate model to set up x i→ y iat fractile τ, the anticipation function f of (τ ∈ (0,1)) τ(x i)=f τ(x i, x i+1..., x i+l-2, x i+l-1).
This invention first step carries out integrated empirical mode decomposition to financial Time Series of Random Macro-price, under obtaining different scale, and regular N+1 stronger Decomposition Sequence; Second step, carries out the model prediction of 1-norm support vector machine quantile estimate to each Decomposition Sequence respectively, obtains 9 fractile τ=0.1 of each sequence, 0.2 ..., 0.9 predict the outcome.3rd step, in units of each decomposed component, based on the following value under each fractile of each Decomposition Sequence of model outside forecast trained.4th step, predicts the outcome each fractile of each sequence and sums up respectively, integrate each fractile and predict the outcome.
The concrete steps of described Forecasting Methodology are as follows:
(1) integrated empirical mode decomposition is carried out to original time series;
Use x trepresent raw financial time series { x 1, x 2..., x t, wherein subscript t represents the time, total T phase a: t=1,2 ..., T; Use integrated empirical mode decomposition algorithm by x tdecompose, obtain N number of intrinsic mode function sequence c j,t, (j=1,2 ..., N) and a residual value sequence r t, decomposition result is formulated as:
x t = &Sigma; j = 1 N c j , t + r t
(2) to each Decomposition Sequence difference sequence prediction function Time Created;
The essence of time series forecasting modeling is the relation of history value by analytical sequence and its future value, sets up funtcional relationship.Use x trepresent raw financial time series { x 1, x 2..., x t, as set up anticipation function x t=f (x t-1, x t-2..., x t-l+1, x t-l), (l<T), wherein l represents the lag period of prediction, by autocorrelation analysis, and is determined by Schwarz minimization principle.Be such as 6 by analysis time series lag period l, then use its t-1, t-2 ..., the value of t-6 phase, as independent variable, predicts the value of t phase.Use y irepresent the dependent variable x of i-th training sample l+i, x irepresent the independent variable vector [x of i-th training sample i, x i+1..., x i+l-2, x i+l-1], then original time series { x 1, x 2..., x ttraining set (the x that one group of sample size is T-l can be formed i, y i), i=1,2 ..., T-l+1, T-l.For given training set (x i, y i), (i=1,2 ..., T-l+1, T-l), use 1-norm support vector machine quantile estimate model to set up x i→ y iat fractile τ, the anticipation function f of (τ ∈ (0,1)) τ(x i)=f τ(x i, x i+1..., x i+l-2, x i+l-1), its concrete grammar is:
For fractile τ ∈ (0,1), the essence of training 1-norm support vector machine quantile estimate model is by Optimal Parameters α i, α i *, (i=1,2 ..., T-l+1, T-l), wherein, α iwith α i *for the Lagrange factor of constraint condition corresponding to i-th sample point predicted value is less than actual value or is greater than actual value.Make following the minimization of object function:
m i n &alpha; i , &alpha; i * , b &Sigma; i = 1 T - l ( &alpha; i + &alpha; i * ) + C &Sigma; i = 1 T - l ( &tau;&xi; i + ( 1 - &tau; ) &xi; i * )
And meet constraint condition:
&Sigma; s = 1 T - l ( &alpha; s - &alpha; s * ) K ( x s , x i ) + b - y i &le; &xi; i * , i = 1 , 2 , ... , T - l ; s = 1 , 2 , ... , T - l , s &NotEqual; i ; - &Sigma; s = 1 T - l ( &alpha; s - &alpha; s * ) K ( x s , x i ) - b + y i &le; &xi; i , i = 1 , 2 , ... , T - l ; s = 1 , 2 , ... , T - l , s &NotEqual; i ; &xi; i , &xi; i * &GreaterEqual; 0
Wherein, dummy variable ξ i, represent model predication value respectively be less than actual value y ibe greater than actual value y iresidual error, b is intercept to be estimated, C for punishment parameter.K () is Radial basis kernel function, for any one group (t 1, t 2∈ [1,2 ..., T-l+1, T-l]):
K ( x t 1 , x t 2 ) = exp ( - | | x t 1 - x t 2 | | 2 2 &sigma; 2 )
σ is the core width of Radial basis kernel function, t 1, t 2represent the subscript of any two independents variable.
Parameter C and σ in model has material impact to prediction effect, and different time serieses may have different optimal parameter combinations.Optimal parameter is combined through gridding method and chooses: in given range C, σ ∈ [1,2,3,4,5,6,7,8,9,10,20,30,40,50,60,70,80,90,100], have 19 × 19 groups of parameters.To often organizing parameter, using 1-norm Nonlinear Support Vector Machines quantile estimate model to set up anticipation function respectively, and according to anticipation function, the data on training set being predicted.Predicated error between computational prediction value and actual value, chooses one group of minimum parameter combinations of predicated error as final argument.Wherein, for fractile τ, its predicated error is:
E &tau; = 1 T - l &Sigma; i = 1 T - l ( &tau;&xi; i , t r a i n + ( 1 - &tau; ) &xi; i , t r a i n * )
Herein, T-l represents training sample number, ξ i, trainrepresent the actual value y of training set i-th sample ibe greater than predicted value residual error, represent the actual value y of training set i-th sample ibe less than predicted value residual error:
&xi; i , t r a i n = m a x { y i - f &tau; ( x i ) , 0 } &xi; i , t r a i n * = m a x { f &tau; ( x i ) - y i , 0 }
After using gridding method to determine optimal parameter C and σ, this model is the linear programming problem of a band Linear Constraints in essence, classical simplicial method can be used to solve, obtain decision variable α i, α i *, (i=1,2 ..., T-l+1, T-l) and intercept b.For fractile τ, (τ ∈ (0,1)), its final anticipation function is as follows:
f &tau; ( x ) = &Sigma; i = 1 T - l ( &alpha; i - &alpha; i * ) K ( x i , x ) + b
Use 1-norm support vector machine quantile estimate model to decomposing N+1 the sequence obtained, each sequence sets up 9 component figure place anticipation functions, i.e. τ=0.1, and 0.2 ..., 0.9 totally 9 fractiles, obtain 9 × (N+1) individual anticipation functions altogether.Use f τ, j, (j=1,2 ..., N) represent the jth intrinsic mode function sequence c of fractile τ j,tanticipation function, f τ, rrepresent residual error r tanticipation function.
(3) outside forecast is carried out based on the model trained.
Use training set to train this model, obtain the anticipation function of 9 fractiles of N+1 Decomposition Sequence respectively, totally 9 × (N+1) is individual.Based on the history value of these anticipation functions and each Decomposition Sequence, can obtain each intrinsic mode function sequence and the residual sequence predicted value in the T+1 phase, its anticipation function is as follows:
c ^ &tau; , j , T + 1 = f &tau; , j ( c &tau; , j , T , c &tau; , j , T - 1 , c &tau; , j , T - 2 , . . . , c &tau; , j , T - l + 1 ) , j = 1,2 , . . . , N , &tau; = 0.1,0.2 , . . . , 0.9
r ^ &tau; , T + 1 = f &tau; , r ( r &tau; , T , r &tau; , T - 1 , r &tau; , T - 2 , . . . , r &tau; , T - l + 1 ) , &tau; = 0.1,0.2 , . . . , 0.9
(4) finally integrated to predicting the outcome of Decomposition Sequence.
Finally, being predicted the outcome by each component of each fractile, it is integrated to sum up, and obtains final the predicting the outcome of all fractiles:
x ^ &tau; , T + 1 = &Sigma; j = 1 N c ^ &tau; , j , T + 1 + r ^ &tau; , T + 1 , j = 1,2 , . . . , N , &tau; = 0.1,0.2 , . . . , 0.9
Relative to traditional Time Series Forecasting Methods, integrated Empirical mode decomposition is being introduced in the forecast of distribution of Nonlinear Time Series by advantage of the present invention.First data are decomposed, obtain the component of the better high and low frequency of predictability; So each component is carried out respectively the prediction of 1-norm support vector machine quantile estimate, obtain the forecast of distribution result of each component; Finally, then according to fractile, important predicting the outcome is carried out integrated, obtain the forecast of distribution result of original time series.
Forecasting Methodology provided by the invention can predict the situation of the complete probability distribution of time series, and has higher precision of prediction, has a wide range of applications at financial risk management and investment decision field.
Accompanying drawing explanation
Fig. 1 is this patent Forecasting Methodology implementation framework figure
Fig. 2 is that Index of Shanghai Stock Exchange monthly set of prices in March ,-2014 in January, 2000 becomes empirical mode decomposition component
Fig. 3 be the present invention to Index of Shanghai Stock Exchange from totally 30 phases in March ,-2014 in October, 2011,10%, 20% ..., 90% fractile predicts the outcome
Embodiment
The present invention is the financial time series distribution forecasting method based on integrated empirical mode decomposition and support vector machine quantile estimate, and its framework as shown in Figure 1.In order to the use of the method is described, for the actual effect of monthly closing price data (the totally 171 phases) test model in Index of Shanghai Stock Exchange in January, 2000 in March, 2014.Wherein, front 141 issues are according to being used for training pattern parameter (known sequential, maximum phase T=141), and rear 30 phases are for testing actual outside forecast effect (test set quantity T test=30).
Concrete steps are:
The first step: given original time series x toriginal series is carried out integrated empirical mode decomposition, is specifically implemented as follows.
Because white noise is random, be repeated below (1)-the decomposable process of (6) 100 times, use k=1 each time, 2,3 ..., 100 represent.
(1) at primordial time series data x ton to add a standard deviation be the white noise of 0.2, obtain and make r t k = x t k ;
(2) sequence is found in all local maximums and local minimum;
(3) by these local maximums and minimum value, its coenvelope line of Cubic Spline Functions Fitting and lower envelope line is used respectively with
(4) the average packet winding thread of upper and lower envelope is calculated
(5) from sequence middle separation obtain if meet two conditions: the number of extreme point is equal with the number of zero crossing or to differ be 1; At any some place, the average envelope generated by local maximum envelope and local minimum envelope is 0, then become new mode sequence, and make if do not satisfy condition, then as the residual error of decomposing;
(6) repeat (2)-(6) process, until find out all N number of intrinsic mode function sequences and residual error, decomposition result can be expressed as
(7) Decomposition Sequence 100 times obtained and residual error average respectively, obtain with final integrated empirical mode decomposition result is expressed as:
x t = &Sigma; j = 1 N c j , t + r t .
The last decomposition result of Index of Shanghai Stock Exchange as shown in Figure 2.
Second step: to using 1-norm support vector machine quantile estimate model to decomposing N+1 the sequence obtained, each sequence sets up 9 component figure place anticipation functions, i.e. τ=0.1,0.2,, 0.9 totally 9 fractiles, obtain 9 × (N+1) individual anticipation functions altogether.Use f τ, j, (j=1,2 ..., N) represent the jth intrinsic mode function sequence c of fractile τ j,tanticipation function, f τ, rrepresent the anticipation function of residual error.Its specific implementation method is as follows:
(1) according to seasonal effect in time series autocorrelation analysis, lag period l (l<T) is determined by Schwarz minimization principle (Bayesian Information amount).
With original time series x tfor example, set up its p rank autoregressive model (AR (p)):
x t=β 01x t-12x t-2+...+β px t-pt
Try to achieve its maximum likelihood function value L, then the Bayesian Information amount (BIC) of this model is:
BIC p=-2ln(L)+ln(T-p)*(p+1)
In order to determine best lag period l, first set up p=1,2 ..., 20 totally 20 rank autoregressive models, and calculate its corresponding BIC pvalue; Then, { BIC is found out 1, BIC 2..., BIC 20minimum one of intermediate value, its exponent number p is as the best lag period:
l=arcmin{BIC 1,BIC 2,...,BIC 20}
Such as, in Index of Shanghai Stock Exchange prediction case, the best lag period is set as 6.
(2) in order to improve the training speed of model, need training sample normalization.For decomposing N+1 the sequence obtained, using following formula respectively, making it normalize to [0,1] interval.
c j , t s c a l e = c j , t - m i n { c j , 1 , c j , 2 , ... , c j , T } m a x { c j , 1 , c j , 2 , ... , c j , T } - min { c j , 1 , c j , 2 , ... , c j , T } , j = 1,2 , . . . , N
r t s c a l e = r t - m i n { r 1 , r 2 , ... , r T } m a x { r 1 , r 2 , ... , r T } - min { r 1 , r 2 , ... , r T } , j = 1,2 , . . . , N
(3) training data generates.
The essence of time series forecasting modeling is the relation of history value by analytical sequence and its future value, sets up funtcional relationship.In this example, need N+1 component, 9 fractiles set up 9 × (N+1) individual anticipation functions.And for the different fractile functions of same Decomposition Sequence, its training set used is identical, therefore need to generate N+1 group training set.
Below with original series x tfor example illustrates the generation method of training sample: use y irepresent the dependent variable x of i-th training sample l+i, x irepresent the independent variable vector [x of i-th training sample i, x i+1..., x i+l-2, x i+l-1], then original time series { x 1, x 2..., x ttraining set (the x that one group of sample size is T-l can be formed i, y i), i=1,2 ..., T-l+1, T-l.For given training set (x i, y i), (i=1,2 ..., T-l+1, T-l), use 1-norm support vector machine quantile estimate model to set up x i→ y iat fractile τ, the anticipation function f of (τ ∈ (0,1)) τ(x i)=f τ(x i, x i+1..., x i+l-2, x i+l-1).
In the case of Index of Shanghai Stock Exchange time series forecasting, the data of front 141 phases are used as training data, can generate 141-6=135 training sample, as follows:
x 1([x 1,x 2,x 3,x 4,x 5,x 6])→y 1(x 7)
x 2([x 2,x 3,x 4,x 5,x 6,x 7])→y 2(x 8)
x 3([x 3,x 4,x 5,x 6,x 7,x 8])→y 3(x 9)
x 134([x 134,x 135,x 136,x 137,x 138,x 139])→y 134(x 140)
x 135([x 135,x 136,x 137,x 138,x 139,x 140])→y 135(x 141)
In like manner, test set 30 samples are as follows:
x 136([x 136,x 137,x 138,x 139,x 140,x 141])→y 136(x 142)
x 137([x 137,x 138,x 139,x 140,x 141,x 142])→y 137(x 143)
x 138([x 138,x 139,x 140,x 141,x 142,x 143])→y 138(x 144)
x 164([x 164,x 165,x 166,x 167,x 168,x 169])→y 164(x 170)
x 165([x 165,x 166,x 167,x 168,x 169,x 170])→y 165(x 171)
(4), after using said method to generate respective training set (altogether N+1) and test set (altogether N+1) to each Decomposition Sequence, each fractile of 1-norm Nonlinear Support Vector Machines quantile estimate model to each Decomposition Sequence is used to set up 9 × (N+1) individual anticipation functions.Below with original series x tfor example illustrates the generation method of anticipation function:
For fractile τ ∈ (0,1) and training set (x i, y i), i=1,2 ..., T-l+1, T-l, the essence of training 1-norm support vector machine quantile estimate model is by Optimal Parameters α i, α i *, (i=1,2 ..., T-l+1, T-l), wherein, α iwith α i *for the Lagrange factor of constraint condition corresponding to i-th sample point predicted value is less than actual value or is greater than actual value.Make following the minimization of object function:
m i n &alpha; i , &alpha; i * , b &Sigma; i = 1 T - l ( &alpha; i + &alpha; i * ) + C &Sigma; i = 1 T - l ( &tau;&xi; i + ( 1 - &tau; ) &xi; i * )
And meet constraint condition:
&Sigma; s = 1 T - l ( &alpha; s - &alpha; s * ) K ( x s , x i ) + b - y i &le; &xi; i * , i = 1 , 2 , ... , T - l ; s = 1 , 2 , ... , T - l , s &NotEqual; i ; - &Sigma; s = 1 T - l ( &alpha; s - &alpha; s * ) K ( x s , x i ) - b + y i &le; &xi; i , i = 1 , 2 , ... , T - l ; s = 1 , 2 , ... , T - l , s &NotEqual; i ; &xi; i , &xi; i * &GreaterEqual; 0
Wherein, dummy variable ξ i, represent model predication value respectively be less than actual value y ibe greater than actual value y iresidual error, b is intercept to be estimated, C for punishment parameter.K () is Radial basis kernel function, for any one (t 1, t 2∈ [1,2 ..., T-l+1, T-l]):
K ( x t 1 , x t 2 ) = exp ( - | | x t 1 - x t 2 | | 2 2 &sigma; 2 )
σ is the core width of Radial basis kernel function, t 1, t 2represent the subscript of any two independents variable.In the case of Index of Shanghai Stock Exchange time series forecasting, such as, for two independent variable x 7with x 10, σ=1, the value of its Radial basis kernel function is:
K ( x 7 , x 10 ) = exp ( - | | x 7 - x 10 | | 2 2 &times; 1 ) = exp ( - 1 2 &times; | | &lsqb; 0.0969 ; 0.1336 ; 0.1511 ; 0.1585 ; 0.1704 ; 0.1772 ] - [ 0.1585 ; 0.1704 ; 0.1772 ; 0.1967 ; 0.1963 ; 0.1736 &rsqb; | | 2 ) = exp { - 1 2 &times; &lsqb; ( 0.0969 - 0.1585 ) 2 + ( 0.1336 - 0.1704 ) 2 + ... + ( 0.1772 - 0.1736 ) 2 &rsqb; } = exp { - 1 2 &times; 0.0080 } = 0.9960
Parameter C and σ in model has material impact to prediction effect, and different time serieses may have different optimal parameter combinations.Optimal parameter is combined through gridding method and chooses: in given range C, σ ∈ [1,2,3,4,5,6,7,8,9,10,20,30,40,50,60,70,80,90,100], have 19 × 19 groups of parameters.To often organizing parameter, using 1-norm Nonlinear Support Vector Machines quantile estimate model to set up anticipation function respectively, and according to anticipation function, the data on training set being predicted.Predicated error between computational prediction value and actual value, chooses one group of minimum parameter combinations of predicated error as final argument.Wherein, for fractile τ, its predicated error is:
E &tau; = 1 T - l &Sigma; i = 1 T - l ( &tau;&xi; i , t r a i n + ( 1 - &tau; ) &xi; i , t r a i n * )
Herein, T-l represents training sample number, ξ i, trainrepresent the actual value y of training set i-th sample ibe greater than predicted value residual error, represent the actual value y of training set i-th sample ibe less than predicted value residual error:
&xi; i , t r a i n = m a x { y i - f &tau; ( x i ) , 0 } &xi; i , t r a i n * = m a x { f &tau; ( x i ) - y i , 0 }
Illustrate, for fractile τ=0.5, the final predicated error E of each combination of C and σ τrow are in Table 1 (for original Index of Shanghai Stock Exchange sequence x tthe example that is predicted as be described)
The original Index of Shanghai Stock Exchange sequence x of table 1 ttime prediction error value of each parameter combinations on training set in fractile τ=0.5
Wherein, when parameter C and σ is 90 and 1, obtain minimum predicated error, therefore for fractile τ=0.5, get C=90, σ=1.
This model is the linear programming problem of a band Linear Constraints in essence, classical simplicial method can be used to solve, obtain decision variable α i, α i *, (i=1,2 ..., T-l+1, T-l) and intercept b.For fractile τ, (τ ∈ (0,1)), its final anticipation function is as follows:
f &tau; ( x ) = &Sigma; i = 1 T - l ( &alpha; i - &alpha; i * ) K ( x i , x ) + b
Such as, the predicted value of outside forecast testing period first sample in fractile τ=0.5, substitutes into following formula:
x ^ &tau; = 0.5 , T + 1 = f &tau; = 0.5 ( x T - l + 1 ) = &Sigma; i = 1 T - l ( &alpha; - &alpha; i * ) K ( x i , x T - l + 1 ) + b
In order to test the forecast result of this method, the precision of prediction of testing model in test set 30 samples.Choose and do not carry out integrated empirical mode decomposition, only for original series x tcarry out respectively 1-norm support vector machine quantile estimate prediction and linear quantile estimate predict that two kinds of methods are tested in contrast.
In contrast method, the prediction of linear quantile estimate and 1-norm support vector machine quantile estimate forecast model carry out predicting the process of modeling and this method similar, all want the autocorrelation analysis of elapsed time sequence, samples normalization and training pattern to set up anticipation function.Be that prediction modeling is carried out to original Index of Shanghai Stock Exchange sequence unlike contrast method, instead of modeling is carried out to each decomposed component through integrated empirical mode decomposition.In addition, the setting of independent 1-norm support vector machine quantile estimate forecast model is identical with choice of parameters and this method.
Because this method can decomposite multiple decomposed component, using during 1-norm support vector machine quantile estimate model modeling each decomposed component and will carry out as the screening in table 1 parameter C, σ in normal situation.For the purpose of simplifying the description, when testing this method below, no longer screen parameter C, σ, the 1-norm support vector machine quantile estimate prediction model parameters of each decomposed component is unified is set to C=10, σ=2.As can be seen from last result, even if without Model Parameter Optimization, the result that this method draws all significantly will be better than contrast method.
The standard evaluated is the asymmetric tolerances function of fractile, selects the evaluation method used during parameter similar to gridding method before:
E &tau; , t e s t = 1 T t e s t &Sigma; i = 1 T t e s t ( &tau;&xi; T - l + i , t e s t + ( 1 - &tau; ) &xi; T - l + i , t e s t * )
&xi; T - l + i , t e s t = m a x { y T - l + i - f &tau; ( x T - l + i ) , 0 } &xi; T - l + i , t e s t * = m a x { f &tau; ( x T - l + i ) - y T - l + i , 0 }
Wherein, ξ t-l+i, testfor test set i-th sample actual value y t-l+ilower than predicted value f τ(x t-l+i) residual error, test set i-th sample is actual value y t-l+ihigher than predicted value f τ(x t-l+i) residual error.T testrepresent test set sample size, in the prediction case of Index of Shanghai Stock Exchange, T test=30.
Such as, during τ=0.3,
E &tau; = 0.3 , t e s t = 1 30 &Sigma; t = 142 171 ( 0.3 &times; &xi; t , r e a l + ( 1 - 0.3 ) &xi; t , r e a l * ) = 1 30 &times; 0.3 &times; max { 2468.3 - 2270.6 , 0 } + 1 30 &times; 0.7 &times; max { 2270.6 - 2468.3 , 0 } + 1 30 &times; 0.3 &times; max { 2333.4 - 2295.5 , 0 } + 1 30 &times; 0.7 &times; max { 2295.5 - 2333.4 , 0 } + ... + 1 30 &times; 0.3 &times; max { 2056.3 - 1982.2 , 0 } + 1 30 &times; 0.7 &times; max { 1982.2 - 2056.3 , 0 } + 1 30 &times; 0.3 &times; max { 2033.3 - 1970.8 , 0 } + 1 30 &times; 0.7 &times; max { 1970.8 - 2033.3 , 0 } = 23.53
Again such as, during τ=0.6,
E &tau; = 0.6 , t e s t = 1 30 &Sigma; t = 142 171 ( 0.6 &times; &xi; t , r e a l + ( 1 - 0.6 ) &xi; t , r e a l * ) = 1 30 &times; 0.6 &times; max { 2468.3 - 2381.4 , 0 } + 1 30 &times; 0.4 &times; max { 2381.4 - 2468.3 , 0 } + 1 30 &times; 0.6 &times; max { 2333.4 - 2275.5 , 0 } + 1 30 &times; 0.4 &times; max { 2275.5 - 2333.4 , 0 } + ... + 1 30 &times; 0.6 &times; max { 2056.3 - 2069.4 , 0 } + 1 30 &times; 0.4 &times; max { 2069.4 - 2056.3 , 0 } + 1 30 &times; 0.6 &times; max { 2033.3 - 2113.3 , 0 } + 1 30 &times; 0.4 &times; max { 2113.3 - 2033.3 , 0 } = 24.78
Table 2 the inventive method and other two kinds of classic methods each fractile predicated error in test set contrast
As can be seen from Table 2, these three kinds of models have obvious good and bad level.Wherein, the effect of 1-norm support vector machine Quantile Regression is better than linear quantile estimate model, and this shows that the method can the nonlinear characteristic of matching Index of Shanghai Stock Exchange.The most important thing is, no matter from deviation or the mean deviation of each fractile, the measurement error based on integrated empirical mode decomposition 1-norm support vector machine Quantile Regression of the present invention is all better than single model, this demonstrates superiority of the present invention.

Claims (1)

1., based on a Financial Time Series Forecasting method for integrated empirical mode decomposition and 1-norm support vector machine quantile estimate, it is characterized in that, the method comprises following operation steps:
(1) Time Series;
Use x trepresent raw financial time series { x 1, x 2..., x t, wherein subscript t represents the time, total T phase a: t=1,2 ..., T; Use integrated empirical mode decomposition algorithm by x tdecompose, obtain N number of intrinsic mode function sequence c j,t, (j=1,2 ..., N) and a residual value sequence r t, decomposition result is formulated as:
x t = &Sigma; j = 1 N c j , t + r t
(2) to each Decomposition Sequence difference sequence prediction function Time Created;
The essence of time series forecasting modeling is the relation of history value by analytical sequence and its future value, sets up funtcional relationship; Use x trepresent raw financial time series { x 1, x 2..., x t, wherein subscript t represents the time, total T phase a: t=1,2 ..., T; As set up anticipation function x t=f (x t-1, x t-2..., x t-l+1, x t-l), (l<T), wherein l represents the lag period of prediction, by auto-correlation and partial Correlation Analysis, and is determined by Schwarz minimization principle; Use y irepresent the dependent variable x of i-th training sample l+i, x irepresent the independent variable vector [x of i-th training sample i, x i+1..., x i+l-2, x i+l-1], then original time series { x 1, x 2..., x tform the training set (x that one group of sample size is T-l i, y i), i=1,2 ..., T-l+1, T-l; For given training set (x i, y i), (i=1,2 ..., T-l+1, T-l), use 1-norm support vector machine quantile estimate model to set up x i→ y iat fractile τ, the anticipation function f of (τ ∈ (0,1)) τ(x i)=f τ(x i, x i+1..., x i+l-2, x i+l-1), its concrete grammar is:
For fractile τ ∈ (0,1), the essence of training 1-norm support vector machine quantile estimate model is by Optimal Parameters α i, α i *, (i=1,2 ..., T-l+1, T-l), wherein, α iwith α i *for the Lagrange factor of constraint condition corresponding to i-th sample point predicted value is less than actual value or is greater than actual value; Make following the minimization of object function:
m i n &alpha; i , &alpha; i * , b &Sigma; i = 1 T - l ( &alpha; i + &alpha; i * ) + C &Sigma; i = 1 T - l ( &tau;&xi; i + ( 1 - &tau; ) &xi; i * )
And meet constraint condition:
&Sigma; s = 1 T - l ( &alpha; s - &alpha; s * ) K ( x s , x i ) + b - y i &le; &xi; i * , i = 1 , 2 , ... , T - l ; s = 1 , 2 , ... , T - l , s &NotEqual; i ; - &Sigma; s = 1 T - l ( &alpha; s - &alpha; s * ) K ( x s , x i ) - b + y i &le; &xi; i , i = 1 , 2 , ... , T - l ; s = 1 , 2 , ... , T - l , s &NotEqual; i ; &xi; i , &xi; i * &GreaterEqual; 0
Wherein, dummy variable ξ i, ξ i *represent model predication value respectively be less than actual value y ibe greater than actual value y iresidual error, b is intercept to be estimated, C for punishment parameter; K () is Radial basis kernel function, for any one (t 1, t 2∈ [1,2 ..., T-l+1, T-l]):
K ( x t 1 , x t 2 ) = exp ( - | | x t 1 - x t 2 | | 2 2 &sigma; 2 )
σ is the core width of Radial basis kernel function, t 1, t 2represent the subscript of any two independents variable;
Parameter C and σ in model has material impact to prediction effect, and different time serieses may have different optimal parameter combinations; Optimal parameter is combined through gridding method and chooses: in given range C, σ ∈ [1,2,3,4,5,6,7,8,9,10,20,30,40,50,60,70,80,90,100], have 19 × 19 groups of parameters; To often organizing parameter, using 1-norm Nonlinear Support Vector Machines quantile estimate model to set up anticipation function respectively, and according to anticipation function, the data on training set being predicted; Predicated error between computational prediction value and actual value, chooses one group of minimum parameter combinations of predicated error as final argument; Wherein, for fractile τ, its predicated error is:
E &tau; = 1 T - l &Sigma; i = 1 T - l ( &tau;&xi; i , t r a i n + ( 1 - &tau; ) &xi; i , t r a i n * )
Herein, T-l represents training sample number, ξ i, trainrepresent the actual value y of training set i-th sample ibe greater than predicted value residual error, represent the actual value y of training set i-th sample ibe less than predicted value residual error:
&xi; i , t r a i n = m a x { y i - f &tau; ( x i ) , 0 } &xi; i , t r a i n * = m a x { f &tau; ( x i ) - y i , 0 }
After using gridding method to determine optimal parameter C and σ, this model is the linear programming problem of a band Linear Constraints in essence, uses classical simplicial method to solve, obtains decision variable α i, α i *, (i=1,2 ..., T-l+1, T-l) and intercept b; For fractile τ, (τ ∈ (0,1)), its final anticipation function is as follows:
f &tau; ( x ) = &Sigma; i = 1 T - l ( &alpha; i - &alpha; i * ) K ( x i , x ) + b
Use 1-norm support vector machine quantile estimate model to decomposing N+1 the sequence obtained, each sequence sets up 9 component figure place anticipation functions, i.e. τ=0.1, and 0.2 ..., 0.9 totally 9 fractiles, obtain 9 × (N+1) individual anticipation functions altogether; Use f τ, j, (j=1,2 ..., N) represent the jth intrinsic mode sequence c of fractile τ j,tanticipation function, f τ, rrepresent residual error r tanticipation function;
(3) outside forecast is carried out based on the model trained;
Use training set to train this model, obtain the anticipation function of 9 fractiles of N+1 Decomposition Sequence respectively, totally 9 × (N+1) is individual; Based on the history value of these anticipation functions and each Decomposition Sequence, obtain each intrinsic mode function sequence and the residual sequence predicted value in the T+1 phase, its anticipation function is as follows:
c ^ &tau; , j , T + 1 = f &tau; , j ( c &tau; , j , T , c &tau; , j , T - 1 , c &tau; , j , T - 2 , . . . , c &tau; , j , T - l + 1 ) , j = 1,2 , . . . , N , &tau; = 0.1,0.2 , . . . , 0.9
r ^ &tau; , T + 1 = f &tau; , r ( r &tau; , T , r &tau; , T - 1 , r &tau; , T - 2 , . . . , r &tau; , T - l + 1 ) , &tau; = 0.1,0.2 , . . . , 0.9
(4) finally integrated to predicting the outcome of Decomposition Sequence;
Finally, being predicted the outcome by each component of each fractile, it is integrated to sum up, and obtains final the predicting the outcome of all fractiles:
x ^ &tau; , T + 1 = &Sigma; j = 1 N c ^ &tau; , j , T + 1 + r ^ &tau; , T + 1 , j = 1,2 , . . . , N , &tau; = 0.1,0.2 , . . . , 0.9 .
CN201510849963.3A 2015-11-29 2015-11-29 Financial time series prediction method based on integrated empirical mode decomposition and 1-norm support vector machine quantile regression Pending CN105354644A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510849963.3A CN105354644A (en) 2015-11-29 2015-11-29 Financial time series prediction method based on integrated empirical mode decomposition and 1-norm support vector machine quantile regression

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510849963.3A CN105354644A (en) 2015-11-29 2015-11-29 Financial time series prediction method based on integrated empirical mode decomposition and 1-norm support vector machine quantile regression

Publications (1)

Publication Number Publication Date
CN105354644A true CN105354644A (en) 2016-02-24

Family

ID=55330611

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510849963.3A Pending CN105354644A (en) 2015-11-29 2015-11-29 Financial time series prediction method based on integrated empirical mode decomposition and 1-norm support vector machine quantile regression

Country Status (1)

Country Link
CN (1) CN105354644A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109242569A (en) * 2018-09-13 2019-01-18 西安建筑科技大学 A kind of molybdenum concentrate Long-term Market price analysis and prediction technique and system
CN109492707A (en) * 2018-11-28 2019-03-19 武汉轻工大学 Construction method, device, equipment and the storage medium of spectrum analysis model
CN110084399A (en) * 2019-03-21 2019-08-02 山东省疾病预防控制中心 A kind of periodical Time Series Forecasting Methods based on support vector machines
FR3099615A1 (en) 2019-07-30 2021-02-05 Advestis INTERPRETABLE PREDICTION PROCESS BY LEARNING USING TIME SERIES

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109242569A (en) * 2018-09-13 2019-01-18 西安建筑科技大学 A kind of molybdenum concentrate Long-term Market price analysis and prediction technique and system
CN109492707A (en) * 2018-11-28 2019-03-19 武汉轻工大学 Construction method, device, equipment and the storage medium of spectrum analysis model
CN109492707B (en) * 2018-11-28 2020-10-23 武汉轻工大学 Method, device and equipment for constructing spectral analysis model and storage medium
CN110084399A (en) * 2019-03-21 2019-08-02 山东省疾病预防控制中心 A kind of periodical Time Series Forecasting Methods based on support vector machines
CN110084399B (en) * 2019-03-21 2023-04-07 山东省疾病预防控制中心 Periodic time sequence prediction method based on support vector machine
FR3099615A1 (en) 2019-07-30 2021-02-05 Advestis INTERPRETABLE PREDICTION PROCESS BY LEARNING USING TIME SERIES

Similar Documents

Publication Publication Date Title
Choi et al. Short-term load forecasting based on ResNet and LSTM
CN111563706A (en) Multivariable logistics freight volume prediction method based on LSTM network
CN101480143B (en) Method for predicating single yield of crops in irrigated area
CN107102969A (en) The Forecasting Methodology and system of a kind of time series data
CN106021377A (en) Information processing method and device implemented by computer
CN104899451A (en) Time sequence prediction model selecting method based on statistical signal process
CN110443417A (en) Multiple-model integration load forecasting method based on wavelet transformation
Kempa et al. Investigation of bagging ensembles of genetic neural networks and fuzzy systems for real estate appraisal
CN109767312A (en) A kind of training of credit evaluation model, appraisal procedure and device
CN105354644A (en) Financial time series prediction method based on integrated empirical mode decomposition and 1-norm support vector machine quantile regression
CN105976077A (en) Power transmission and transformation project cost dynamic control target calculating system and calculating method
CN106778838A (en) A kind of method for predicting air quality
CN110619427A (en) Traffic index prediction method and device based on sequence-to-sequence learning model
Mrowczynska et al. Application of artificial intelligence in prediction of road freight transportation
Akpinar et al. Forecasting natural gas consumption with hybrid neural networks—Artificial bee colony
Batarseh et al. Application of machine learning in forecasting international trade trends
CN112418485A (en) Household load prediction method and system based on load characteristics and power consumption behavior mode
Hryhorkiv et al. Forecasting financial time sesries using combined ARIMA-ANN algorithm
CN107316096A (en) A kind of track traffic one-ticket pass passenger amount of entering the station Forecasting Methodology
Mey et al. Prediction of energy consumption for variable customer portfolios including aleatoric uncertainty estimation
CN104217296A (en) Listed company performance comprehensive evaluation method
CN103605493A (en) Parallel sorting learning method and system based on graphics processing unit
CN105894138A (en) Optimum weighted composite prediction method for shipment amount of manufacturing industry
CN109920489A (en) It is a kind of that model and method for building up are hydrocracked based on Lasso-CCF-CNN
CN107704944B (en) Construction method of stock market fluctuation interval prediction model based on information theory learning

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20160224

RJ01 Rejection of invention patent application after publication