CN113627687A - Water supply amount prediction method based on ARIMA-LSTM combined model - Google Patents

Water supply amount prediction method based on ARIMA-LSTM combined model Download PDF

Info

Publication number
CN113627687A
CN113627687A CN202111029223.7A CN202111029223A CN113627687A CN 113627687 A CN113627687 A CN 113627687A CN 202111029223 A CN202111029223 A CN 202111029223A CN 113627687 A CN113627687 A CN 113627687A
Authority
CN
China
Prior art keywords
model
prediction
arima
lstm
water supply
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111029223.7A
Other languages
Chinese (zh)
Inventor
秦贤海
杜坤
董云程
宋志刚
彭彦焜
吴以文
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Kunming University of Science and Technology
Original Assignee
Kunming University of Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Kunming University of Science and Technology filed Critical Kunming University of Science and Technology
Priority to CN202111029223.7A priority Critical patent/CN113627687A/en
Publication of CN113627687A publication Critical patent/CN113627687A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/06Energy or water supply

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Economics (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Strategic Management (AREA)
  • Human Resources & Organizations (AREA)
  • General Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • Tourism & Hospitality (AREA)
  • Marketing (AREA)
  • General Health & Medical Sciences (AREA)
  • Software Systems (AREA)
  • Primary Health Care (AREA)
  • Mathematical Physics (AREA)
  • Computing Systems (AREA)
  • Molecular Biology (AREA)
  • Evolutionary Computation (AREA)
  • Data Mining & Analysis (AREA)
  • Public Health (AREA)
  • Water Supply & Treatment (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • General Engineering & Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Development Economics (AREA)
  • Game Theory and Decision Science (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a water supply amount prediction method based on an ARIMA-LSTM combined model. Firstly, combining an ARIMA model and an LSTM model by using a series connection mode, wherein the LSTM model is used for error correction of the ARIMA model, namely predicting a nonlinear part of the ARIMA model, thereby obtaining a final prediction result of the ARIMA model. And finally, respectively carrying out parallel weighted combination on the final prediction result of the ARIMA model and the prediction result of the LSTM model by using fixed weight to obtain the prediction result of the ARIMA-LSTM combined model. The method greatly improves the prediction precision, reduces the prediction error, has good prediction effect under the condition of more data sets, and can provide decision basis and technical support for urban water supply prediction.

Description

Water supply amount prediction method based on ARIMA-LSTM combined model
Technical Field
The invention belongs to the field of water supply prediction, and particularly relates to a water supply prediction method based on an ARIMA-LSTM combined model.
Background
The water supply prediction is used as the basis of decision planning of water supply units and provides prerequisites for the operation, management and optimization of water supply systems involved in the decision planning. The existing prediction method and model can provide great help for water demand prediction, but the application of the prediction method and model to water demand prediction of water supply units is full of various problems. For example: the accuracy of the existing data, numerous variables affecting the water demand prediction, the prediction range involved, the diversity of the prediction period, and the like all affect the reliability of the water demand prediction result. Water supply at a given time in the future is often an important and close link to water supply in the past, and therefore a great deal of research has taken advantage of this important link to make accurate and reliable predictions of municipal water supply.
In the context of water supply prediction, many researchers have proposed various prediction methods, which can be roughly classified into conventional methods and novel methods. Early studies attempted to solve the municipal water supply prediction problem using traditional statistical models, such as linear regression models and time series models. In the water supply time series model, a general model is an integrated autoregressive moving average (ARIMA) model, a seasonal autoregressive moving average (SARIMA) model, or the like. Since such linear models are easy to understand and implement, they have been the focus of research and have been widely used in practice.
However, in practice the variation in water supply is a combination of linear and non-linear and is both regular and random, making water supply prediction a challenging task. The nonlinear model can have better processing capability on the nonlinear part in the water supply time sequence, wherein a learning algorithm (such as machine learning and artificial intelligence) belongs to a nonlinear method, advanced data analysis is used, so that the learning algorithm model can effectively learn valuable information from water supply data and realize high-precision prediction. Long-short term memory neural networks (LSTM) are a popular machine learning method commonly used for water supply prediction, and the effectiveness of this method has been verified in many studies.
However, the single prediction model has a certain limitation on the prediction accuracy, and because the single model only extracts and predicts part of effective information in the water supply data from the information utilization perspective, the rule of the prediction data sequence can be described from only one information level. In view of the limitations of single models, researchers propose a combined predictive model concept. The combined model, namely different prediction models are combined properly, and the advantages of the respective models are utilized to extract different characteristic information of the predicted water supply data, so that the water supply data is comprehensively and fully utilized to a certain extent.
In conclusion, the method develops the urban short-term water supply prediction research based on the ARIMA-LSTM combined model.
Disclosure of Invention
The invention provides a water supply amount prediction method based on an ARIMA-LSTM combined model, which greatly improves the prediction precision, reduces the prediction error, has good prediction effect under the condition of more data sets, and can provide decision basis and technical support for urban water supply amount prediction.
A water supply amount prediction method based on an ARIMA-LSTM combined model. The method comprises the following steps:
(1) and (4) preprocessing the original water supply amount data. Preprocessing the original water supply time sequence based on a data preprocessing technology: carrying out stabilization and missing value treatment on the water supply time sequence;
(2) prediction by an ARIMA model. Predicting the linear part of the time sequence by using an ARIMA model;
(3) prediction by LSTM model. The LSTM model is used for error correction of the ARIMA model, namely, the nonlinear part of the ARIMA model is predicted, so that the final prediction result of the ARIMA model is obtained;
(4) ARIMA-LSTM combined model prediction. And respectively carrying out parallel weighted combination on the final prediction result of the ARIMA model and the prediction result of the LSTM model by using fixed weight to obtain the prediction result of the ARIMA-LSTM combined model.
In the step (1), firstly, decomposing the time sequence according to the characteristics of the original time sequence; secondly, the non-stationary part of the original time sequence is stabilized by using a traditional difference algorithm and a seasonal difference algorithm, and finally, the missing data value is further processed by using an interpolation method.
The specific steps of constructing the ARIMA model in the step (2) are as follows, 1) judging the stationarity of a time sequence; 2) scaling the model; 3) estimating model parameters; 4) further optimizing and selecting the model; 5) significance (white noise) test of the model; 6) the proposed model is predicted.
The specific steps of constructing the LSTM model in the step (3) are as follows, 1) dividing a verification set; 2) preprocessing data; 3) setting parameters; 4) training a model; 5) evaluating the model; 6) and (5) optimal model prediction.
And (4) the weighting coefficient is obtained by using a minimum criterion of combined prediction error. The specific solving method is as follows: assuming that n different single prediction models are used for prediction analysis of urban water supply, the combined prediction model expression is as follows:
Figure BDA0003245142440000041
in the formula (f)tA prediction value for the combined prediction model; lambda [ alpha ]iIs the weighting coefficient of the ith prediction model,
Figure BDA0003245142440000042
xitis the predicted value of the ith model at the moment t.
The prediction error of the combined prediction model can be expressed as:
Figure BDA0003245142440000043
in the formula (2), etA prediction error value for the combined prediction model; y istIs the measured value of the water supply amount; e.g. of the typeitThe prediction error value of the ith prediction model at the time t.
The square value of the prediction error of the combined prediction model is:
Figure BDA0003245142440000044
the sum of the squares of the prediction errors of the combined prediction model is set as:
Figure BDA0003245142440000045
if the unit column vector is In=[1,1,…,1]TConstraint condition of weighting coefficient
Figure BDA0003245142440000046
Can be written as:
Figure BDA0003245142440000047
if the prediction error variance of the combined prediction model is to be minimized, then the minimum value is determined under the constraint condition:
Figure BDA0003245142440000051
the two different models (ARIMA corrected model and LSTM model) are weighted and combined, so the weighted coefficients and the equation for the sum of the squares of the prediction errors for the optimal combined prediction method are as follows:
1. solving formula of weighting coefficient:
Figure BDA0003245142440000052
Figure BDA0003245142440000053
2. the prediction error sum of squares calculation formula:
Figure BDA0003245142440000054
the invention has the beneficial effects that: and obtaining a combined prediction model by carrying out parallel weighting on the ARIMA model and the LSTM model. The method can effectively improve the prediction precision, reduce the prediction error and have good prediction effect under the condition of more data sets, thereby providing decision basis and technical support for urban water supply prediction.
Drawings
FIG. 1 is a flow chart of ARIMA-LSTM combined model
FIG. 2 is a diagram of ARIMA-LSTM combined model prediction results
FIG. 3 is a diagram of ARIMA-LSTM combined model prediction relative error
Detailed Description
For a more clear understanding of the present invention, the following detailed description of the invention is given in conjunction with the actual cases.
As shown in FIG. 1, the invention relates to a water supply amount prediction method based on an ARIMA-LSTM combined model, which comprises the following steps:
(1) and (4) preprocessing the original water supply amount data. Preprocessing the original water supply time sequence based on a data preprocessing technology: and (4) carrying out stabilization and missing value treatment on the water supply time series. Firstly, decomposing the original time sequence according to the characteristics of the original time sequence; secondly, the non-stationary part of the original time sequence is stabilized by using a traditional difference algorithm and a seasonal difference algorithm, and finally, the missing data value is further processed by using an interpolation method.
(2) Prediction by an ARIMA model. Linear portions of the time series were predicted using the ARIMA model. The specific steps of constructing the ARIMA model comprise 1) judging the stationarity of a time sequence; 2) scaling the model; 3) estimating model parameters; 4) further optimizing and selecting the model; 5) significance (white noise) test of the model; 6) the proposed model is predicted.
(3) Prediction by LSTM model. The LSTM model is used for error correction of the ARIMA model, namely, the nonlinear part of the ARIMA model is predicted, so that the final prediction result of the ARIMA model is obtained. The specific steps of constructing the LSTM model comprise 1) dividing a verification set; 2) preprocessing data; 3) setting parameters; 4) training a model; 5) evaluating the model; 6) and (5) optimal model prediction.
(4) ARIMA-LSTM combined model prediction. And respectively carrying out parallel weighted combination on the final prediction result of the ARIMA model and the prediction result of the LSTM model by using fixed weight to obtain the prediction result of the ARIMA-LSTM combined model. Wherein the weighting coefficients are derived from a combined prediction error minimization criterion. The specific solving method is as follows: assuming that n different single prediction models are used for prediction analysis of urban water supply, the combined prediction model expression is as follows:
Figure BDA0003245142440000071
in the formula (f)tA prediction value for the combined prediction model; lambda [ alpha ]iIs the weighting coefficient of the ith prediction model,
Figure BDA0003245142440000072
xitis the predicted value of the ith model at the moment t.
The prediction error of the combined prediction model can be expressed as:
Figure BDA0003245142440000073
in the formula (2), etA prediction error value for the combined prediction model; y istIs the measured value of the water supply amount; e.g. of the typeitThe prediction error value of the ith prediction model at the time t.
The square value of the prediction error of the combined prediction model is:
Figure BDA0003245142440000074
the sum of the squares of the prediction errors of the combined prediction model is set as:
Figure BDA0003245142440000075
if the unit column vector is In=[1,1,…,1]TConstraint condition of weighting coefficient
Figure BDA0003245142440000076
Can be written as:
Figure BDA0003245142440000077
if the prediction error variance of the combined prediction model is to be minimized, then the minimum value is determined under the constraint condition:
Figure BDA0003245142440000078
the two different models (ARIMA corrected model and LSTM model) are weighted and combined, so the weighted coefficients and the equation for the sum of the squares of the prediction errors for the optimal combined prediction method are as follows:
1. solving formula of weighting coefficient:
Figure BDA0003245142440000081
Figure BDA0003245142440000082
2. the prediction error sum of squares calculation formula:
Figure BDA0003245142440000083

Claims (5)

1. a method for predicting water supply based on ARIMA-LSTM model combination, the method comprising the steps of:
(1) and (4) preprocessing the original water supply amount data. Preprocessing the original water supply time sequence based on a data preprocessing technology: carrying out stabilization and missing value treatment on the water supply time sequence;
(2) prediction by an ARIMA model. Predicting the linear part of the time sequence by using an ARIMA model;
(3) prediction by LSTM model. The LSTM model is used for error correction of the ARIMA model, namely, the nonlinear part of the ARIMA model is predicted, so that the final prediction result of the ARIMA model is obtained;
(4) ARIMA-LSTM combined model prediction. And respectively carrying out parallel weighted combination on the final prediction result of the ARIMA model and the prediction result of the LSTM model by using fixed weight to obtain the prediction result of the ARIMA-LSTM combined model.
2. The method for predicting water supply amount based on ARIMA-LSTM combination model as set forth in claim 1, wherein the step (1) is performed by first decomposing the time series according to the characteristics of the time series; secondly, the non-stationary part of the original time sequence is stabilized by using a traditional difference algorithm and a seasonal difference algorithm, and finally, the missing data value is further processed by using an interpolation method.
3. The method for predicting water supply based on the ARIMA-LSTM combined model as claimed in claim 1, wherein the ARIMA model is constructed in the step (2) by the specific steps of 1) judging the time sequence stationarity; 2) scaling the model; 3) estimating model parameters; 4) further optimizing and selecting the model; 5) significance (white noise) test of the model; 6) the proposed model is predicted.
4. The method for predicting water supply based on the ARIMA-LSTM combined model as claimed in claim 1, wherein the LSTM model is constructed in the step (3) by the specific steps of 1) dividing a validation set; 2) preprocessing data; 3) setting parameters; 4) training a model; 5) evaluating the model; 6) and (5) optimal model prediction.
5. The ARIMA-LSTM combination model-based water supply prediction method as set forth in claim 1, wherein the weighting coefficients in step (4) are determined using a combination prediction error minimization criterion. The specific solving method is as follows: assuming that n different single prediction models are used for prediction analysis of urban water supply, the combined prediction model expression is as follows:
Figure FDA0003245142430000021
in the formula (f)tA prediction value for the combined prediction model; lambda [ alpha ]iIs the weighting coefficient of the ith prediction model,
Figure FDA0003245142430000022
xitis the predicted value of the ith model at the moment t.
The prediction error of the combined prediction model can be expressed as:
Figure FDA0003245142430000023
in the formula (2), etA prediction error value for the combined prediction model; y istIs the measured value of the water supply amount; e.g. of the typeitThe prediction error value of the ith prediction model at the time t.
The square value of the prediction error of the combined prediction model is:
Figure FDA0003245142430000024
the sum of the squares of the prediction errors of the combined prediction model is set as:
Figure FDA0003245142430000025
if the unit column vector is In=[1,1,…,1]TConstraint condition of weighting coefficient
Figure FDA0003245142430000026
Can be written as:
Figure FDA0003245142430000031
if the prediction error variance of the combined prediction model is to be minimized, then the minimum value is determined under the constraint condition:
Figure FDA0003245142430000032
the two different models (ARIMA corrected model and LSTM model) are weighted and combined, so the weighted coefficients and the equation for the sum of the squares of the prediction errors for the optimal combined prediction method are as follows:
1. solving formula of weighting coefficient:
Figure FDA0003245142430000033
Figure FDA0003245142430000034
2. the prediction error sum of squares calculation formula:
Figure FDA0003245142430000035
CN202111029223.7A 2021-09-03 2021-09-03 Water supply amount prediction method based on ARIMA-LSTM combined model Pending CN113627687A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111029223.7A CN113627687A (en) 2021-09-03 2021-09-03 Water supply amount prediction method based on ARIMA-LSTM combined model

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111029223.7A CN113627687A (en) 2021-09-03 2021-09-03 Water supply amount prediction method based on ARIMA-LSTM combined model

Publications (1)

Publication Number Publication Date
CN113627687A true CN113627687A (en) 2021-11-09

Family

ID=78389016

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111029223.7A Pending CN113627687A (en) 2021-09-03 2021-09-03 Water supply amount prediction method based on ARIMA-LSTM combined model

Country Status (1)

Country Link
CN (1) CN113627687A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117670000A (en) * 2024-02-01 2024-03-08 四川省机械研究设计院(集团)有限公司 Pump station water supply quantity prediction method based on combined prediction model

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117670000A (en) * 2024-02-01 2024-03-08 四川省机械研究设计院(集团)有限公司 Pump station water supply quantity prediction method based on combined prediction model
CN117670000B (en) * 2024-02-01 2024-04-12 四川省机械研究设计院(集团)有限公司 Pump station water supply quantity prediction method based on combined prediction model

Similar Documents

Publication Publication Date Title
CN111027772B (en) Multi-factor short-term load prediction method based on PCA-DBILSTM
CN116757534B (en) Intelligent refrigerator reliability analysis method based on neural training network
CN112149879B (en) New energy medium-and-long-term electric quantity prediction method considering macroscopic volatility classification
McAdam et al. Forecasting inflation with thick models and neural networks
CN110705743A (en) New energy consumption electric quantity prediction method based on long-term and short-term memory neural network
CN110942194A (en) Wind power prediction error interval evaluation method based on TCN
CN109635245A (en) A kind of robust width learning system
CN112085254A (en) Prediction method and model based on multi-fractal cooperative measurement gating cycle unit
CN113393057A (en) Wheat yield integrated prediction method based on deep fusion machine learning model
CN112365056A (en) Electrical load joint prediction method and device, terminal and storage medium
CN115470862A (en) Dynamic self-adaptive load prediction model combination method
CN107871157B (en) Data prediction method, system and related device based on BP and PSO
CN114971090A (en) Electric heating load prediction method, system, equipment and medium
JPH04372046A (en) Method and device for predicting demand amount
CN116227180A (en) Data-driven-based intelligent decision-making method for unit combination
CN113627687A (en) Water supply amount prediction method based on ARIMA-LSTM combined model
CN111292121A (en) Garden load prediction method and system based on garden image
CN114581141A (en) Short-term load prediction method based on feature selection and LSSVR
CN113177355A (en) Power load prediction method
CN113762591A (en) Short-term electric quantity prediction method and system based on GRU and multi-core SVM counterstudy
CN109829115B (en) Search engine keyword optimization method
CN113674014B (en) Method and system for predicting daily line loss rate of distribution line
CN116341753A (en) Method for predicting regional carbon dioxide emission by using machine learning
CN114925931A (en) Platform area load prediction method and system
CN112446550B (en) Short-term building load probability density prediction method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination