CN115495991A - Rainfall interval prediction method based on time convolution network - Google Patents

Rainfall interval prediction method based on time convolution network Download PDF

Info

Publication number
CN115495991A
CN115495991A CN202211197406.4A CN202211197406A CN115495991A CN 115495991 A CN115495991 A CN 115495991A CN 202211197406 A CN202211197406 A CN 202211197406A CN 115495991 A CN115495991 A CN 115495991A
Authority
CN
China
Prior art keywords
precipitation
data
time
interval
convolution network
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211197406.4A
Other languages
Chinese (zh)
Inventor
冯钧
邵萍萍
王文鹏
丁昱凯
严乐
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hohai University HHU
Original Assignee
Hohai University HHU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hohai University HHU filed Critical Hohai University HHU
Priority to CN202211197406.4A priority Critical patent/CN115495991A/en
Publication of CN115495991A publication Critical patent/CN115495991A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F30/00Computer-aided design [CAD]
    • G06F30/20Design optimisation, verification or simulation
    • G06F30/27Design optimisation, verification or simulation using machine learning, e.g. artificial intelligence, neural networks, support vector machines [SVM] or training a model
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2119/00Details relating to the type or aim of the analysis or the optimisation
    • G06F2119/02Reliability analysis or reliability optimisation; Failure analysis, e.g. worst case scenario performance, failure mode and effects analysis [FMEA]
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A90/00Technologies having an indirect contribution to adaptation to climate change
    • Y02A90/10Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Medical Informatics (AREA)
  • Software Systems (AREA)
  • Artificial Intelligence (AREA)
  • Computer Hardware Design (AREA)
  • Geometry (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a rainfall interval prediction method based on a time convolution network, which comprises the following steps of collecting rainfall data and meteorological factor data to form an initial time sequence data set, and preprocessing the data; analyzing the characteristics of meteorological factors, constructing a characteristic extraction algorithm, acquiring the incidence relation between the meteorological factors and precipitation, and screening out a time sequence of the meteorological factors which have great influence on precipitation by adopting a maximum information coefficient-asynchronous principal component analysis algorithm; and (3) constructing a time convolution network model for predicting the drainage surface rainfall by combining the drainage surface rainfall, training the model by using data in the training set, then adjusting the model parameter evaluation model to converge the model, and finally predicting by using the test set data. According to the method, the capability of TCN for capturing effective long-time sequence features is utilized, the optimized LUBE coverage width evaluation standard is used as one of training loss objective functions to generate probability interval prediction of future observation results, and the precision of rainfall prediction is effectively improved.

Description

Rainfall interval prediction method based on time convolution network
Technical Field
The invention belongs to the technical field of precipitation forecast, and particularly relates to a precipitation interval prediction method based on a time convolution network.
Background
The medium-long term precipitation prediction is an important basis for the precise management of the water resources in the drainage basin. The data-driven model is one of the important ways to develop the prediction of the mid-long term precipitation. The beginning, duration and ending time of a rainy season are determined by the monthly rainfall, and the monthly rainfall provides more accurate annual rainfall distribution data than the seasonal rainfall, so that rainfall prediction needs to be researched on a monthly time scale, however, the sample capacity of monthly-scale rainfall statistical data is limited, the dimensionality of available forecasting factors is higher, high-correlation meteorological cause elements are screened out under the condition of limited sample capacity, and the construction of a robust data driving model is the key for successful medium-and long-term rainfall prediction.
In precipitation prediction, the models commonly used are mainly based on physical (process) driving and on data driving. The physical driving model is based on a hydrological conceptual model, the method analyzes all factors of precipitation formation, a general physical equation suitable for a certain territory is established to simulate the precipitation process, and the model parameters can be calibrated only by relying on experience and manual interaction and continuous iteration. The accuracy of model prediction depends on the knowledge and experience of the modeler and the completeness of the data. The parameters contained in the model have physical significance and good explanatory property, but the difficulty of parameter calibration is increased by the complex influence factors of medium-term and long-term precipitation. In recent years, data driving models such as linear regression, neural networks and support vector machines are applied to precipitation forecasting, the models regard hydrological processes as black boxes, and modeling is achieved by establishing mapping relations between input and output samples without considering physical mechanisms inside the system. The intelligent model has a certain limitation on the forecasting performance due to the fact that accurate precipitation and influence factors (precipitation and related meteorological factors) thereof cannot be obtained. In precipitation prediction, each meteorological factor has uncertainty, and the factors are correlated with each other, so that the physical driving model and the data driving model have respective defects.
Disclosure of Invention
The purpose of the invention is as follows: the invention aims to provide a rainfall interval prediction method based on a time convolution network, which is suitable for the prediction of the rainfall interval in a medium-term and long-term period and has higher prediction precision.
The technical scheme is as follows: the invention provides a rainfall interval prediction method based on a time convolution network, which comprises the following steps:
(1) Collecting precipitation data and meteorological factor data to form an initial time sequence data set, and preprocessing the data;
(2) Analyzing the characteristics of meteorological factors, constructing a characteristic extraction algorithm, acquiring the incidence relation between the meteorological factors and precipitation, and screening out a time sequence of the meteorological factors which have great influence on precipitation by adopting a maximum information coefficient-asynchronous principal component analysis algorithm;
(3) Constructing a precipitation interval prediction model based on a time convolution network, dividing screened data into a training set and a test set according to a reservation method, then sampling by a sliding window, and preparing a precipitation interval prediction model input data group based on the time convolution network;
(4) According to the input historical precipitation and highly-correlated historical meteorological factor sequence, a TCN time convolution network model is selected to construct a precipitation interval prediction model based on a time convolution network, and an LUBE interval prediction method is introduced to output an interval prediction value so as to train the precipitation interval prediction model based on the time convolution network and complete comprehensive modeling of a precipitation process;
(5) And inputting new precipitation amount to the trained precipitation interval prediction model based on the time convolution network to complete the prediction of the precipitation interval.
Further, the preprocessing of the data in the step (1) includes data cleaning and missing completion of the data, wherein the missing completion of the data adopts a linear interpolation processing mode in a time dimension.
Further, the time series implementation process of screening out the meteorological factors having a large influence on the precipitation amount by adopting the maximum information coefficient-asynchronous principal component analysis algorithm in the step (2) is as follows:
construction of interpolation timing by DTWStep correlation time series: organizing a data set X m×n Wherein m in each column represents the length of a univariate time sequence, and n represents the number of attributes in each row; normalizing each time series in the data matrix X; normalizing the time series Xi using the Z-score method to give normalized X' i Has zero mean and one standard variance, namely X' ih N (0, 1); calculating normalized X 'from DTW' i And X j ' obtaining an optimal deformation path
Figure BDA0003870812510000021
All elements of the optimal deformation path constitute elements of the interpolated time series, respectively: x ″ i ={p 1 (1),p 2 (1),…,p k (1) And X ″) j ={p 1 (2),p 2 (2),…,p k (2) All normalized time series should be extended to time series reflecting the corresponding asynchronous correlation; obtain a new interpolated time series, i.e., X = X "; x "is performed on 2 new time-series scatter plots, set D i Dividing by X Y, and dividing the elements by X ″) i The value is divided into X ″) i In each grid, dividing the grid into Y grids according to Y values; calculating the frequency distribution D | G of the set D of points falling on a given grid G, reasonably selecting the upper boundary B (n) of the grid G, B (n) being a function of the sample size and representing the constraint of the total number B of squares of the grid G smaller than B (n), B (n) = n 0.6 (ii) a Calculating different grids G to determine different probability distributions, namely I (D | G), and representing mutual information of the point sets based on the distribution D | G; on the basis of X ″) i Finding the maximum mutual information maxI (D | G) in all the possible distributed D | G mutual information of the XY grid G, and recording the maxI (D | G) as I '(D, m, n) to obtain a characteristic matrix I' (D) of the two-dimensional data set D;
and effectively measuring the correlation between the forecast factor and the actual precipitation sequence through the maximum information coefficient, screening out a factor with strong correlation with the actual precipitation, recording the factor as a high information quantity factor, and removing the factor containing more overlapped information from the high information quantity factor.
Further, the TCN time convolution network model is selected and used to construct the precipitation interval prediction model based on the time convolution network in the step (4), and the implementation process is as follows:
in the case of a univariate sequence, the complete dilation causal convolution operation f on successive layers for a given one-dimensional sequence of precipitation inputs X and a filter function ω of size k: {0, \8230;, k-1}, the definition on the sequence s is expressed as follows:
Figure BDA0003870812510000031
where s is an element of the sequence and d is an expansion parameter, according to the network depth d =2 i Exponential increase, d =2 i I-level for the network; and s-d · i describes the past direction; d is referred to as the dilated convolution operation to distinguish the normal convolution operation.
Further, the implementation process of introducing the LUBE interval prediction method to output the predicted value of the interval in the step (4) is as follows:
the evaluation index of the LUBE interval prediction method comprises the evaluation of PICP from the probability that the actual observation value is between the upper bound and the lower bound of the prediction interval; PINAW is evaluated from the width between the upper and lower bounds of the prediction interval:
Figure BDA0003870812510000032
Figure BDA0003870812510000033
wherein, U (x) k ) For the upper boundary value of the interval prediction, L (x) k ) A lower boundary value for interval prediction, n is the number of samples of the measured value, a is the range of the target variable, i.e. the difference between the maximum and minimum values; then the CWC measuring the width and coverage probability can be defined as:
Figure BDA0003870812510000034
adding penalty parameters, considering width, covering probability and average deviation indexes are defined as:
Figure BDA0003870812510000041
wherein the parameter tau linearly amplifies the PINAW, the parameter
Figure BDA0003870812510000042
Exponentially amplifying PICPs and e -η(PICP-μ) The difference between if PCWC is too small, the epsilon and tau hyper parameters are used to avoid vanishing.
Has the advantages that: compared with the prior art, the invention has the following beneficial effects: according to the method, the capability of TCN for capturing effective long-time sequence features is utilized, the optimized LUBE coverage width evaluation standard is used as one of training loss objective functions to generate probability interval prediction of future observation results, and the precision of rainfall prediction is effectively improved; preprocessing input data, constructing an interpolation time sequence through DTW, introducing an MIC matrix into an APCA method, performing feature screening, further excavating asynchronous correlation of meteorological factors and rainfall, and being more suitable for feature selection of high-dimensional meteorological factors; the method is more suitable for forecasting the precipitation interval in the middle and long term, has higher forecasting precision, is superior to models such as the traditional support vector machine and the like, and greatly improves the forecasting precision.
Drawings
FIG. 1 is a flow chart of the present invention;
FIG. 2 is a schematic diagram of causal dilation convolution in a TCN time convolution network model in an embodiment of the present invention.
Detailed Description
The present invention will be described in further detail with reference to the accompanying drawings.
As shown in figure 1, the invention provides a rainfall interval prediction method based on a time convolution network, which comprises the steps of firstly, collecting hydrological data of a research basin, and then storing the data into a historical database; secondly, preprocessing hydrologic historical data such as missing completion, data anomaly correction and data normalization, and then dividing a training set and a test set after feature screening; constructing a rainfall interval prediction model based on a time convolution network, training the model by using data in a training set, then adjusting a model parameter evaluation model to converge the model, and finally predicting by using test set data; the method specifically comprises the following steps:
step 1: precipitation data and meteorological factor data are collected to form an initial time sequence data set, data are cleaned, and missing data are supplemented.
Collecting hydrological data with the data granularity of 1 item/month in 1979-2019 of a target basin (Qijiang basin), wherein each piece of data comprises precipitation in the basin, an atmospheric circulation factor, a sea temperature factor and other meteorological influence factors, and storing the data into a historical database; and taking out the data from the historical database and carrying out data preprocessing, including data missing completion. The data missing completion method adopts a linear interpolation method on a time dimension, namely, a missing value is determined according to the time values before and after the missing part.
And 2, step: analyzing the meteorological factor characteristics, constructing a characteristic extraction algorithm of input data, adopting maximum information coefficient-asynchronous principal component analysis to screen out a time sequence of the meteorological factor which has a large influence on precipitation, and acquiring the incidence relation between the meteorological factor and precipitation.
Constructing an interpolation time sequence and an asynchronous correlation time sequence through DTW; the correlation relation between the forecasting factors and the actual precipitation sequence is effectively measured through the maximum information coefficient, and the factors with strong correlation with the actual precipitation are screened out, wherein the factors are high information quantity factors generally and have obvious influence on forecasting variables; removing factors containing more overlapped information from the high information quantity factors, and the specific process is as follows:
organizing a data set (or data matrix) X m×n Wherein, each column m represents the length of the univariate time sequence, and n represents the number of attributes (or variables) of each row; each time series in the data matrix X is normalized. Normalizing the time series Xi using the Z-score method to give normalized X' i Has zero mean and one standard variance, namely X' ih N (0, 1). An interpolated time series is obtained. Calculating normalized X 'from DTW' i And X j ', obtaining an optimal deformation path
Figure BDA0003870812510000052
All elements of the optimal deformation path constitute elements of the interpolated time series, respectively: x ″) i ={p 1 (1),p 2 (1),…,p k (1) } and X ″ ", respectively j ={p 1 (2),p 2 (2),…,p k (2) All normalized time series should be extended to time series reflecting the corresponding asynchronous correlations. A new interpolated time series is obtained, i.e. X = X ". X "is performed on 2 new time-series scatter plots (set D) i Dividing by X Y, and dividing the elements by X ″) i The values are divided into X ″) i In each cell, the cells are divided into Y cells by Y value. Calculating the frequency distribution D | G of the set D of points falling on a given grid G, reasonably selecting the upper boundary B (n) of the grid G, B (n) being a function of the sample size and representing the constraint of the total number B of squares of the grid G smaller than B (n), B (n) = n 0.6 Computing different grids G determines different probability distributions, I (D | G), representing mutual information of the point sets based on the distributions D | G. Based on X ″ i Finding the maximum mutual information maxI (D | G) in all the possible distributed D | G mutual information of the XY grid G, and recording the maxI (D | G) as I '(D, m, n) to obtain a characteristic matrix I' (D) of the two-dimensional data set D; finding out the maximum from the results obtained from the above results, that is, the maximum information coefficient.
Figure BDA0003870812510000051
Respectively calculating the maximum mutual information coefficient MIC (X') between each weather forecast factor and the actually measured precipitation i And Y) are arranged in sequence from large to small, and a plurality of factors with the top rank are screened out to form a new factor set X' = { X1, X2, \8230;, xr }, wherein r is less than or equal to n. Separately calculating MIC values MIC (X ″) between the factors in the factor set X i ,X″ j ) (i, j is less than or equal to n) to form an MIC characteristic matrix. Calculating the eigenvalues of the MIC eigenmatrix and arranging the λ in descending order 1 ≥λ 2 ≥…≥λ m Is more than or equal to 0. Respectively determining corresponding characteristic values lambda i Characteristic vector e of i (i =1,2, \8230;, m), wherein | | | e i ||=1。
Calculating the principal component contribution rate and the accumulated contribution rate of the characteristic value, wherein the calculation formula is as follows:
Figure BDA0003870812510000061
Figure BDA0003870812510000062
wherein L is 1 Is a principal component contribution rate, L 2 In order to obtain the cumulative contribution rate, i =1,2, \8230, m, q principal components corresponding to q feature values with the cumulative contribution rate of 80% -95% are generally taken, and the final forecast factor set X "= { X ″", is obtained 1 ,X″ 2 ,…,X″ q Due to q<m, so a reduction in dimensionality can be achieved.
And step 3: and constructing a rainfall interval prediction model based on a time convolution network, dividing the screened data into a training set and a test set according to a reservation method, then sampling by a sliding window, and preparing a rainfall interval prediction model input data group based on the time convolution network.
The data after feature screening is divided into two mutually exclusive sets by a retention method: the method comprises the following steps of training set and testing set, wherein the proportion of the training set to the testing set is 8, the training set is data in 1979-2011, and the testing set is data in 2011-2019. And preparing input data of the model by adopting a sliding window sampling technology.
And 4, step 4: according to the input historical precipitation and highly-correlated historical meteorological factor sequences, a TCN time convolution network model is selected to construct a precipitation interval prediction model based on a time convolution network, and an interval prediction value is output by introducing a LUBE interval prediction method, so that training of the precipitation interval prediction model based on the time convolution network is realized, and comprehensive modeling of a precipitation process is completed.
And constructing a TCN time convolution network model which is formed by connecting causal expansion volumes and residual errors. Causal dilated convolutions are shown in FIG. 2: the causal convolution means that the output at the time t can only obtain convolution from the input which is not later than t, the causal convolution plays a role of a filter in time, and unlike the standard convolution, the output at the time t has no influence on future values, so that the causal relationship between historical data and future predicted values can be better captured, the TCN uses a 1DFCN (one-dimensional full convolution network) architecture, the length of each hidden layer is the same as that of an input layer, and zero padding is used to ensure that subsequent layers have the same length. The longer prediction period increases the difficulty of predicting the target value, the extended causal convolution allows for skipping the input values of a particular step, and the difference between the extended convolution and the conventional convolution is that it allows the input of the convolution to be sampled at intervals. Such convolution increases the input range of the network, accessing longer input subsequences. The concrete structure is as follows:
in the case of a univariate sequence, the complete dilation causal convolution operation f on the concealment layer for a given one-dimensional sequence of precipitation inputs X and a filter function ω of size k: {0, \8230;, k-1}, the definition on the sequence s is expressed as follows:
Figure BDA0003870812510000071
where s is an element of the sequence, d is an expansion parameter, and s-d · i describes the past direction. D is referred to as the dilated convolution operation to distinguish the normal convolution operation. The normal convolution operator (×) is a specific version of the dilation convolution (when d = 1). The receptive field of causal networks is increased in two ways: one is to increase the filter size k; the second is to increase the expansion factor d. In this study, d is according to the network depth d =2 i Exponentially increased to ensure that long-term history can be effectively covered; specifically, d =2 i For the i-level of the network.
A LUBE (lower and upper bound estimation) interval prediction method is introduced into an output layer, and uncertainty of a prediction result is quantized to generate probability interval prediction of a future observation result. The evaluation standard (PCWC) of the coverage width of the LUBE is optimized, and the precision of rainfall prediction is effectively improved, and the method specifically comprises the following steps:
the number of evaluation indexes of the LUBE interval prediction method is mainly 2, and the PICP is evaluated from the probability that an actual observation value is between the upper bound and the lower bound of a prediction interval; PINAW is evaluated from the width between the upper and lower bounds of the prediction interval:
Figure BDA0003870812510000072
Figure BDA0003870812510000073
wherein, U (x) k ) For the upper boundary value of the interval prediction, L (x) k ) For the lower boundary value of the interval prediction, n is the number of samples of the measured value, and a is the range of the target variable, i.e. the difference between the maximum and minimum values. Then the CWC measuring the width and coverage probability can be defined as:
Figure BDA0003870812510000074
in the rainfall prediction, under the influence of high-dimensional meteorological factors, the accuracy of a prediction result is often greatly influenced by characteristics, and the balance between the section coverage rate and the section width of the rainfall prediction is difficult to adjust by phi. Adding penalty parameters on the basis, and considering the width, the coverage probability and the average deviation index can be defined as:
Figure BDA0003870812510000075
wherein the parameter tau linearly amplifies the PINAW, the parameter
Figure BDA0003870812510000081
Exponentially amplifying PICPs and e -η(PICP-μ) The difference between if PCWC is too small, 8 and τ superparameters are used to avoid vanishing.
And 5: and inputting new precipitation amount to the trained precipitation interval prediction model based on the time convolution network to complete the prediction of the precipitation interval.
The optimized PCWC indexes are introduced to carry out interval prediction training on the model, in addition, 2 evaluation standards are used for evaluating the performance of the model, namely the root mean square error RMSE and the Nash efficiency coefficient NSE, and the model evaluation standard formula is as follows:
root mean square error
Figure BDA0003870812510000082
Coefficient of nash efficiency:
Figure BDA0003870812510000083
wherein, y i Is the observed value at the time of the ith time,
Figure BDA0003870812510000084
is a predicted value at the i-th time,
Figure BDA0003870812510000085
is the average of the observations and N represents the number of observations during the test.
The foregoing is only a preferred embodiment of the present invention, and it should be noted that, for those skilled in the art, many modifications and adaptations can be made without departing from the principle of the present invention, and such modifications and adaptations should also be considered to be within the scope of the present invention.

Claims (5)

1. A rainfall interval prediction method based on a time convolution network is characterized by comprising the following steps:
(1) Collecting precipitation data and meteorological factor data to form an initial time sequence data set, and preprocessing the data;
(2) Analyzing meteorological factor characteristics, constructing a characteristic extraction algorithm, acquiring the incidence relation between meteorological factors and precipitation, and screening out a time sequence of the meteorological factors which have great influence on the precipitation by adopting a maximum information coefficient-asynchronous principal component analysis algorithm;
(3) Constructing a precipitation interval prediction model based on a time convolution network, dividing screened data into a training set and a test set according to a reservation method, then sampling by a sliding window, and preparing a precipitation interval prediction model input data group based on the time convolution network;
(4) According to the input historical precipitation and highly-correlated historical meteorological factor sequence, a TCN time convolution network model is selected to construct a precipitation interval prediction model based on a time convolution network, and an LUBE interval prediction method is introduced to output an interval prediction value so as to train the precipitation interval prediction model based on the time convolution network and complete comprehensive modeling of a precipitation process;
(5) And inputting new precipitation amount to the trained precipitation interval prediction model based on the time convolution network to complete the prediction of the precipitation interval.
2. The method for predicting the precipitation interval based on the time convolution network as claimed in claim 1, wherein the preprocessing of the data in the step (1) includes data cleaning and data missing completion, wherein the data missing completion adopts a linear interpolation processing mode in a time dimension.
3. The rainfall interval prediction method based on the time convolution network of claim 1, wherein the time series implementation process for screening out the meteorological factors having a large influence on the rainfall amount by using the maximum information coefficient-asynchronous principal component analysis algorithm in the step (2) is as follows:
constructing an interpolation time sequence through DTW, wherein the time sequence of asynchronous correlation comprises the following steps: organizing a data set X m×n Wherein, each column m represents the length of the univariate time sequence, and n represents the number of the attributes of each row; normalizing each time series in the data matrix X; normalizing the time series Xi using the Z-score method to give normalized X' i Has zero mean and one standard variance, namely X' ih N (0, 1); calculating normalized X 'from DTW' i And X' j Obtaining the optimal deformation path
Figure FDA0003870812500000011
All elements of the optimal deformation path constitute elements of the interpolated time series, respectively: x' i ={p 1 (1),p 2 (1),…,p k (1) And X " j ={p 1 (2),p 2 (2),…,p k (2) All normalized time series should be extended to time series reflecting the corresponding asynchronous correlations; obtain a new interpolated time series, i.e., X = X "; x over 2 new time-series scatter plots, set D " i Dividing by X Y, and dividing the elements by X " i Value division into X' i In each grid, dividing the grid into Y grids according to Y values; calculating a frequency distribution D | G of the set D of points falling on a given grid G, the upper boundary B (n) of the grid G being chosen appropriately, B (n) being a function of the sample size, the constraint representing the total number B of squares of the grid G being less than B (n), B (n) = n 0.6 (ii) a Calculating different grids G to determine different probability distributions, namely I (D | G), and representing mutual information of the point sets based on the distribution D | G; based on X' i Finding the largest mutual information maxI (D | G) in all the possible mutual information of the distribution D | G of the XY grid G and recording the maxI (D | G) as I '(D, m, n) to obtain a feature matrix I' (D) of the two-dimensional data set D;
and effectively measuring the correlation between the forecast factor and the actual precipitation sequence through the maximum information coefficient, screening out a factor with strong correlation with the actual precipitation, recording the factor as a high information quantity factor, and removing the factor containing more overlapped information from the high information quantity factor.
4. The method for predicting the precipitation interval based on the time convolution network is characterized in that in the step (4), the TCN time convolution network model is selected to construct the precipitation interval prediction model based on the time convolution network, and the implementation process comprises the following steps:
in the case of a univariate sequence, the complete dilation causal convolution operation f on successive layers for a given one-dimensional sequence of precipitation inputs X and a filter function ω of size k: {0, \8230;, k-1}, the definition on the sequence s is expressed as follows:
Figure FDA0003870812500000021
where s is an element of the sequence and d is an expansion parameter, according to the network depth d =2 i Exponential increase, d =2 i I-level for the network; and s-d · i describes the past direction; d is referred to as the dilated convolution operation to distinguish the normal convolution operation.
5. The method for predicting the precipitation interval based on the time convolutional network of claim 1, wherein the step (4) of introducing the LUBE interval prediction method to output the interval prediction value is implemented as follows:
the evaluation index of the LUBE interval prediction method comprises the evaluation of PICP from the probability that the actual observation value is between the upper bound and the lower bound of the prediction interval; PINAW is evaluated from the width between the upper and lower bounds of the prediction interval:
Figure FDA0003870812500000022
Figure FDA0003870812500000023
wherein, U (x) k ) For the upper boundary value of the interval prediction, L (x) k ) A lower boundary value for interval prediction, n is the number of samples of the measured value, a is the range of the target variable, i.e. the difference between the maximum and minimum values; the CWC measuring the width and coverage probability can be defined as:
Figure FDA0003870812500000031
adding penalty parameters, considering width, covering probability and average deviation indexes are defined as:
Figure FDA0003870812500000032
wherein the parameter tau linearly amplifies the PINAW, the parameter
Figure FDA0003870812500000033
Exponentially amplifying PICPs and e -η(PICP-μ) The difference between if PCWC is too small, then the epsilon and tau hyper-parameters are used to avoid vanishing.
CN202211197406.4A 2022-09-29 2022-09-29 Rainfall interval prediction method based on time convolution network Pending CN115495991A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211197406.4A CN115495991A (en) 2022-09-29 2022-09-29 Rainfall interval prediction method based on time convolution network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211197406.4A CN115495991A (en) 2022-09-29 2022-09-29 Rainfall interval prediction method based on time convolution network

Publications (1)

Publication Number Publication Date
CN115495991A true CN115495991A (en) 2022-12-20

Family

ID=84472247

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211197406.4A Pending CN115495991A (en) 2022-09-29 2022-09-29 Rainfall interval prediction method based on time convolution network

Country Status (1)

Country Link
CN (1) CN115495991A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116579509A (en) * 2023-07-14 2023-08-11 天津福天科技有限公司 Photovoltaic power generation prediction method based on virtual reality
CN117093873A (en) * 2023-10-19 2023-11-21 国网浙江省电力有限公司丽水供电公司 Hydropower station storage capacity assessment method and system based on natural water inflow prediction
CN117273200A (en) * 2023-08-31 2023-12-22 淮阴工学院 Runoff interval forecasting method based on convolution optimization algorithm and Pyraformer neural network
CN117574960A (en) * 2023-12-14 2024-02-20 江苏省气象台 Multi-factor plum rainfall prediction method based on self-adaptive graph structure and reinforcement integration
CN117633449A (en) * 2024-01-25 2024-03-01 南京信息工程大学 DE-DOA improved RRDBNet precipitation data downscaling method based on Spark-Cassandra framework
CN118094127A (en) * 2024-04-29 2024-05-28 南京信息工程大学 Precipitation prediction method based on high altitude data and GM gray iteration

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116579509A (en) * 2023-07-14 2023-08-11 天津福天科技有限公司 Photovoltaic power generation prediction method based on virtual reality
CN117273200A (en) * 2023-08-31 2023-12-22 淮阴工学院 Runoff interval forecasting method based on convolution optimization algorithm and Pyraformer neural network
CN117093873A (en) * 2023-10-19 2023-11-21 国网浙江省电力有限公司丽水供电公司 Hydropower station storage capacity assessment method and system based on natural water inflow prediction
CN117093873B (en) * 2023-10-19 2024-01-30 国网浙江省电力有限公司丽水供电公司 Hydropower station storage capacity assessment method and system based on natural water inflow prediction
CN117574960A (en) * 2023-12-14 2024-02-20 江苏省气象台 Multi-factor plum rainfall prediction method based on self-adaptive graph structure and reinforcement integration
CN117633449A (en) * 2024-01-25 2024-03-01 南京信息工程大学 DE-DOA improved RRDBNet precipitation data downscaling method based on Spark-Cassandra framework
CN117633449B (en) * 2024-01-25 2024-04-12 南京信息工程大学 DE-DOA improved RRDBNet precipitation data downscaling method based on Spark-Cassandra framework
CN118094127A (en) * 2024-04-29 2024-05-28 南京信息工程大学 Precipitation prediction method based on high altitude data and GM gray iteration
CN118094127B (en) * 2024-04-29 2024-07-02 南京信息工程大学 Precipitation prediction method based on high altitude data and GM gray iteration

Similar Documents

Publication Publication Date Title
CN110084367B (en) Soil moisture content prediction method based on LSTM deep learning model
CN115495991A (en) Rainfall interval prediction method based on time convolution network
CN110648014B (en) Regional wind power prediction method and system based on space-time quantile regression
CN112712209B (en) Reservoir warehousing flow prediction method and device, computer equipment and storage medium
CN111665575B (en) Medium-and-long-term rainfall grading coupling forecasting method and system based on statistical power
CN115423163A (en) Method and device for predicting short-term flood events of drainage basin and terminal equipment
CN116128141B (en) Storm surge prediction method and device, storage medium and electronic equipment
CN112396152A (en) Flood forecasting method based on CS-LSTM
CN112766454A (en) Air PM2.5 concentration prediction method, system, terminal and storage medium
CN117909888B (en) Intelligent artificial intelligence climate prediction method
CN114357737B (en) Agent optimization calibration method for time-varying parameters of large-scale hydrologic model
CN115456245A (en) Prediction method for dissolved oxygen in tidal river network area
CN114692981A (en) Medium-and-long-term runoff forecasting method and system based on Seq2Seq model
CN117520784A (en) Groundwater level multi-step prediction method based on convolution attention long-short-term neural network
CN112215495B (en) Pollution source contribution calculation method based on long-time and short-time memory neural network
CN116779172A (en) Lung cancer disease burden risk early warning method based on ensemble learning
CN116306832A (en) Multimode generation antagonistic neural network modeling method and device for multidimensional sequence data
CN116720080A (en) Homologous meteorological element fusion inspection method
CN115860165A (en) Neural network basin rainfall runoff forecasting method and system considering initial loss
CN110009132A (en) A kind of short-term electric load fining prediction technique based on LSTM deep neural network
CN113487069B (en) Regional flood disaster risk assessment method based on GRACE daily degradation scale and novel DWSDI index
CN113255207A (en) Iterative multi-output-Markov chain-based multi-step prediction method for argon fraction variable of air separation system
CN117784290B (en) Sudden drought early warning method and system based on Bayesian neural network
CN118010103B (en) Intelligent monitoring method and system for equal-thickness cement soil stirring wall in severe cold environment
CN117933946B (en) Rural business management method based on big data

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination