CN112232577B

CN112232577B - Power load probability prediction system and method for multi-core smart meter

Info

Publication number: CN112232577B
Application number: CN202011147997.5A
Authority: CN
Inventors: 吴晓政; 姚诚; 周立; 毛子春; 周念成; 王强钢
Original assignee: Zhejiang Bada Electronic Instruments Co ltd; Chongqing University
Current assignee: Zhejiang Bada Electronic Instruments Co ltd; Chongqing University
Priority date: 2020-10-23
Filing date: 2020-10-23
Publication date: 2024-04-16
Anticipated expiration: 2040-10-23
Also published as: CN112232577A

Abstract

The invention discloses a power load probability prediction system and a method for a multi-core intelligent meter, wherein a sample set is constructed through a convolutional neural network CNN and a cyclic neural network GRU: training the mixed density network with the training set to optimize network parameters: verifying the trained mixed density network by using a verification set; carrying out power load probability prediction by adopting the verified mixed density network; the mixed density network comprises a full-connection layer, a mixed parameter output layer and a mixed probability density function output layer which are connected step by step; the mixed probability density function output layer is used for obtaining the mixed weight, the probability density function variance and the probability density function mean value output by the mixed parameter output layer to construct a mixed density function, and the mixed density function is used as the power utilization load probability density distribution. The invention has high training efficiency, can well learn the uncertainty of the sample, can cope with the time sequence of the height fluctuation, and improves the prediction accuracy.

Description

Power load probability prediction system and method for multi-core smart meter

Technical Field

The invention relates to the field of power load prediction, in particular to a method for predicting a power load probability density function.

Background

Under the background of the construction of the electric power Internet of things, the electric power system can collect, record and store massive load data by widely paving intelligent electric meters, and the multi-core intelligent electric meter is the intelligent electric meter which can realize non-invasive load identification and has the edge computing capability, and the load prediction method based on data processing and analysis benefits from the fact that the intelligent electric meter has more data foundation support. The load prediction accuracy is affected by a plurality of factors such as seasonal dynamic and regional power generation capacity, has certain uncertainty, and the problems of how to construct a high-efficiency and accurate load prediction model by utilizing mass data acquired by a multi-core intelligent meter, how to describe the uncertainty in the load prediction and the like become research hotspots in recent years.

The load prediction method can be mainly divided into a time sequence prediction method, a regression analysis prediction method and an artificial intelligence prediction method. Because the influence factors of the load prediction have the characteristics of multiple types, burstiness, uncertainty and the like, for a time sequence prediction method and a regression analysis prediction method, the accuracy and the robustness of the load prediction are lower, and compared with the above methods, the artificial intelligence method represented by the neural network and the support vector machine prediction has better performance, and has been rapidly developed in recent years.

At present, in order to improve accuracy of a predicted result, a plurality of predicted models are often adopted to predict simultaneously, and then the predicted results of the plurality of predicted models are synthesized (such as statistical analysis) to obtain a final predicted result. And the structure is complicated, a plurality of prediction models are trained in parallel, a large amount of training resources are required to be consumed, and the training efficiency is low.

Disclosure of Invention

Aiming at the defects of the technology, the invention provides a power load probability prediction method for a multi-core intelligent meter, which solves the technical problem that in the prior art, in order to improve the accuracy of a prediction result, a final prediction result is required to be constructed by relying on the prediction results of a plurality of models.

In order to solve the technical problems, the invention provides a power load probability prediction method for a multi-core intelligent meter, which comprises the following steps:

constructing a sample set: acquiring historical load data under different environmental factors, and constructing a sample set according to the historical load data and the environmental factor data; the sample set is divided into a training set and a verification set;

constructing a mixed density network: the system comprises a full connection layer, a mixed parameter output layer and a mixed probability density function output layer which are connected step by step;

the full-connection layer is used for outputting the processed input data to the mixed parameter output layer; the full-connection layer comprises 3 groups of output neurons, and each group of output neurons comprises N output neurons;

the mixed parameter output layer comprises three parallel mixed parameter neurons; each mixed parameter neuron correspondingly acquires N output data of a group of output neurons of the full-connection layer, so that N corresponding mixed parameters are fitted according to the N output data; the three parallel mixed parameter neurons are mixed weight neurons, probability density function variance neurons and probability density function mean neurons;

the mixed probability density function output layer is used for obtaining mixed weight, probability density function variance and probability density function mean value output by the mixed parameter output layer to construct a mixed probability density function, and the mixed probability density function is used as power consumption load probability density distribution;

training the mixed density network with the training set to optimize network parameters: verifying the trained mixed density network by using a verification set; and after the historical load data and the corresponding environmental factor data are processed into the same data structure as the sample data, inputting the mixed density network after passing the verification to predict the power load probability.

Further, a softmax activation function is adopted to fit the mixed weights, and the nth input data of the mixed weight neuron is set as z _n Then softmax (z _n ) The nth mixing weight assigned to the mixing weight neuron output is calculated as follows:

wherein z is _j The j-th input data representing the mixed-weight neuron, j= {1,2, once again the number of the groups is once again, n= {1,2,...

Further, ELU activation function pairs are adopted to fit probability density function variance and probability density function mean, and the nth input data of the probability density function variance neuron is set as a' _n The nth input data of the probability density function mean neuron is a _n Will f _ELU (a′ _n ) Assigning f to the nth probability density function variance of the probability density function variance neuron outputs _ELU (a″ _n ) Assigning the n-th probability density function mean value to the probability density function mean value neuron output; f (f) _ELU (a′ _n ) And f _ELU (a″ _n ) The calculated formula of (2) is as follows:

where a represents the input data of the ELU activation function.

Further, the mixed probability density function is constructed in a manner weighted by a plurality of gaussian probability distributions:

wherein p (y _t I x) represents the power load y at time t under the input data x _t Probability distribution of (2); alpha _n (x, t) represents the weight of the nth gaussian distribution at time t, i.e., the nth mixing weight output by the mixing weight neuron; sigma (sigma) _n (x, t) represents the variance of the nth gaussian distribution at time t, i.e., the nth probability density function variance of the probability density function variance neuron outputs; mu (mu) _n (x, t) represents the mean value of the nth gaussian distribution at the time t, namely the nth probability density function mean value output by the probability density function mean value neuron; n= {1,2,...

Further, a rapid gradient symbol algorithm FGSM is adopted to generate an countermeasure training sample set, the countermeasure training is carried out on the mixed density network by combining the training set, and meanwhile, the network parameters are optimized by adopting a minimized loss function;

the calculation formula of the challenge sample is as follows:

x″＝x′+ε.sign(-▽ _x log(p(y _t |x′)))；

wherein x' represents a sample in the training set; x "represents challenge samples; epsilon represents the disturbance added during training; p (y) _t The |x ') represents the power load y at time t under the sample x' _t Probability distribution of (2)

Loss function f _loss The calculation formula of (2) is as follows:

where λ represents the weight of the challenge sample; θ _k The learning parameters of the mixed parameter output layer are represented, and K represents the number of the learning parameters; gamma represents a regularized hyper-parameter; p (y) _t The |x ") represents the electrical load y at time t under the challenge sample x″ _t Is a probability distribution of (c).

Further, a sample set is constructed using the following method:

firstly, a convolutional neural network CNN is adopted to extract spatial features from environmental factor data, wherein the spatial features comprise seasonal features, weather features and regional power generation capacity features which can influence load variation;

then, constructing the historical load data and the spatial features into feature vectors, and transmitting the feature vectors to the recurrent neural network GRU; the cyclic neural network GRU extracts the time characteristics of the characteristic vectors according to the short-term and long-term dependencies between the characteristic vectors at adjacent moments, so that sample data is constructed on the basis of the characteristic vectors; the time feature refers to the amount of information that needs to be used for the sample at each time.

The invention also provides a power load probability prediction system for the multicore intelligent meter, which comprises a convolutional neural network CNN, a cyclic neural network GRU and a mixed density network;

the convolutional neural network is used for extracting spatial features from the historical environmental factor data, constructing feature vectors by the spatial features and corresponding historical load data, and sending the feature vectors to the recurrent neural network GRU; the spatial features comprise seasonal features, weather features and regional power generation features which can influence load variation;

the cyclic neural network GRU is used for extracting the time characteristics of the characteristic vectors according to the short-term and long-term dependencies between the characteristic vectors at adjacent moments, so that sample data are constructed on the basis of the characteristic vectors; the time characteristic refers to the information quantity required by the sample at each moment;

the mixed density network comprises a full-connection layer, a mixed parameter output layer and a mixed probability density function output layer which are connected step by step;

the mixed parameter output layer comprises three parallel mixed parameter neurons, and each mixed parameter neuron correspondingly acquires N output data of a group of output neurons of the full-connection layer, so that N corresponding mixed parameters are fitted according to the N output data; the three parallel mixed parameter neurons are mixed weight neurons, probability density function variance neurons and probability density function mean neurons;

the mixed probability density function output layer is used for obtaining the mixed weight, the probability density function variance and the probability density function mean value output by the mixed parameter output layer to construct a mixed probability density function, and the mixed probability density function is used as power utilization load probability density distribution.

Further, the convolutional neural network CNN includes a convolutional layer and a pooling layer; the convolutional neural network CNN is connected with the cyclic neural network GRU through a flat layer, and the flat layer is used for reducing the dimension of space characteristics, then forming characteristic vectors with historical load data, and outputting the characteristic vectors to the cyclic neural network GRU; the cyclic neural network GRU comprises two cascaded GRU layers respectively corresponding to an update gate and a reset gate; the hybrid density network has 4 levels of fully connected layers, forming a deep hybrid density network.

Furthermore, the convolutional neural network CNN adopts a ReLU as an activation function; the pooling layer can discard 20% of data and output the data to the flat layer; two GRU layers of the GRU of the cyclic neural network can discard 50% of data respectively and output the data; the first two full-connection layers in the 4-stage full-connection layers can discard 25% of data respectively and output the data.

Compared with the prior art, the invention has the advantages that:

1. according to the invention, only the sample is used for training the mixed density network, so that the defect that a plurality of prediction models need to be trained simultaneously in the prior art is overcome, and the training efficiency is greatly improved. Meanwhile, the mixed probability density function is obtained by weighting a plurality of probability density distributions, so that uncertainty of a sample can be well learned, a time sequence of high fluctuation can be dealt with, and the requirement of load probability prediction is met.

2. The load probability prediction method of the invention uses CNN to convolve and pool data, adopts a space downsampling mode to extract space dimension feature vectors of mass data, improves the learning capacity of a model on a large amount of load data,

3. according to the load probability prediction method, the dependence characteristics of the GRU mining sample data on a long term and a short term are utilized, the correlation characteristics between time sequence values are extracted, and compared with the traditional cyclic neural network (RNN) model parameters such as long-short-term memory (LSTM) and the like, the load probability prediction method is less in parameters and faster in training speed.

4. The load probability prediction method adopts the countermeasure training method to train the deep mixed density network, and can quickly generate countermeasure samples by using the FGSM. The training neural network usually adopts a maximum likelihood loss function, but the training of the mixed density network can cause overfitting, and the countertraining can not only regularize to prevent overfitting, but also reduce the computational complexity and improve the disturbance resistance of the model, thereby improving the robustness of the model

5. According to the load probability prediction method, the network is verified through the verification set data in the deep mixed density network training process, so that the network performance can be verified and optimized, the final short-term deviation is partially compensated, and the expansion of the training set data size is realized.

Drawings

FIG. 1 is a block diagram of a power utilization load probability prediction system in this embodiment;

fig. 2 is a schematic diagram of a time series cross-validation method in a hybrid density network training process.

Detailed Description

Referring to fig. 1, a power load probability prediction system for a multicore smart meter includes a convolutional neural network CNN, a recurrent neural network GRU, and a mixed density network. The convolutional neural network CNN comprises a convolutional layer and a pooling layer; the convolutional neural network CNN adopts a ReLU as an activation function; the pooling layer can discard 20% of data and output the data to the flat layer, so that overfitting in the training process is prevented; the convolutional neural network CNN is connected with the cyclic neural network GRU through a flat layer, and the flat layer is used for reducing the dimension of the space characteristics and then forms a characteristic vector with the historical load data to be output to the cyclic neural network GRU.

The convolutional neural network is used for extracting spatial features from the historical environmental factor data, constructing feature vectors by the spatial features and corresponding historical load data, and sending the feature vectors to the recurrent neural network GRU; the spatial characteristics include seasonal characteristics, weather characteristics, and regional power generation characteristics that can affect load fluctuations.

The cyclic neural network GRU is used for extracting the time characteristics of the characteristic vectors according to the short-term and long-term dependencies between the characteristic vectors at adjacent moments, so that sample data are constructed on the basis of the characteristic vectors; the time feature refers to the amount of information that needs to be used for the sample at each time.

The recurrent neural network GRU consists of two GRU layers, each discarding 50% of the data to prevent overfitting, whose update gate u (t) and reset gate r (t) functions are shown below,

u(t)＝f[ω ^u x(t)+R ^u h(t-1)+b ^u ]

r(t)＝f[ω ^r x(t)+R ^r h(t-1)+b ^r ]

wherein x (t) is the current moment input vector, u (t) and r (t) are the states of the current update gate and the reset gate respectively, ω ^u 、ω ^r Weights for GRU network update gate and reset gate, respectively, R ^u 、R ^r B is the cycle parameters of the update gate and the reset gate, respectively ^u 、b ^r The bias of the refresh gate and the reset gate, h (t) and h (t-1) are the state memory variables at the current time and the last time respectively,candidate state at the current time. The update gate determines the degree to which the state variable at the previous time is brought into the current state, the reset gate controls the information amount written into the current candidate state at the previous time, the previous time information is stored through (1-u (t)) times operation, the current candidate state information is stored through u (t) operation, the output vector is obtained through addition, and the internal data can be obtainedA predictive model is introduced at the temporal feature.

The mixed density network comprises a full-connection layer, a mixed parameter output layer and a mixed probability density function output layer which are connected step by step; the full-connection layer is used for outputting the processed input data to the mixed parameter output layer; the fully-connected layer comprises 3 groups of output neurons, and each group of output neurons comprises N output neurons.

The hybrid density network has 4 levels of fully connected layers, forming a deep hybrid density network. The first two full connection layers in the 4-stage full connection layers can discard 25% of data respectively and output the data so as to prevent over fitting.

fitting the mixed weights by using softmax activation function, and setting the nth input data of the mixed weight neuron as z _n Then softmax (z _n ) The nth mixing weight assigned to the mixing weight neuron output is calculated as follows:

Fitting the probability density function variance and the probability density function mean by using ELU activation function pairs, and setting the nth input data of the probability density function variance neuron as a' _n The nth input data of the probability density function mean neuron is a _n Will f _ELU (a′ _n ) Assigning f to the nth probability density function variance of the probability density function variance neuron outputs _ELU (a″ _n ) Assignment toThe nth probability density function mean value output by the probability density function mean value neuron; f (f) _ELU (a′ _n ) And f _ELU (a″ _n ) The calculated formula of (2) is as follows:

where a represents the input data of the ELU activation function.

The mixed probability density function output layer is used for obtaining the mixed weight, the probability density function variance and the probability density function mean value output by the mixed parameter output layer to construct a mixed probability density function, and the mixed probability density function is used as the power utilization load probability density distribution.

Constructing a mixed probability density function in a manner weighted by a plurality of gaussian probability distributions:

The convolutional neural network CNN and the cyclic neural network GRU in the power consumption load probability prediction system in the specific embodiment are adopted to realize sample construction, then the mixed density network is trained, after training is completed, the convolutional neural network CNN is used for inputting historical load data and corresponding environmental factor data, the convolutional neural network CNN and the cyclic neural network GRU are used for processing the historical load data and the corresponding environmental factor data into the same data structure as sample data, and then the mixed density network after verification is input for power consumption load probability prediction. For example: the power consumption load probability of the future t moment is predicted, then the historical load data before the t moment and the corresponding environmental factor data are input, then the power consumption load probability distribution of the future t moment is obtained, and the probability of various loads at the future t moment can be calculated through the power consumption load probability distribution; further, the expectation of the probability distribution may also be taken as a predicted value of the electric load, because this expectation is a value at which the probability of occurrence is maximum.

In the specific embodiment, a countermeasure training method is adopted for the mixed density network, a rapid gradient sign algorithm FGSM is adopted to generate a countermeasure training sample set, the training set is combined to perform countermeasure training on the mixed density network, and meanwhile, a minimized loss function is adopted to optimize network parameters;

the calculation formula of the challenge sample is as follows:

x″＝x′+ε.sign(-▽ _x log(p(y _t |x′)))；

Loss function f _loss The calculation formula of (2) is as follows:

Training the challenge sample with the original data, the loss generated by the challenge sample is taken as part of the original loss, i.e. the loss of the model is increased without modifying the original model structure, resulting in a regularization effect. The third term on the right of the damage function is L2 regularization of learning parameters, and the third term is added into the loss function to keep consistency of hybrid network training.

Referring to fig. 2, a time series cross validation method is used in the training process of the deep mixed density network to validate with validation set data. And learning network parameters through the training set data, and verifying the network parameters at the first time. The folded samples for verification are then integrated into the training set. The network training is re-performed and then further expanded for verification. This process continues until the last fold of the validation set. In addition to implementing its verification function, time series cross-verification also enables expansion of training set size, and in addition, the network verifies data in a time period near the available set, thereby partially compensating for the final short-term bias in the generation process.

Claims

1. A method for predicting probability of power load for a multi-core smart meter, comprising the steps of:

constructing a sample set: acquiring historical load data under different environmental factors, and constructing a sample set according to the historical load data and the environmental factor data; the sample set is divided into a training set and a verification set; the sample set was constructed using the following method: firstly, a convolutional neural network CNN is adopted to extract spatial features from environmental factor data, wherein the spatial features comprise seasonal features, weather features and regional power generation capacity features which can influence load variation; then, constructing the historical load data and the spatial features into feature vectors, and transmitting the feature vectors to the recurrent neural network GRU; the cyclic neural network GRU extracts the time characteristics of the characteristic vectors according to the short-term and long-term dependencies between the characteristic vectors at adjacent moments, so that sample data is constructed on the basis of the characteristic vectors; the time characteristic refers to the information quantity required by the sample at each moment;

the convolutional neural network CNN comprises a convolutional layer and a pooling layer; the convolutional neural network CNN is connected with the cyclic neural network GRU through a flat layer, and the flat layer is used for reducing the dimension of space characteristics, then forming characteristic vectors with historical load data, and outputting the characteristic vectors to the cyclic neural network GRU; the cyclic neural network GRU comprises two cascaded GRU layers respectively corresponding to an update gate and a reset gate; the mixed density network has 4-level fully-connected layers, thereby forming a deep mixed density network;

the mixed probability density function output layer is used for obtaining mixed weight, probability density function variance and probability density function mean value output by the mixed parameter output layer to construct a mixed probability density function, and the mixed probability density function is used as power consumption load probability density distribution; constructing a mixed probability density function in a manner weighted by a plurality of gaussian probability distributions:

wherein p (y _t I x) represents the power load y at time t under the input data x _t Probability distribution of (2); alpha _n (x, t) represents the weight of the nth gaussian distribution at time t, i.e., the nth mixing weight output by the mixing weight neuron; sigma (sigma) _n (x, t) represents the variance of the nth gaussian distribution at time t, i.e., the nth probability density function variance of the probability density function variance neuron outputs; mu (mu) _n (x, t) represents the mean value of the nth Gaussian distribution at time t, i.e. the probability density function mean neuron outputThe nth probability density function mean; n= {1,2,..;

2. The method for predicting probability of power load of a multicore smart meter of claim 1, wherein a softmax activation function is used to fit the mixed weights, and the nth input data of the mixed weight neuron is set to be z _n Then softmax (z _n ) The nth mixing weight assigned to the mixing weight neuron output is calculated as follows:

3. The method for predicting probability of electrical loads for a multicore smart meter of claim 1, wherein the probability density function variance and the probability density function mean are fitted by using an ELU activation function pair, and the nth input data of the probability density function variance neuron is set to be a' _n The nth input data of the probability density function mean neuron is a' _n ' then f _ELU (a _n ') assigning f to the nth probability density function variance of the probability density function variance neuron outputs _ELU (a _n ") assigning an nth probability density function mean value to the probability density function mean value neuron output; f (f) _ELU (a _n ') and f _ELU (a _n The calculated formula of "") is as follows:

where a represents the input data of the ELU activation function.

4. The method for predicting the probability of power load of a multicore smart meter according to claim 1, wherein a fast gradient sign algorithm FGSM is used to generate a challenge training sample set, and a mixed density network is subjected to challenge training in combination with the training set, and network parameters are optimized by adopting a minimization loss function;

the calculation formula of the challenge sample is as follows:

Loss function f _loss The calculation formula of (2) is as follows:

5. The method for predicting the probability of power load for a multi-core smart meter of claim 1, wherein the verification is performed by time series cross-validation using verification set data during training of the hybrid density network.

6. A power load probability prediction system for a multi-core smart meter, characterized by: the system comprises a convolutional neural network CNN, a cyclic neural network GRU and a mixed density network;

7. The power load probability prediction system for a multi-core smart meter of claim 6, wherein: the convolutional neural network CNN adopts a ReLU as an activation function; the pooling layer can discard 20% of data and output the data to the flat layer; two GRU layers of the GRU of the cyclic neural network can discard 50% of data respectively and output the data; the first two full-connection layers in the 4-stage full-connection layers can discard 25% of data respectively and output the data.