CN111369120A

CN111369120A - Non-invasive load monitoring method based on equipment transfer learning

Info

Publication number: CN111369120A
Application number: CN202010122993.5A
Authority: CN
Inventors: 邵振国; 张承圣; 邓宏杰; 黄耿业; 陈飞雄; 张嫣
Original assignee: Fuzhou University
Current assignee: Fuzhou University
Priority date: 2020-02-27
Filing date: 2020-02-27
Publication date: 2020-07-03

Abstract

The invention relates to a non-invasive load monitoring method based on equipment transfer learning, which comprises the steps of collecting load amplitude data of various household electrical appliances and total household load amplitude data, and establishing an original data set; normalizing the data of the original data set; building an initialization sequence to point CNN model to build a device selection model for transfer learning, and selecting initial data and target data for transfer learning; performing a training process and a verification process on the CNN model from the initialization sequence to the point by using initial data to form a transfer learning initial model; fine-tuning a full connection layer of the initial model by adopting target data to form a transfer learning target model; and inputting the normalized household total load amplitude data into a transfer learning target model to obtain the power amplitude of the target equipment, thereby realizing non-invasive load monitoring of the target electrical equipment. The invention reduces the time of model training and the economic cost and time cost of non-invasive load monitoring.

Description

Non-invasive load monitoring method based on equipment transfer learning

Technical Field

The invention relates to the field of power metering methods, in particular to a non-invasive load monitoring method based on equipment migration learning.

Background

The power consumer power consumption load detail monitoring is mainly used for acquiring information such as the power consumption condition and the power consumption behavior of each load device in a consumer. Traditional invasive power load monitoring all need install induction measurement and data transmission device on every consumer inside the load, and economic input is great, and the management is maintained comparatively complicatedly, is not suitable for popularization on a large scale. Non-Intrusive Load Monitoring (NILM) can refine Monitoring to the inside of the total Load according to the total power consumption of a user without installing a large amount of Monitoring equipment, so that the consumed power and the running state of each electrical appliance are obtained, and the Non-Intrusive Load Monitoring (NILM) has the advantages of strong operability, low implementation cost, high user acceptance, high reliability, easiness in wide popularization and the like. The deep neural network method has the highest precision for realizing NILM, but has the disadvantages that a lot of time is consumed for training a model each time, and the model trained in a data-poor area has a poor effect, so that a proper method is needed for solving the problems.

The existing NILM methods are mainly of two types: one is unsupervised learning without a training process and the other is supervised learning with a priori training process.

The unsupervised learning method is represented by a hidden Markov model, considers the use condition and the working process of equipment, and simplifies the step of labeling data for the electric equipment. However, due to lack of prior supervision, when the number of the electric devices is large, the electric devices are easy to fall into local optimization, and the identification precision is greatly reduced.

The supervised learning algorithm needs to be trained on known data, and the load identification target is achieved by learning the load stamp of the electric equipment. Compared with an unsupervised learning method, the supervised learning method represented by the deep neural network has higher precision and can cope with the complex conditions of similar load marks, occurrence of simultaneous events and the like, but the training of one model usually takes longer time, the models among regions are not universal, and a good solution is not provided for the regions with deficient data.

Disclosure of Invention

In view of this, the present invention provides a non-intrusive load monitoring method based on device migration learning, which employs a sequence-to-point input/output mode and can provide load information to a user more efficiently.

The invention is realized by adopting the following scheme: a non-intrusive load monitoring method based on equipment transfer learning comprises the following steps:

step S1: acquiring load power data of each household appliance and total household load power data, and establishing an original data set; the storage form is as follows:

each household appliance is A, B, … and L, the total household load is U, and the corresponding load data in the data set is

In the formula, 0, 1, …, n represents the sampling point number; a, (n), b (n), …, l (n), u (n) is the amplitude of the power of the specific sampling point; a (n), b (n), …, l (n) represent a set of load sample point magnitudes for the corresponding device; u (n) represents a set of home total load sample point magnitudes;

step S2: normalizing the data of all the stored data sets;

step S3: randomSelecting a section of load sampling point amplitude data of each electrical appliance in the household electrical appliance devices A, B, … and L during respective operation to form a data set A_e(n)，B_e(n)，…，L_e(n); set data set A_e(n)，B_e(n)，…，L_e(n) the amplitude sets of the total household load sampling points in the corresponding time periods are respectively U_a(n)，U_b(n)，…，U_l(n), constructing a device selection model for transfer learning by constructing an initialization sequence to point (CNN) model; defining a convolution kernel with a changed weight value in the process of selecting a model by the equipment for constructing the transfer learning as an active convolution kernel, observing the number of the active convolution kernels in the model, and selecting load sampling point amplitude data of the electrical equipment with the largest number of the active convolution kernels as an initial data set O for the transfer learning_m(n), the load sampling point amplitude data of the electrical equipment needing to be monitored, namely the target electrical equipment, is used as a target data set P for transfer learning_m(n); mixing O with_m(n)、P_m(n) two data sets as per 7: 2: 1, respectively randomly dividing the training set, the verification set and the test set; the training set, the verification set and the test set of the formed initial data set are respectively O_mt(n)、O_mv(n)、O_mtt(n), the training set, the verification set and the test set of the target data set are respectively P_mt(n)、P_mv(n)、P_mtt(n)；

Step S4: initialization sequence to Point CNN model, set up by step S3, in U_m(n)、O_mt(n) as a sample, training the model;

step S5: by U_m(n)、O_mv(n) as a sample, performing a verification process to stop the training process in step S4 to prevent an overfitting phenomenon; at this time, the construction of the transfer learning initial model is finished;

step S6: by U_m(n)、P_mt(n) as a sample, carrying out a migration fine tuning process, fine tuning a full connection layer of the migration learning initial model to form a migration learning target model, and finishing the migration learning at the moment; inputting the normalized magnitude data of the total load sampling points of the family into a migration learning target modelAnd obtaining the power amplitude of the target equipment to realize non-invasive load monitoring on the target electrical equipment.

Further, the normalization process uses the following formula:

wherein x is_m(i) Representing the power amplitude of each sampling point after normalization processing of the total load of the target equipment or the family; i is 0, 1, …, n denotes the sample point number; x (i) represents the power amplitude of each sample point before normalization,

an average value representing the total load of the target device or the home; x after normalization_m(i) Forming a new set of device load data A_m(n)，B_m(n)，…，L_m(n) and a set of Total Home load data U_m(n)。

Further, the specific construction process of constructing the device selection model for transfer learning is as follows:

an initialization sequence to point CNN model is built, and the structure is as follows:

an input layer: w samples are input as a window;

the convolutional layer 1: 30 convolution kernels are contained, and each convolution kernel is 10 in size; the stride is 1; the activation function is a linear rectification function ReLU;

and (3) convolutional layer 2: 30 convolution kernels are contained, and each convolution kernel is 8 in size; the stride is 1; the activation function is ReLU;

and (3) convolutional layer: 40 convolution kernels are included, and each convolution kernel is 6 in size; the stride is 1; the activation function is ReLU;

and (4) convolutional layer: 50 convolution kernels are included, and the size of each convolution kernel is 5; the stride is 1; the activation function is ReLU;

and (5) convolutional layer: 50 convolution kernels are included, and the size of each convolution kernel is 5; the stride is 1; the activation function is ReLU;

full connection layer: the dense layer structure comprises 1024 neural network units, and 1 sample is output as a window;

will U_a(n)，U_b(n)，…，U_l(n) as input sequence, A_e(n)，B_e(n)，…，L_e(n) as a target sequence; setting the sequence length of an input window as W, namely the sequence of the input window is u (i) to u (i + W-1); setting the target value under the input window as the midpoint value of the corresponding target sequence, namely a_e(τ)，b_e(τ)，…，l_e(τ), where τ is i + W/2;

performing one round of training on the initialization sequence to point CNN model: inputting data of an input sequence into an initialization sequence to point CNN model, and transmitting the data forward through a convolutional layer and a full-link layer to obtain an output value; comparing the output value with the target value to generate an error set epsilon (n); returning epsilon (n) to the network, sequentially obtaining errors of all weights in the full-connection layer and the convolution layer, further updating the weights of the convolution kernel and the full-connection layer neural network unit, and finishing one round of training;

after one round of training is completed, the generated equipment selection model expression of the transfer learning is as follows:

further, the specific content of step S4 is: will U_m(n) as input sequence, O_mt(n) as a target sequence; setting the sequence length of an input window to W, i.e. the sequence of input windows to u_m(i) To u_m(i + W-1); setting the target value under the input window as the midpoint value of the corresponding target sequence, i.e. o_mt(τ), where τ is i + W/2;

training the initialization sequence to point CNN model: inputting data of an input sequence into an initialization sequence to point CNN model, and transmitting the data forward through a convolutional layer and a full-link layer to obtain an output value; comparing the output value with the target value to generate an error set epsilon₁(n); will epsilon₁(n) transmitting back to the network, in turn, to find full connectivityErrors of all weights in the layers and the convolutional layers are further updated, and then one round of training is completed; repeating the training process to perform the next round of training; the training process uses an ADAM optimizer to optimize the model.

Further, the specific content of step S5 is:

the expressions defining the training penalty value Lpt and the verification penalty value Lpv for the verification process are as follows:

wherein θ represents a parameter of the network; adopting an early stopping criterion to train the model to stop after the number of rounds meeting the early stopping criterion condition; the early stop criteria are:

Lpυ≥Lpt

at this time, the training process is stopped, the initial model of the transfer learning is constructed, and the expression is as follows:

o_mt(τ)＝f(u_m(i)，u_m(i+W-1)，ε₁(n))。

further, the specific content of step S6 is:

will U_m(n) as input sequence, P_mt(n) as a target sequence; setting the sequence length of an input window to W, i.e. the sequence of input windows to u_m(i) To u_m(i + W-1); setting the target value under the input window as the midpoint value of the corresponding target sequence, i.e. p_mt(τ), where τ is i + W/2;

training the migration learning initial model, and simultaneously keeping the weight of the convolutional layer unchanged, wherein the process is as follows:

inputting data of an input sequence into a migration learning initial model, and forward transmitting the data through a convolutional layer and a full-link layer to obtain an output value; comparing the output value with the target value to generate an error setε₂(n); will epsilon₂(n) returning the weight error of the full connection layer to the network, and further updating the weight of the neural network unit of the full connection layer, wherein one round of training is completed; and repeating the training process to perform the next training round. The training process uses an ADAM optimizer to optimize the model.

The expressions for the training penalty value Lpt1 and the verification penalty value Lpv1 that define the migration trimming process are as follows:

Lpυ1≥Lpt1

stopping training at the moment, finishing the migration fine tuning process, finishing the construction of the migration learning target model, and obtaining the following expression:

p_mt(τ)＝f(u_m(i)，u_m(i+W-1)，ε₂(n))。

compared with the prior art, the invention has the following beneficial effects:

compared with the existing non-invasive load monitoring method based on the neural network, the method adopts the sequence-to-point input and output mode, can more efficiently provide load information for users, and simplifies the algorithm flow; by means of transfer learning, the universality of the non-invasive load monitoring model is improved, various problems caused by the fact that the sample capacity of the known data set is too small are solved, meanwhile, the time for constructing the target model is reduced, and the economic cost and the time cost are reduced.

Drawings

FIG. 1 is a flow chart of an embodiment of the present invention.

Detailed Description

The invention is further explained below with reference to the drawings and the embodiments.

It should be noted that the following detailed description is exemplary and is intended to provide further explanation of the disclosure. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this application belongs.

It is noted that the terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of example embodiments according to the present application. As used herein, the singular forms "a", "an" and "the" are intended to include the plural forms as well, and it should be understood that when the terms "comprises" and/or "comprising" are used in this specification, they specify the presence of stated features, steps, operations, devices, components, and/or combinations thereof, unless the context clearly indicates otherwise.

As shown in fig. 1, the present embodiment provides a non-intrusive load monitoring method based on device transfer learning, including the following steps:

step S2: normalizing the data of all the stored data sets;

step S3: randomly selecting a section of load sampling point amplitude data of each electrical appliance in the household electrical appliance devices A, B, … and L during respective operation to form a data set A_e(n)，B_e(n)，…，L_e(n)；Set data set A_e(n)，B_e(n)，…，L_e(n) the amplitude sets of the total household load sampling points in the corresponding time periods are respectively U_a(n)，U_b(n)，…，U_l(n), constructing a device selection model for transfer learning by constructing an initialization sequence to point (CNN) model; defining a convolution kernel with a changed weight value in the process of selecting a model by the equipment for constructing the transfer learning as an active convolution kernel, observing the number of the active convolution kernels in the model, and selecting load sampling point amplitude data of the electrical equipment with the largest number of the active convolution kernels as an initial data set O for the transfer learning_m(n), the load sampling point amplitude data of the electrical equipment needing to be monitored, namely the target electrical equipment, is used as a target data set P for transfer learning_m(n); mixing O with_m(n)、P_m(n) two data sets as per 7: 2: 1, respectively randomly dividing the training set, the verification set and the test set; the training set, the verification set and the test set of the formed initial data set are respectively O_mt(n)、O_mv(n)、O_mtt(n), the training set, the verification set and the test set of the target data set are respectively P_mt(n)、P_mv(n)、P_mtt(n)；

step S6: by U_m(n)、P_mt(n) as a sample, carrying out a migration fine tuning process, fine tuning a full connection layer of the migration learning initial model to form a migration learning target model, and finishing the migration learning at the moment; and inputting the normalized amplitude data of the sampling points of the total household load into a migration learning target model to obtain the power amplitude of the target equipment so as to realize non-invasive load monitoring on the target electrical equipment.

In this embodiment, the formula adopted by the normalization process is as follows:

an average value representing the total load of the target device or the home; treated x_m(i) Forming a new set of device load data A_m(n)，B_m(n)，…，L_m(n) and a set of Total Home load data U_m(n)。

In this embodiment, the specific construction process of constructing the device selection model for transfer learning is as follows:

an input layer: w samples are input as a window;

in this embodiment, the specific content of step S4 is:

an input layer: w samples are input as a window;

will U_m(n) as input sequence, O_mt(n) as a target sequence; setting the sequence length of an input window to W, i.e. the sequence of input windows to u_m(i) To u_m(i + W-1); setting the target value under the input window as the midpoint value of the corresponding target sequence, i.e. o_mt(τ), where τ is i + W/2;

training the initialization sequence to point CNN model: inputting data of an input sequence into an initialization sequence to point CNN model, and transmitting the data forward through a convolutional layer and a full-link layer to obtain an output value; comparing the output value with the target value to generate an error set epsilon₁(n); will epsilon₁(t) returning the error of all weights in the full-connection layer and the convolution layer to the network, and further updating the weights of the model convolution kernel and the full-connection layer neural network unit, wherein one round of training is completed; repeating the training process to perform the next round of training; the training process uses an ADAM optimizer to optimize the model.

In this embodiment, the specific content of step S5 is:

Lpυ≥Lpt

o_mt(τ)＝f(u_m(i)，u_m(i+W-1)，ε₁(n))。

in this embodiment, the specific content of step S6 is:

inputting data of an input sequence into a migration learning initial model, and forward transmitting the data through a convolutional layer and a full-link layer to obtain an output value; comparing the output value with the target value to generate an error set epsilon₂(n); will epsilon₂(n) returning the weight error of the full connection layer to the network, and further updating the weight of the neural network unit of the full connection layer, wherein one round of training is completed; and repeating the training process to perform the next training round. The training process uses an ADAM optimizer to optimize the model.

Lpυ1≥Lpt1

p_mt(τ)＝f(u_m(i)，u_m(i+W-1)，ε₂(n))。

preferably, in the present embodiment, U is used_m(n)、P_mttAnd (n) as a sample, establishing three indexes for testing the precision of the target model.

The first index is the Mean Absolute Error (MAE), which is given by:

wherein the content of the first and second substances,

indicates the predicted value, p_mtt(τ) represents the actual value, T represents the predicted total time; the index evaluates the absolute error between the predicted value and the actual value;

the second indicator is total energy error (SAE), which is given by the following equation:

wherein the content of the first and second substances,

representing the predicted total plant energy consumption, and r representing the actual total plant energy consumption; the index evaluates the relative error of the predicted total energy consumption and the actual total energy consumption;

the third indicator is the daily energy error (EpD), which is given by the formula:

wherein the content of the first and second substances,

indicating predicted total energy consumption of the equipment during a day, e indicating actual total energy consumption of the equipment during a day, D indicating testedTotal number of days; the index evaluates the absolute error between the predicted total energy consumption and the actual total energy consumption within one day;

and judging the accuracy of the model according to the average value of the three indexes, wherein the formula is as follows:

wherein α, β and gamma are the weighted numbers of the three indexes;

the larger the model, the lower the model accuracy;

the smaller the model, the higher the accuracy.

Preferably, this embodiment

(1) The traditional sequence-to-sequence input and output mode based on the bidirectional cyclic Neural network is improved, and the sequence-to-point input and output mode is formed based on the deep learning of the Convolutional Neural Network (CNN).

(2) The device transfer learning method shown in steps S3-S6 is provided, and a transfer learning initial model is constructed by applying load data training and verification initialization sequence to point CNN model of a certain device aiming at different electrical devices in the same family. And then, fine-tuning the full connection layer of the initial model through the load data of the target equipment, and applying the model to other target equipment.

In the embodiment, the non-intrusive load monitoring is realized by a transfer learning method based on sequence-to-point CNN deep learning. One of the advantages of the monitoring method is that a large number of models do not need to be trained in a time-consuming manner, and only the initial models are widely applied to various target scenes by using transfer learning.

In deep learning, too little training data often reduces the result precision, resulting in poor final test effect. The method provided by the embodiment has the second advantage of effectively overcoming the defect, and the method uses a certain known data set as a sample to construct an initial model, thereby solving the problem of low resolution precision due to too small sample volume of the known data set in the target area.

The conventional sequence-to-sequence mode outputs a target device load sequence by inputting a segment of device total load sequence. In the mode, the monitoring data volume is large, the transmission is complex, the precision is not high enough, and the obtained load information cannot be well included for the user. The sequence-to-point mode provided by this embodiment is to output the median of the target device load sequence by inputting a section of device total load sequence, thereby greatly simplifying the transmission flow, improving the monitoring precision, and providing more practical information for the user.

In summary, compared with the existing non-invasive load monitoring method based on the neural network, the non-invasive load monitoring method based on the equipment transfer learning provided by the invention adopts the sequence-to-point input and output mode, so that the load information can be provided for the user more efficiently, and the algorithm flow is simplified; by means of transfer learning, the universality of the non-invasive load monitoring model is improved, various problems caused by the fact that the sample capacity of the known data set is too small are solved, meanwhile, the time for constructing the target model is reduced, and the economic cost and the time cost are reduced. Therefore, the method provided by the invention has advantages.

The above description is only a preferred embodiment of the present invention, and all equivalent changes and modifications made in accordance with the claims of the present invention should be covered by the present invention.

Claims

1. A non-intrusive load monitoring method based on equipment transfer learning is characterized in that: the method comprises the following steps:

step S2: normalizing the data of all the stored data sets;

step S3: randomly selecting a section of load sampling point amplitude data of each electrical appliance in the household electrical appliance devices A, B, … and L during respective operation to form a data set A_e(n)，B_e(n)，…，L_e(n); set data set A_e(n)，B_e(n)，…，L_e(n) the amplitude sets of the total household load sampling points in the corresponding time periods are respectively U_a(n)，U_b(n)，…，U_l(n), constructing a device selection model for transfer learning by constructing an initialization sequence to point (CNN) model; defining a convolution kernel with a changed weight value in the process of selecting a model by the equipment for constructing the transfer learning as an active convolution kernel, observing the number of the active convolution kernels in the model, and selecting load sampling point amplitude data of the electrical equipment with the largest number of the active convolution kernels as an initial data set O for the transfer learning_m(n), the load sampling point amplitude data of the electrical equipment needing to be monitored, namely the target electrical equipment, is used as a target data set P for transfer learning_m(n); mixing O with_m(n)、P_m(n) two data sets as per 7: 2: 1, respectively randomly dividing the training set, the verification set and the test set; the training set, the verification set and the test set of the formed initial data set are respectively O_mt(n)、O_mv(n)、O_mtt(n), the training set, the verification set and the test set of the target data set are respectively P_mt(n)、P_mv(n)、P_mtt(n)；

2. The non-intrusive load monitoring method based on equipment transfer learning according to claim 1, characterized in that: the normalization process uses the following formula:

3. The non-intrusive load monitoring method based on equipment transfer learning according to claim 1, characterized in that: the specific construction process of the equipment selection model for constructing the transfer learning is as follows:

an input layer: w samples are input as a window;

4. the non-intrusive load monitoring method based on equipment transfer learning according to claim 1, characterized in that: the specific content of step S4 is:

training the initialization sequence to point CNN model: inputting data of an input sequence into an initialization sequence to point CNN model, and transmitting the data forward through a convolutional layer and a full-link layer to obtain an output value; comparing the output value with the target value to generate an error set epsilon₁(n); will epsilon₁(n) returning the error of all weights in the full-connection layer and the convolution layer to the network, and further updating the weights of the model convolution kernel and the full-connection layer neural network unit, wherein one round of training is completed; repeating the training process to perform the next round of training; the training process uses an ADAM optimizer to optimize the model.

5. The non-intrusive load monitoring method based on equipment transfer learning according to claim 1, characterized in that: the specific content of step S5 is:

Lpv≥Lpt

o_mt(τ)＝f(u_m(i)，u_m(i+W-1)，ε₁(n))。

6. the non-intrusive load monitoring method based on equipment transfer learning according to claim 1, characterized in that: the specific content of step S6 is:

Lpv1≥Lpt1

p_mt(τ)＝f(u_m(i)，u_m(i+W-1)，ε₂(n))。