AU2016325186A1 - Bus load forecasting method - Google Patents

Bus load forecasting method Download PDF

Info

Publication number
AU2016325186A1
AU2016325186A1 AU2016325186A AU2016325186A AU2016325186A1 AU 2016325186 A1 AU2016325186 A1 AU 2016325186A1 AU 2016325186 A AU2016325186 A AU 2016325186A AU 2016325186 A AU2016325186 A AU 2016325186A AU 2016325186 A1 AU2016325186 A1 AU 2016325186A1
Authority
AU
Australia
Prior art keywords
load
bus
data
class
forecasting
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
AU2016325186A
Inventor
Zhengrui BAO
Wei Deng
Zhaohui Du
Junli FAN
Hanyong HAO
Man Leng
Qiang MENG
Sining Wang
Yirong WANG
Runze Wu
Wenzhao Wu
Songnan Yang
Chunguang Zhang
Jian Zhang
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
BEIJING GUODIANTONG Corp
State Grid Corp of China SGCC
State Grid Information and Telecommunication Co Ltd
North China Electric Power University
Jincheng Power Supply Co of State Grid Shanxi Electric Power Co Ltd
Original Assignee
BEIJING GUODIANTONG CORP
State Grid Corp of China SGCC
State Grid Information and Telecommunication Co Ltd
North China Electric Power University
Jincheng Power Supply Co of State Grid Shanxi Electric Power Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BEIJING GUODIANTONG CORP, State Grid Corp of China SGCC, State Grid Information and Telecommunication Co Ltd, North China Electric Power University, Jincheng Power Supply Co of State Grid Shanxi Electric Power Co Ltd filed Critical BEIJING GUODIANTONG CORP
Publication of AU2016325186A1 publication Critical patent/AU2016325186A1/en
Abandoned legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • G06N20/20Ensemble learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • G06F18/232Non-hierarchical techniques
    • G06F18/2321Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
    • G06F18/23213Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2413Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on distances to training or reference patterns
    • G06F18/24133Distances to prototypes

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Software Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Medical Informatics (AREA)
  • Evolutionary Biology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The present invention discloses a method of bus load forecasting, the method comprises the following steps: using the horizontal comparison method to correct the abnormal value in the historical load data, 5 and using the gray correlation projection method to determine the key influencing factors of the bus load; using an improved K-means clustering method to classify the load curve with similar characteristics into one class, obtaining several typical load patterns, constructing the random forest classification model, and establishing a mapping relationship between influencing factors and clustering results; for each class of load pattern, using multiple linear regression method to train several forecasting models; using the random 10 forest classification model to determine the class of the day under test, and selecting the matching regression model to realize the load forecasting. The present invention adopts the data mining method to analyze the change rule of the bus load and to set up forecasting model base to realize the model matching in combination with the class of day under test, which improves the accuracy and real-time performance of short-term bus load forecasting, and provides more accurate decision support for power grid planning 15 and real-time scheduling.

Description

The present invention discloses a method of bus load forecasting, the method comprises the following steps: using the horizontal comparison method to correct the abnormal value in the historical load data, and using the gray correlation projection method to determine the key influencing factors of the bus load; using an improved K-means clustering method to classify the load curve with similar characteristics into one class, obtaining several typical load patterns, constructing the random forest classification model, and establishing a mapping relationship between influencing factors and clustering results; for each class of load pattern, using multiple linear regression method to train several forecasting models; using the random forest classification model to determine the class of the day under test, and selecting the matching regression model to realize the load forecasting. The present invention adopts the data mining method to analyze the change rule of the bus load and to set up forecasting model base to realize the model matching in combination with the class of day under test, which improves the accuracy and real-time performance of short-term bus load forecasting, and provides more accurate decision support for power grid planning and real-time scheduling.
DESCRIPTION
Bus Load Forecasting Method
CROSSREFERENCE TO RELATED APPLICATIONS [0001] This disclosure claims the priority of Chinese patent application No. 2016108123390, filed on September 9, 2016, and the entire contents thereof are incorporated herein by reference.
Technical Field [0002] The present invention relates to the field of power system engineering, and more particularly to a method of bus load forecasting.
Background [0003] Bus load forecasting is an important part of power system planning and the basis of economic operation of power system. Its forecasting results can better realize decentralized load management, which directly affects the analysis results of subsequent power grid safety check, which is very important for dynamic state estimation, reactive power optimization, local control of power plant, and reduction of power generation cost. Bus load is used as the bus-supplied load of substation, due to its own characteristics, such as the small base, the not strong stability, the not obvious change trend, and the excessive high frequency fluctuating components, which make it difficult to improve the forecasting accuracy. However, short-term bus load forecasting generally refers to real-time forecasting. It requires not only high forecasting accuracy but also fast calculation speed. Due to the characteristics of bus load itself and the inherent flaws of traditional forecasting methods, the research on system load forecasting is less.
[0004] Bus load forecasting generally uses regression analysis, neural network and support vector regression and other methods. Different forecasting methods extract historical load information from different perspectives, and have different requirements on the size and complexity and the like of sample data. When dealing with bus load forecasting problems considering multiple influencing factors, the regression analysis method is simple to implement, but it is not appropriate to deal directly with the bus load with strong fluctuation. Neural network and support vector regression have strong ability of non-linear fitting, which require not so strong the smoothness of the sample as the regression analysis method. However, when the input data dimension is too large and the sample size is large, the training of the model is easy to fall into local optimum, and the complexity is relatively higher, which reduces the forecasting accuracy and computational efficiency. And with the rise of large data technology and the rapid development of sensing technology, has formed a large power data, the internal characteristics of bus load become more complex, which is not conducive to the establishment of forecasting models. Therefore, the existing bus load forecasting algorithm cannot achieve a satisfactory forecasting effect.
Summary [0005] In view of the above, it is an object of the present invention to provide a method of bus load forecasting which can analyze the change rule of the bus load and improve the forecasting accuracy of the bus load while satisfying the real-time requirement.
[0006] Based on the above objects, the present invention provides a method of bus load forecasting, comprising:
inputting key influencing factor values of the bus under test as characteristic vectors into a pre-constructed classification model to obtain the class of day under test; the pre-constructed classification model is a result of clustering analysis based on the key influencing factors and the historical load data of the bus under test; establishing a mapping relationship between the key influencing factors and bus load;
selecting a corresponding multiple regression forecasting model from the multiple regression forecasting models obtained from the pre-training according to the class of day under test; forecasting the bus load under test, wherein the multiple regression forecasting models obtained from pre-training are a result of clustering analysis based on the historical load data of the bus under test; selecting the corresponding historical load data and influencing factor values for different typical load patterns as a forecasting model obtained from sample data training.
[0007] Optionally, the classification model is constructed by:
using the horizontal comparison method to select abnormal data from the historical bus load data of the bus under test, and correcting the abnormal data according to a preset algorithm to obtain the corrected load data;
using the gray correlation projection method to calculate the correlation degree between the influencing factors and the historical load data in combination with the corrected load data and the load-related influencing factors;
selecting a preset number of influencing factors corresponding to the correlation degree from large to small as the key influencing factors;
clustering analyzing the historical load data of bus under test according to the preset K-means clustering algorithm to obtain different typical load patterns;
according to the key influencing factors and the result of clustering analysis, using the random forest algorithm to construct the classification model, and establishing the mapping relationship between the key influencing factors and various bus loads.
[0008] Optionally, the step of selecting the abnormal data from the historical bus load data of the bus under test by the horizontal comparison method comprises:
cleaning the historical load data of the bus under test by the horizontal comparison method to identify the abnormal data; wherein, the horizontal comparison method using the change rate of the adjacent point load as a criterion for determining whether or not abnormal data is present, the change rate of the load is calculated as follows:
where ^e change rate of the bus load under test at time t of d day, and L(d,t) js tjje hus load value under test at time t of d day;
whether the sampling point data is abnormal data depends on:
L(d.t)-L(d.t-1) t=1,
L(d,t —1)
L(d.t)-IXd-l,q)
L(d,q)
Ja(d,t)<a^iax (d,t), the load at time t is normal value va(d,t)>a^iax (d,t),theloadat time t is abnormal value and
a.
(d,t) fL(d-i,t)-L(d — i,t —1)-, · 3 n max{-ι/d r n-}1 = X2’ ·
L(d — i,t -1) rL(d-i,t)-L(d-i-l,q). . . _ max--—---—1 = 1,2, 1 L(d—i,q) ' ,n,t = 1,2,,...,q .,n,t = 0 where amax maxjmum value of change rate of load at time t of d day during the first n days, i represents the number of days from the current d day, — 1 — 1) js the sampling point data at time (t-1) of (d-i) day, (q+1) sampling points per day; using the last m days of data to correct the abnormal data;
comparing the load value at the current time with the load value at the previous time in chronological order, correcting the abnormal data once the abnormal data detected, and using the corrected data as the comparison value of load value at the next time, completing the load data 15 correction.
[0009] Optionally, the step of correcting the abnormal data by the preset algorithm comprises:
correcting the selected abnormal data according to the following algorithm, the specific algorithm formula is:
L(d,t) = ^Iftd - l,t) + - 2,t) + ... + 2mL(d - m,t) e(0,l)
ΣΛ=1
J=1 λ
where m is the weight coefficient, which is used to represent the influencing degree of the load at time t of (d-m) day on the load at time t of d day; P is the smoothing coefficient, — ΠΊΤ ) js the bus load value under test at time t of (d-m) day.
[0010] Optionally, the step of using the gray correlation projection method to calculate the correlation 5 degree between the influencing factors and the historical load data and selecting a preset number of influencing factors corresponding to the correlation degree from large to small as the key influencing factors comprises:
First, determining a reference sequence and a comparison sequence, where the reference sequence is the corrected load data: X° ; the comparison sequence is the load-related influencing factors: Ν — 1 i > xi..J J — 1,2,...,n . obtaining the sequence matrix of the reference sequence and the comparison sequence:
Xo <XO1 · • %h?
X1 = ; ·
x„ s%i · • %m^
where m represents that each sequence has m elements, and n is the number of comparison sequences; Xo is the load sequence or reference sequence corresponding to the corrected load data, X; is the i-th comparison sequence;
normalizing the matrix X as follows:
eo = max : 1.2....,::% 111111 i 1.2...., m%
-(1-0.1) + 0.1 where mm and max are the minimum and maximum values of the row elements of
X..
1 respectively, and 1J is the normalized value of the element Xy of the matrix X, the value of 20 limited to 0.1 to 1;
the normalized matrix X is as follows:
is
-Ό1
Venl e0m e , nm J p £.1=12 where 0 and 1 ’ ’ ’ sequence;
are normalized reference sequence and normalized comparison the correlation coefficient -°'v 7 of the k-th element of normalized comparison sequence e; and normalized reference sequence e0 is calculated as follows:
mirr min, I — e0, I +z?max, max, I — e0, I
4„,(k) =-k-J P' ”, 01 , i = 1,2,...,n, j = 1,2,...,
1¾ -eok +/)iimx, max lee -e0J I min. min. I e. — en I where 1 J υ UJ is the minimum difference between the two levels, max. max. I e.. — e„. I . , . .... . , , n . . ., ..
1 j u xj is the maximum diiierence between two levels; 1S the identification coefficient;
the gray correlation coefficient judgment matrix F is calculated as follows:
£oo(D · ·· ^oo(j) · ·· 4, (rn)
^(1) · £n(j) · ·· £n (m)
^oi(l) · · ^oi(j) · ·· 20l(m)
£o„(D · '· £o„(j) · ·· ^on(m)
^oo(J) where
1 .. 1 .. 1
^(1) · · £n(j) · · ^01 (m)
^oi(l) ·· • 20l(m)
£o„(D · ·· £o„(j) · ·· ^on(m)
is the correlation coefficient between the normalized reference sequence e0 and its own
A (j) j-th element, and the value of >θθ v J 7 is 1;
using the entropy weight method to give weights to column vectors of matrix E, except for the first row elements, the formula is as follows:
E- =ln(m) i=1
ΣΡϋΡϋ ln(m) j
ΣΧ /ΣΧ)111^ ZZeij)’ j =1,2,...m i=l —Ew; =-j = l,2,...,m '-Σε j=l where Ej is the information entropy of the data of the j -th column, Py is the proportion of the value of the 15 element ey in the column elements, Wj is the weight of the j-th column data, and the weight vector
W = (w1?..., Wj,..., wm) the weighted gray correlation coefficient judgment matrix F is as follows:
where e(b
W1 w/oiQ) ... Wj · ··· w^01(j) · ·· Wm ·· wX(m)
w/oiQ) ' ··· w^Oi(j) ·· '· wjoi(m)
W/onQ) · ·· w^On(j) · ·· wJOn(m)
Wj^oi(j) is the weighted correlation cot
calculating the gray correlation projection value, the formula is as follows:
ZwA(jWj j=! _
I m I m i = l,2, where D; is the correlation projection value between the comparison sequence X; and the bus load under test, D; is between 0 and 1;
selecting the M factors which are the top of the projection values from large to small as the key influencing factors.
[0011] Optionally, the step of clustering analyzing the historical load data of bus under test according to the preset K-means clustering algorithm comprises:
using the corrected load data as a sample set, setting the sample set as iN’···’xi’···’Nn) , load
X=fx X X ) C°=ic°c0 C° C°1 sequence as' ΐΛο’-’ it’···’ in J, the initial centroid set as ’’ 2'’ j’’’ k J, selecting one sample randomly as the first centroid, calculating the Euclidean distance to the first centroid separately for each of the remaining samples:
<ϊ = Σ(\-<ΐ) t=0 where m is the number of load sequences in the sample set, n represents that each load sequence has (n+1) data points, k is the number of centroids; X'1 represents the load value at time (t+1) in the i-th load c° sequence, 11 represents the value of the (t + l)-th element in the first centroid vector;
selecting the sample with the largest Euclidean distance as the second centroid, then calculating the Euclidean distance to the second centroid separately for each of the remaining samples, then selecting the sample with the largest Euclidean distance as the third centroid, and so on until k centroids are determined;
calculating the Euclidean distance of each sample to all centroids separately, selecting the centroid with the smallest Euclidean distance from the current sample as the class to which the current sample belongs:
n 2 d(k,cj) = Σ<Χ-c‘t) , j =1,2,...,k,l =0,1,...,h t=0 where 1 is the number of iterations, x; is the i-th sample, and J is the centroid vector of samples of class j before the (l+l)-th iteration;
updating the centroid of each class after all samples classified; let the number of samples of class j be mj and the sample set be , recalculating the centroid vector of class j as follows:
Cj = (CjO’Cjl’-’Cjt’---’Cjn) »
where calculation formula of (t+l)-th component is:
XjteSj Xit , j =1,2, ,k determining whether or not the preset termination condition is reached, continuing the iteration calculation if not reached, until the historical load of the bus under test is classified as a typical load pattern of class k.
[0012] Optionally, the step of determining whether or not the preset termination condition is reached comprises:
after the completion of each iteration, calculating the distance interval between the various centroids before and after updating according to the following formula:
Dis' =11 c* -c'f1 i = 1 2 k J ’·>····> , Disk represents the centroid distance interval of class j after the completion of 1-th iteration;
selecting the maximum value max{Dis. ), p- ^g maximum value is less than the difference tolerance , the algorithm terminates, otherwise reclassifying the samples to continue the iteration updating.
[0013] Optionally, the step of using the random forest algorithm to construct the classification model comprises:
' I ’O · _ /—y sampling with replacement s training sample sets 1 , 1 — EA···,s of the same size randomly from the historical load data sets;
sampling N influencing factors randomly as the characteristic attribute of each training sample set; wherein N<M;
training s sample sets TS; separately to generate the corresponding decision tree Tree;; remaining the characteristic attribute of the decision tree unchanged throughout the forest growth process;
except for the leaf nodes, each decision tree using the Gini index of the CART algorithm as the basis for node splitting:
Gini(d) = 1-£β 'i,d i=l
Figure AU2016325186A1_D0001
for each node, selecting the attribute with the smallest Gini index as the splitting attribute a; the Gini index of current node d split by splitting attribute a is:
Gini(a,d) = pLGini(dL) + pRGini(dR) where dL and dR are the left and right child nodes of node d, and PL and PR are the proportion of the left and right child nodes in the parent node;
continuing to split from top to bottom according to the above rules, until all nodes are split or marked as leaf nodes, that is, the decision tree growth is complete, there are a total of s trees, none of the trees needs pruning operation, leaf nodes of each tree correspond to some clustering result of clustering analysis; combining the s decision trees in order to obtain the random forest classification model;
when a test sample input, using each decision tree to classify the test sample, obtaining s classification results and selecting the class with the highest proportion as the class of test sample.
[0014] Optionally, establishing the multiple regression model by the multiple linear regression method, and determining the regression parameters by the least square method.
[0015] Optionally, the regression equation is:
f 00 = b0 +b^i + b2xi2 + - · +t»An
Figure AU2016325186A1_D0002
X γ γ is the input sequence of sample k ™ is the value of the n-th influencing factor, ’i is the true value of the bus load corresponding to the output;
the input matrix X, the output matrix Y, and the coefficient matrix B are as follows:
X =
' 1 X11 X12 ·' Xln
_Yi ’ bo
1 X21 ^2 ’
, Y = y2 , B = bl
1 xmi Xm2 · Xmn_ _ Yn _ _b„_
using the least squares method to determine the estimated parameter value of the regression equation:
(XTX)-1XTY b„ [0016] As can be seen from the above, the bus load forecasting method provided by the present invention determines the key influencing factors of the load data by the gray correlation projection method. By means of K-means clustering and random forest classification method in combination with regression analysis method, and based on the characteristics of the sequence to establish statistical forecasting model separately, to mine the inherent rule of bus load data, to decompose the changing complex historical load into several classes of typical load patterns, to targeted train forecasting model, thus to complete the forecasting by selecting matching models according to the class of day under test. The method of bus load forecasting not only can improve the accuracy of the bus load forecasting, but also meet the requirement of real-time forecasting, so that the forecasting result is more stable and reliable.
Brief Description of the Drawings [0017] Figure 1 is a flow chart of a method of bus load forecasting according to one embodiment of the present invention;
[0018] Figure 2 is a flow chart of a method of bus load forecasting according to another embodiment of the present invention;
[0019] Figure 3 is a flow chart of the gray correlation projection method according to one embodiment of the present invention;
[0020] Figure 4 is a flow chart of the preset K-means clustering algorithm according to one embodiment of the present invention;
[0021] Figure 5 is a graph showing a result of clustering of load data in a bus load forecasting method according to the present invention;
[0022] Figure 6 is a flow chart of the random forest algorithm according to one embodiment of the present invention;
[0023] Figure 7 is a thumbnail view of the decision tree generation in the random forest algorithm according to the present invention;
[0024] Figure 8 is a graph showing a result of bus load forecasting in the bus load forecasting method according to the present invention.
Detailed Description [0025] In order to make the objects, technical solutions and advantages of the present invention become more clear and easy to understand, the present invention will be described in further detail with reference to specific embodiments in conjunction with the accompanying drawings.
[0026] It should be noted that all the expressions first and second used in the embodiments of the present invention are for the purpose of distinguishing two different entities having the same name or different parameters, and the use of first and second is for convenience of description only, and should not be construed as limiting the embodiments of the present invention, which will not be further described in detail in this regard in the following embodiments.
[0027] In order to overcome the shortcomings of short-term bus load forecasting in the prior art, the present invention adopts the data mining method in large data technology to cluster for several bus load patterns, and establishes the statistics forecasting models based on the characteristic rules of various load sequences in combination with regression analysis method, so as to select the corresponding model according to the class of day under test to complete the forecast, that is, to achieve model matching. Compared with the traditional method establishing the forecasting model directly, this method can better mine the change rule of bus load and improve the accuracy of the bus load forecasting by the time-domain decomposition of the historical load sequence; and the forecasting model base can be directly obtained once the forecasting model training is completed. The load of bus at all times of one day could be forecasted directly, it is not necessary to retrain the model before every forecasting to satisfy the real-time requirement of short-term forecasting.
[0028] With reference to Figure 1, in one embodiment of the present invention, a method of bus load forecasting is provided, comprising:
[0029] In step 101, inputting key influencing factor values of the bus under test of day under test as characteristic vectors into a pre-constructed classification model to obtain the class of day under test; the pre-constructed classification model is a result of clustering analysis based on the key influencing factors and the historical load data of the bus under test; establishing a mapping relationship between the key influencing factors and bus load;
[0030] In step 102, selecting a corresponding multiple regression forecasting model from the multiple regression forecasting models obtained from the pre-training according to the class of day under test; forecasting the bus load under test, wherein the multiple regression forecasting models obtained from pre-training are a result of clustering analysis based on the historical load data of the bus under test; selecting the corresponding historical load data and influencing factor values for different typical load patterns as a forecasting model obtained from sample data training.
[0031] In this way, by classifying the model classes to which day under test belongs firstly and then forecasting them using the forecasting model, the bus load can be forecasted more accurately, and the efficiency of bus load forecasting can be further improved, especially for the short-term bus load io forecasting. That is, the method of bus load forecasting according to the present invention realizes the efficient and accurate forecasting of the bus load based on the model matching.
[0032] The pre-constructed classification model is one of the key steps of the present invention, and therefore, in an alternative embodiment of the present invention, there is also provided a construction method of a classification model, comprising:
using the horizontal comparison method to select abnormal data from the historical bus load data of the bus under test, and correcting the abnormal data according to a preset algorithm to obtain the corrected load data; using the gray correlation projection method to calculate the correlation degree between the influencing factors and the historical load data in combination with the corrected load data and the load-related influencing factors;
selecting a preset number of influencing factors corresponding to the correlation degree from large to small as the key influencing factors; clustering analyzing the historical load data of bus under test according to the preset K-means clustering algorithm to obtain different typical load patterns;
according to the key influencing factors and the result of clustering analysis, using the random forest algorithm to construct the classification model, and establishing the mapping relationship between the key influencing factors and various bus loads.
[0033] Of course, the algorithm selected in the above-described construction method is merely an alternative to the embodiment of the present invention, and other suitable algorithms can be used for the construction of the model. But also does not limit the order between the above steps, only need to make the logical relationship with the normal algorithm steps can be.
[0034] With reference to Figure 2, a flow chart of a method of bus load forecasting according to another embodiment of the present invention is provided. The method of bus load forecasting comprising:
[0035] In step 201, obtaining and storing historical load data of the bus under test and load-related influencing factor data; wherein the influencing factors including factors such as meteorological information, day class and the like capable of affecting bus load.
[0036] In step 202, using the horizontal comparison method to select abnormal data from the historical bus load data of the bus under test, and correcting the abnormal data according to a preset algorithm to obtain the corrected load data; that is, preprocessing the historical load data of the bus under test, using the horizontal comparison method to process the historical load data of the bus under test to identify the abnormal data in load sequence and to correct the identified abnormal data.
[0037] In step 203, using the gray correlation projection method to calculate the correlation degree between the influencing factors and the historical load data in combination with the corrected load data and the load-related influencing factors; by determining the key influencing factors of the bus load, making the forecasting model has reasonable input dimension.
[0038] In step 204, clustering analyzing the historical load data of bus under test according to the preset K-means clustering algorithm, distributing the initial cluster centroids evenly in the sample space, and classifying the historical load data with similar change characteristics into one class, to obtain several typical load patterns. Distributing the initial cluster centroids evenly in the sample space according to the preset K-means clustering algorithm improves the convergence speed and clustering accuracy of the algorithm.
[0039] In step 205, according to the key influencing factors and the result of clustering analysis, using the random forest algorithm to construct the classification model, and establishing the mapping relationship between the key influencing factors and various bus loads.
[0040] In step 206, based on the result of clustering analysis of the historical load data of the bus under test, selecting the corresponding historical load data and influencing factor values for different typical load patterns, training the corresponding historical load data and influencing factor values to obtain different multiple regression forecasting models.
[0041] In step 207, inputting key influencing factor values of the bus under test of day under test as characteristic vectors into the classification model to obtain the class of day under test, selecting the corresponding multiple regression forecasting model according to the class of day under test to forecast the bus load under test.
[0042] It can be seen from the above embodiment that the bus load forecasting method provided by the present invention corrects the abnormal value in the historical load sequence of bus under test and determines the key influence factors of the bus load by the gray correlation projection method; uses the improved K-means algorithm to clustering analyze the corrected historical data, and uses the random forest algorithm to construct the classification model to establish the mapping relationship between the clustering results and key influencing factors; trains several multiple regression models for each class of load pattern; inputs the influencing factors of day under test into the random forest classification model to obtain the class of day under test, to select the matching regression model to complete the forecasting. The method of bus load forecasting not only improves the accuracy of bus load forecasting, but also meets the requirement of real-time forecasting, so that the forecasting result is more stable and reliable.
[0043] In some alternative embodiments of the present invention, the step of selecting the abnormal data from the historical bus load data of the bus under test by the horizontal comparison method comprises: cleaning the historical load data of the bus under test by the horizontal comparison method to identify and correct the abnormal data; wherein, the horizontal comparison method using the change rate of the adjacent point load as a criterion for determining whether or not abnormal data is present, the change rate of the load is calculated as follows:
[ΐχά,Ο-Ι/ά,ΐ-Ι) [ L(d,t —1) ’ | L(d,t)-L(d-l,q) |
L(d,q) where gjj^gg rate of ^us |oaj un(jer test at time t of d day, and L(d,t) js the ^us load value under test at time t of d day;
whether the sampling point data is abnormal data depends on:
J a (d ,t)<aJJiax (d,t), the load at time t is normal value va(d,t)>a^iax (d,t),theloadat time t is abnormal value and n
max (d,t)
L(d — i, t) — L(d — i, t — 1) . 1 o max{-:-}, ϊ = 1,2, max{
L(d-i,t-l)
L(d - i, t) - L(d - i -1, q) L(d -i, q) }, i = l,2, ,n,t = 1,2,,...,q .,n,t = 0 where amax ;s tjje maximum value of change rate of load at time t of d day during the first n days, i represents the number of days from the current d day, Ifod — i,t — 1) js samppng point data at time (t-1) of (d-i) day, (q+1) sampling points per day;
correcting the identified abnormal data according to the following formula:
L(d,t) = Λ, L(d — 1, t) + d — 2,t) + ... + 2mL(d — m,t) < λ. =β(ϊ-βγ~\ e(o,i)
ΣΤ-1 +1 λ
where m is the weight coefficient, which is used to represent the influencing degree of the load at time t of (d-m) day on the load at time t of d day; P is the smoothing coefficient, mO is the bus load value under test at time t of (d-m) day, that is, using the last m days of data as corrected data.
[0044] The whole process is carried out in chronological order, and only the load value at the previous time is compared. When the anomaly data is detected, it is corrected immediately and is used as the comparison value of the next time data to complete the correction of all historical load data.
[0045] In some alternative embodiments of the present invention, with reference to Figure 3, the step of using the gray correlation projection method to calculate the correlation degree between the influencing factors and the historical load data and selecting a preset number of influencing factors corresponding to the correlation degree from large to small as the key influencing factors comprises: using the gray correlation projection method to calculate the correlation degree between each influencing factor and the bus load. Classifying the calculated gray correlation projection values and selecting the M factors with high value as the key influencing factors. The specific calculation steps are as follows:
[0046] In step 301, first, determining a reference sequence and a comparison sequence, where the reference sequence is the corrected load data: iNn, Νκ,···, Nm ) ; the comparison sequence is the influencing factors (such as meteorological data, day class): iNi’Nmi 1 — 1,2,...,n .
obtaining the sequence matrix of the reference sequence and the comparison sequence:
Xo <XO1 · • *0?
= ; ·
X„_ Λ. · • Xnmy
where m represents that each sequence has m elements, n is the number of comparison sequence. Xo represents the reference sequence, that is, the load sequence; X; is i-th comparison sequence, that is, influencing factor sequence.
[0047] In step 302, normalizing the matrix X as follows:
. — min eu = j= 1,2,....m Xj max.
,. — mm.
j-1,2,...,m Xj
-(1-0.1) + 0.1 mm. , 7 x. x. max. . 7 x.
where J 9 is the minimum value of the row elements of 1J, and J « is the maximum value of the row elements of eX, the value of 1J is limited to 0.1 to 1.
The normalized matrix X is as follows:
e; 1J is the normalized value of the element χΗ of the matrix <e e A C01 · · · C0m where 0 and sequence.
ei9i = 1,2,...,n are normalized reference sequence and normalized comparison [0048] In step 303, calculating the correlation coefficient, the correlation coefficient -(li v ' of the k-th 15 element of normalized comparison sequence e; and normalized reference sequence e0 is calculated as follows:
min, min, I — e0, I +/?maxj max, I — e0, I 4„,(k) =--3 , ,J P, 3 , °3 , i = 1,2,...,n, j = 1,2,...,
1¾ -eok l+pmax.maxj lee -e0J I , min. min. I e-- en. I . , . . , , ,, where J J 1 is the minimum difference between the two levels, max-max , I e-— en, I . , . ,.-. , , , n . , ., .
j u is the maximum difference between two levels; f is the identification coefficient, preferably, the value of P is 0.5.
The gray correlation coefficient judgment matrix F is calculated as follows:
F =
X(D ··· ZooO · ·· Zoo (rn)> ( 1 • 1 ·· ' 1 Ί
Z01(D ·· £n(j) · ·· Zoi(m) ZoiQ) · • Z01(j) · • Zoi(m)
^oi(l) ·· Zoi(m) ZOi(D · • ZOl(j) ·· • Zoi(m)
Jond) ·· ZonG) · ·· Zon(m) y Jo„(D ' '· Zon(J) · ·· ZOn(mb
Λ (j) where js the correlation coefficient between the normalized reference sequence e0 and its own / (j) j-th element, and the value of >θθν J7 is 1.
[0049] In step 304, using the entropy weight method to give weights to column vectors of matrix E, except 5 for the first row elements, the formula is as follows:
E. =ln(m) ti
ΣΡίΛΡίι ln(m) i=1
ΣΧ ZZeij)’ j =1,2,...m l —Ew; =-j =l,2,...,m πι-ΣΕ, j=l where Ej is the information entropy of the data of the j -th column, Py is the proportion of the value of the element ey in the column elements, Wj is the weight of the j-th column data, and the weight vector
W =
[0050] In step 305, a weighted gray correlation judgment matrix F is obtained from step 304:
W1 ·· Wj · ·· Wm
w/oi(l) · ·· w^01(j) · ·· wX(m)
F' =
w/oiQ) · ·· w^Oi(j) · • wjoi(m)
,w/on(!) · ·· w^On(j) · • wJOn(m)?
wi4i(j)
where J 77 Ul \ J z is the weighted correlation coefficient between the sequence e; and the j-th element of
e0[0051] In step 306, calculating the gray correlation projection value, the formula is as follows:
D.
m
Σ wj^oi( j) · j=!
ill! I 111
Σ» j=i j=i where D; is the correlation projection value between the comparison sequence X; and the bus load, D; is between 0 and 1, the closer the value is to 1, the greater correlation degree; the closer the value is to 0, the weaker the correlation degree. Compared with the gray correlation, gray correlation projection value can be more comprehensive to reflect the degree of similarity of the development trends of two objects.
[0052] In step 307, selecting M factors with the largest projection values as the key influencing factors, the values of M could be set on demand, and could classify all gray correlation projection values firstly before the selecting process, selecting the top of M factors from large to small as the key influencing factors, it is possible without classifying.
[0053] In some alternative embodiments of the present invention, the step of clustering analyzing the historical load data of bus under test according to the preset K-means clustering algorithm comprises:
using the corrected historical load data as the sample set, and using the improved K-means algorithm to cluster the load characteristics of bus. Specific calculation process is as follows:
[0054] In step 401, the selecting of the initial centroid. In the traditional algorithms, the initial clustering centroid is selected randomly, and the improved algorithm proposed by the present invention distributes the initial centroid evenly in the sample space with distance as the criterion, and effectively reduces the influence of the initial centroid on the clustering result. Setting the sample set as iN’···’ N ’···’ xm , load sequence as X' iNo’···’ Nt’···’ Nn) , the initial centroid set as sample is selected randomly as the first centroid.
Cu={c[\c°...,c)
X’} . One [0055] In step 402, calculating the Euclidean distance to the first centroid separately for each of the remaining samples:
^ = Σ(^-υθ) t=0 where m is the number of load sequences in the sample set, n represents that each load sequence has (n+1) data points, k is the number of centroids; X'1 represents the load value at time (t+1) in the i-th load c° sequence, 11 represents the value of the (t + l)-th element in the first centroid vector;
selecting the sample with the largest Euclidean distance as the second centroid, then calculating the Euclidean distance to the second centroid separately for each of the remaining samples, then selecting the sample with the largest Euclidean distance as the third centroid, and so on until k centroids are determined. Optionally, k is 6.
[0056] In step 403, determining whether k centroids have been selected, if does, means that the selecting process has been completed and can proceed to the next step. If the number of centroids has not yet reached k, then continue the selecting process until reached.
[0057] In step 404, let obtained k centroids mutually correspond to different classes of load patterns separately; in other words, establishing the corresponding relationship, and the number of iterations this moment is denoted as 0.
[0058] In step 405, sample assignment or sample determination. Calculating the Euclidean distance of each sample to all centroids separately, selecting the load class corresponding to the centroid with the smallest Euclidean distance from the sample, assigning the sample to the load class. The calculation formula is as follows:
ά<Χφ=Σ<Χ-4) ’ J=12’ k,l =0,1,..,h t=0 where 1 is the number of iterations, x; is the i-th sample, and J is the centroid vector of samples of class j before the (l+l)-th iteration.
[0059] In step 406, centroid updating: Calculating centroid vectors of each class, updating the centroid of each class after all samples assigned; let the number of samples of class j be and the sample set be recalculating the centroid vector of class j as follows:
Cj = (CjO’Cjl’-’Cjt’---’Cjn) »
where (t+l)-th component is calculated as follows:
Figure AU2016325186A1_D0003
j =1,2, ,k [0060] In step 407, determining whether or not the termination condition is reached. In the present invention, in addition to selecting and setting the maximum number of iterations as the termination condition, it is also determined whether or not the algorithm can be terminated by defining the difference tolerance & . After the completion of each iteration, calculating the distance interval between the various centroids before and after updating according to the following formula:
018,=110,-0^11, j=l,2,...,k ! , .... . , c , . c , J J j ·' , DiSj represents the centroid distance interval ot class j alter the completion of 1-th iteration; each assignment updated, there are k calculations. Selecting the maximum valuemax{kfisj b J 1,2,..., k va|ue js |ess than the predefined difference tolerance, the algorithm terminates, otherwise proceeds to step 405 to continue the iteration updating.
[0061] This process classifies the bus historical load into load pattern of class k, the change trend of bus load in each load pattern is more obvious and more regular, which lays the foundation for constructing model and improving forecasting accuracy.
[0062] In step 408, obtaining k clustering results.
[0063] In a further embodiment of the present invention, the step of using the random forest algorithm to construct the classification model further comprises:
sampling with replacement s training sample sets i, 1 — Tz’···’sof S size randomly from the historical load data set S;
sampling N influencing factors randomly as the characteristic attribute of each training sample set according to M key influencing factors; wherein N<M;
training s sample sets TS; separately to generate the corresponding decision tree Tree;. Remaining the characteristic attribute of the decision tree unchanged throughout the forest growth process. Except for the leaf nodes, each decision tree using the Gini index of the CART algorithm as the basis for node splitting:
Gini(d) = 1-^ p*d i=l where d is the current node, ^’d ;s tjje proportion of target class i, k is the number of target class. The closer the Gini index is to 0, the higher the purity of the classification, the better the effect.
[0064] For each node, considering all cases where each characteristic attribute to node splitting, selecting the attribute with the smallest Gini index as the splitting attribute a; the Gini index of current node d split by splitting attribute a is:
Gini(a,d) = pLGini(dL) + pRGini(dR) where dL and dR are the left and right child nodes of node d, and PL and PR are the proportion of the left and right child nodes in the parent node.
[0065] Continuing to split from top to bottom according to the above rules, until all nodes are split or marked as leaf nodes, that is, the decision tree growth is complete, there are a total of s trees, none of the trees needs pruning operation, leaf nodes of each tree correspond to some clustering result of clustering analysis. When a test sample is input, each decision tree is used to classify the test sample, s classification results are obtained and the class with the highest proportion is selected as the class of test sample.
[0066] Combining the s decision trees in order to obtain the random forest classification model, thus to establish the mapping relationship between the influencing factors and the clustering results, and complete the determination of the class of day under test in real time, laying the foundation for the forecasting model matching.
[0067] In some alternative embodiments of the present invention, the step of obtaining different multiple regression forecasting models from the training further comprises:
[0068] Considering the clustering result of bus load and the real-time of short-term load forecasting, the present invention uses the multiple linear regression models for forecasting. To use bus load data of each class and key influencing factors as training samples to construct several forecasting models. Establishing the multiple regression model by the multiple linear regression method, and determining the regression parameters by the least square method.
[0069] Specifically, the i-th training sample in each class of sample set is X = i = l,2,...,m is ’ Yi) , where
Xi , m is the number of samples in each class of load sample set;
x x y is the input sequence of sample i, in is the value of the n-th influencing factor, is the true value of the bus load corresponding to the output.
The regression equation is:
f 00 = b0 +b^i + b2xl2 + · · · +bA where b0 is the regression constant term, bpb2,...,bn -s regressjon coefficient, is the regression forecasting value.
[0070] The input matrix X, the output matrix Y, and the coefficient matrix B are as follows:
X =
' 1 X11 X12 ' Xln
_Yi ’ bo
1 X21 ^2 ’
, Y = y2 , B = bl
1 xmi Xm2 · Xmn_ _ Yn _ _b„_
using the least squares method to determine the estimated parameter value of the regression equation:
bi (XTX)-1XTY inputting the key influencing factors at each time of day under test into the random forest classification model to obtain the class of day under test, to select the corresponding multiple regression forecasting model to complete the load forecasting.
[0071] The method of bus load forecasting of the present invention has the following effects: through the clustering analysis of the historical bus load, the establishment of statistical model and the model matching of the day under test, the change rule of the bus load is effectively mined, and more accurate forecasting results are obtained; meanwhile it is not necessary to train the model before every forecasting to further satisfy the real-time requirement of bus load forecasting.
[0072] In another embodiment of the present invention, a 110 kV bus active load of a city power grid company is used as a forecasting example.
[0073] In the first stage, the historical load data preprocessing section. The load value 24 hours of llOkV bus of a substation is selected as the forecasting object and the load data of one year before the forecasting day is used as training sample. The data format is shown in Table 1:
[0074] Table 1 Bus load data table
Date The bus load value at each sampling time (Unit: MW)
00:00 01:00 02:00 03:00 .. .. 20:00 21:00 22:00 23:00
01/01 52.6 49.5 46.7 44.7 .. .. 49.9 46.1 46.8 46.2
12/31 52.0 48.5 42.9 41.3 .. .. 69.0 65.3 60.5 59.5
[0075] Each row of data in Table 1 represents a 24-dimensional load sample sequence. There may be several abnormal values in historical load data due to instability of the acquisition systems, namely, abnormal data, which will seriously affect the forecasting accuracy.
[0076] The horizontal comparison method is used to correct abnormal data. Sampling cycle set to 1 hour, a total of 24 data points per day, the change rate of the load at the adjacent time is sequentially calculated in chronological order:
a(d,t) ! L(d,t)-L(d,t~23) |
L(d,t-23) <! L(d,t)-L(d-1,23)! , L(d,23) t = 1,2, , t = 0 ,23 [0077] The maximum value of the change rate of load within 7 days before the sampling point is 15 calculated as the criterion of whether or not the sampling time data is an abnormal value. The judgment formula is as follows:
[0078] If the load data is abnormal value, then the data at the same time of last 3 days is corrected through the correction formula, the correction formula is as follows:
20 L(d,t) = d - l,t) + 2,L(d - 2,t) + 2,L(d - 3,t) _ θ . . , ’ A 0.26 an(|^3 θ·24 , the correction effect is the best.
[0080] In the second stage, the determination stage of key influencing factors. With reference to Figure 3, a specific implementation process of the gray correlation projection method is shown. Based on the meteorological data of the city where the city power grid company is located, the influencing factors are chosen as follows: maximum temperature (Tmax), minimum temperature (Tmin), real-time temperature (RT), Average wind speed (AW), relative humidity (RH), average precipitation (AP), day type (DT), and season type (ST), specific parameters as shown in Table 2:
[0081] Table 2 Influencing factors data
Sampling time Influencing factors
Tmax Tmin RT AW RH AP DT ST
01/01 00:00 10 -2 2 2.6 51 1.8 1 3
01/01 01:00 10 -2 0 2.6 51 6.5 1 3
06/13 14:00 31 18 30 0.8 64 25.7 2 4
10/01 00:00 25 10 15 1.2 59 19.6 3 1
12/31 23:00 8 -6 -1 2.3 48 0.9 1 3
[0082] In Table 2, the column data corresponding to each influencing factor is the comparison sequence X;, and the load data at each sampling time is the reference sequence Xo. The day type is assigned {working day, weekend, holiday} = {1, 2, 3}, the season type is assigned {summer, winter, spring, autumn} = {4, 3, 2,1}, other influencing factors data is actually measured value.
[0083] The comparison sequence and the reference sequence are normalized by the formula, and the corresponding values are limited to 0.1 to 1 to obtain the normalized sequence matrix E:
<e e A C01 · · · C0m yem ''nm J [0084] Where nm represents the normalized value of element xnm in the sequence matrix X. The number of comparison sequence n=8, m=8760 is the number of elements per sequence, ie, 24 sampling points per day for 365 days.
[0085] The gray correlation coefficient between the comparison sequence and the reference sequence is obtained by the formula of the correlation coefficient, and the gray correlation coefficient judgment matrix F is obtained. The entropy weight method is used to give weights to column vectors of matrix E, except for the first row elements, and the weight vector (wx,W2,..., W8760) correlation judgment matrix F is obtained in combination with matrix F:
where is obtained. The gray
W1 · WJ · ' ' W8760
W1 ^01 (D · ·· wA(J) · - W8760^01 (876θ)
·· w^Oi(j) ·· · w876/0i(8760)
W^OnQ) ·· • w^On(j) · ·· w876(/0n(8760)
4i(j) i is the correlation coefficient between the comparison sequence e; and the j-th element of
Wreference sequence e0. J is the weight of column vector j, where coefficient between the sequence e; and the j-th element of e0.
is the weighted correlation [0086] The gray correlation projection value D; between the reference sequence X; and the comparison sequence Xo is calculated as follows:
8760
D; =
Σ wj^oi( J) · j=l
8760 /8760 , 1 = 1,2,...,8 |Σ>ΑΟ>}24Σ>7 [0087] Through the above calculation, the gray correlation projection value of 8 influencing factors is obtained as shown in Table 3:
[0088] Table 3 the gray correlation projection values of influencing factors
Tmax Tmin RT AW RH AP DT ST
0.87 0.64 0.72 0.28 0.66 0.32 0.55 0.43
[0089] Average wind speed and average precipitation are weakly correlation factors, and the day type and season type are moderate correlation factors.
[0090] Classifying and selecting the maximum temperature, real-time temperature, relative humidity, minimum temperature, and day type as the key influencing factors for the trend of bus load under test.
[0091] In the third stage, the historical load clustering stage of the bus under test. With reference to Figure 4, a specific implementation process of the improved K-means algorithm is shown.
[0092] The determination of initial clustering centroid: There are a total of 365 sample sequences in the c° historical load sample set; one sample is selected randomly as the first initial centroid vector 1 , the c°
Euclidean distance d to 1 for each of the remaining samples is separately calculated as follows:
t=0
Y C where il represents the load value at time (t+1) in the i-th sample sequence, ll represents the value of the (t + l)-th element in the initial centroid vector of class 1.
[0093] The sample with the largest d is selected as the second initial centroid vector 2 , then the Euclidean c° distance to 2 is calculated separately for each of the remaining samples, then the sample with the c° largest d as 3 is selected, and so on until k centroids are determined. In the present embodiment, k = 6, 10 the bus load sequence is classified into 6 typical load patterns.
[0094] After 6 initial centroids are determined, the iterative algorithm is started and the samples are assigned. The Euclidean distances of each sample to all centroids are calculated, and each sample will be assigned to the class represented by the nearest centroid:
d(Xi,c‘) = E(N-c‘t) , j =1,2,...,6,1=0,1,...,1000 t=0
V, · where 1 is the number of iterations, x; is the i-th sample, and J is the centroid vector of samples of class j before the (l+l)-th iteration.
[0095] The updating of the centroid vector: After each sample assignment, the centroid of load sample of each class needs to be updated. If the number of samples of class j is mj , the sample set ts , and then c1 c1 the elements Jtin the centroid vector J are updated as:
Σ.
_
Jl
Hlj
Nt , j =1,2,
Determine whether the termination condition of algorithm is reached. In addition to the maximum number of iterations, define difference tolerance & to determine whether the algorithm terminates. After the updating of centroid, the distance before and after the updating of each class centroid vector is calculated:
Dis1; =11 C1; - C1;’1 j =1,2,...,6,1=1,2,...1000
Selecting the maximum valuemaxDiSj } va|ue js |ess difference tolerance^ 0.01 the algorithm terminates, otherwise continuing the iteration updating.
[0096] The above clustering process classifies the bus historical load into 6 load patterns, and the clustering results are shown in Figure 5. Class 1 consists of 82 days, mainly on rest days, Class 2, Class 5 mainly consist of 112 and 89 working days; The difference between these two load values is mainly due to seasonal factors; Class 3 consists of 25 days, the load is much higher than other classes due to extreme weather; Class 6 consists of 31 days, the load curve fluctuates greatly.
[0097] In the fourth stage, the construction stage of random forest classification model. With reference to Figure 6, a specific implementation process of the random forest algorithm is shown.
' I ’O · _
Sampling with replacement 100 training sample sets 1 , 1 — CA---,sof S size randomly from the historical sample set S. The data of training sample set ί is shown in Table 4:
[0098] Table 4 training sample set
time Tmax RT RH Tmin DT Clustering result
01/01 00:00 10 2 51 -2 3 Class 5
02/05 09:00 5 3 64 -5 1 Class 2
[0099] 3 influencing factors are sampled randomly as the characteristic attribute for each training sample set.
[0100] 100 sample sets TS; are trained separately to generate the corresponding decision tree Tree;. The characteristic attribute of the decision tree Tree; is remained unchanged throughout the forest growth process. Except for the leaf nodes, each decision tree uses the Gini index of the CART algorithm as the basis for node splitting:
Gini(d) =l-£p2d i=l where d is the current node, is the proportion of target class i. The closer the Gini index is to 0, the higher the purity of the classification, the better the effect.
[0101] For each node, consider all the cases of node splitting with 3 characteristic attributes, and select the attribute with the smallest Gini index as the splitting attribute a. The Gini index of current node d split by splitting attribute a is:
25 Gini(a,d) = pLGini(dL) + pRGini(dR) where dL and dR are the left and right child nodes of node d, and PL and PR are the proportion of the left and right child nodes in the parent node.
[0102] Continuing to split from top to bottom according to the above rules, until all nodes are split or marked as leaf nodes, that is, the decision tree growth is complete, there are a total of 100 trees, none of the trees needs pruning operation, leaf nodes of each tree correspond to some clustering result of clustering analysis. When a test sample is input, each decision tree is used to classify the test sample, 100 classification results are obtained and the class with the highest proportion is selected as the class of test sample.
[0103] Combine the 100 decision trees in order to obtain the random forest classification model, so as to establish the mapping relationship between the influencing factors and the clustering results. Figure 7 is a thumbnail view of the decision tree generation in the random forest classification model.
[0104] In the fifth stage, the training stage of the forecasting model. According to the clustering results, the bus load data and the key influencing factors in each load pattern are integrated into the training sample set, as shown in Table 5:
[0105] Table 5 training sample set
Sampling time Sample sequence Key Influencing factors Corresponding load
Tmax RT RH Tmtn DT
06/13 14:00 XI 31 30 64 18 2 54.2
10/07 19:00 Xm 26 20 58 9 3 63.6
[0106] Where m Is the number of samples in each class of sample set; sample sequence i = l,2,...,mwh_ % ; »
X;
m .Where is the influencing factor sequence of sample “u yi; is the true value of the bus load corresponding to the output.
[0107] Each class of load pattern needs to construct the forecasting model based on its own training sample set, and finally obtains 6 regression equations. The multiple regression equation is given by:
f (X) = b0 + + b2?q2 + · · · + b5 xl5 where b0 is the regression constant term, bpb2,...,bn -s regressjon coefficient, ^'^X'^is the regression forecasting value. The regression constant terms and coefficients for each regression equation are calculated from equation (22). The results are shown in Table 6:
[0108] Table 6 regression coefficient
bo bi b2 b3 b4 b5
Equation 1 4.9 0.64 0.51 0.36 0.32 0.23
Equation 2 -8.5 0.57 0.60 0.41 0.26 0.18
Equation 3 6.5 0.48 0.66 0.35 0.43 0.20
Equation 4 -3.8 0.68 0.52 0.37 0.35 0.29
Equation 5 3.2 0.51 0.52 0.36 0.30 0.27
Equation 6 1.8 0.71 0.55 0.34 0.39 0.22
[0109] In the sixth stage, the stage of model matching and forecasting: The load data of the bus on June 21, 2016 is selected as the test sample, and the weather data and day type information of the day are as follows:
Time RT RH Time RT RH Time RT RH Tmax Tmin DT
00:00 23 52 08:00 24 53 16:00 30 37 30 18 1
01:00 22 56 09:00 26 48 17:00 30 40
02:00 21 59 10:00 28 44 18:00 29 43
03:00 20 65 11:00 29 35 19:00 28 46
04:00 20 64 12:00 30 36 20:00 26 51
05:00 19 68 13:00 30 37 21:00 25 55
06:00 18 70 14:00 30 38 22:00 24 57
07:00 21 62 15:00 30 37 23:00 23 58
0110] Then, the corresponding influencing : 'actors ol 24 hours of the c ay are input into the random forest
classification model separately, and result that the day under test belongs to load pattern of class 2 is obtained, and the model 2 (regression equation 2) is selected to complete the load forecasting, the results are shown in Figure 8. The maximum relative error of the forecasting result obtained by the method of the present invention is 2.89%, the minimum relative error is 0.32%, and the average relative error is only 1.29%; while the maximum relative error of the traditional multiple regression forecasting method is
3.06%, the minimum relative error is 0.28%, and the average relative error is 1.38%. Therefore, the accuracy of the forecasting method adopted in the present invention is obviously improved, and has wide application prospect.
[0111] It should be understood by those skilled in the art that the foregoing discussion of any embodiment is intended to be illustrative and not intended to suggest that the scope of the present disclosure, including the claims, be limited to such examples; and in the context of the present invention, the technical features of the various embodiments may also be combined, the steps may be implemented in any order, and there are many other variations of the various aspects of the present invention as described above, which are not provided in the details for the sake of brevity.
[0112] In addition, to simplify the description and discussion, and in order not to obscure the present invention, well-known power/ground connections to integrated circuit (IC) chips and other components may or may not be shown in the provided drawings. In addition, devices may be shown in block diagram form in order to avoid obscuring the present invention, and this also takes into account the fact that the details regarding the embodiments of these block diagram devices are highly dependent on the platform on which the present invention will be implemented (that is, these details should be fully within the purview of those skilled in the art). In the event that specific details (e.g., circuits) have been set forth to describe exemplary embodiments of the present invention, it will be apparent to those skilled in the art that the present invention may be implemented without these specific details or with variations in these specific details. Accordingly, these descriptions are to be regarded in an illustrative rather than a restrictive sense.
[0113] Although the present invention has been described in connection with specific embodiments thereof, many alternatives, modifications and variations of these embodiments will be apparent to those skilled in the art from the foregoing description. For example, other memory architectures (e.g., dynamic RAM (DRAM)) may use the discussed embodiments.
[0114] It is intended that the embodiments of the present invention cover all such alternatives, modifications and variations that fall within the broad scope of the appended claims. Therefore, any omissions, modifications, equivalents, improvements, and the like within the spirit and principles of the present invention are intended to be embraced within the scope of the present invention.

Claims (10)

  1. What is claimed is:
    1. A method of bus load forecasting, comprising:
    inputting key influencing factor values of the bus under test as characteristic vectors into a pre-constructed classification model to obtain the class of day under test; the pre-constructed classification model is a result of clustering analysis based on the key influencing factors and the historical load data of the bus under test; establishing a mapping relationship between the key influencing factors and bus load;
    selecting a corresponding multiple regression forecasting model from the multiple regression forecasting models obtained from the pre-training according to the class of day under test; forecasting the bus load under test, wherein the multiple regression forecasting models obtained from pre-training are a result of clustering analysis based on the historical load data of the bus under test;
    selecting the corresponding historical load data and influencing factor values for different typical load patterns as a forecasting model obtained from sample data training.
  2. 2. The method of bus load forecasting according to claim 1, wherein the classification model is constructed by:
    using the horizontal comparison method to select abnormal data from the historical bus load data of the bus under test, and correcting the abnormal data according to a preset algorithm to obtain the corrected load data;
    using the gray correlation projection method to calculate the correlation degree between the influencing factors and the historical load data in combination with the corrected load data and the load-related influencing factors;
    selecting a preset number of influencing factors corresponding to the correlation degree from large to small as the key influencing factors;
    clustering analyzing the historical load data of bus under test according to the preset K-means clustering algorithm to obtain different typical load patterns;
    according to the key influencing factors and the result of clustering analysis, using the random forest algorithm to construct the classification model, and establishing the mapping relationship between the key influencing factors and various bus loads.
  3. 3. The method of bus load forecasting according to claim 2, wherein the step of selecting the abnormal data from the historical bus load data of the bus under test by the horizontal comparison method comprises:
    cleaning the historical load data of the bus under test by the horizontal comparison method to identify the abnormal data; wherein, the horizontal comparison method using the change rate of the adjacent point load as a criterion for determining whether or not abnormal data is present, the change rate of the load is calculated as follows:
    L(d.t)-L(d.t-1) t=1,
    L(d,t —1) |L(d.t)-L(d-l,q)|
    L(d,q) where tjje change rate of the bus load under test at time t of d day, and L(d,t) -s ^us load value under test at time t of d day;
    whether the sampling point data is abnormal data depends on:
    (d,t),theloadat time tis normal value ,a(d,t)>a1’jiax (d,t),theloadat time tis abnormal value and
    OL (d,t) i t)-Ud , = 1>2
    L(d — i,t -1) max ud-ro-ud-'-tq) ' L(d—i,q) ' ,n,t = 1,2,,...,q .,n,t = 0
    10 where amax μ tjje maximum value of change rate of load at time t of d day during the first n days, i represents the number of days from the current d day, ~ k> is the sampling point data at time (t-1) of (d-i) day, (q+1) sampling points per day; using the last m days of data to correct the abnormal data;
    comparing the load value at the current time with the load value at the previous time in chronological
    15 order, correcting the abnormal data once the abnormal data detected, and using the corrected data as the comparison value of load value at the next time, completing the load data correction.
  4. 4. The method of bus load forecasting according to claim 2, wherein the step of correcting the abnormal data by the preset algorithm comprises:
    20 correcting the selected abnormal data according to the following algorithm, the specific algorithm formula is:
    L(d,t) = 2jL(d — l,t) + 22L(d — 2,t) + ... + 2mL(d — m,t) < λ.=β(ΐ-βγ~\ e(o,i) m
    ΣΛ=ι j=i λ
    where m is the weight coefficient, which is used to represent the influencing degree of the load at time t of (d-m) day on the load at time t of d day; P is the smoothing coefficient, — ΠΊΑ ) js the bus load value under test at time t of (d-m) day.
  5. 5 5. The method of bus load forecasting according to claim 2, wherein the step of using the gray correlation projection method to calculate the correlation degree between the influencing factors and the historical load data and selecting a preset number of influencing factors corresponding to the correlation degree from large to small as the key influencing factors comprises:
    first, determining a reference sequence and a comparison sequence, where the reference sequence is the
    10 corrected load data: X° i-Nn’Nk’···’Arm) ; the comparison sequence is the load-related influencing factors: X > N2’---’ Nm) 1 — 1,2,...,n . obtaining the sequence matrix of the reference sequence and the comparison sequence:
    Xo _ <XO1 · X, = ; · X„_ Λι ·
    where m represents that each sequence has m elements, and n is the number of comparison sequences; Xo 15 is the load sequence or reference sequence corresponding to the corrected load data, X; is the i-th comparison sequence;
    normalizing the matrix X as follows:
    ea = max i 1.2.....,,, Xii 111111 i 1.2....,mNj
    -(1-0.1) + 0.1 where mm and max are the minimum and maximum values of the row elements of
    X··
    20 1J respectively, and 1J is the normalized value of the element Xy of the matrix X, the value of limited to 0.1 to 1;
    the normalized matrix X is as follows:
    is
    -Ό1
    Venl e0m e , nm / p £1=12 where 0 and ' ’ ’ ’ sequence;
    are normalized reference sequence and normalized comparison the correlation coefficient of the k-th element of normalized comparison sequence e; and normalized reference sequence e0 is calculated as follows:
    mirr min, I — e0, I +z?max, max, I — e0, I
  6. 6,(k) =--j .j oj e-' J OJ , i = 1,2,...,n, j = 1,2,...,
    1¾ -eok l+pmax,max, le„ -e0J I , min. min. I e. - en. I . , . . , , ,, where J J 1 is the minimum difference between the two levels, max. max. I e.. — e„. I . , . ,... , 1 j u is the maximum dinerence between two levels; v 1S the identification coefficient;
    the gray correlation coefficient judgment matrix F is calculated as follows:
    F =
    £oo(D · ·· ^oo(j) · ·· +oo<m) r i • 1 ·· ' 1 Ί ^oi CD · ^oi(j) · ·· £oi (m) ^oid) · • ^oi(j) · • ^01 (m) CD ^(1) · £o„(D · ·· £o„(j) · ·· ^on(m) y Jo„(D · ·· £o„(j) ·
    / (i) where >θθv J7 is the correlation coefficient between the normalized reference sequence e0 and its own / (i) j-th element, and the value of -(K) J 7 is I;
    using the entropy weight method to give weights to column vectors of matrix E, except for the first row elements, the formula is as follows:
    E: =--— ΣΡϋΡίι =—Γ7—Σ<Αι /ΣΧ)111^ ZZeiP’ j =l’2,...m J ln(m) y 1-E Wj = ln(m) i=1 J—, j =l,2,...,m m_EEi j=l where Ej is the information entropy of the data of the j -th column, Py is the proportion of the value of the element ey in the column elements, Wj is the weight of the j-th column data, and the weight vector W = (w1,...,wj,...,wm) the weighted gray correlation coefficient judgment matrix is as follows:
    W1 w/oiQ) ... Wj · ··· w^01(j) · ·· Wm ·· wX(m) w/oib) ' ··· w^Oi(j) ·· '· wjoi(m) W/onQ) · ·· w^On(j) · ·· wJOn(m) Wj^oi(j) is the weighted correlation cot
    where e(b calculating the gray correlation projection value, the formula is as follows:
    D.
    EWi^( j) · Wii i=l
    E(wA(j))2^£(wi)2 where D; is the correlation projection value between the comparison sequence X; and the bus load under test, D; is between 0 and 1, the closer the value is to 1, the greater correlation degree; the closer the value is to 0, the weaker the correlation degree;
    selecting the M factors which are the top of the projection values from large to small as the key 10 influencing factors.
    6. The method of bus load forecasting according to claim 2, wherein the step of clustering analyzing the historical load data of bus under test according to the preset K-means clustering algorithm comprises:
    using the corrected load data as a sample set, setting the sample set as ίΚ’···’ K’···’ xm} , load x=fx x x ) C°=ic°c° c° c°) sequence as ' ΐΛο’-’ it’···’ in J, the initial centroid set as 1 ’’ 1' G’ k J, selecting one
    15 sample randomly as the first centroid, calculating the Euclidean distance to the first centroid separately for each of the remaining samples:
    t=0 where m is the number of load sequences in the sample set, n represents that each load sequence has (n+1) data points, k is the number of centroids; X'1 represents the load value at time (t+1) in the i-th load c°
    20 sequence, 11 represents the value of the (t + l)-th element in the first centroid vector;
    selecting the sample with the largest Euclidean distance as the second centroid, then calculating the Euclidean distance to the second centroid separately for each of the remaining samples, then selecting the sample with the largest Euclidean distance as the third centroid, and so on until k centroids are determined;
    calculating the Euclidean distance of each sample to all centroids separately, selecting the centroid with the smallest Euclidean distance from the current sample as the class to which the current sample belongs:
    n 2 ά<Λ,φ = Σ<χ-φ , j =1,2,...,k,l =0,1,...,h t=0 where 1 is the number of iterations, x; is the i-th sample, and J is the centroid vector of samples of class j before the (l+l)-th iteration;
    updating the centroid of each class after all samples classified; let the number of samples of class j be mj and the sample set be , recalculating the centroid vector of class j as follows:
    Cj = (CjO’Cjl’-’Cjt’---’Cjn) where calculation formula of (t+l)-th component is:
    Σ.
    sAt j =1,2, ,k determining whether or not the preset termination condition is reached, continuing the iteration calculation if not reached, until the historical load of the bus under test is classified as a typical load pattern of class k.
  7. 7. The method of bus load forecasting according to claim 6, wherein the step of determining whether or not the preset termination condition is reached comprises:
    after the completion of each iteration, calculating the distance interval between the various centroids before and after updating according to the following formula:
    Dis1. =11 c1. — c1.-1 II, j=l,2,...,k ! , .... . , £ , . £ , J J j ·' , DiSj represents the centroid distance interval of class j after the completion of 1-th iteration;
    selecting the maximum value max{Dis. }, maxjmum va|ue js |ess than the difference tolerance , the algorithm terminates, otherwise reclassifying the samples to continue the iteration updating.
  8. 8. The method of bus load forecasting according to claim 2, wherein the step of using the random forest algorithm to construct the classification model comprises:
    ' I 'O · _ -| sampling with replacement s training sample sets 1 , 1 — TA···,s of the same size randomly from the historical load data sets;
    sampling N influencing factors randomly as the characteristic attribute of each training sample set; wherein N<M;
    training s sample sets TS; separately to generate the corresponding decision tree Tree;; remaining the characteristic attribute of the decision tree unchanged throughout the forest growth process;
    except for the leaf nodes, each decision tree using the Gini index of the CART algorithm as the basis for node splitting:
    Gini(d) = 1-£r 'i,d i=l for each node, selecting the attribute with the smallest Gini index as the splitting attribute a; the Gini index of current node d split by splitting attribute a is:
    Gini(a,d) = pLGini(dL) + pRGini(dR) where dL and dR are the left and right child nodes of node d, and PL and PR are the proportion of the left and right child nodes in the parent node;
    continuing to split from top to bottom according to the above rules, until all nodes are split or marked as leaf nodes, that is, the decision tree growth is complete, there are a total of s trees, none of the trees needs pruning operation, leaf nodes of each tree correspond to some clustering result of clustering analysis; combining the s decision trees in order to obtain the random forest classification model;
    when a test sample input, using each decision tree to classify the test sample, obtaining s classification results and selecting the class with the highest proportion as the class of test sample.
  9. 9. The method of bus load forecasting according to claim 1, comprising establishing the multiple regression model by the multiple linear regression method, and determining the regression parameters by the least square method.
  10. 10. The method of bus load forecasting according to claim 9, wherein the regression equation is:
    f 00 = b0 +b^i +b2^2 + · · · +bA where b0 is the regression constant term, bpb2,...,bn -s regressjon coefficient, Un) is the regression forecasting value; the i-th training sample in each class of sample set is^1 >x,, Y,where ^={^1,^2,...,xj i = l,2,...,m , m is the number of samples in each class of load sample set; i x y.
    ', in is the value of the n-th influencing factor, J1 is the true value is the input sequence of sample 1, in ii of the bus load corresponding to the output;
    the input matrix X, the output matrix Y, and the coefficient matrix B are as follows:
    X11 X12 ·' Xln ^2 ’ ' xmi Xm2 · ·· Xmn
    _Yi ’ ’V , Y = y2 , B = bl _ Yn _ A_
    using the least squares method to determine the estimated parameter value of the regression equation:
    (XTX)-1XTY
    DRAWINGS
    Input key influencing factor values oflhe bus under test as eharaetcristie vectors into a pre-eonstrueted classification model to obtain the class of day under test
    Select a corresponding multiple regression forecasting model front the multiple regression forecasting models obtained from the pre-training according to the class of the day under test and forecast the bus load under test
    101
    102
    FIG.l
    Start
    201
    Obtain and store historical load data of the bus under test and load-related influencing
    Use the horizontal comparison method to correct abnormal data in the historical load
    202
    Use the gray correlation projection method to determine key influencing factors
    203
    Clustering analyzing the historical load data of bus under test according to the preset
    204
    Use the random forest algorithm to construct the classification model, and establish
    205
    Select the corresponding historical load data and influencing factor values for different t\picul load patterns, and train them to obtain different multiple regression forecasting models
    206
    Input key influencing factor values of the bus of day under test into the classification ni«.»del to obtain the class of day uniler test, ;in<.l select the corresponding multiple..................
    207
    End
    FIG.2 ί
    Start
    301
    Use the corrected historical load data as a reference sequence, and use the influencing factors as a comparison sequence
    302
    Normalize the elements of the reference sequence and the comparison sequence to obtain ; the normalized sequence matrix E
    Calculate the correlation coefficient between the reference sequence and corresponding i element of each comparison sequence to obtain the gray correlation coefficient judgment ' matrix F
    - 303
    304
    Use the entropy weight method to give weights to column vectors of matrix E
    305
    Obtain a weighted gray correlation judgment matrix from the matrix F and weight vector [ calculation
    306
    Calculate the gray correlation projection value of each influencing factor on historical load
    Select preset number of factors with the largest projection values as the key influencing factors
    - 307
    End
    FIG.3
    401
    Start
    402
    - 403
    404
    Obtain 6 initial clustering centroids, each clustering centroid corresponds
    Determine whether 1 is greater than a preset number threshold or whether the maximum interval before and after each class of centroid updating is less than the difference tolerance
    406
    408
    End
    FIG.4
    FIG.5 sample set S
    Sample with replacement training sample set TS1
    Sample with replacement training sample set TS100
    Sample 3 influencing factors randomly as the characteristic attribute
    Sample 3 influencing factors randomly as the characteristic attribute calculate Gini index 'or current each node
    Mark all child nodes that belong to the same class as leaf nodes (except for the leaf nodes), and select the splitting^ ttrihnte
    Use the splitting attribute to binary split the current node into sub-nodes
    Treel generate decision tree Treel
    Mark all child nodes that belong to the same class as leaf nodes calculate Gini index for current each node (except for the leaf nodes), and select the spl itt ing^at tribute
    Use the splitting attribute to binary split the current node into sub-nodes random forest classification model
    FIG.6
    Treel
    TreelOO
    FIG.7
    FIG.8
AU2016325186A 2016-09-09 2016-11-30 Bus load forecasting method Abandoned AU2016325186A1 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
CN2016108123390 2016-09-09
CN201610812339.0A CN106485262B (en) 2016-09-09 2016-09-09 Bus load prediction method
PCT/CN2016/108051 WO2018045642A1 (en) 2016-09-09 2016-11-30 A bus bar load forecasting method

Publications (1)

Publication Number Publication Date
AU2016325186A1 true AU2016325186A1 (en) 2018-03-29

Family

ID=58273572

Family Applications (1)

Application Number Title Priority Date Filing Date
AU2016325186A Abandoned AU2016325186A1 (en) 2016-09-09 2016-11-30 Bus load forecasting method

Country Status (3)

Country Link
CN (1) CN106485262B (en)
AU (1) AU2016325186A1 (en)
WO (1) WO2018045642A1 (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109345011A (en) * 2018-09-19 2019-02-15 中冶赛迪重庆信息技术有限公司 A kind of Air-conditioning Load Prediction method and system returning forest based on depth
CN109947079A (en) * 2019-03-20 2019-06-28 阿里巴巴集团控股有限公司 Region method for detecting abnormality and edge calculations equipment based on edge calculations
CN110826774A (en) * 2019-10-18 2020-02-21 广州供电局有限公司 Bus load prediction method and device, computer equipment and storage medium
CN111105098A (en) * 2019-12-25 2020-05-05 北京华电天仁电力控制技术有限公司 Single user algorithm self-matching load prediction method and system
CN112289031A (en) * 2020-11-03 2021-01-29 蚌埠学院 Method and device for detecting and alarming abnormal conditions in bus driving process
CN114819380A (en) * 2022-05-12 2022-07-29 福州大学 Power grid bus load prediction method based on model fusion
CN114819380B (en) * 2022-05-12 2024-06-28 福州大学 Power grid bus load prediction method based on model fusion

Families Citing this family (46)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107256415B (en) * 2017-08-04 2020-12-11 国网经济技术研究院有限公司 Calculation method and calculation system for operation mode scene of power system
CN107800140B (en) * 2017-10-18 2021-08-13 天津大学 Large user power supply access decision method considering load characteristics
CN108334900B (en) * 2018-01-29 2021-08-13 上海电气分布式能源科技有限公司 Generation method and system of classification model of power battery, and classification method and system
CN108539738B (en) * 2018-05-10 2020-04-21 国网山东省电力公司电力科学研究院 Short-term load prediction method based on gradient lifting decision tree
CN110766032A (en) * 2018-07-27 2020-02-07 国网江西省电力有限公司九江供电分公司 Power distribution network data clustering integration method based on hierarchical progressive strategy
CN109214458B (en) * 2018-09-19 2021-08-13 合肥工业大学 Urban load quantification method based on historical data
CN109597967B (en) * 2018-11-20 2023-08-25 江苏云上电力科技有限公司 Power system distribution substation load data anomaly detection and repair method
CN109636010B (en) * 2018-11-23 2022-12-02 国网湖北省电力有限公司 Provincial power grid short-term load prediction method and system based on correlation factor matrix
CN109636035A (en) * 2018-12-12 2019-04-16 北京天诚同创电气有限公司 Load forecasting model creation method and device, Methods of electric load forecasting and device
CN111461378A (en) * 2019-01-18 2020-07-28 电力规划总院有限公司 Power grid load prediction method and device
CN109886465B (en) * 2019-01-20 2022-03-18 东北电力大学 Power distribution network load prediction method based on intelligent electric meter user cluster analysis
CN110175166B (en) * 2019-04-18 2023-04-07 红云红河烟草(集团)有限责任公司 Method for confirming quality abnormity of tobacco wrapping workshop
CN110570025A (en) * 2019-08-20 2019-12-13 精硕科技(北京)股份有限公司 prediction method, device and equipment for real reading rate of WeChat seal
CN110489893B (en) * 2019-08-23 2023-02-03 国网山东省电力公司莱芜供电公司 Variable weight-based bus load prediction method and system
CN110598125A (en) * 2019-08-26 2019-12-20 精硕科技(北京)股份有限公司 Method and device for evaluating key opinion leader investment and storage medium
CN110648248B (en) * 2019-09-05 2023-04-07 广东电网有限责任公司 Control method, device and equipment for power station
CN110689195A (en) * 2019-09-26 2020-01-14 云南电网有限责任公司电力科学研究院 Power daily load prediction method
CN110807508B (en) * 2019-10-31 2023-06-09 国网辽宁省电力有限公司经济技术研究院 Bus peak load prediction method considering complex weather influence
CN110852628B (en) * 2019-11-13 2023-09-12 国网江西省电力有限公司经济技术研究院 Rural medium-long term load prediction method considering development mode influence
CN112862142A (en) * 2019-11-28 2021-05-28 新奥数能科技有限公司 Load and price prediction and correction method
CN112882822B (en) * 2019-11-29 2024-03-01 阿里巴巴集团控股有限公司 Method, apparatus, device and storage medium for generating load prediction model
CN111210059A (en) * 2019-12-26 2020-05-29 国网北京市电力公司 Bus daily maximum load processing method and device
CN111476397B (en) * 2020-03-05 2022-10-25 国电南瑞科技股份有限公司 Bus load prediction method and system applied to medium and long term electric quantity safety check
CN111505406B (en) * 2020-03-16 2022-03-01 剑科云智(深圳)科技有限公司 Power distribution cabinet and wire monitoring method
CN111581883B (en) * 2020-05-09 2022-09-23 国网上海市电力公司 Method for calculating and predicting load on automation device
CN111626360B (en) * 2020-05-28 2023-11-17 新奥新智科技有限公司 Method, apparatus, device and storage medium for detecting boiler fault type
CN112070307B (en) * 2020-09-10 2023-01-20 云南电网有限责任公司电力科学研究院 Method and device for predicting energy source load in region
CN112330009B (en) * 2020-11-02 2023-09-01 广东电网有限责任公司 Power grid load prediction method
CN112365039B (en) * 2020-11-03 2023-11-24 国网冀北电力有限公司计量中心 Method and system for predicting monthly electricity consumption based on longitudinal and transverse correlation
CN112327046B (en) * 2020-11-09 2021-09-07 北华航天工业学院 Non-invasive load monitoring method based on fuzzy clustering and support vector regression
CN112446509B (en) * 2020-11-10 2023-05-26 中国电子科技集团公司第三十八研究所 Prediction maintenance method for complex electronic equipment
CN112487612B (en) * 2020-11-12 2022-08-30 云南电网有限责任公司 Bus load combination prediction method and system based on distribution factor and information entropy
CN112529262A (en) * 2020-11-27 2021-03-19 北京京能高安屯燃气热电有限责任公司 Short-term power prediction method, device, computer equipment and storage medium
CN112633738A (en) * 2020-12-30 2021-04-09 安徽广志电气有限公司 Distribution control cabinet terminal load distribution method
CN113283774A (en) * 2021-06-07 2021-08-20 润电能源科学技术有限公司 Deep peak regulation method and device for heating unit, electronic equipment and storage medium
CN113393028B (en) * 2021-06-10 2022-08-19 国网江苏省电力有限公司南通供电分公司 Load prediction method based on data mining technology
CN113781254A (en) * 2021-08-04 2021-12-10 广西电网有限责任公司 Abnormal electricity consumption detection algorithm based on weekly electricity consumption correlation coefficient
CN113689053B (en) * 2021-09-09 2024-03-29 国网安徽省电力有限公司电力科学研究院 Strong convection weather overhead line power failure prediction method based on random forest
CN113782187B (en) * 2021-09-10 2023-06-27 深圳平安智慧医健科技有限公司 Index data processing method, related equipment and medium
CN113884807B (en) * 2021-09-24 2023-10-20 广西电网有限责任公司电力科学研究院 Power distribution network fault prediction method based on random forest and multi-layer architecture clustering
CN113779112A (en) * 2021-09-27 2021-12-10 南京乐帆电气科技有限公司 Electric energy quality analysis system based on spatial information and big data mapping algorithm
CN113888356A (en) * 2021-10-19 2022-01-04 山东大学 Building energy consumption mode analysis and judgment method and system
CN114692386B (en) * 2022-01-26 2024-02-06 天津大学 Operation state abnormity monitoring method for industrial bus
CN114739977A (en) * 2022-04-13 2022-07-12 重庆大学 Method and system for extracting oil paper insulation aging spectral characteristics based on random forest method
CN115085196B (en) * 2022-08-19 2022-12-23 国网信息通信产业集团有限公司 Power load predicted value determination method, device, equipment and computer readable medium
CN117977587B (en) * 2024-04-02 2024-06-07 南京鼎研电力科技有限公司 Power load prediction system and method based on deep neural network

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9235208B2 (en) * 2011-07-19 2016-01-12 GE Intelligent Platforms, Inc System of sequential kernel regression modeling for forecasting financial data
CN103106329B (en) * 2012-11-19 2016-01-13 华北电力大学 A kind of training sample constructed in groups method for SVR short-term load forecasting
CN104123595B (en) * 2014-07-22 2018-09-07 国家电网公司 A kind of distribution network load prediction technique and system
CN104463445A (en) * 2014-11-27 2015-03-25 国网浙江省电力公司嘉兴供电公司 Load forecasting method based on experience day local correction multipoint extrapolation method
CN104881706B (en) * 2014-12-31 2018-05-25 天津弘源慧能科技有限公司 A kind of power-system short-term load forecasting method based on big data technology
CN105303262A (en) * 2015-11-12 2016-02-03 河海大学 Short period load prediction method based on kernel principle component analysis and random forest
CN105631532A (en) * 2015-12-07 2016-06-01 江苏省电力公司检修分公司 Power system load prediction method using fuzzy decision-based neural network model
CN105469123A (en) * 2015-12-30 2016-04-06 华东理工大学 Missing data completion method based on k plane regression
CN105631483B (en) * 2016-03-08 2019-03-12 国家电网公司 A kind of short-term electro-load forecast method and device
CN105608512A (en) * 2016-03-24 2016-05-25 东南大学 Short-term load forecasting method

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109345011A (en) * 2018-09-19 2019-02-15 中冶赛迪重庆信息技术有限公司 A kind of Air-conditioning Load Prediction method and system returning forest based on depth
CN109947079A (en) * 2019-03-20 2019-06-28 阿里巴巴集团控股有限公司 Region method for detecting abnormality and edge calculations equipment based on edge calculations
CN110826774A (en) * 2019-10-18 2020-02-21 广州供电局有限公司 Bus load prediction method and device, computer equipment and storage medium
CN110826774B (en) * 2019-10-18 2022-03-22 广东电网有限责任公司广州供电局 Bus load prediction method and device, computer equipment and storage medium
CN111105098A (en) * 2019-12-25 2020-05-05 北京华电天仁电力控制技术有限公司 Single user algorithm self-matching load prediction method and system
CN111105098B (en) * 2019-12-25 2023-11-03 国能信控互联技术有限公司 Load prediction method and system for self-matching of single user algorithm
CN112289031A (en) * 2020-11-03 2021-01-29 蚌埠学院 Method and device for detecting and alarming abnormal conditions in bus driving process
CN114819380A (en) * 2022-05-12 2022-07-29 福州大学 Power grid bus load prediction method based on model fusion
CN114819380B (en) * 2022-05-12 2024-06-28 福州大学 Power grid bus load prediction method based on model fusion

Also Published As

Publication number Publication date
WO2018045642A1 (en) 2018-03-15
CN106485262B (en) 2020-02-07
CN106485262A (en) 2017-03-08

Similar Documents

Publication Publication Date Title
AU2016325186A1 (en) Bus load forecasting method
CN107516170B (en) Difference self-healing control method based on equipment failure probability and power grid operation risk
CN114792156B (en) Photovoltaic output power prediction method and system based on curve characteristic index clustering
CN106327357B (en) A kind of load identification method based on improvement probabilistic neural network
CN111091233A (en) Wind power plant short-term wind power prediction modeling method based on wavelet analysis and multi-model AdaBoost depth network
CN108898251A (en) Consider the marine wind electric field power forecasting method of meteorological similitude and power swing
CN109086928A (en) Photovoltaic plant realtime power prediction technique based on SAGA-FCM-LSSVM model
CN111860692B (en) Abnormal data detection method based on K-media in Internet of things environment
CN110717610A (en) Wind power prediction method based on data mining
CN111461921B (en) Load modeling typical user database updating method based on machine learning
CN110942205A (en) Short-term photovoltaic power generation power prediction method based on HIMVO-SVM
CN115907195A (en) Photovoltaic power generation power prediction method, system, electronic device and medium
Peng et al. A very short term wind power prediction approach based on Multilayer Restricted Boltzmann Machine
CN110674864A (en) Wind power abnormal data identification method with synchronous phasor measurement device
CN114611757A (en) Electric power system short-term load prediction method based on genetic algorithm and improved depth residual error network
CN113095547A (en) Short-term wind power prediction method based on GRA-LSTM-ICE model
CN113270898A (en) Distributed photovoltaic optimization configuration method based on two-stage planning
CN112115984A (en) Tea garden abnormal data correction method and system based on deep learning and storage medium
CN110349050B (en) Intelligent electricity stealing criterion method and device based on power grid parameter key feature extraction
CN112734073A (en) Photovoltaic power generation short-term prediction method based on long and short-term memory network
CN114818827A (en) Non-invasive load decomposition method based on seq2point network
CN114881176A (en) Non-invasive load identification method based on self-adaptive optimization random forest
CN113780643A (en) Photovoltaic power station short-term output prediction method based on case reasoning
CN109543976A (en) A kind of Wind turbines harmonic emissions modal identification method using gauss hybrid models
Hanmin et al. An incremental elm method for hourly load forecasting

Legal Events

Date Code Title Description
MK5 Application lapsed section 142(2)(e) - patent request and compl. specification not accepted