AU2016325186A1 - Bus load forecasting method - Google Patents
Bus load forecasting method Download PDFInfo
- Publication number
- AU2016325186A1 AU2016325186A1 AU2016325186A AU2016325186A AU2016325186A1 AU 2016325186 A1 AU2016325186 A1 AU 2016325186A1 AU 2016325186 A AU2016325186 A AU 2016325186A AU 2016325186 A AU2016325186 A AU 2016325186A AU 2016325186 A1 AU2016325186 A1 AU 2016325186A1
- Authority
- AU
- Australia
- Prior art keywords
- load
- bus
- data
- class
- forecasting
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 238000013277 forecasting method Methods 0.000 title description 10
- 238000000034 method Methods 0.000 claims abstract description 99
- 238000012360 testing method Methods 0.000 claims abstract description 96
- 230000002159 abnormal effect Effects 0.000 claims abstract description 45
- 238000007637 random forest analysis Methods 0.000 claims abstract description 26
- 238000013145 classification model Methods 0.000 claims abstract description 23
- 230000008859 change Effects 0.000 claims abstract description 21
- 238000003064 k means clustering Methods 0.000 claims abstract description 12
- 238000013507 mapping Methods 0.000 claims abstract description 12
- 238000012417 linear regression Methods 0.000 claims abstract description 6
- 238000004422 calculation algorithm Methods 0.000 claims description 53
- 239000011159 matrix material Substances 0.000 claims description 41
- 238000012549 training Methods 0.000 claims description 37
- 239000013598 vector Substances 0.000 claims description 35
- 238000003066 decision tree Methods 0.000 claims description 27
- 238000005070 sampling Methods 0.000 claims description 23
- 238000004458 analytical method Methods 0.000 claims description 17
- 230000008569 process Effects 0.000 claims description 15
- 238000004364 calculation method Methods 0.000 claims description 11
- 238000012937 correction Methods 0.000 claims description 6
- 238000013138 pruning Methods 0.000 claims description 4
- 238000004140 cleaning Methods 0.000 claims description 3
- 238000009499 grossing Methods 0.000 claims description 3
- 239000004261 Ascorbyl stearate Substances 0.000 claims 1
- 238000007418 data mining Methods 0.000 abstract description 3
- 238000013439 planning Methods 0.000 abstract description 3
- 101710108562 Reverse transcriptase/ribonuclease H Proteins 0.000 description 5
- 230000000694 effects Effects 0.000 description 5
- 238000000611 regression analysis Methods 0.000 description 5
- 238000010276 construction Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000013528 artificial neural network Methods 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000007781 pre-processing Methods 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 238000000354 decomposition reaction Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000014509 gene expression Effects 0.000 description 1
- 238000007726 management method Methods 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000010248 power generation Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000001932 seasonal effect Effects 0.000 description 1
- 238000013179 statistical model Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
- G06N20/20—Ensemble learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
- G06F18/232—Non-hierarchical techniques
- G06F18/2321—Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
- G06F18/23213—Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
- G06F18/2413—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on distances to training or reference patterns
- G06F18/24133—Distances to prototypes
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- General Engineering & Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Software Systems (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Life Sciences & Earth Sciences (AREA)
- Medical Informatics (AREA)
- Evolutionary Biology (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Probability & Statistics with Applications (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The present invention discloses a method of bus load forecasting, the method comprises the following steps: using the horizontal comparison method to correct the abnormal value in the historical load data, 5 and using the gray correlation projection method to determine the key influencing factors of the bus load; using an improved K-means clustering method to classify the load curve with similar characteristics into one class, obtaining several typical load patterns, constructing the random forest classification model, and establishing a mapping relationship between influencing factors and clustering results; for each class of load pattern, using multiple linear regression method to train several forecasting models; using the random 10 forest classification model to determine the class of the day under test, and selecting the matching regression model to realize the load forecasting. The present invention adopts the data mining method to analyze the change rule of the bus load and to set up forecasting model base to realize the model matching in combination with the class of day under test, which improves the accuracy and real-time performance of short-term bus load forecasting, and provides more accurate decision support for power grid planning 15 and real-time scheduling.
Description
The present invention discloses a method of bus load forecasting, the method comprises the following steps: using the horizontal comparison method to correct the abnormal value in the historical load data, and using the gray correlation projection method to determine the key influencing factors of the bus load; using an improved K-means clustering method to classify the load curve with similar characteristics into one class, obtaining several typical load patterns, constructing the random forest classification model, and establishing a mapping relationship between influencing factors and clustering results; for each class of load pattern, using multiple linear regression method to train several forecasting models; using the random forest classification model to determine the class of the day under test, and selecting the matching regression model to realize the load forecasting. The present invention adopts the data mining method to analyze the change rule of the bus load and to set up forecasting model base to realize the model matching in combination with the class of day under test, which improves the accuracy and real-time performance of short-term bus load forecasting, and provides more accurate decision support for power grid planning and real-time scheduling.
DESCRIPTION
Bus Load Forecasting Method
CROSSREFERENCE TO RELATED APPLICATIONS [0001] This disclosure claims the priority of Chinese patent application No. 2016108123390, filed on September 9, 2016, and the entire contents thereof are incorporated herein by reference.
Technical Field [0002] The present invention relates to the field of power system engineering, and more particularly to a method of bus load forecasting.
Background [0003] Bus load forecasting is an important part of power system planning and the basis of economic operation of power system. Its forecasting results can better realize decentralized load management, which directly affects the analysis results of subsequent power grid safety check, which is very important for dynamic state estimation, reactive power optimization, local control of power plant, and reduction of power generation cost. Bus load is used as the bus-supplied load of substation, due to its own characteristics, such as the small base, the not strong stability, the not obvious change trend, and the excessive high frequency fluctuating components, which make it difficult to improve the forecasting accuracy. However, short-term bus load forecasting generally refers to real-time forecasting. It requires not only high forecasting accuracy but also fast calculation speed. Due to the characteristics of bus load itself and the inherent flaws of traditional forecasting methods, the research on system load forecasting is less.
[0004] Bus load forecasting generally uses regression analysis, neural network and support vector regression and other methods. Different forecasting methods extract historical load information from different perspectives, and have different requirements on the size and complexity and the like of sample data. When dealing with bus load forecasting problems considering multiple influencing factors, the regression analysis method is simple to implement, but it is not appropriate to deal directly with the bus load with strong fluctuation. Neural network and support vector regression have strong ability of non-linear fitting, which require not so strong the smoothness of the sample as the regression analysis method. However, when the input data dimension is too large and the sample size is large, the training of the model is easy to fall into local optimum, and the complexity is relatively higher, which reduces the forecasting accuracy and computational efficiency. And with the rise of large data technology and the rapid development of sensing technology, has formed a large power data, the internal characteristics of bus load become more complex, which is not conducive to the establishment of forecasting models. Therefore, the existing bus load forecasting algorithm cannot achieve a satisfactory forecasting effect.
Summary [0005] In view of the above, it is an object of the present invention to provide a method of bus load forecasting which can analyze the change rule of the bus load and improve the forecasting accuracy of the bus load while satisfying the real-time requirement.
[0006] Based on the above objects, the present invention provides a method of bus load forecasting, comprising:
inputting key influencing factor values of the bus under test as characteristic vectors into a pre-constructed classification model to obtain the class of day under test; the pre-constructed classification model is a result of clustering analysis based on the key influencing factors and the historical load data of the bus under test; establishing a mapping relationship between the key influencing factors and bus load;
selecting a corresponding multiple regression forecasting model from the multiple regression forecasting models obtained from the pre-training according to the class of day under test; forecasting the bus load under test, wherein the multiple regression forecasting models obtained from pre-training are a result of clustering analysis based on the historical load data of the bus under test; selecting the corresponding historical load data and influencing factor values for different typical load patterns as a forecasting model obtained from sample data training.
[0007] Optionally, the classification model is constructed by:
using the horizontal comparison method to select abnormal data from the historical bus load data of the bus under test, and correcting the abnormal data according to a preset algorithm to obtain the corrected load data;
using the gray correlation projection method to calculate the correlation degree between the influencing factors and the historical load data in combination with the corrected load data and the load-related influencing factors;
selecting a preset number of influencing factors corresponding to the correlation degree from large to small as the key influencing factors;
clustering analyzing the historical load data of bus under test according to the preset K-means clustering algorithm to obtain different typical load patterns;
according to the key influencing factors and the result of clustering analysis, using the random forest algorithm to construct the classification model, and establishing the mapping relationship between the key influencing factors and various bus loads.
[0008] Optionally, the step of selecting the abnormal data from the historical bus load data of the bus under test by the horizontal comparison method comprises:
cleaning the historical load data of the bus under test by the horizontal comparison method to identify the abnormal data; wherein, the horizontal comparison method using the change rate of the adjacent point load as a criterion for determining whether or not abnormal data is present, the change rate of the load is calculated as follows:
where ^e change rate of the bus load under test at time t of d day, and L(d,t) js tjje hus load value under test at time t of d day;
whether the sampling point data is abnormal data depends on:
L(d.t)-L(d.t-1) t=1,
L(d,t —1)
L(d.t)-IXd-l,q)
L(d,q)
Ja(d,t)<a^iax (d,t), the load at time t is normal value va(d,t)>a^iax (d,t),theloadat time t is abnormal value and
a.
(d,t) fL(d-i,t)-L(d — i,t —1)-, · 3 n max{-ι/d r n-} ’ 1 = X’2’ ·
L(d — i,t -1) rL(d-i,t)-L(d-i-l,q). . . _ max--—---—1 = 1,2, 1 L(d—i,q) ' ,n,t = 1,2,,...,q .,n,t = 0 where amax maxjmum value of change rate of load at time t of d day during the first n days, i represents the number of days from the current d day, — 1 — 1) js the sampling point data at time (t-1) of (d-i) day, (q+1) sampling points per day; using the last m days of data to correct the abnormal data;
comparing the load value at the current time with the load value at the previous time in chronological order, correcting the abnormal data once the abnormal data detected, and using the corrected data as the comparison value of load value at the next time, completing the load data 15 correction.
[0009] Optionally, the step of correcting the abnormal data by the preset algorithm comprises:
correcting the selected abnormal data according to the following algorithm, the specific algorithm formula is:
L(d,t) = ^Iftd - l,t) + - 2,t) + ... + 2mL(d - m,t) e(0,l)
ΣΛ=1
J=1 λ
where m is the weight coefficient, which is used to represent the influencing degree of the load at time t of (d-m) day on the load at time t of d day; P is the smoothing coefficient, — ΠΊΤ ) js the bus load value under test at time t of (d-m) day.
[0010] Optionally, the step of using the gray correlation projection method to calculate the correlation 5 degree between the influencing factors and the historical load data and selecting a preset number of influencing factors corresponding to the correlation degree from large to small as the key influencing factors comprises:
First, determining a reference sequence and a comparison sequence, where the reference sequence is the corrected load data: X° — ; the comparison sequence is the load-related influencing factors: Ν — 1 i > xi..J J — 1,2,...,n . obtaining the sequence matrix of the reference sequence and the comparison sequence:
Xo | <XO1 · | • %h? | |
X1 | = | ; · | |
x„ | s%i · | • %m^ |
where m represents that each sequence has m elements, and n is the number of comparison sequences; Xo is the load sequence or reference sequence corresponding to the corrected load data, X; is the i-th comparison sequence;
normalizing the matrix X as follows:
eo = max : 1.2....,::% 111111 i 1.2...., m%
-(1-0.1) + 0.1 where mm and max are the minimum and maximum values of the row elements of
X..
1 respectively, and 1J is the normalized value of the element Xy of the matrix X, the value of 20 limited to 0.1 to 1;
the normalized matrix X is as follows:
is
-Ό1
Venl e0m e , nm J p £.1=12 where 0 and 1 ’ ’ ’ sequence;
are normalized reference sequence and normalized comparison the correlation coefficient -°'v 7 of the k-th element of normalized comparison sequence e; and normalized reference sequence e0 is calculated as follows:
mirr min, I — e0, I +z?max, max, I — e0, I
4„,(k) =-k-J P' ”, 01 , i = 1,2,...,n, j = 1,2,...,
1¾ -eok +/)iimx, max lee -e0J I min. min. I e. — en I where 1 J υ UJ is the minimum difference between the two levels, max. max. I e.. — e„. I . , . .... . , , n . . ., ..
1 j u xj is the maximum diiierence between two levels; 1S the identification coefficient;
the gray correlation coefficient judgment matrix F is calculated as follows:
£oo(D · | ·· ^oo(j) · | ·· 4, (rn) |
^(1) | · £n(j) · | ·· £n (m) |
^oi(l) · | · ^oi(j) · | ·· 20l(m) |
£o„(D · | '· £o„(j) · | ·· ^on(m) |
^oo(J) where
1 | .. 1 .. | 1 |
^(1) · | · £n(j) · | · ^01 (m) |
^oi(l) ·· | • 20l(m) | |
£o„(D · | ·· £o„(j) · | ·· ^on(m) |
is the correlation coefficient between the normalized reference sequence e0 and its own
A (j) j-th element, and the value of >θθ v J 7 is 1;
using the entropy weight method to give weights to column vectors of matrix E, except for the first row elements, the formula is as follows:
E- =ln(m) i=1
ΣΡϋ1ηΡϋ ln(m) j
ΣΧ /ΣΧ)111^ ZZeij)’ j =1,2,...m i=l —Ew; =-j = l,2,...,m '-Σε j=l where Ej is the information entropy of the data of the j -th column, Py is the proportion of the value of the 15 element ey in the column elements, Wj is the weight of the j-th column data, and the weight vector
W = (w1?..., Wj,..., wm) the weighted gray correlation coefficient judgment matrix F is as follows:
where e(b
W1 w/oiQ) | ... Wj · ··· w^01(j) · | ·· Wm ·· wX(m) |
w/oiQ) ' | ··· w^Oi(j) ·· | '· wjoi(m) |
W/onQ) · | ·· w^On(j) · | ·· wJOn(m) |
Wj^oi(j) | is the weighted correlation cot |
calculating the gray correlation projection value, the formula is as follows:
ZwA(jWj j=! _
I m I m i = l,2, where D; is the correlation projection value between the comparison sequence X; and the bus load under test, D; is between 0 and 1;
selecting the M factors which are the top of the projection values from large to small as the key influencing factors.
[0011] Optionally, the step of clustering analyzing the historical load data of bus under test according to the preset K-means clustering algorithm comprises:
using the corrected load data as a sample set, setting the sample set as iN’···’xi’···’Nn) , load
X=fx X X ) C°=ic°c0 C° C°1 sequence as' ΐΛο’-’ it’···’ in J, the initial centroid set as ’’ 2'’ j’’’ k J, selecting one sample randomly as the first centroid, calculating the Euclidean distance to the first centroid separately for each of the remaining samples:
<ϊ = Σ(\-<ΐ) t=0 where m is the number of load sequences in the sample set, n represents that each load sequence has (n+1) data points, k is the number of centroids; X'1 represents the load value at time (t+1) in the i-th load c° sequence, 11 represents the value of the (t + l)-th element in the first centroid vector;
selecting the sample with the largest Euclidean distance as the second centroid, then calculating the Euclidean distance to the second centroid separately for each of the remaining samples, then selecting the sample with the largest Euclidean distance as the third centroid, and so on until k centroids are determined;
calculating the Euclidean distance of each sample to all centroids separately, selecting the centroid with the smallest Euclidean distance from the current sample as the class to which the current sample belongs:
n 2 d(k,cj) = Σ<Χ-c‘t) , j =1,2,...,k,l =0,1,...,h t=0 where 1 is the number of iterations, x; is the i-th sample, and J is the centroid vector of samples of class j before the (l+l)-th iteration;
updating the centroid of each class after all samples classified; let the number of samples of class j be mj and the sample set be , recalculating the centroid vector of class j as follows:
Cj = (CjO’Cjl’-’Cjt’---’Cjn) »
where calculation formula of (t+l)-th component is:
XjteSj Xit , j =1,2, ,k determining whether or not the preset termination condition is reached, continuing the iteration calculation if not reached, until the historical load of the bus under test is classified as a typical load pattern of class k.
[0012] Optionally, the step of determining whether or not the preset termination condition is reached comprises:
after the completion of each iteration, calculating the distance interval between the various centroids before and after updating according to the following formula:
Dis' =11 c* -c'f1 i = 1 2 k J ’·>····> , Disk represents the centroid distance interval of class j after the completion of 1-th iteration;
selecting the maximum value max{Dis. ), p- ^g maximum value is less than the difference tolerance , the algorithm terminates, otherwise reclassifying the samples to continue the iteration updating.
[0013] Optionally, the step of using the random forest algorithm to construct the classification model comprises:
' I ’O · _ /—y sampling with replacement s training sample sets 1 , 1 — EA···,s of the same size randomly from the historical load data sets;
sampling N influencing factors randomly as the characteristic attribute of each training sample set; wherein N<M;
training s sample sets TS; separately to generate the corresponding decision tree Tree;; remaining the characteristic attribute of the decision tree unchanged throughout the forest growth process;
except for the leaf nodes, each decision tree using the Gini index of the CART algorithm as the basis for node splitting:
Gini(d) = 1-£β 'i,d i=l
for each node, selecting the attribute with the smallest Gini index as the splitting attribute a; the Gini index of current node d split by splitting attribute a is:
Gini(a,d) = pLGini(dL) + pRGini(dR) where dL and dR are the left and right child nodes of node d, and PL and PR are the proportion of the left and right child nodes in the parent node;
continuing to split from top to bottom according to the above rules, until all nodes are split or marked as leaf nodes, that is, the decision tree growth is complete, there are a total of s trees, none of the trees needs pruning operation, leaf nodes of each tree correspond to some clustering result of clustering analysis; combining the s decision trees in order to obtain the random forest classification model;
when a test sample input, using each decision tree to classify the test sample, obtaining s classification results and selecting the class with the highest proportion as the class of test sample.
[0014] Optionally, establishing the multiple regression model by the multiple linear regression method, and determining the regression parameters by the least square method.
[0015] Optionally, the regression equation is:
f 00 = b0 +b^i + b2xi2 + - · +t»An
X γ γ is the input sequence of sample k ™ is the value of the n-th influencing factor, ’i is the true value of the bus load corresponding to the output;
the input matrix X, the output matrix Y, and the coefficient matrix B are as follows:
X =
' 1 | X11 | X12 ·' | • Xln | ||||
_Yi ’ | bo | ||||||
1 | X21 | ^2 ’ | • | ||||
, Y = | y2 | , B = | bl | ||||
1 | xmi | Xm2 · | • Xmn_ | _ Yn _ | _b„_ |
using the least squares method to determine the estimated parameter value of the regression equation:
(XTX)-1XTY b„ [0016] As can be seen from the above, the bus load forecasting method provided by the present invention determines the key influencing factors of the load data by the gray correlation projection method. By means of K-means clustering and random forest classification method in combination with regression analysis method, and based on the characteristics of the sequence to establish statistical forecasting model separately, to mine the inherent rule of bus load data, to decompose the changing complex historical load into several classes of typical load patterns, to targeted train forecasting model, thus to complete the forecasting by selecting matching models according to the class of day under test. The method of bus load forecasting not only can improve the accuracy of the bus load forecasting, but also meet the requirement of real-time forecasting, so that the forecasting result is more stable and reliable.
Brief Description of the Drawings [0017] Figure 1 is a flow chart of a method of bus load forecasting according to one embodiment of the present invention;
[0018] Figure 2 is a flow chart of a method of bus load forecasting according to another embodiment of the present invention;
[0019] Figure 3 is a flow chart of the gray correlation projection method according to one embodiment of the present invention;
[0020] Figure 4 is a flow chart of the preset K-means clustering algorithm according to one embodiment of the present invention;
[0021] Figure 5 is a graph showing a result of clustering of load data in a bus load forecasting method according to the present invention;
[0022] Figure 6 is a flow chart of the random forest algorithm according to one embodiment of the present invention;
[0023] Figure 7 is a thumbnail view of the decision tree generation in the random forest algorithm according to the present invention;
[0024] Figure 8 is a graph showing a result of bus load forecasting in the bus load forecasting method according to the present invention.
Detailed Description [0025] In order to make the objects, technical solutions and advantages of the present invention become more clear and easy to understand, the present invention will be described in further detail with reference to specific embodiments in conjunction with the accompanying drawings.
[0026] It should be noted that all the expressions first and second used in the embodiments of the present invention are for the purpose of distinguishing two different entities having the same name or different parameters, and the use of first and second is for convenience of description only, and should not be construed as limiting the embodiments of the present invention, which will not be further described in detail in this regard in the following embodiments.
[0027] In order to overcome the shortcomings of short-term bus load forecasting in the prior art, the present invention adopts the data mining method in large data technology to cluster for several bus load patterns, and establishes the statistics forecasting models based on the characteristic rules of various load sequences in combination with regression analysis method, so as to select the corresponding model according to the class of day under test to complete the forecast, that is, to achieve model matching. Compared with the traditional method establishing the forecasting model directly, this method can better mine the change rule of bus load and improve the accuracy of the bus load forecasting by the time-domain decomposition of the historical load sequence; and the forecasting model base can be directly obtained once the forecasting model training is completed. The load of bus at all times of one day could be forecasted directly, it is not necessary to retrain the model before every forecasting to satisfy the real-time requirement of short-term forecasting.
[0028] With reference to Figure 1, in one embodiment of the present invention, a method of bus load forecasting is provided, comprising:
[0029] In step 101, inputting key influencing factor values of the bus under test of day under test as characteristic vectors into a pre-constructed classification model to obtain the class of day under test; the pre-constructed classification model is a result of clustering analysis based on the key influencing factors and the historical load data of the bus under test; establishing a mapping relationship between the key influencing factors and bus load;
[0030] In step 102, selecting a corresponding multiple regression forecasting model from the multiple regression forecasting models obtained from the pre-training according to the class of day under test; forecasting the bus load under test, wherein the multiple regression forecasting models obtained from pre-training are a result of clustering analysis based on the historical load data of the bus under test; selecting the corresponding historical load data and influencing factor values for different typical load patterns as a forecasting model obtained from sample data training.
[0031] In this way, by classifying the model classes to which day under test belongs firstly and then forecasting them using the forecasting model, the bus load can be forecasted more accurately, and the efficiency of bus load forecasting can be further improved, especially for the short-term bus load io forecasting. That is, the method of bus load forecasting according to the present invention realizes the efficient and accurate forecasting of the bus load based on the model matching.
[0032] The pre-constructed classification model is one of the key steps of the present invention, and therefore, in an alternative embodiment of the present invention, there is also provided a construction method of a classification model, comprising:
using the horizontal comparison method to select abnormal data from the historical bus load data of the bus under test, and correcting the abnormal data according to a preset algorithm to obtain the corrected load data; using the gray correlation projection method to calculate the correlation degree between the influencing factors and the historical load data in combination with the corrected load data and the load-related influencing factors;
selecting a preset number of influencing factors corresponding to the correlation degree from large to small as the key influencing factors; clustering analyzing the historical load data of bus under test according to the preset K-means clustering algorithm to obtain different typical load patterns;
according to the key influencing factors and the result of clustering analysis, using the random forest algorithm to construct the classification model, and establishing the mapping relationship between the key influencing factors and various bus loads.
[0033] Of course, the algorithm selected in the above-described construction method is merely an alternative to the embodiment of the present invention, and other suitable algorithms can be used for the construction of the model. But also does not limit the order between the above steps, only need to make the logical relationship with the normal algorithm steps can be.
[0034] With reference to Figure 2, a flow chart of a method of bus load forecasting according to another embodiment of the present invention is provided. The method of bus load forecasting comprising:
[0035] In step 201, obtaining and storing historical load data of the bus under test and load-related influencing factor data; wherein the influencing factors including factors such as meteorological information, day class and the like capable of affecting bus load.
[0036] In step 202, using the horizontal comparison method to select abnormal data from the historical bus load data of the bus under test, and correcting the abnormal data according to a preset algorithm to obtain the corrected load data; that is, preprocessing the historical load data of the bus under test, using the horizontal comparison method to process the historical load data of the bus under test to identify the abnormal data in load sequence and to correct the identified abnormal data.
[0037] In step 203, using the gray correlation projection method to calculate the correlation degree between the influencing factors and the historical load data in combination with the corrected load data and the load-related influencing factors; by determining the key influencing factors of the bus load, making the forecasting model has reasonable input dimension.
[0038] In step 204, clustering analyzing the historical load data of bus under test according to the preset K-means clustering algorithm, distributing the initial cluster centroids evenly in the sample space, and classifying the historical load data with similar change characteristics into one class, to obtain several typical load patterns. Distributing the initial cluster centroids evenly in the sample space according to the preset K-means clustering algorithm improves the convergence speed and clustering accuracy of the algorithm.
[0039] In step 205, according to the key influencing factors and the result of clustering analysis, using the random forest algorithm to construct the classification model, and establishing the mapping relationship between the key influencing factors and various bus loads.
[0040] In step 206, based on the result of clustering analysis of the historical load data of the bus under test, selecting the corresponding historical load data and influencing factor values for different typical load patterns, training the corresponding historical load data and influencing factor values to obtain different multiple regression forecasting models.
[0041] In step 207, inputting key influencing factor values of the bus under test of day under test as characteristic vectors into the classification model to obtain the class of day under test, selecting the corresponding multiple regression forecasting model according to the class of day under test to forecast the bus load under test.
[0042] It can be seen from the above embodiment that the bus load forecasting method provided by the present invention corrects the abnormal value in the historical load sequence of bus under test and determines the key influence factors of the bus load by the gray correlation projection method; uses the improved K-means algorithm to clustering analyze the corrected historical data, and uses the random forest algorithm to construct the classification model to establish the mapping relationship between the clustering results and key influencing factors; trains several multiple regression models for each class of load pattern; inputs the influencing factors of day under test into the random forest classification model to obtain the class of day under test, to select the matching regression model to complete the forecasting. The method of bus load forecasting not only improves the accuracy of bus load forecasting, but also meets the requirement of real-time forecasting, so that the forecasting result is more stable and reliable.
[0043] In some alternative embodiments of the present invention, the step of selecting the abnormal data from the historical bus load data of the bus under test by the horizontal comparison method comprises: cleaning the historical load data of the bus under test by the horizontal comparison method to identify and correct the abnormal data; wherein, the horizontal comparison method using the change rate of the adjacent point load as a criterion for determining whether or not abnormal data is present, the change rate of the load is calculated as follows:
[ΐχά,Ο-Ι/ά,ΐ-Ι) [ L(d,t —1) ’ | L(d,t)-L(d-l,q) |
L(d,q) where gjj^gg rate of ^us |oaj un(jer test at time t of d day, and L(d,t) js the ^us load value under test at time t of d day;
whether the sampling point data is abnormal data depends on:
J a (d ,t)<aJJiax (d,t), the load at time t is normal value va(d,t)>a^iax (d,t),theloadat time t is abnormal value and n
max (d,t)
L(d — i, t) — L(d — i, t — 1) . 1 o max{-:-}, ϊ = 1,2, max{
L(d-i,t-l)
L(d - i, t) - L(d - i -1, q) L(d -i, q) }, i = l,2, ,n,t = 1,2,,...,q .,n,t = 0 where amax ;s tjje maximum value of change rate of load at time t of d day during the first n days, i represents the number of days from the current d day, Ifod — i,t — 1) js samppng point data at time (t-1) of (d-i) day, (q+1) sampling points per day;
correcting the identified abnormal data according to the following formula:
L(d,t) = Λ, L(d — 1, t) + d — 2,t) + ... + 2mL(d — m,t) < λ. =β(ϊ-βγ~\ e(o,i)
ΣΤ-1 +1 λ
where m is the weight coefficient, which is used to represent the influencing degree of the load at time t of (d-m) day on the load at time t of d day; P is the smoothing coefficient, — mO is the bus load value under test at time t of (d-m) day, that is, using the last m days of data as corrected data.
[0044] The whole process is carried out in chronological order, and only the load value at the previous time is compared. When the anomaly data is detected, it is corrected immediately and is used as the comparison value of the next time data to complete the correction of all historical load data.
[0045] In some alternative embodiments of the present invention, with reference to Figure 3, the step of using the gray correlation projection method to calculate the correlation degree between the influencing factors and the historical load data and selecting a preset number of influencing factors corresponding to the correlation degree from large to small as the key influencing factors comprises: using the gray correlation projection method to calculate the correlation degree between each influencing factor and the bus load. Classifying the calculated gray correlation projection values and selecting the M factors with high value as the key influencing factors. The specific calculation steps are as follows:
[0046] In step 301, first, determining a reference sequence and a comparison sequence, where the reference sequence is the corrected load data: — iNn, Νκ,···, Nm ) ; the comparison sequence is the influencing factors (such as meteorological data, day class): — iNi’Nmi 1 — 1,2,...,n .
obtaining the sequence matrix of the reference sequence and the comparison sequence:
Xo | <XO1 · | • *0? | |
= | ; · | ||
X„_ | Λ. · | • Xnmy |
where m represents that each sequence has m elements, n is the number of comparison sequence. Xo represents the reference sequence, that is, the load sequence; X; is i-th comparison sequence, that is, influencing factor sequence.
[0047] In step 302, normalizing the matrix X as follows:
. — min eu = j= 1,2,....m Xj max.
,. — mm.
j-1,2,...,m Xj
-(1-0.1) + 0.1 mm. , 7 x. x. max. . 7 x.
where J 9 is the minimum value of the row elements of 1J, and J « is the maximum value of the row elements of eX, the value of 1J is limited to 0.1 to 1.
The normalized matrix X is as follows:
e; 1J is the normalized value of the element χΗ of the matrix <e e A C01 · · · C0m where 0 and sequence.
ei9i = 1,2,...,n are normalized reference sequence and normalized comparison [0048] In step 303, calculating the correlation coefficient, the correlation coefficient -(li v ' of the k-th 15 element of normalized comparison sequence e; and normalized reference sequence e0 is calculated as follows:
min, min, I — e0, I +/?maxj max, I — e0, I 4„,(k) =--3 , ,J P, 3 , °3 , i = 1,2,...,n, j = 1,2,...,
1¾ -eok l+pmax.maxj lee -e0J I , min. min. I e-- en. I . , . . , , ,, where J J 1 is the minimum difference between the two levels, max-max , I e-— en, I . , . ,.-. , , , n . , ., .
j u is the maximum difference between two levels; f is the identification coefficient, preferably, the value of P is 0.5.
The gray correlation coefficient judgment matrix F is calculated as follows:
F =
X(D | ··· ZooO · | ·· Zoo (rn)> | ( 1 | • 1 ·· | ' 1 Ί | |
Z01(D | ·· £n(j) · | ·· Zoi(m) | ZoiQ) · | • Z01(j) · | • Zoi(m) | |
^oi(l) | ·· Zoi(m) | — | ZOi(D · | • ZOl(j) ·· | • Zoi(m) | |
Jond) | ·· ZonG) · | ·· Zon(m) y | Jo„(D ' | '· Zon(J) · | ·· ZOn(mb |
Λ (j) where js the correlation coefficient between the normalized reference sequence e0 and its own / (j) j-th element, and the value of >θθν J7 is 1.
[0049] In step 304, using the entropy weight method to give weights to column vectors of matrix E, except 5 for the first row elements, the formula is as follows:
E. =ln(m) ti
ΣΡίΛΡίι ln(m) i=1
ΣΧ ZZeij)’ j =1,2,...m l —Ew; =-j =l,2,...,m πι-ΣΕ, j=l where Ej is the information entropy of the data of the j -th column, Py is the proportion of the value of the element ey in the column elements, Wj is the weight of the j-th column data, and the weight vector
W = | ||||
[0050] In step 305, | a weighted gray correlation judgment matrix F is obtained from step 304: | |||
W1 | ·· Wj · | ·· Wm | ||
w/oi(l) · | ·· w^01(j) · | ·· wX(m) | ||
F' = | ||||
w/oiQ) · | ·· w^Oi(j) · | • wjoi(m) | ||
,w/on(!) · | ·· w^On(j) · | • wJOn(m)? | ||
wi4i(j) | ||||
where | J 77 Ul \ J z | is the weighted correlation coefficient between the sequence e; and the j-th element of |
e0[0051] In step 306, calculating the gray correlation projection value, the formula is as follows:
D.
m
Σ wj^oi( j) · j=!
ill! I 111
Σ» j=i j=i where D; is the correlation projection value between the comparison sequence X; and the bus load, D; is between 0 and 1, the closer the value is to 1, the greater correlation degree; the closer the value is to 0, the weaker the correlation degree. Compared with the gray correlation, gray correlation projection value can be more comprehensive to reflect the degree of similarity of the development trends of two objects.
[0052] In step 307, selecting M factors with the largest projection values as the key influencing factors, the values of M could be set on demand, and could classify all gray correlation projection values firstly before the selecting process, selecting the top of M factors from large to small as the key influencing factors, it is possible without classifying.
[0053] In some alternative embodiments of the present invention, the step of clustering analyzing the historical load data of bus under test according to the preset K-means clustering algorithm comprises:
using the corrected historical load data as the sample set, and using the improved K-means algorithm to cluster the load characteristics of bus. Specific calculation process is as follows:
[0054] In step 401, the selecting of the initial centroid. In the traditional algorithms, the initial clustering centroid is selected randomly, and the improved algorithm proposed by the present invention distributes the initial centroid evenly in the sample space with distance as the criterion, and effectively reduces the influence of the initial centroid on the clustering result. Setting the sample set as — iN’···’ N ’···’ xm , load sequence as X' iNo’···’ Nt’···’ Nn) , the initial centroid set as sample is selected randomly as the first centroid.
Cu={c[\c°...,c)
X’} . One [0055] In step 402, calculating the Euclidean distance to the first centroid separately for each of the remaining samples:
^ = Σ(^-υθ) t=0 where m is the number of load sequences in the sample set, n represents that each load sequence has (n+1) data points, k is the number of centroids; X'1 represents the load value at time (t+1) in the i-th load c° sequence, 11 represents the value of the (t + l)-th element in the first centroid vector;
selecting the sample with the largest Euclidean distance as the second centroid, then calculating the Euclidean distance to the second centroid separately for each of the remaining samples, then selecting the sample with the largest Euclidean distance as the third centroid, and so on until k centroids are determined. Optionally, k is 6.
[0056] In step 403, determining whether k centroids have been selected, if does, means that the selecting process has been completed and can proceed to the next step. If the number of centroids has not yet reached k, then continue the selecting process until reached.
[0057] In step 404, let obtained k centroids mutually correspond to different classes of load patterns separately; in other words, establishing the corresponding relationship, and the number of iterations this moment is denoted as 0.
[0058] In step 405, sample assignment or sample determination. Calculating the Euclidean distance of each sample to all centroids separately, selecting the load class corresponding to the centroid with the smallest Euclidean distance from the sample, assigning the sample to the load class. The calculation formula is as follows:
ά<Χφ=Σ<Χ-4) ’ J=1’2’ k,l =0,1,..,h t=0 where 1 is the number of iterations, x; is the i-th sample, and J is the centroid vector of samples of class j before the (l+l)-th iteration.
[0059] In step 406, centroid updating: Calculating centroid vectors of each class, updating the centroid of each class after all samples assigned; let the number of samples of class j be and the sample set be recalculating the centroid vector of class j as follows:
Cj = (CjO’Cjl’-’Cjt’---’Cjn) »
where (t+l)-th component is calculated as follows:
j =1,2, ,k [0060] In step 407, determining whether or not the termination condition is reached. In the present invention, in addition to selecting and setting the maximum number of iterations as the termination condition, it is also determined whether or not the algorithm can be terminated by defining the difference tolerance & . After the completion of each iteration, calculating the distance interval between the various centroids before and after updating according to the following formula:
018,=110,-0^11, j=l,2,...,k ! , .... . , c , . c , J J j ·' , DiSj represents the centroid distance interval ot class j alter the completion of 1-th iteration; each assignment updated, there are k calculations. Selecting the maximum valuemax{kfisj b J 1,2,..., k va|ue js |ess than the predefined difference tolerance, the algorithm terminates, otherwise proceeds to step 405 to continue the iteration updating.
[0061] This process classifies the bus historical load into load pattern of class k, the change trend of bus load in each load pattern is more obvious and more regular, which lays the foundation for constructing model and improving forecasting accuracy.
[0062] In step 408, obtaining k clustering results.
[0063] In a further embodiment of the present invention, the step of using the random forest algorithm to construct the classification model further comprises:
sampling with replacement s training sample sets i, 1 — Tz’···’sof S size randomly from the historical load data set S;
sampling N influencing factors randomly as the characteristic attribute of each training sample set according to M key influencing factors; wherein N<M;
training s sample sets TS; separately to generate the corresponding decision tree Tree;. Remaining the characteristic attribute of the decision tree unchanged throughout the forest growth process. Except for the leaf nodes, each decision tree using the Gini index of the CART algorithm as the basis for node splitting:
Gini(d) = 1-^ p*d i=l where d is the current node, ^’d ;s tjje proportion of target class i, k is the number of target class. The closer the Gini index is to 0, the higher the purity of the classification, the better the effect.
[0064] For each node, considering all cases where each characteristic attribute to node splitting, selecting the attribute with the smallest Gini index as the splitting attribute a; the Gini index of current node d split by splitting attribute a is:
Gini(a,d) = pLGini(dL) + pRGini(dR) where dL and dR are the left and right child nodes of node d, and PL and PR are the proportion of the left and right child nodes in the parent node.
[0065] Continuing to split from top to bottom according to the above rules, until all nodes are split or marked as leaf nodes, that is, the decision tree growth is complete, there are a total of s trees, none of the trees needs pruning operation, leaf nodes of each tree correspond to some clustering result of clustering analysis. When a test sample is input, each decision tree is used to classify the test sample, s classification results are obtained and the class with the highest proportion is selected as the class of test sample.
[0066] Combining the s decision trees in order to obtain the random forest classification model, thus to establish the mapping relationship between the influencing factors and the clustering results, and complete the determination of the class of day under test in real time, laying the foundation for the forecasting model matching.
[0067] In some alternative embodiments of the present invention, the step of obtaining different multiple regression forecasting models from the training further comprises:
[0068] Considering the clustering result of bus load and the real-time of short-term load forecasting, the present invention uses the multiple linear regression models for forecasting. To use bus load data of each class and key influencing factors as training samples to construct several forecasting models. Establishing the multiple regression model by the multiple linear regression method, and determining the regression parameters by the least square method.
[0069] Specifically, the i-th training sample in each class of sample set is X = i = l,2,...,m is ’ Yi) , where
Xi , m is the number of samples in each class of load sample set;
x x y is the input sequence of sample i, in is the value of the n-th influencing factor, is the true value of the bus load corresponding to the output.
The regression equation is:
f 00 = b0 +b^i + b2xl2 + · · · +bA where b0 is the regression constant term, bpb2,...,bn -s regressjon coefficient, is the regression forecasting value.
[0070] The input matrix X, the output matrix Y, and the coefficient matrix B are as follows:
X =
' 1 | X11 | X12 ' | • Xln | ||||
_Yi ’ | bo | ||||||
1 | X21 | ^2 ’ | • | ||||
, Y = | y2 | , B = | bl | ||||
1 | xmi | Xm2 · | • Xmn_ | _ Yn _ | _b„_ |
using the least squares method to determine the estimated parameter value of the regression equation:
bi (XTX)-1XTY inputting the key influencing factors at each time of day under test into the random forest classification model to obtain the class of day under test, to select the corresponding multiple regression forecasting model to complete the load forecasting.
[0071] The method of bus load forecasting of the present invention has the following effects: through the clustering analysis of the historical bus load, the establishment of statistical model and the model matching of the day under test, the change rule of the bus load is effectively mined, and more accurate forecasting results are obtained; meanwhile it is not necessary to train the model before every forecasting to further satisfy the real-time requirement of bus load forecasting.
[0072] In another embodiment of the present invention, a 110 kV bus active load of a city power grid company is used as a forecasting example.
[0073] In the first stage, the historical load data preprocessing section. The load value 24 hours of llOkV bus of a substation is selected as the forecasting object and the load data of one year before the forecasting day is used as training sample. The data format is shown in Table 1:
[0074] Table 1 Bus load data table
Date | The bus load value at each sampling time (Unit: MW) | |||||||
00:00 | 01:00 | 02:00 | 03:00 .. | .. 20:00 | 21:00 22:00 | 23:00 | ||
01/01 | 52.6 | 49.5 | 46.7 | 44.7 .. | .. 49.9 | 46.1 | 46.8 | 46.2 |
12/31 | 52.0 | 48.5 | 42.9 | 41.3 .. | .. 69.0 | 65.3 | 60.5 | 59.5 |
[0075] Each row of data in Table 1 represents a 24-dimensional load sample sequence. There may be several abnormal values in historical load data due to instability of the acquisition systems, namely, abnormal data, which will seriously affect the forecasting accuracy.
[0076] The horizontal comparison method is used to correct abnormal data. Sampling cycle set to 1 hour, a total of 24 data points per day, the change rate of the load at the adjacent time is sequentially calculated in chronological order:
a(d,t) ! L(d,t)-L(d,t~23) |
L(d,t-23) <! L(d,t)-L(d-1,23)! , L(d,23) t = 1,2, , t = 0 ,23 [0077] The maximum value of the change rate of load within 7 days before the sampling point is 15 calculated as the criterion of whether or not the sampling time data is an abnormal value. The judgment formula is as follows:
[0078] If the load data is abnormal value, then the data at the same time of last 3 days is corrected through the correction formula, the correction formula is as follows:
20 L(d,t) = d - l,t) + 2,L(d - 2,t) + 2,L(d - 3,t) _ θ . . , ’ A 0.26 an(|^3 θ·24 , the correction effect is the best.
[0080] In the second stage, the determination stage of key influencing factors. With reference to Figure 3, a specific implementation process of the gray correlation projection method is shown. Based on the meteorological data of the city where the city power grid company is located, the influencing factors are chosen as follows: maximum temperature (Tmax), minimum temperature (Tmin), real-time temperature (RT), Average wind speed (AW), relative humidity (RH), average precipitation (AP), day type (DT), and season type (ST), specific parameters as shown in Table 2:
[0081] Table 2 Influencing factors data
Sampling time | Influencing factors | |||||||
Tmax | Tmin | RT | AW | RH | AP | DT | ST | |
01/01 00:00 | 10 | -2 | 2 | 2.6 | 51 | 1.8 | 1 | 3 |
01/01 01:00 | 10 | -2 | 0 | 2.6 | 51 | 6.5 | 1 | 3 |
06/13 14:00 | 31 | 18 | 30 | 0.8 | 64 | 25.7 | 2 | 4 |
10/01 00:00 | 25 | 10 | 15 | 1.2 | 59 | 19.6 | 3 | 1 |
12/31 23:00 | 8 | -6 | -1 | 2.3 | 48 | 0.9 | 1 | 3 |
[0082] In Table 2, the column data corresponding to each influencing factor is the comparison sequence X;, and the load data at each sampling time is the reference sequence Xo. The day type is assigned {working day, weekend, holiday} = {1, 2, 3}, the season type is assigned {summer, winter, spring, autumn} = {4, 3, 2,1}, other influencing factors data is actually measured value.
[0083] The comparison sequence and the reference sequence are normalized by the formula, and the corresponding values are limited to 0.1 to 1 to obtain the normalized sequence matrix E:
<e e A C01 · · · C0m yem ''nm J [0084] Where nm represents the normalized value of element xnm in the sequence matrix X. The number of comparison sequence n=8, m=8760 is the number of elements per sequence, ie, 24 sampling points per day for 365 days.
[0085] The gray correlation coefficient between the comparison sequence and the reference sequence is obtained by the formula of the correlation coefficient, and the gray correlation coefficient judgment matrix F is obtained. The entropy weight method is used to give weights to column vectors of matrix E, except for the first row elements, and the weight vector (wx,W2,..., W8760) correlation judgment matrix F is obtained in combination with matrix F:
where is obtained. The gray
W1 · | WJ · | ' ' W8760 |
W1 ^01 (D · | ·· wA(J) · | - W8760^01 (876θ) |
·· w^Oi(j) ·· | · w876/0i(8760) | |
W^OnQ) ·· | • w^On(j) · | ·· w876(/0n(8760) |
4i(j) i is the correlation coefficient between the comparison sequence e; and the j-th element of
Wreference sequence e0. J is the weight of column vector j, where coefficient between the sequence e; and the j-th element of e0.
is the weighted correlation [0086] The gray correlation projection value D; between the reference sequence X; and the comparison sequence Xo is calculated as follows:
8760
D; =
Σ wj^oi( J) · j=l
8760 /8760 , 1 = 1,2,...,8 |Σ>ΑΟ>}24Σ>7 [0087] Through the above calculation, the gray correlation projection value of 8 influencing factors is obtained as shown in Table 3:
[0088] Table 3 the gray correlation projection values of influencing factors
Tmax | Tmin | RT | AW | RH | AP | DT | ST |
0.87 | 0.64 | 0.72 | 0.28 | 0.66 | 0.32 | 0.55 | 0.43 |
[0089] Average wind speed and average precipitation are weakly correlation factors, and the day type and season type are moderate correlation factors.
[0090] Classifying and selecting the maximum temperature, real-time temperature, relative humidity, minimum temperature, and day type as the key influencing factors for the trend of bus load under test.
[0091] In the third stage, the historical load clustering stage of the bus under test. With reference to Figure 4, a specific implementation process of the improved K-means algorithm is shown.
[0092] The determination of initial clustering centroid: There are a total of 365 sample sequences in the c° historical load sample set; one sample is selected randomly as the first initial centroid vector 1 , the c°
Euclidean distance d to 1 for each of the remaining samples is separately calculated as follows:
t=0
Y C where il represents the load value at time (t+1) in the i-th sample sequence, ll represents the value of the (t + l)-th element in the initial centroid vector of class 1.
[0093] The sample with the largest d is selected as the second initial centroid vector 2 , then the Euclidean c° distance to 2 is calculated separately for each of the remaining samples, then the sample with the c° largest d as 3 is selected, and so on until k centroids are determined. In the present embodiment, k = 6, 10 the bus load sequence is classified into 6 typical load patterns.
[0094] After 6 initial centroids are determined, the iterative algorithm is started and the samples are assigned. The Euclidean distances of each sample to all centroids are calculated, and each sample will be assigned to the class represented by the nearest centroid:
d(Xi,c‘) = E(N-c‘t) , j =1,2,...,6,1=0,1,...,1000 t=0
V, · where 1 is the number of iterations, x; is the i-th sample, and J is the centroid vector of samples of class j before the (l+l)-th iteration.
[0095] The updating of the centroid vector: After each sample assignment, the centroid of load sample of each class needs to be updated. If the number of samples of class j is mj , the sample set ts , and then c1 c1 the elements Jtin the centroid vector J are updated as:
Σ.
_
Jl
Hlj
Nt , j =1,2,
Determine whether the termination condition of algorithm is reached. In addition to the maximum number of iterations, define difference tolerance & to determine whether the algorithm terminates. After the updating of centroid, the distance before and after the updating of each class centroid vector is calculated:
Dis1; =11 C1; - C1;’1 j =1,2,...,6,1=1,2,...1000
Selecting the maximum valuemaxDiSj } va|ue js |ess difference tolerance^ — 0.01 the algorithm terminates, otherwise continuing the iteration updating.
[0096] The above clustering process classifies the bus historical load into 6 load patterns, and the clustering results are shown in Figure 5. Class 1 consists of 82 days, mainly on rest days, Class 2, Class 5 mainly consist of 112 and 89 working days; The difference between these two load values is mainly due to seasonal factors; Class 3 consists of 25 days, the load is much higher than other classes due to extreme weather; Class 6 consists of 31 days, the load curve fluctuates greatly.
[0097] In the fourth stage, the construction stage of random forest classification model. With reference to Figure 6, a specific implementation process of the random forest algorithm is shown.
' I ’O · _
Sampling with replacement 100 training sample sets 1 , 1 — CA---,sof S size randomly from the historical sample set S. The data of training sample set ί is shown in Table 4:
[0098] Table 4 training sample set
time | Tmax | RT | RH | Tmin | DT | Clustering result |
01/01 00:00 | 10 | 2 | 51 | -2 | 3 | Class 5 |
02/05 09:00 | 5 | 3 | 64 | -5 | 1 | Class 2 |
[0099] 3 influencing factors are sampled randomly as the characteristic attribute for each training sample set.
[0100] 100 sample sets TS; are trained separately to generate the corresponding decision tree Tree;. The characteristic attribute of the decision tree Tree; is remained unchanged throughout the forest growth process. Except for the leaf nodes, each decision tree uses the Gini index of the CART algorithm as the basis for node splitting:
Gini(d) =l-£p2d i=l where d is the current node, is the proportion of target class i. The closer the Gini index is to 0, the higher the purity of the classification, the better the effect.
[0101] For each node, consider all the cases of node splitting with 3 characteristic attributes, and select the attribute with the smallest Gini index as the splitting attribute a. The Gini index of current node d split by splitting attribute a is:
25 Gini(a,d) = pLGini(dL) + pRGini(dR) where dL and dR are the left and right child nodes of node d, and PL and PR are the proportion of the left and right child nodes in the parent node.
[0102] Continuing to split from top to bottom according to the above rules, until all nodes are split or marked as leaf nodes, that is, the decision tree growth is complete, there are a total of 100 trees, none of the trees needs pruning operation, leaf nodes of each tree correspond to some clustering result of clustering analysis. When a test sample is input, each decision tree is used to classify the test sample, 100 classification results are obtained and the class with the highest proportion is selected as the class of test sample.
[0103] Combine the 100 decision trees in order to obtain the random forest classification model, so as to establish the mapping relationship between the influencing factors and the clustering results. Figure 7 is a thumbnail view of the decision tree generation in the random forest classification model.
[0104] In the fifth stage, the training stage of the forecasting model. According to the clustering results, the bus load data and the key influencing factors in each load pattern are integrated into the training sample set, as shown in Table 5:
[0105] Table 5 training sample set
Sampling time | Sample sequence | Key Influencing factors | Corresponding load | ||||
Tmax | RT | RH | Tmtn | DT | |||
06/13 14:00 | XI | 31 | 30 | 64 | 18 | 2 | 54.2 |
10/07 19:00 | Xm | 26 | 20 | 58 | 9 | 3 | 63.6 |
[0106] Where m Is the number of samples in each class of sample set; sample sequence i = l,2,...,mwh_ % ; »
X;
m .Where is the influencing factor sequence of sample “u yi; is the true value of the bus load corresponding to the output.
[0107] Each class of load pattern needs to construct the forecasting model based on its own training sample set, and finally obtains 6 regression equations. The multiple regression equation is given by:
f (X) = b0 + + b2?q2 + · · · + b5 xl5 where b0 is the regression constant term, bpb2,...,bn -s regressjon coefficient, ^'^X'^is the regression forecasting value. The regression constant terms and coefficients for each regression equation are calculated from equation (22). The results are shown in Table 6:
[0108] Table 6 regression coefficient
bo | bi | b2 | b3 | b4 | b5 | |
Equation 1 | 4.9 | 0.64 | 0.51 | 0.36 | 0.32 | 0.23 |
Equation 2 | -8.5 | 0.57 | 0.60 | 0.41 | 0.26 | 0.18 |
Equation 3 | 6.5 | 0.48 | 0.66 | 0.35 | 0.43 | 0.20 |
Equation 4 | -3.8 | 0.68 | 0.52 | 0.37 | 0.35 | 0.29 |
Equation 5 | 3.2 | 0.51 | 0.52 | 0.36 | 0.30 | 0.27 |
Equation 6 | 1.8 | 0.71 | 0.55 | 0.34 | 0.39 | 0.22 |
[0109] In the sixth stage, the stage of model matching and forecasting: The load data of the bus on June 21, 2016 is selected as the test sample, and the weather data and day type information of the day are as follows:
Time | RT | RH | Time | RT | RH | Time | RT | RH | Tmax | Tmin | DT |
00:00 | 23 | 52 | 08:00 | 24 | 53 | 16:00 | 30 | 37 | 30 | 18 | 1 |
01:00 | 22 | 56 | 09:00 | 26 | 48 | 17:00 | 30 | 40 | |||
02:00 | 21 | 59 | 10:00 | 28 | 44 | 18:00 | 29 | 43 | |||
03:00 | 20 | 65 | 11:00 | 29 | 35 | 19:00 | 28 | 46 | |||
04:00 | 20 | 64 | 12:00 | 30 | 36 | 20:00 | 26 | 51 | |||
05:00 | 19 | 68 | 13:00 | 30 | 37 | 21:00 | 25 | 55 | |||
06:00 | 18 | 70 | 14:00 | 30 | 38 | 22:00 | 24 | 57 | |||
07:00 | 21 | 62 | 15:00 | 30 | 37 | 23:00 | 23 | 58 | |||
0110] Then, the corresponding influencing : | 'actors ol | 24 hours of the c | ay are input into the random forest |
classification model separately, and result that the day under test belongs to load pattern of class 2 is obtained, and the model 2 (regression equation 2) is selected to complete the load forecasting, the results are shown in Figure 8. The maximum relative error of the forecasting result obtained by the method of the present invention is 2.89%, the minimum relative error is 0.32%, and the average relative error is only 1.29%; while the maximum relative error of the traditional multiple regression forecasting method is
3.06%, the minimum relative error is 0.28%, and the average relative error is 1.38%. Therefore, the accuracy of the forecasting method adopted in the present invention is obviously improved, and has wide application prospect.
[0111] It should be understood by those skilled in the art that the foregoing discussion of any embodiment is intended to be illustrative and not intended to suggest that the scope of the present disclosure, including the claims, be limited to such examples; and in the context of the present invention, the technical features of the various embodiments may also be combined, the steps may be implemented in any order, and there are many other variations of the various aspects of the present invention as described above, which are not provided in the details for the sake of brevity.
[0112] In addition, to simplify the description and discussion, and in order not to obscure the present invention, well-known power/ground connections to integrated circuit (IC) chips and other components may or may not be shown in the provided drawings. In addition, devices may be shown in block diagram form in order to avoid obscuring the present invention, and this also takes into account the fact that the details regarding the embodiments of these block diagram devices are highly dependent on the platform on which the present invention will be implemented (that is, these details should be fully within the purview of those skilled in the art). In the event that specific details (e.g., circuits) have been set forth to describe exemplary embodiments of the present invention, it will be apparent to those skilled in the art that the present invention may be implemented without these specific details or with variations in these specific details. Accordingly, these descriptions are to be regarded in an illustrative rather than a restrictive sense.
[0113] Although the present invention has been described in connection with specific embodiments thereof, many alternatives, modifications and variations of these embodiments will be apparent to those skilled in the art from the foregoing description. For example, other memory architectures (e.g., dynamic RAM (DRAM)) may use the discussed embodiments.
[0114] It is intended that the embodiments of the present invention cover all such alternatives, modifications and variations that fall within the broad scope of the appended claims. Therefore, any omissions, modifications, equivalents, improvements, and the like within the spirit and principles of the present invention are intended to be embraced within the scope of the present invention.
Claims (10)
- What is claimed is:1. A method of bus load forecasting, comprising:inputting key influencing factor values of the bus under test as characteristic vectors into a pre-constructed classification model to obtain the class of day under test; the pre-constructed classification model is a result of clustering analysis based on the key influencing factors and the historical load data of the bus under test; establishing a mapping relationship between the key influencing factors and bus load;selecting a corresponding multiple regression forecasting model from the multiple regression forecasting models obtained from the pre-training according to the class of day under test; forecasting the bus load under test, wherein the multiple regression forecasting models obtained from pre-training are a result of clustering analysis based on the historical load data of the bus under test;selecting the corresponding historical load data and influencing factor values for different typical load patterns as a forecasting model obtained from sample data training.
- 2. The method of bus load forecasting according to claim 1, wherein the classification model is constructed by:using the horizontal comparison method to select abnormal data from the historical bus load data of the bus under test, and correcting the abnormal data according to a preset algorithm to obtain the corrected load data;using the gray correlation projection method to calculate the correlation degree between the influencing factors and the historical load data in combination with the corrected load data and the load-related influencing factors;selecting a preset number of influencing factors corresponding to the correlation degree from large to small as the key influencing factors;clustering analyzing the historical load data of bus under test according to the preset K-means clustering algorithm to obtain different typical load patterns;according to the key influencing factors and the result of clustering analysis, using the random forest algorithm to construct the classification model, and establishing the mapping relationship between the key influencing factors and various bus loads.
- 3. The method of bus load forecasting according to claim 2, wherein the step of selecting the abnormal data from the historical bus load data of the bus under test by the horizontal comparison method comprises:cleaning the historical load data of the bus under test by the horizontal comparison method to identify the abnormal data; wherein, the horizontal comparison method using the change rate of the adjacent point load as a criterion for determining whether or not abnormal data is present, the change rate of the load is calculated as follows:L(d.t)-L(d.t-1) t=1,L(d,t —1) |L(d.t)-L(d-l,q)|L(d,q) where tjje change rate of the bus load under test at time t of d day, and L(d,t) -s ^us load value under test at time t of d day;whether the sampling point data is abnormal data depends on:(d,t),theloadat time tis normal value ,a(d,t)>a1’jiax (d,t),theloadat time tis abnormal value andOL (d,t) i t)-Ud , = 1>2L(d — i,t -1) max ud-ro-ud-'-tq) ' L(d—i,q) ' ,n,t = 1,2,,...,q .,n,t = 010 where amax μ tjje maximum value of change rate of load at time t of d day during the first n days, i represents the number of days from the current d day, — ~ k> is the sampling point data at time (t-1) of (d-i) day, (q+1) sampling points per day; using the last m days of data to correct the abnormal data;comparing the load value at the current time with the load value at the previous time in chronological15 order, correcting the abnormal data once the abnormal data detected, and using the corrected data as the comparison value of load value at the next time, completing the load data correction.
- 4. The method of bus load forecasting according to claim 2, wherein the step of correcting the abnormal data by the preset algorithm comprises:20 correcting the selected abnormal data according to the following algorithm, the specific algorithm formula is:L(d,t) = 2jL(d — l,t) + 22L(d — 2,t) + ... + 2mL(d — m,t) < λ.=β(ΐ-βγ~\ e(o,i) mΣΛ=ι j=i λwhere m is the weight coefficient, which is used to represent the influencing degree of the load at time t of (d-m) day on the load at time t of d day; P is the smoothing coefficient, — ΠΊΑ ) js the bus load value under test at time t of (d-m) day.
- 5 5. The method of bus load forecasting according to claim 2, wherein the step of using the gray correlation projection method to calculate the correlation degree between the influencing factors and the historical load data and selecting a preset number of influencing factors corresponding to the correlation degree from large to small as the key influencing factors comprises:first, determining a reference sequence and a comparison sequence, where the reference sequence is the10 corrected load data: X° — i-Nn’Nk’···’Arm) ; the comparison sequence is the load-related influencing factors: X — > N2’---’ Nm) 1 — 1,2,...,n . obtaining the sequence matrix of the reference sequence and the comparison sequence:
Xo _ <XO1 · • X, = ; · X„_ Λι · where m represents that each sequence has m elements, and n is the number of comparison sequences; Xo 15 is the load sequence or reference sequence corresponding to the corrected load data, X; is the i-th comparison sequence;normalizing the matrix X as follows:ea = max i 1.2.....,,, Xii 111111 i 1.2....,mNj-(1-0.1) + 0.1 where mm and max are the minimum and maximum values of the row elements ofX··20 1J respectively, and 1J is the normalized value of the element Xy of the matrix X, the value of limited to 0.1 to 1;the normalized matrix X is as follows:is-Ό1Venl e0m e , nm / p £1=12 where 0 and ' ’ ’ ’ sequence;are normalized reference sequence and normalized comparison the correlation coefficient of the k-th element of normalized comparison sequence e; and normalized reference sequence e0 is calculated as follows:mirr min, I — e0, I +z?max, max, I — e0, I - 6,(k) =--j .j oj e-' J OJ , i = 1,2,...,n, j = 1,2,...,1¾ -eok l+pmax,max, le„ -e0J I , min. min. I e. - en. I . , . . , , ,, where J J 1 is the minimum difference between the two levels, max. max. I e.. — e„. I . , . ,... , 1 j u is the maximum dinerence between two levels; v 1S the identification coefficient;the gray correlation coefficient judgment matrix F is calculated as follows:F =
£oo(D · ·· ^oo(j) · ·· +oo<m) r i • 1 ·· ' 1 Ί ^oi CD · ^oi(j) · ·· £oi (m) ^oid) · • ^oi(j) · • ^01 (m) CD — ^(1) · £o„(D · ·· £o„(j) · ·· ^on(m) y Jo„(D · ·· £o„(j) · / (i) where >θθv J7 is the correlation coefficient between the normalized reference sequence e0 and its own / (i) j-th element, and the value of -(K) J 7 is I;using the entropy weight method to give weights to column vectors of matrix E, except for the first row elements, the formula is as follows:E: =--— ΣΡϋ1ηΡίι =—Γ7—Σ<Αι /ΣΧ)111^ ZZeiP’ j =l’2,...m J ln(m) y 1-E Wj = ln(m) i=1 J—, j =l,2,...,m m_EEi j=l where Ej is the information entropy of the data of the j -th column, Py is the proportion of the value of the element ey in the column elements, Wj is the weight of the j-th column data, and the weight vector W = (w1,...,wj,...,wm) the weighted gray correlation coefficient judgment matrix is as follows:W1 w/oiQ) ... Wj · ··· w^01(j) · ·· Wm ·· wX(m) w/oib) ' ··· w^Oi(j) ·· '· wjoi(m) W/onQ) · ·· w^On(j) · ·· wJOn(m) Wj^oi(j) is the weighted correlation cot where e(b calculating the gray correlation projection value, the formula is as follows:D.EWi^( j) · Wii i=lE(wA(j))2^£(wi)2 where D; is the correlation projection value between the comparison sequence X; and the bus load under test, D; is between 0 and 1, the closer the value is to 1, the greater correlation degree; the closer the value is to 0, the weaker the correlation degree;selecting the M factors which are the top of the projection values from large to small as the key 10 influencing factors.6. The method of bus load forecasting according to claim 2, wherein the step of clustering analyzing the historical load data of bus under test according to the preset K-means clustering algorithm comprises:using the corrected load data as a sample set, setting the sample set as — ίΚ’···’ K’···’ xm} , load x=fx x x ) C°=ic°c° c° c°) sequence as ' ΐΛο’-’ it’···’ in J, the initial centroid set as 1 ’’ 1' G’ k J, selecting one15 sample randomly as the first centroid, calculating the Euclidean distance to the first centroid separately for each of the remaining samples:t=0 where m is the number of load sequences in the sample set, n represents that each load sequence has (n+1) data points, k is the number of centroids; X'1 represents the load value at time (t+1) in the i-th load c°20 sequence, 11 represents the value of the (t + l)-th element in the first centroid vector;selecting the sample with the largest Euclidean distance as the second centroid, then calculating the Euclidean distance to the second centroid separately for each of the remaining samples, then selecting the sample with the largest Euclidean distance as the third centroid, and so on until k centroids are determined;calculating the Euclidean distance of each sample to all centroids separately, selecting the centroid with the smallest Euclidean distance from the current sample as the class to which the current sample belongs:n 2 ά<Λ,φ = Σ<χ-φ , j =1,2,...,k,l =0,1,...,h t=0 where 1 is the number of iterations, x; is the i-th sample, and J is the centroid vector of samples of class j before the (l+l)-th iteration;updating the centroid of each class after all samples classified; let the number of samples of class j be mj and the sample set be , recalculating the centroid vector of class j as follows:Cj = (CjO’Cjl’-’Cjt’---’Cjn) where calculation formula of (t+l)-th component is:Σ.sAt j =1,2, ,k determining whether or not the preset termination condition is reached, continuing the iteration calculation if not reached, until the historical load of the bus under test is classified as a typical load pattern of class k. - 7. The method of bus load forecasting according to claim 6, wherein the step of determining whether or not the preset termination condition is reached comprises:after the completion of each iteration, calculating the distance interval between the various centroids before and after updating according to the following formula:Dis1. =11 c1. — c1.-1 II, j=l,2,...,k ! , .... . , £ , . £ , J J j ·' , DiSj represents the centroid distance interval of class j after the completion of 1-th iteration;selecting the maximum value max{Dis. }, maxjmum va|ue js |ess than the difference tolerance , the algorithm terminates, otherwise reclassifying the samples to continue the iteration updating.
- 8. The method of bus load forecasting according to claim 2, wherein the step of using the random forest algorithm to construct the classification model comprises:' I 'O · _ -| sampling with replacement s training sample sets 1 , 1 — TA···,s of the same size randomly from the historical load data sets;sampling N influencing factors randomly as the characteristic attribute of each training sample set; wherein N<M;training s sample sets TS; separately to generate the corresponding decision tree Tree;; remaining the characteristic attribute of the decision tree unchanged throughout the forest growth process;except for the leaf nodes, each decision tree using the Gini index of the CART algorithm as the basis for node splitting:Gini(d) = 1-£r 'i,d i=l for each node, selecting the attribute with the smallest Gini index as the splitting attribute a; the Gini index of current node d split by splitting attribute a is:Gini(a,d) = pLGini(dL) + pRGini(dR) where dL and dR are the left and right child nodes of node d, and PL and PR are the proportion of the left and right child nodes in the parent node;continuing to split from top to bottom according to the above rules, until all nodes are split or marked as leaf nodes, that is, the decision tree growth is complete, there are a total of s trees, none of the trees needs pruning operation, leaf nodes of each tree correspond to some clustering result of clustering analysis; combining the s decision trees in order to obtain the random forest classification model;when a test sample input, using each decision tree to classify the test sample, obtaining s classification results and selecting the class with the highest proportion as the class of test sample.
- 9. The method of bus load forecasting according to claim 1, comprising establishing the multiple regression model by the multiple linear regression method, and determining the regression parameters by the least square method.
- 10. The method of bus load forecasting according to claim 9, wherein the regression equation is:f 00 = b0 +b^i +b2^2 + · · · +bA where b0 is the regression constant term, bpb2,...,bn -s regressjon coefficient, Un) is the regression forecasting value; the i-th training sample in each class of sample set is^1 >x,, Y,where ^={^1,^2,...,xj i = l,2,...,m , m is the number of samples in each class of load sample set; i x y.', in is the value of the n-th influencing factor, J1 is the true value is the input sequence of sample 1, in ii of the bus load corresponding to the output;the input matrix X, the output matrix Y, and the coefficient matrix B are as follows:
X11 X12 ·' • Xln ^2 ’ ' '· xmi Xm2 · ·· Xmn _Yi ’ ’V , Y = y2 , B = bl _ Yn _ A_ using the least squares method to determine the estimated parameter value of the regression equation:(XTX)-1XTYDRAWINGSInput key influencing factor values oflhe bus under test as eharaetcristie vectors into a pre-eonstrueted classification model to obtain the class of day under testSelect a corresponding multiple regression forecasting model front the multiple regression forecasting models obtained from the pre-training according to the class of the day under test and forecast the bus load under test101102FIG.lStart201Obtain and store historical load data of the bus under test and load-related influencingUse the horizontal comparison method to correct abnormal data in the historical load202Use the gray correlation projection method to determine key influencing factors203Clustering analyzing the historical load data of bus under test according to the preset204Use the random forest algorithm to construct the classification model, and establish205Select the corresponding historical load data and influencing factor values for different t\picul load patterns, and train them to obtain different multiple regression forecasting models206Input key influencing factor values of the bus of day under test into the classification ni«.»del to obtain the class of day uniler test, ;in<.l select the corresponding multiple..................207EndFIG.2 ίStart301Use the corrected historical load data as a reference sequence, and use the influencing factors as a comparison sequence302Normalize the elements of the reference sequence and the comparison sequence to obtain ; the normalized sequence matrix ECalculate the correlation coefficient between the reference sequence and corresponding i element of each comparison sequence to obtain the gray correlation coefficient judgment ' matrix F- 303304Use the entropy weight method to give weights to column vectors of matrix E305Obtain a weighted gray correlation judgment matrix from the matrix F and weight vector [ calculation306Calculate the gray correlation projection value of each influencing factor on historical loadSelect preset number of factors with the largest projection values as the key influencing factors- 307EndFIG.3401Start402- 403404Obtain 6 initial clustering centroids, each clustering centroid correspondsDetermine whether 1 is greater than a preset number threshold or whether the maximum interval before and after each class of centroid updating is less than the difference tolerance406408EndFIG.4FIG.5 sample set SSample with replacement training sample set TS1Sample with replacement training sample set TS100Sample 3 influencing factors randomly as the characteristic attributeSample 3 influencing factors randomly as the characteristic attribute calculate Gini index 'or current each nodeMark all child nodes that belong to the same class as leaf nodes (except for the leaf nodes), and select the splitting^ ttrihnteUse the splitting attribute to binary split the current node into sub-nodesTreel generate decision tree TreelMark all child nodes that belong to the same class as leaf nodes calculate Gini index for current each node (except for the leaf nodes), and select the spl itt ing^at tributeUse the splitting attribute to binary split the current node into sub-nodes random forest classification modelFIG.6TreelTreelOOFIG.7FIG.8
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2016108123390 | 2016-09-09 | ||
CN201610812339.0A CN106485262B (en) | 2016-09-09 | 2016-09-09 | Bus load prediction method |
PCT/CN2016/108051 WO2018045642A1 (en) | 2016-09-09 | 2016-11-30 | A bus bar load forecasting method |
Publications (1)
Publication Number | Publication Date |
---|---|
AU2016325186A1 true AU2016325186A1 (en) | 2018-03-29 |
Family
ID=58273572
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AU2016325186A Abandoned AU2016325186A1 (en) | 2016-09-09 | 2016-11-30 | Bus load forecasting method |
Country Status (3)
Country | Link |
---|---|
CN (1) | CN106485262B (en) |
AU (1) | AU2016325186A1 (en) |
WO (1) | WO2018045642A1 (en) |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109345011A (en) * | 2018-09-19 | 2019-02-15 | 中冶赛迪重庆信息技术有限公司 | A kind of Air-conditioning Load Prediction method and system returning forest based on depth |
CN109947079A (en) * | 2019-03-20 | 2019-06-28 | 阿里巴巴集团控股有限公司 | Region method for detecting abnormality and edge calculations equipment based on edge calculations |
CN110826774A (en) * | 2019-10-18 | 2020-02-21 | 广州供电局有限公司 | Bus load prediction method and device, computer equipment and storage medium |
CN111105098A (en) * | 2019-12-25 | 2020-05-05 | 北京华电天仁电力控制技术有限公司 | Single user algorithm self-matching load prediction method and system |
CN112289031A (en) * | 2020-11-03 | 2021-01-29 | 蚌埠学院 | Method and device for detecting and alarming abnormal conditions in bus driving process |
CN114819380A (en) * | 2022-05-12 | 2022-07-29 | 福州大学 | Power grid bus load prediction method based on model fusion |
CN114819380B (en) * | 2022-05-12 | 2024-06-28 | 福州大学 | Power grid bus load prediction method based on model fusion |
Families Citing this family (46)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107256415B (en) * | 2017-08-04 | 2020-12-11 | 国网经济技术研究院有限公司 | Calculation method and calculation system for operation mode scene of power system |
CN107800140B (en) * | 2017-10-18 | 2021-08-13 | 天津大学 | Large user power supply access decision method considering load characteristics |
CN108334900B (en) * | 2018-01-29 | 2021-08-13 | 上海电气分布式能源科技有限公司 | Generation method and system of classification model of power battery, and classification method and system |
CN108539738B (en) * | 2018-05-10 | 2020-04-21 | 国网山东省电力公司电力科学研究院 | Short-term load prediction method based on gradient lifting decision tree |
CN110766032A (en) * | 2018-07-27 | 2020-02-07 | 国网江西省电力有限公司九江供电分公司 | Power distribution network data clustering integration method based on hierarchical progressive strategy |
CN109214458B (en) * | 2018-09-19 | 2021-08-13 | 合肥工业大学 | Urban load quantification method based on historical data |
CN109597967B (en) * | 2018-11-20 | 2023-08-25 | 江苏云上电力科技有限公司 | Power system distribution substation load data anomaly detection and repair method |
CN109636010B (en) * | 2018-11-23 | 2022-12-02 | 国网湖北省电力有限公司 | Provincial power grid short-term load prediction method and system based on correlation factor matrix |
CN109636035A (en) * | 2018-12-12 | 2019-04-16 | 北京天诚同创电气有限公司 | Load forecasting model creation method and device, Methods of electric load forecasting and device |
CN111461378A (en) * | 2019-01-18 | 2020-07-28 | 电力规划总院有限公司 | Power grid load prediction method and device |
CN109886465B (en) * | 2019-01-20 | 2022-03-18 | 东北电力大学 | Power distribution network load prediction method based on intelligent electric meter user cluster analysis |
CN110175166B (en) * | 2019-04-18 | 2023-04-07 | 红云红河烟草(集团)有限责任公司 | Method for confirming quality abnormity of tobacco wrapping workshop |
CN110570025A (en) * | 2019-08-20 | 2019-12-13 | 精硕科技(北京)股份有限公司 | prediction method, device and equipment for real reading rate of WeChat seal |
CN110489893B (en) * | 2019-08-23 | 2023-02-03 | 国网山东省电力公司莱芜供电公司 | Variable weight-based bus load prediction method and system |
CN110598125A (en) * | 2019-08-26 | 2019-12-20 | 精硕科技(北京)股份有限公司 | Method and device for evaluating key opinion leader investment and storage medium |
CN110648248B (en) * | 2019-09-05 | 2023-04-07 | 广东电网有限责任公司 | Control method, device and equipment for power station |
CN110689195A (en) * | 2019-09-26 | 2020-01-14 | 云南电网有限责任公司电力科学研究院 | Power daily load prediction method |
CN110807508B (en) * | 2019-10-31 | 2023-06-09 | 国网辽宁省电力有限公司经济技术研究院 | Bus peak load prediction method considering complex weather influence |
CN110852628B (en) * | 2019-11-13 | 2023-09-12 | 国网江西省电力有限公司经济技术研究院 | Rural medium-long term load prediction method considering development mode influence |
CN112862142A (en) * | 2019-11-28 | 2021-05-28 | 新奥数能科技有限公司 | Load and price prediction and correction method |
CN112882822B (en) * | 2019-11-29 | 2024-03-01 | 阿里巴巴集团控股有限公司 | Method, apparatus, device and storage medium for generating load prediction model |
CN111210059A (en) * | 2019-12-26 | 2020-05-29 | 国网北京市电力公司 | Bus daily maximum load processing method and device |
CN111476397B (en) * | 2020-03-05 | 2022-10-25 | 国电南瑞科技股份有限公司 | Bus load prediction method and system applied to medium and long term electric quantity safety check |
CN111505406B (en) * | 2020-03-16 | 2022-03-01 | 剑科云智(深圳)科技有限公司 | Power distribution cabinet and wire monitoring method |
CN111581883B (en) * | 2020-05-09 | 2022-09-23 | 国网上海市电力公司 | Method for calculating and predicting load on automation device |
CN111626360B (en) * | 2020-05-28 | 2023-11-17 | 新奥新智科技有限公司 | Method, apparatus, device and storage medium for detecting boiler fault type |
CN112070307B (en) * | 2020-09-10 | 2023-01-20 | 云南电网有限责任公司电力科学研究院 | Method and device for predicting energy source load in region |
CN112330009B (en) * | 2020-11-02 | 2023-09-01 | 广东电网有限责任公司 | Power grid load prediction method |
CN112365039B (en) * | 2020-11-03 | 2023-11-24 | 国网冀北电力有限公司计量中心 | Method and system for predicting monthly electricity consumption based on longitudinal and transverse correlation |
CN112327046B (en) * | 2020-11-09 | 2021-09-07 | 北华航天工业学院 | Non-invasive load monitoring method based on fuzzy clustering and support vector regression |
CN112446509B (en) * | 2020-11-10 | 2023-05-26 | 中国电子科技集团公司第三十八研究所 | Prediction maintenance method for complex electronic equipment |
CN112487612B (en) * | 2020-11-12 | 2022-08-30 | 云南电网有限责任公司 | Bus load combination prediction method and system based on distribution factor and information entropy |
CN112529262A (en) * | 2020-11-27 | 2021-03-19 | 北京京能高安屯燃气热电有限责任公司 | Short-term power prediction method, device, computer equipment and storage medium |
CN112633738A (en) * | 2020-12-30 | 2021-04-09 | 安徽广志电气有限公司 | Distribution control cabinet terminal load distribution method |
CN113283774A (en) * | 2021-06-07 | 2021-08-20 | 润电能源科学技术有限公司 | Deep peak regulation method and device for heating unit, electronic equipment and storage medium |
CN113393028B (en) * | 2021-06-10 | 2022-08-19 | 国网江苏省电力有限公司南通供电分公司 | Load prediction method based on data mining technology |
CN113781254A (en) * | 2021-08-04 | 2021-12-10 | 广西电网有限责任公司 | Abnormal electricity consumption detection algorithm based on weekly electricity consumption correlation coefficient |
CN113689053B (en) * | 2021-09-09 | 2024-03-29 | 国网安徽省电力有限公司电力科学研究院 | Strong convection weather overhead line power failure prediction method based on random forest |
CN113782187B (en) * | 2021-09-10 | 2023-06-27 | 深圳平安智慧医健科技有限公司 | Index data processing method, related equipment and medium |
CN113884807B (en) * | 2021-09-24 | 2023-10-20 | 广西电网有限责任公司电力科学研究院 | Power distribution network fault prediction method based on random forest and multi-layer architecture clustering |
CN113779112A (en) * | 2021-09-27 | 2021-12-10 | 南京乐帆电气科技有限公司 | Electric energy quality analysis system based on spatial information and big data mapping algorithm |
CN113888356A (en) * | 2021-10-19 | 2022-01-04 | 山东大学 | Building energy consumption mode analysis and judgment method and system |
CN114692386B (en) * | 2022-01-26 | 2024-02-06 | 天津大学 | Operation state abnormity monitoring method for industrial bus |
CN114739977A (en) * | 2022-04-13 | 2022-07-12 | 重庆大学 | Method and system for extracting oil paper insulation aging spectral characteristics based on random forest method |
CN115085196B (en) * | 2022-08-19 | 2022-12-23 | 国网信息通信产业集团有限公司 | Power load predicted value determination method, device, equipment and computer readable medium |
CN117977587B (en) * | 2024-04-02 | 2024-06-07 | 南京鼎研电力科技有限公司 | Power load prediction system and method based on deep neural network |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9235208B2 (en) * | 2011-07-19 | 2016-01-12 | GE Intelligent Platforms, Inc | System of sequential kernel regression modeling for forecasting financial data |
CN103106329B (en) * | 2012-11-19 | 2016-01-13 | 华北电力大学 | A kind of training sample constructed in groups method for SVR short-term load forecasting |
CN104123595B (en) * | 2014-07-22 | 2018-09-07 | 国家电网公司 | A kind of distribution network load prediction technique and system |
CN104463445A (en) * | 2014-11-27 | 2015-03-25 | 国网浙江省电力公司嘉兴供电公司 | Load forecasting method based on experience day local correction multipoint extrapolation method |
CN104881706B (en) * | 2014-12-31 | 2018-05-25 | 天津弘源慧能科技有限公司 | A kind of power-system short-term load forecasting method based on big data technology |
CN105303262A (en) * | 2015-11-12 | 2016-02-03 | 河海大学 | Short period load prediction method based on kernel principle component analysis and random forest |
CN105631532A (en) * | 2015-12-07 | 2016-06-01 | 江苏省电力公司检修分公司 | Power system load prediction method using fuzzy decision-based neural network model |
CN105469123A (en) * | 2015-12-30 | 2016-04-06 | 华东理工大学 | Missing data completion method based on k plane regression |
CN105631483B (en) * | 2016-03-08 | 2019-03-12 | 国家电网公司 | A kind of short-term electro-load forecast method and device |
CN105608512A (en) * | 2016-03-24 | 2016-05-25 | 东南大学 | Short-term load forecasting method |
-
2016
- 2016-09-09 CN CN201610812339.0A patent/CN106485262B/en active Active
- 2016-11-30 WO PCT/CN2016/108051 patent/WO2018045642A1/en active Application Filing
- 2016-11-30 AU AU2016325186A patent/AU2016325186A1/en not_active Abandoned
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109345011A (en) * | 2018-09-19 | 2019-02-15 | 中冶赛迪重庆信息技术有限公司 | A kind of Air-conditioning Load Prediction method and system returning forest based on depth |
CN109947079A (en) * | 2019-03-20 | 2019-06-28 | 阿里巴巴集团控股有限公司 | Region method for detecting abnormality and edge calculations equipment based on edge calculations |
CN110826774A (en) * | 2019-10-18 | 2020-02-21 | 广州供电局有限公司 | Bus load prediction method and device, computer equipment and storage medium |
CN110826774B (en) * | 2019-10-18 | 2022-03-22 | 广东电网有限责任公司广州供电局 | Bus load prediction method and device, computer equipment and storage medium |
CN111105098A (en) * | 2019-12-25 | 2020-05-05 | 北京华电天仁电力控制技术有限公司 | Single user algorithm self-matching load prediction method and system |
CN111105098B (en) * | 2019-12-25 | 2023-11-03 | 国能信控互联技术有限公司 | Load prediction method and system for self-matching of single user algorithm |
CN112289031A (en) * | 2020-11-03 | 2021-01-29 | 蚌埠学院 | Method and device for detecting and alarming abnormal conditions in bus driving process |
CN114819380A (en) * | 2022-05-12 | 2022-07-29 | 福州大学 | Power grid bus load prediction method based on model fusion |
CN114819380B (en) * | 2022-05-12 | 2024-06-28 | 福州大学 | Power grid bus load prediction method based on model fusion |
Also Published As
Publication number | Publication date |
---|---|
WO2018045642A1 (en) | 2018-03-15 |
CN106485262B (en) | 2020-02-07 |
CN106485262A (en) | 2017-03-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2016325186A1 (en) | Bus load forecasting method | |
CN107516170B (en) | Difference self-healing control method based on equipment failure probability and power grid operation risk | |
CN114792156B (en) | Photovoltaic output power prediction method and system based on curve characteristic index clustering | |
CN106327357B (en) | A kind of load identification method based on improvement probabilistic neural network | |
CN111091233A (en) | Wind power plant short-term wind power prediction modeling method based on wavelet analysis and multi-model AdaBoost depth network | |
CN108898251A (en) | Consider the marine wind electric field power forecasting method of meteorological similitude and power swing | |
CN109086928A (en) | Photovoltaic plant realtime power prediction technique based on SAGA-FCM-LSSVM model | |
CN111860692B (en) | Abnormal data detection method based on K-media in Internet of things environment | |
CN110717610A (en) | Wind power prediction method based on data mining | |
CN111461921B (en) | Load modeling typical user database updating method based on machine learning | |
CN110942205A (en) | Short-term photovoltaic power generation power prediction method based on HIMVO-SVM | |
CN115907195A (en) | Photovoltaic power generation power prediction method, system, electronic device and medium | |
Peng et al. | A very short term wind power prediction approach based on Multilayer Restricted Boltzmann Machine | |
CN110674864A (en) | Wind power abnormal data identification method with synchronous phasor measurement device | |
CN114611757A (en) | Electric power system short-term load prediction method based on genetic algorithm and improved depth residual error network | |
CN113095547A (en) | Short-term wind power prediction method based on GRA-LSTM-ICE model | |
CN113270898A (en) | Distributed photovoltaic optimization configuration method based on two-stage planning | |
CN112115984A (en) | Tea garden abnormal data correction method and system based on deep learning and storage medium | |
CN110349050B (en) | Intelligent electricity stealing criterion method and device based on power grid parameter key feature extraction | |
CN112734073A (en) | Photovoltaic power generation short-term prediction method based on long and short-term memory network | |
CN114818827A (en) | Non-invasive load decomposition method based on seq2point network | |
CN114881176A (en) | Non-invasive load identification method based on self-adaptive optimization random forest | |
CN113780643A (en) | Photovoltaic power station short-term output prediction method based on case reasoning | |
CN109543976A (en) | A kind of Wind turbines harmonic emissions modal identification method using gauss hybrid models | |
Hanmin et al. | An incremental elm method for hourly load forecasting |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
MK5 | Application lapsed section 142(2)(e) - patent request and compl. specification not accepted |