CN112163682B

CN112163682B - Power dispatching automation system fault tracing method based on information difference graph model

Info

Publication number: CN112163682B
Application number: CN202011118535.0A
Authority: CN
Inventors: 任昺; 高欣; 贾欣; 李康生; 刘治宇
Original assignee: Beijing University of Posts and Telecommunications
Current assignee: Beijing University of Posts and Telecommunications
Priority date: 2020-10-19
Filing date: 2020-10-19
Publication date: 2022-05-17
Anticipated expiration: 2040-10-19
Also published as: CN112163682A

Abstract

The embodiment of the invention provides a power dispatching automation system fault tracing method based on an information difference graph model, which comprises the following steps: selecting historical data before and after alarming of the power dispatching automation system, obtaining a clustering center through a k-means algorithm, taking the clustering center as an endpoint of interval division, and taking the mean value of each interval as a discretization result of continuous characteristics; calculating the information entropy of the components of the power dispatching automation system and the transfer entropy among the components, establishing an information correlation matrix with or without an alarm section, measuring the difference degree before and after the alarm through the change rate of the information correlation matrix, and obtaining an information difference matrix by adopting a normalization technology; extracting the characteristics with high alarm information change of the power dispatching automation system and the interactive information among the characteristics, further constructing an information difference graph model combining a digraph and node self-information, and fitting fault degree indexes to perform fault degree sequencing. According to the technical scheme provided by the embodiment of the invention, the performance of tracing the fault of the power dispatching automation system is improved.

Description

Power dispatching automation system fault tracing method based on information difference graph model

[ technical field ] A method for producing a semiconductor device

The invention relates to a fault tracing method for solving unknown system topological relation in the field of fault positioning, in particular to a power dispatching automation system fault tracing method based on an information difference graph model.

[ background of the invention ]

With the continuous maturity of intelligent technology and network technology, the power dispatching automation system is a complex system integrating computing, communication and physical environments as a whole of computing and physical processes. The system generally comprises a plurality of components such as a server, a storage, a network device, application software and the like, and once a component fails, the association relationship of the component affects other components, so that the whole system is abnormal or fluctuated. Due to the fact that the topological relation among the components is unknown and complex, fault tracing becomes extremely difficult, and early tracing has important significance for guaranteeing safe and stable operation of the power dispatching automation system. Existing fault tracing methods mainly include rule-based and model-based. The rule-based fault tracing method relies heavily on expert experience, and operation and maintenance personnel are required to clearly master logical topological relations and fault cases. The fault tracing method based on modeling mainly comprises undirected graph models, directed graph models, fault trees, invariant networks and other models, wherein the undirected graph models have the problem that the fault propagation direction cannot be determined; the directed graph model is established in a complex system logic relation, once a part of components are increased or reduced, relevant knowledge in the professional field is required to be modified and updated, and maintenance cost is greatly increased; the fault tree needs the participation of experts in different fields, can be correctly constructed by means of detailed system knowledge, cannot meet the requirements of a large-scale system, neglects certain cause and effect relationships due to subjective factors of expert experience, and finally limits the deployment of the fault tree; the invariant network model is modeled in a data-driven mode, so that the problem of serious dependence on prior knowledge is greatly solved, but a huge fault candidate set needs to be trained to backtrack the root-cause fault component.

[ summary of the invention ]

In view of this, the embodiment of the present invention provides a power dispatching automation system fault tracing method based on an information difference graph model, and under the condition that a logical topological relation is ambiguous, a fault source can be effectively located through information transfer in an information model simulation actual system, and the performance of fault tracing is improved.

The embodiment of the invention provides a power dispatching automation system fault tracing method based on an information difference graph model, which comprises the following steps:

selecting historical data before and after alarming of the power dispatching automation system, obtaining a clustering center through a k-means algorithm, taking the clustering center as an endpoint of interval division, and taking the mean value of each interval as a discretization result of continuous characteristics;

calculating the information entropy of the components of the power dispatching automation system and the transfer entropy among the components, establishing an information correlation matrix with or without an alarm section, measuring the difference degree before and after the alarm through the change rate of the information correlation matrix, and obtaining an information difference matrix by adopting a normalization technology;

extracting the characteristics with high alarm information change of the power dispatching automation system and the interactive information among the characteristics, further constructing an information difference graph model combining a digraph and node self-information, and fitting fault degree indexes to perform fault degree sequencing.

In the method, the discretization of the continuous features is adopted, the historical data before and after the alarm of the power dispatching automation system is selected, the clustering center is obtained through a k-means algorithm and is used as an endpoint of interval division, and the method that the mean value of each interval is used as the discretization result of the continuous features comprises the following steps: collecting resource occupation data of CPUs (central processing units), memories, disks, networks and processes of all servers in the power dispatching automation system, wherein the resource occupation conditions of each server comprise IO (input/output) read-write conditions, utilization rates, collision rates and waiting time, taking time sequences of the characteristics as input of a tracing method, and assuming that values of a certain characteristic are distributed in [ a, b ]]And (4) interval, removing the duplication of all values of the characteristic to obtain the total number num, and setting the number k of the centroids of the clusters as

The sum of the squared errors of the centroid and the sample points is SSE, the relationship graph of SSE and k is the shape of one elbow, the k value of the elbow position is selected as the optimal clustering number,

wherein, C_iIs the ith cluster, and poi is C_iSample point of (1), m_iIs C_iK centroids of { m ] are obtained according to a k-means algorithm₁,m₂,...,m_k}(m₁＜m₂＜...＜m_k) The value of the feature is divided into k +1 intervals [ a, m₁],[m₁,m₂],…,[m_k-1,m_k],[m_k,b]And averaging the characteristic values in each interval to obtain a discretization result of the interval.

In the method, the information entropy of the components of the power dispatching automation system and the transfer entropy among the components are calculated, an information correlation matrix with or without an alarm section is established, and the information correlation matrix is used for calculating the information entropy of the components of the power dispatching automation system and the transfer entropy among the componentsThe change rate of the matrix measures the difference degree before and after the alarm, and the method for obtaining the information difference matrix by adopting the normalization technology comprises the following steps: discretizing resource occupation data of CPUs, memories, disks, networks and processes of all servers of the power dispatching automation system, collecting discretization characteristics of IO read-write conditions, utilization rates, collision rates and waiting time of the CPUs, the memories, the disks, the networks and the processes to obtain N characteristic time sequences { S }₁,S₂,...,S_NCalculating self-information entropy of each time series { H }₁,H₂,...,H_NH (S) is the self-entropy of time series S:

wherein x represents each time point in the characteristic time sequence S, p (x) represents the output probability of x in the time sequence S, and alpha_xAll possible values of x in the characteristic time series S are represented, and the transfer entropy { T ] of every two characteristic time series is calculated_1→2,T_1→3,...,T_1→N,T_2→1,T_2→3,...,T_2→N,...,T_N→N-1The mutual information of any two time series is defined as the measure of the amount of information shared by two variables, mutual information entropy I (S)_I；S_J) For measuring time series S_IAnd S_JAmount of shared information of (2):

wherein S is_IRepresenting the I-th characteristic time series, S_JRepresents the J-th characteristic time series, and x and y represent the characteristic time series S_IAnd S_JP (x, y) represents the probability of joint distribution, p (x | y) represents the conditional probability, α_xAnd alpha_yRepresenting a characteristic time series S_IAnd S_JAll possible values of x and y are further expanded into transfer entropy on the basis of mutual information entropy, and the transfer entropy is measuredDivide by S in calculation_IIn addition to the information itself, S_JAdditional information is also provided to predict S_I(t+1)Setting the transfer entropy T_J→IFor a characteristic time series S_JTo S_IThe measure of mutual information of (1):

wherein i_tRepresenting a characteristic time series S_IState at time t, i_t ^(k)Represents i_t+1The first k most recent characteristic time series S_IState of (j)_tRepresenting a characteristic time series S_JState at time t, j_t ^(l)Denotes j_t+1Previous l most recent characteristic time series S_JState (b), transfer entropy has directionality, so T_I→JThe cause and effect relationship matrix A can be established by exchanging variables in the formula and calculating self information and mutual information of the time sequence without the alarm period:

similarly, a cause and effect relation matrix B with an alarm section is obtained, and an information difference matrix is established

And respectively carrying out normalization processing on the change rates of the diagonal line information and the off-diagonal line information to obtain a final information difference matrix.

In the method, the characteristics with high alarm information change of the power dispatching automation system and the interactive information among the characteristics are extracted, an information difference graph model combining a digraph and node self-information is further constructed, and fault degree indexes are fitted for fault degree sequencing, specifically: setting a threshold theta epsilon (0,1), c_m,n(m<N，n<N,c_m,nE (0, 1)) represents the value of the m row and n column in the information difference matrix C, traverses the matrix C and marks C_m,nRows and columns > Θ, SaKeeping all marked row and column values and setting other elements to zero to obtain an information difference matrix C ', establishing an information difference graph model according to a causal relationship between the extracted fault characteristics and the links among the characteristics, wherein the model comprises two pieces of information, the diagonal value of the C ' matrix represents the confidence difference value inside the node, the non-diagonal value of the C ' matrix represents the mutual information difference value between the node and the node, and the link between the node and the node represents S_IInfluence S_JAnd S_JInfluence S_ICalculating the Fault degree through a Fault _ degree index:

wherein, V_iRepresenting a single node, BINNs of V, in an information-difference graph model_iIs represented by the formula V_iAdjacent node, V_jIs represented by the formula V_iSingle one of the adjacent nodes, NUM (BINNs of V)_i) Is represented by the formula V_iAnd traversing all nodes of the information difference graph model by the total number of the adjacent nodes, calculating the Fault degree of each node by adopting a Fault _ degree index, calculating the Fault degree of each node, and sequencing from large to small to obtain a final result.

The power dispatching automation system fault tracing method improves the performance of power dispatching automation system fault tracing.

According to the technical scheme, the invention has the following beneficial effects:

according to the technical scheme, the idea of mining information difference before and after alarming is adopted, an information difference graph model is established, and the fault sequencing indexes fused from information difference and mutual information difference are fitted through the network characteristics of the graph, so that the fault tracing positioning is realized under the condition that a system topological graph is unknown, and the fault tracing performance of the power dispatching automation system is improved.

[ description of the drawings ]

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings needed to be used in the embodiments will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without inventive labor.

Fig. 1 is a schematic flowchart of a method for tracing a fault of an electric power dispatching automation system based on an information difference graph model according to an embodiment of the present invention;

fig. 2 is a flowchart of a framework of a power dispatching automation system fault tracing method based on an information difference graph model according to an embodiment of the present invention.

[ detailed description ] embodiments

For better understanding of the technical solutions of the present invention, the following detailed descriptions of the embodiments of the present invention are provided with reference to the accompanying drawings.

It should be understood that the described embodiments are only some embodiments of the invention, and not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

An embodiment of the present invention provides a method for tracing a fault of an electric power dispatching automation system based on an information difference graph model, please refer to fig. 1, which is a schematic flow diagram of the method for tracing a fault of an electric power dispatching automation system based on an information difference graph model, as shown in fig. 1, the method includes the following steps:

Fig. 2 is a flowchart of a frame of a fault tracing method for an automatic power dispatching system based on an information difference graph model according to an embodiment of the present invention, in which a continuous characteristic is discretized, and a single-sequence self-information operation and a double-sequence mutual information operation are performed; secondly, constructing an information correlation matrix with or without an alarm section, and obtaining a normalized information difference matrix by comparing the difference change rate before and after the alarm; and finally, establishing an information difference graph model according to the information difference matrix and fitting the fault degrees for sequencing.

Step 101, selecting historical data before and after alarming of the power dispatching automation system, obtaining a clustering center through a k-means algorithm, taking the clustering center as an endpoint of interval division, and taking the mean value of each interval as a discretization result of continuous characteristics;

specifically, resource occupation data of CPUs (central processing units), memories, disks, networks and processes of all servers in the power dispatching automation system are collected, the resource occupation conditions of each server comprise IO (input/output) read-write conditions, utilization rates, collision rates and waiting time, time sequences of the characteristics are used as input of a tracing method, and values of certain characteristics are assumed to be distributed in [ a, b ]]And (4) interval, removing the duplication of all values of the characteristic to obtain the total number num, and setting the number k of the centroids of the clusters as

The sum of squared errors of the centroid and the sample points is SSE, the relationship graph of SSE and k is the shape of one elbow, the k value of the elbow position is selected as the optimal clustering number,

102, calculating the information entropy of the components of the power dispatching automation system and the transfer entropy among the components, establishing an information correlation matrix with or without an alarm section, measuring the difference degree before and after the alarm through the change rate of the information correlation matrix, and obtaining an information difference matrix by adopting a normalization technology;

specifically, discretizing resource occupation data of CPUs (central processing units), memories, disks, networks and processes of all servers of the power dispatching automation system, and collecting discretization characteristics of IO (input/output) read-write conditions, utilization rate, collision rate and waiting time of the CPUs, the memories, the disks, the networks and the processes to obtain N characteristic time sequences { S }₁,S₂,...,S_NCalculating self-information entropy of each time series { H }₁,H₂,...,H_NH (S) is the self-entropy of time series S:

wherein S is_IRepresenting the I-th characteristic time series, S_JRepresents the J-th characteristic time series, and x and y represent the characteristic time series S_IAnd S_JP (x, y) represents the probability of joint distribution, p (x | y) represents the conditional probability, α_xAnd alpha_yRepresenting a characteristic time series S_IAnd S_JAll possible values of x and y are expanded into transfer entropy on the basis of mutual information entropy, and S is divided in the calculation of the transfer entropy_IIn addition to its own information, S_JAdditional information is also provided to predict S_I(t+1)Setting the transfer entropy T_J→IFor a characteristic time series S_JTo S_IThe measure of mutual information of (1):

Respectively carrying out normalization processing on the change rates of diagonal line information and off-diagonal line information to obtain the final informationAn information difference matrix.

103, extracting features with high alarm information change of the power dispatching automation system and interactive information among the features, further constructing an information difference graph model combining a digraph and node self-information, and fitting fault degree indexes to perform fault degree sequencing;

specifically, a threshold Θ ∈ (0,1), c is set_m,n(m<N，n<N,c_m,nE (0, 1)) represents the value of the m row and n column in the information difference matrix C, traverses the matrix C and marks C_m,nKeeping the values of all marked rows and columns and setting other elements to zero to obtain an information difference matrix C ', establishing an information difference graph model according to the causal relationship of the extracted fault characteristics and the links among the characteristics, wherein the model comprises two pieces of information, the diagonal value of the C ' matrix represents the confidence difference value inside the node, the non-diagonal value of the C ' matrix represents the mutual information difference value between the node and the node, and the link between the node and the node represents S_IInfluence S_JAnd S_JInfluence S_IThe Fault causal association degree of (2) is calculated through a Fault _ degree index:

wherein, V_iRepresenting a single node, BINNs of V, in an information-difference graph model_iIs represented by the formula V_iAdjacent node, V_jIs represented by the formula V_iSingle one of the adjacent nodes, NUM (BINNs of V)_i) Is represented by the formula V_iAnd traversing all nodes of the information difference graph model by the total number of the adjacent nodes, calculating the Fault degree of each node by adopting a Fault _ degree index, calculating the Fault degree of each node, and sequencing the Fault degrees from large to small to obtain a final result.

For a specific embodiment, a power dispatching automation system data set is used, the data set comprises 718 time series collected by the power dispatching automation system, each time series has 30 minutes of data, and the sampling period is 1 second. Table 1 shows components and characteristic information of the power dispatching automation system, and the characteristic quantities collected by the components of the system together form the time series.

TABLE 1 Components and characteristic information of Power dispatching Automation System

The resource occupation data of CPUs, memories, disks, networks and processes of all servers in the power dispatching automation system are input, the resource occupation conditions of all the servers include IO read-write conditions, utilization rates, collision rates and waiting time, the characteristics are input as time sequences, the last 5-minute sequence with system alarm information is used as an alarm segment, the rest time sequences are used as normal segments, a threshold theta is set to be 0.7, the number k of the clustered centroids is determined along with the change of the characteristics, and therefore specific numerical values do not need to be set.

In order to visually display the results of the failure tracing, in table 2, the first ten tracing results of different methods are listed. For convenience of description, server numbers 1 to 85 are defined, and comparison methods for calculating fault degrees on the basis of the information difference model include mRank, gRank and RCA. The Benchmark represents that the time series is sequenced according to the sequence of the change rate of the time series from high to low and is used as a reference for verifying the effect of each algorithm. The first three servers are marked with a five-pointed star, a diamond and a triangle, respectively. As can be seen from Table 2, the Fault _ degree performs better than the other three methods, and the server with the top rank can well verify the Fault sorting benchmark.

Table 2 fault tracing result of power dispatching automation system

In order to verify the effectiveness of the method, the accuracy, the recall rate and the nDCG are adopted to evaluate the failure tracing effect of the algorithm. Usually, nDCG (Normalized divided Cumulative Gain) is a commonly used ranking index, and is used to determine whether a ranking result is good or bad, and a larger value of nDCG indicates a better performance of the method.

For the results of the failure tracing, the tracing effect ranked at the top is preferentially considered, so that twenty results before the tracing list are selected for index performance analysis, and the results are shown in table 3. The method provided by the embodiment of the invention improves the accuracy, the recall rate and the nDCG value, wherein the accuracy is improved by 3.6-14%, and the nDCG value is improved by 0.5-0.1. The method for tracing the fault of the power dispatching automation system, which embodies the information difference graph model provided by the embodiment of the invention, has better performance of tracing the fault.

TABLE 3 comparison of Performance of twenty prior to failure tracing

	mRank	gRank	RCA	Fault_degree
					Rate of accuracy	0.75	0.8	0.82	0.857142857
Recall rate	0.86862	0.797831	0.85597	0.876578
					nDCG	0.85	0.8	0.85	0.9

In summary, the embodiments of the present invention have the following beneficial effects:

in the technical scheme, historical data before and after the alarm of the power dispatching automation system is selected, a clustering center is obtained through a k-means algorithm and is used as an endpoint of interval division, and the mean value of each interval is used as a discretization result of continuous characteristics; calculating the information entropy of the components of the power dispatching automation system and the transfer entropy among the components, establishing an information correlation matrix with or without an alarm section, measuring the difference degree before and after the alarm through the change rate of the information correlation matrix, and obtaining an information difference matrix by adopting a normalization technology; extracting the characteristics with high alarm information change of the power dispatching automation system and the interactive information among the characteristics, further constructing an information difference graph model combining a digraph and node self-information, and fitting fault degree indexes to perform fault degree sequencing.

The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like made within the spirit and principle of the present invention should be included in the scope of the present invention.

Claims

1. A power dispatching automation system fault tracing method based on an information difference graph model is characterized by comprising the following steps:

(1) selecting historical data before and after alarming of the power dispatching automation system, obtaining a clustering center through a k-means algorithm, taking the clustering center as an endpoint of interval division, and taking the mean value of each interval as a discretization result of continuous characteristics;

(2) calculating the information entropy of the components of the power dispatching automation system and the transfer entropy among the components, establishing an information correlation matrix with or without an alarm section, measuring the difference degree before and after the alarm through the change rate of the information correlation matrix, and obtaining an information difference matrix by adopting a normalization technology, wherein the information difference matrix specifically comprises the following steps: discretizing resource occupation data of CPUs, memories, disks, networks and processes of all servers of the power dispatching automation system, collecting discretization characteristics of IO read-write conditions, utilization rates, collision rates and waiting time of the CPUs, the memories, the disks, the networks and the processes to obtain N characteristic time sequences { S }₁,S₂,...,S_NCalculating self-information entropy of each time series { H }₁,H₂,...,H_NH (S) is the self-information entropy of the time series S:

wherein S is_IRepresenting the I-th characteristic time series, S_JRepresents the J-th characteristic time series, and x and y represent the characteristic time series S_IAnd S_JP (x, y) represents the joint distribution probability, and p (x | y) represents the conditional probability，α_xAnd alpha_yRepresenting a characteristic time series S_IAnd S_JAll possible values of x and y are expanded into transfer entropy on the basis of mutual information entropy, and S is divided in the calculation of the transfer entropy_IIn addition to its own information, S_JAdditional information is also provided to predict S_I(t+1)Setting the transfer entropy T_J→IFor a characteristic time series S_JTo S_IThe measure of mutual information of (1):

Respectively carrying out normalization processing on the change rates of diagonal line information and off-diagonal line information to obtain a final information difference matrix;

(3) extracting the characteristics with high alarm information change of the power dispatching automation system and the interactive information among the characteristics, further constructing an information difference graph model combining a digraph and node self-information, and fitting a fault degree index to carry out fault degree rankingThe sequence specifically comprises the following steps: setting a threshold theta epsilon (0,1), c_m,n(m<N，n<N,c_m,nE (0, 1)) represents the value of the m row and n column in the information difference matrix C, traverses the matrix C and marks C_m,nKeeping the values of all marked rows and columns and setting other elements to zero to obtain an information difference matrix C ', establishing an information difference graph model according to the causal relationship of the extracted fault characteristics and the links among the characteristics, wherein the model comprises two pieces of information, the diagonal value of the C ' matrix represents the confidence difference value inside the node, the non-diagonal value of the C ' matrix represents the mutual information difference value between the node and the node, and the link between the node and the node represents S_IInfluence S_JAnd S_JInfluence S_ICalculating the Fault degree through a Fault _ degree index:

2. The method according to claim 1, wherein the discretization of the continuous features is adopted, historical data before and after the alarm of the power dispatching automation system is selected, a clustering center is obtained through a k-means algorithm and is used as an endpoint of interval division, and a mean value of each interval is used as a discretization result of the continuous features, and the method is specifically described as follows: collecting resource occupation data of CPUs (central processing units), memories, disks, networks and processes of all servers in the power dispatching automation system, wherein the resource occupation conditions of each server comprise IO (input/output) read-write conditions, utilization rates and collision ratesWaiting time, using the time series of these characteristics as the input of tracing method, assuming that the value of a certain characteristic is distributed in [ a, b ]]And (4) interval, removing the duplication of all values of the characteristic to obtain the total number num, and setting the number k of the centroids of the clusters as