CN116032828B - Medium number centrality approximate calculation method and device - Google Patents

Medium number centrality approximate calculation method and device Download PDF

Info

Publication number
CN116032828B
CN116032828B CN202310167081.3A CN202310167081A CN116032828B CN 116032828 B CN116032828 B CN 116032828B CN 202310167081 A CN202310167081 A CN 202310167081A CN 116032828 B CN116032828 B CN 116032828B
Authority
CN
China
Prior art keywords
centrality
network
node
vector
nodes
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202310167081.3A
Other languages
Chinese (zh)
Other versions
CN116032828A (en
Inventor
王怀习
束妮娜
马涛
王晨
冯也来
黄郡
沈培佳
杨成武
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
National University of Defense Technology
Original Assignee
National University of Defense Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by National University of Defense Technology filed Critical National University of Defense Technology
Priority to CN202310167081.3A priority Critical patent/CN116032828B/en
Publication of CN116032828A publication Critical patent/CN116032828A/en
Application granted granted Critical
Publication of CN116032828B publication Critical patent/CN116032828B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Landscapes

  • Complex Calculations (AREA)

Abstract

The invention disclosesA method and a device for approximate calculation of medium centrality. The method comprises the following steps: computing networkGIs characterized by the feature vector centrality of (1); calculating a network by using the feature vector centralityGThe number of the intermediate nodes is counted again to construct a multiple networkG 2 The method comprises the steps of carrying out a first treatment on the surface of the From the multiple networksG 2 Sample nodes are selected to obtain a non-heavy sample node setSThe method comprises the steps of carrying out a first treatment on the surface of the Computing the set of non-heavy sample nodesSCenter of bettery, get networkGThe median centrality approximation. Therefore, the invention selects the sample nodes of the medium number centrality calculation based on the characteristic vector centrality as the basis based on the advantage of low characteristic vector centrality calculation complexity, the shortest paths among the sample nodes can better represent all the shortest paths among the network nodes, and the network approximate medium number centrality value is obtained by calculating the shortest paths among the sample nodes, so that the rapid calculation of the medium number centrality of the large-scale network is realized.

Description

Medium number centrality approximate calculation method and device
Technical Field
The invention relates to the technical field of networks, in particular to a method and a device for approximate calculation of betweenness centrality.
Background
Since the 21 st century, human society information has been rapidly developed, mobile internet popularization and application and internet of things deployment have rapidly landed, the number of network devices has been significantly increased, and the complexity of the number of physical network devices and the connection relationship has increased. The real networks in the different fields of power network, telecommunication network, traffic network, social network, communication network, internet of things and the like form the network world with all the inclusive sense.
The real network is a plentiful research object for network science, the abstract network is usually embodied with the same network attribute by abstracting different real networks, and the system research of abstract network property and rule forms complex network science with abundant connotation. The main research topics of network science cover network centrality measure and global network characteristics, network model structure and function analysis, network link prediction and recommendation algorithm, network dynamics, network control and optimization and the like, and the network centrality measure research plays a fundamental role in complex network science. In order to characterize the importance of nodes and edges in a network, researchers have proposed various network centrality measures that evaluate the centrality of nodes and edges in a network from various angles. The network centrality measure mainly characterizes the importance of nodes and edges in a network, and provides theoretical support for researching the identification of key nodes and key edges in a real network.
The betweenness centrality is an index for describing the importance of a certain node or a certain side in a network from the perspective of network connectivity, and is also an important research point of a complex network, and the research of a rapid calculation method has profound practical significance. For example, according to the core node in the social network, an influence ranking list is given, users are attracted, and targeted network marketing is carried out on the influence ranking list; by protecting the key server in the network, the key server can be prevented from being attacked by viruses or hackers, so that the whole network can normally operate; by isolating the infectious source, the transmission and spread of infectious viruses and the like can be effectively prevented, and in the practical application, the importance degree of each node in the network needs to be known so as to find out the key nodes in the network. However, as the number of nodes in a network increases and the connection relationship between the nodes is dense, the topology structure of the network is also more complex, and the conventional medium number centrality calculation method is difficult to completely meet the requirement of large-scale medium number centrality calculation in the network. It is highly desirable to design a fast calculation method that achieves the centrality of the bets.
Disclosure of Invention
In view of the above-mentioned problems, the present invention aims to provide a method and a device for approximate calculation of medium centrality, which are based on the advantage of low complexity of calculation of characteristic vector centrality, and based on the characteristic vector centrality, sample nodes for medium centrality calculation are selected, the shortest paths between the sample nodes can better represent all the shortest paths between network nodes, and then approximate medium centrality values are obtained by calculating the shortest paths between the sample nodes, so that the method and the device are favorable for realizing rapid calculation of medium centrality in a large-scale network.
To achieve the above object, in a first aspect of the present invention, a method for calculating a median centrality approximation is disclosed, the method comprising:
computing networkGThe characteristic vector centrality is obtained; the feature vector centrality
Figure SMS_1
=(x 1 x 2 ,…,x n ) The method comprises the steps of carrying out a first treatment on the surface of the The saidnRepresentation vector->
Figure SMS_2
Component numbers of (2); the saidnNetwork =networkGTotal number of medium nodes. />
Calculating a network by using the feature vector centralityGNode weight vector of (a) to construct a multiple networkG 2
From the multiple networksG 2 Sample nodes are selected to obtain a non-heavy sample node setS
Computing the set of non-heavy sample nodesSCenter of bettery, get networkGThe median centrality approximation.
As an optional implementation manner, in the first aspect of the embodiment of the present invention, the calculating network uses the feature vector centralityGNode weight vector of (a) to construct a multiple networkG 2 Comprising:
calculating the mean value of each component of the centrality of the feature vector to obtain a component mean valuex 0
The saidx 0 =
Figure SMS_3
The method comprises the steps of carrying out a first treatment on the surface of the The saidx i ∈{x 1 x 2 ,…,x n }。
Judging the component mean valuex 0 Whether or not it is smaller than a preset average weight numbercAnd obtaining a judging result.
The average weight numbercThe range of the values is as follows
Figure SMS_4
The method comprises the steps of carrying out a first treatment on the surface of the The range of the average weight value sufficiently ensures the universality and the variability of the sampled samples based on the centrality of the feature vector, wherein ∈ >
Figure SMS_5
For networksGIs the average degree of the node.
According to the judgment result, a preset vector coefficient calculation model is utilized to obtain a vector coefficient
Figure SMS_6
Centering the feature vector with the vector coefficients
Figure SMS_7
Multiplying to obtain a second feature vector; said feature vector centrality->
Figure SMS_8
=(x 1 x 2 ,…,x n ) The method comprises the steps of carrying out a first treatment on the surface of the Said second eigenvector->
Figure SMS_9
=(τ˙x 1 τ˙x 2 ,…,τ˙x n ) 。
And rounding up each component in the second feature vector to obtain a node weight vector.
The node weight vector
Figure SMS_10
=(μ 1 μ 2 ,…,μ n ) The method comprises the steps of carrying out a first treatment on the surface of the The saidμ i =⌈τ⋅x i ⌉,1≤inThe method comprises the steps of carrying out a first treatment on the surface of the The saidμ i Is a non-negative integer; the node weight vector is the firstiIndividual componentsμ i Characterizing a networkGMiddle (f)iPersonal nodev i Corresponding weight number.
Using the node weight vector to determine the node weight vector for the networkGProcessing to obtain multiple networksG 2
The saidG 2 ={μ v 1 μ v 2 ,…,μ v n }。
In an optional implementation manner, in a first aspect of the embodiment of the present invention, according to the determination result, a vector coefficient is obtained by using a preset vector coefficient calculation modelτComprising:
when the judgment result is yes, calculating the minimum integer for enabling the preset first vector coefficient calculation model to be establishedz 1 The minimum integer is setz 1 As vector coefficientsτIs a value of (2).
The first vector coefficient calculation model is
Figure SMS_11
Wherein, the saidx i ∈{x 1 x 2 ,…,x n };
Figure SMS_12
Representing an upward rounding, saidnNetwork =networkGThe total number of the middle nodes; the said z 1 Is an integer greater than 0; the saidc 1 For a preset average weightc
When the judgment result is NO, calculating the maximum integer for enabling the preset second vector coefficient calculation model to be establishedz 2 The maximum integer is calculatedz 2 Is the inverse of (2) as a vector coefficientτIs a value of (2).
The second vector coefficient calculation model is
Figure SMS_13
Wherein, the saidx i ∈{x 1 x 2 ,…,x n };
Figure SMS_14
Representing an upward rounding, saidnNetwork =networkGThe total number of the middle nodes; the saidz 2 Is an integer greater than 0; the saidc 2 For a preset average weightc
As an optional implementation manner, in the first aspect of the embodiment of the present invention, the multiple network is selected from the multiple networksG 2 Sample nodes are selected to obtain a non-heavy sample node setSComprising:
from the multiple networksG 2 In the method, the method is selected according to uniform random probability distributionhObtaining a first sample node set by the nodes; the saidh=
Figure SMS_15
Said->
Figure SMS_16
Representing an upward rounding; the saidnNetwork =networkGThe total number of the middle nodes; the saidpThe pre-set sampling proportion is characterized in that,pthe value range is 0.1-0pAnd less than or equal to 0.2, the sampling proportion is selected by comprehensively considering the representativeness of the sampling sample and the efficiency of an approximate calculation method.
And judging whether the number of the repeated nodes exists in the first sample node set or not to obtain a second judging result.
If the second judgment result is yes, deleting repeated nodes in the first sample node set, selecting new nodes from a method of uniform random probability distribution in a multiple network, adding the new nodes into the first sample node set, and enabling the total number of the nodes in the first sample node set to reach hAnd triggering and executing the judgment whether the first sample node set has the repeated nodes or not to obtain a second judgment result.
If the second judgment result is negative, the first sample node set is confirmed to be a non-heavy sample node set
Figure SMS_17
As an optional implementation manner, in the first aspect of the embodiment of the present invention, the computing networkGFeature vector centrality, resulting in feature vector centrality, comprising:
constructing a networkGAdjacent matrix a of (a);
the a=
Figure SMS_18
If the network isGMiddle nodev i And nodev j With edges in betweena i,j =1, a step of; otherwisea i,j =0。
And constructing a first characteristic equation based on the adjacency matrix A.
The first characteristic equation is
Figure SMS_19
Where A is the adjacency matrix of the network,λas a value of the characteristic(s),
Figure SMS_20
is a feature vector.
And carrying out calculation processing on the first characteristic equation, calculating to obtain a characteristic vector corresponding to the maximum characteristic value, and taking the characteristic vector corresponding to the maximum characteristic value as characteristic vector centrality.
As an optional implementation manner, in the first aspect of the embodiment of the present invention, the computing the set of non-heavy sample nodesSCenter of bettery, get networkGA median centrality approximation comprising:
using a median centrality calculation model for the non-heavy sample node set SProcessing to obtain a sample node setSIs a median centrality of (a).
The medium centrality calculation model is as follows:
Figure SMS_21
Figure SMS_22
in the method, in the process of the invention,Bet(v)representing nodesvThe centrality of the medium number,Bet(e)representing edgeseThe centrality of the medium number,σv i, v j ) Is a nodev i To the nodev j Is used to determine the number of shortest paths,σv i, v j v) Is a nodev i To the nodev j Through the node
Figure SMS_23
The number of shortest paths of (a);σv i, v j e) Is a nodev i To the nodev j Through the edgeeThe number of shortest paths of (a);Srepresenting a set of non-heavy sample nodesS
Aggregating the sample nodesSIs determined as a networkGIs approximated to the median centrality of (a) to obtain a networkGIs a median centrality approximation of (c).
In a second aspect of an embodiment of the present invention, there is disclosed a medium centrality approximation calculation apparatus, the apparatus including:
a first computing module for computing a networkGThe characteristic vector centrality is obtained; the feature vector centrality
Figure SMS_24
=(x 1 x 2 ,…,x n ) The method comprises the steps of carrying out a first treatment on the surface of the The saidnRepresentation vector->
Figure SMS_25
Component numbers of (2); the saidnNetwork =networkGThe total number of the middle nodes; />
A first network construction module for calculating a network using the feature vector centralityGNode weight vector of (a) to construct a multiple networkG 2
A second network construction module for constructing a network from the multiple networksG 2 Sample nodes are selected to obtain a non-heavy sample node set S
A second calculation module for calculating the non-heavy sample node setSCenter of bettery, get networkGThe median centrality approximation.
In a second aspect of the embodiment of the present invention, the first computing module computes a networkGThe characteristic vector centrality is obtained by the specific way that:
constructing a networkGIs a contiguous matrix a of (a).
The said
Figure SMS_26
If the network isGMiddle nodev i And nodev j With edges in betweena i,j =1, a step of; otherwisea i,j =0。
And constructing a first characteristic equation based on the adjacency matrix A.
The first characteristic equation is
Figure SMS_27
In the method, in the process of the invention,Aas an adjacency matrix for the network,λas a value of the characteristic(s),
Figure SMS_28
is a feature vector.
And carrying out calculation processing on the first characteristic equation, calculating to obtain a characteristic vector corresponding to the maximum characteristic value, and taking the characteristic vector corresponding to the maximum characteristic value as characteristic vector centrality.
In a second aspect of the embodiment of the present invention, the first network construction module calculates a network using the feature vector centralityGNode weight vector of (a) to construct a multiple networkG 2 The method specifically comprises the following steps:
calculating the mean value of each component of the centrality of the feature vector to obtain a component mean value x 0
The saidx 0 =
Figure SMS_29
The method comprises the steps of carrying out a first treatment on the surface of the The saidx i ∈{x 1 x 2 ,…,x n }。
Judging the component mean valuex 0 Whether or not it is smaller than a preset average weight numbercAnd obtaining a judging result.
The average weight numbercThe range of the values is as follows
Figure SMS_30
The method comprises the steps of carrying out a first treatment on the surface of the The range of the average weight value sufficiently ensures the universality and the variability of the sampled samples based on the centrality of the feature vector, wherein ∈>
Figure SMS_31
For networksGIs the average degree of the node.
According to the judgment result, a preset vector coefficient calculation model is utilized to obtain a vector coefficient
Figure SMS_32
Centering the feature vector with the vector coefficientsτMultiplying to obtain a second feature vector; the feature vector centrality
Figure SMS_33
=(x 1 x 2 ,…,x n ) The method comprises the steps of carrying out a first treatment on the surface of the Said second eigenvector->
Figure SMS_34
=(τ˙x 1 τ˙x 2 ,…,τ˙x n )。
And rounding up each component in the second feature vector to obtain a node weight vector.
The node weight vector
Figure SMS_35
=(μ 1 μ 2 ,…,μ n ) The method comprises the steps of carrying out a first treatment on the surface of the The saidμ i =⌈τ⋅x i ⌉,1≤inThe method comprises the steps of carrying out a first treatment on the surface of the The saidμ i Is a non-negative integer; the node weight vector is the firstiIndividual componentsμ i Characterizing a networkGMiddle (f)iPersonal nodev i Corresponding weight number
Using node weight vectors for the networkGProcessing to obtain multiple networksG 2
The saidG 2 ={μ v 1 μ v 2 ,…,μ v n }。
In a second aspect of the embodiment of the present invention, the first network construction module calculates a model by using a preset vector coefficient according to the determination result to obtain a vector coefficient, and specifically includes:
When the judgment result is yes, calculating the minimum integer for enabling the preset first vector coefficient calculation model to be establishedz 1 The minimum integer is setz 1 As vector coefficientsτIs a value of (2).
The first vector coefficient calculation model is
Figure SMS_36
Wherein, the saidx i ∈{x 1 x 2 ,…,x n };
Figure SMS_37
Representing an upward rounding, saidnNetwork =networkGThe total number of the middle nodes; the saidz 1 Is an integer greater than 0; the saidc 1 For a preset average weightc
When the judgment result is NO, calculating the maximum integer for enabling the preset second vector coefficient calculation model to be establishedz 2 The maximum integer is calculatedz 2 Is the inverse of (2) as a vector coefficientτIs a value of (2).
The second vector coefficient calculation model is
Figure SMS_38
Wherein, the saidx i ∈{x 1 x 2 ,…,x n };
Figure SMS_39
Representing an upward rounding, saidnNetwork =networkGThe total number of the middle nodes; the saidz 2 Is an integer greater than 0; the saidc 2 For a preset average weightc
As an optional implementation manner, in the second aspect of the embodiment of the present invention, the second network building module is configured from the multiple networksCollateralsG 2 Sample nodes are selected to obtain a non-heavy sample node setSThe method specifically comprises the following steps:
from the multiple networksG 2 In the method, the method is selected according to uniform random probability distributionhObtaining a first sample node set by the nodes; the saidh=⌈p⋅n⌉, said
Figure SMS_40
Representing an upward rounding; the saidnNetwork =networkGThe total number of the middle nodes; the said pThe pre-set sampling proportion is characterized in that,pthe value range is 0.1-0pAnd less than or equal to 0.2, the sampling proportion is selected by comprehensively considering the representativeness of the sampling sample and the efficiency of an approximate calculation method.
And judging whether the number of the repeated nodes exists in the first sample node set or not to obtain a second judging result.
When the second judgment result is yes, deleting repeated nodes in the first sample node set, selecting new nodes from a method of uniform random probability distribution in a multiple network, and adding the new nodes into the first sample node set to enable the total number of the nodes in the first sample node set to reachhAnd triggering and executing the judgment whether the first sample node set has the repeated nodes or not to obtain a second judgment result.
When the second judgment result is no, the first sample node set is confirmed to be a no-heavy sample node setS
As an optional implementation manner, in the second aspect of the embodiment of the present invention, the second calculation module calculates the set of non-heavy sample nodesSCenter of bettery, get networkGThe median centrality approximation value specifically comprises:
using a median centrality calculation model for the non-heavy sample node setSProcessing to obtain a sample node set SIs a median centrality of (a).
The medium centrality calculation model is as follows:
Figure SMS_41
Figure SMS_42
in the method, in the process of the invention,Bet(v)representing nodesvThe centrality of the medium number,Bet(e)representing edgeseThe centrality of the medium number,σv i, v j ) Is a nodev i To the nodev j Is used to determine the number of shortest paths,σv i, v j v) Is a nodev i To the nodev j Through the node
Figure SMS_43
The number of shortest paths of (a);σv i, v j e) Is a nodev i To the nodev j Through the edgeeThe number of shortest paths of (a);Srepresenting a set of non-heavy sample nodesS
Aggregating the sample nodesSIs determined as a networkGIs approximated to the median centrality of (a) to obtain a networkGThe median centrality approximation.
Another aspect of the invention discloses another medium number centrality approximation calculation device, the device comprising:
a memory storing executable program code;
a processor coupled to the memory;
the processor invokes the executable program code stored in the memory to perform some or all of the steps in the method for median centering approximation calculation disclosed in the first aspect of the embodiment of the present invention.
A fourth aspect of the present invention discloses a computer storage medium storing computer instructions for performing part or all of the steps in the method for central approximation of a betweenness disclosed in the first aspect of the present invention when called.
The invention has the beneficial effects that:
the invention relates to a method for approximate calculation of betweenness centrality, which uses a calculation networkGThe characteristic vector centrality is obtained; calculating a network by using the feature vector centralityGNode weight vector of (a) to construct a multiple networkG 2 The method comprises the steps of carrying out a first treatment on the surface of the From the multiple networksG 2 Sample nodes are selected to obtain a non-heavy sample node setSThe method comprises the steps of carrying out a first treatment on the surface of the Computing the set of non-heavy sample nodesSCenter of bettery, get networkGThe median centrality approximation. Therefore, the invention selects the sample nodes of the medium number centrality calculation based on the characteristic vector centrality as the basis based on the advantage of low characteristic vector centrality calculation complexity, the shortest paths among the sample nodes can better represent all the shortest paths among the network nodes, and the network approximate medium number centrality value is obtained by calculating the shortest paths among the sample nodes, so that the rapid calculation of the medium number centrality of the large-scale network is realized.
Drawings
FIG. 1 is a flow chart of a method for calculating a median centrality approximation in accordance with an embodiment of the present invention;
FIG. 2 is a schematic diagram of an exemplary three-layer routing network according to one embodiment of the present invention;
FIG. 3 is a schematic diagram of a medium-center approximation calculation device according to an embodiment of the present invention;
FIG. 4 is a schematic diagram of another medium-center approximation calculation apparatus according to an embodiment of the present invention.
Detailed Description
In order that those skilled in the art will better understand the present invention, a technical solution in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention, and it is apparent that the described embodiments are only some embodiments of the present invention, not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
For the sake of easy understanding of the embodiments of the present application, the following will briefly introduce related concepts:
the median centrality. The betweenness centrality is generally divided into node betweenness centrality and edge betweenness centrality, and betweenness centrality index is an important centrality index in key node and key link identification. Given a network g= (V, E), V, E represent the node set and edge set of the network respectively,n= |v| represents the number of nodes in the network,m=i E represents the number of edges in the network.
Node bets are defined as the ratio of the number of all shortest paths through the node to the total number of shortest paths in the network. Node in network vThe betweenness of (a) is defined as:
Figure SMS_44
wherein the method comprises the steps ofσv i, v j ) Is a nodev i To the nodev j Is used to determine the number of shortest paths,σv i, v j v) Is a nodev i To the nodev j Through the nodevIs used for the number of shortest paths of the network.
Edge betweenness is defined as the ratio of the number of all shortest paths through the edge to the total number of shortest paths in the network. Edge(s)eThe betweenness of (a) is defined as:
Figure SMS_45
wherein the method comprises the steps ofσv i, v j e) Is a nodev i To the nodev j Through the edgeeIs used for the number of shortest paths of the network.
For ease of research, node and edge betweenness centrality in a network is often normalized, and normalized betweenness centrality in an undirected network is defined as:
Figure SMS_46
the closer the normalized value is to 1, the higher the frequency of the node on the shortest path between the network nodes is, and the more important is; when the normalized value is 0, the node is not present on the shortest path between other nodes, and the importance is very low.
Theorem: in a connected network, the sum of node normalization medians is equal to the average shortest distance of the networklSubtracting 1; the sum of the edge normalized medians is equal to the average shortest distancelI.e.
Figure SMS_47
The relationship between the mid-distance centrality and the edge mid-distance centrality in the network and the average shortest distance of the network is given by the mid-distance centrality identity, which reveals the inter-implication relationship between the mid-distance centrality and the average shortest path of the network, and establishes the relationship between the mid-distance centrality and the small world network. The medium center identity is established not only on the connected network but also on the general network.
The medium number centrality calculation usually adopts a Brandes algorithm, and when the medium number centrality of an unauthorized network is calculated, a depth-first search algorithm is used for calculating the complexity;Θ(mn)the method comprises the steps of carrying out a first treatment on the surface of the In calculating the betweenness centrality of the authorized network, the Dijkstra algorithm is used, and the calculation complexity is thatΘ(mn+n 2 logn). When the network is
Figure SMS_48
When the network is dense, m is ≡n 2 The central computational complexity of the medium number is thatΘ(n 3 )The computational complexity is high and cannot be applied to a large-scale network.
Feature vector centrality. Feature vector centrality is an important measure of node centrality in a network, the feature vector centrality is related to the number of adjacent nodes and the importance of each adjacent node in the nodes, and node information of the centrality is more abundant than the number of the centrality.
Let a=
Figure SMS_49
Representing a networkGIf nodev i And nodev j With edges in betweena i,j =1, a step of; otherwisea i,j =0. Nodev i The feature vector centrality of (1) may be defined as:
Figure SMS_50
/>
wherein, the liquid crystal display device comprises a liquid crystal display device,M(v)is a nodevThe set of all neighboring nodes,λis a constant.
Feature vector centrality definitions may be expressed as matrix-vector tokens,
Figure SMS_51
constant (constant)λIs a characteristic value of the adjacency matrix A, < >>
Figure SMS_52
Is the characteristic valueλCorresponding feature vectors. Typically, the adjacency matrix A will have a number of different eigenvalues λThere is also a corresponding feature vector. However, the fact that all elements of the matrix are positive means that only the largest eigenvalues will produce the required centrality measure. Feature vector of adjacency matrixiThe individual components give the nodes +.>
Figure SMS_53
Is used for the feature vector centrality value of (1). Since the eigenvector point multiplied by an arbitrary constant is still eigenvalue +.>
Figure SMS_54
Is based on the feature vectors ofFeature vector centrality the average weight parameter is introduced to determine feature vector values when constructing multiple networks. The feature vector centrality time complexity isO(n+m)The computational complexity is low.
Example 1
Referring to fig. 1, fig. 1 is a schematic flow chart of a method for calculating a median centrality approximation according to an embodiment of the present invention. The method described in fig. 1 is applicable to an information network, a social network, an internet of things and a traffic network, and the embodiment of the invention is not limited. As shown in fig. 1, the method for calculating the median centrality approximation of the feature vector centrality may include the following operations:
101. computing networkGAnd obtaining the characteristic vector centrality.
In the embodiment of the invention, the characteristic vector centrality
Figure SMS_55
=(x 1 x 2 ,…,x n ) The method comprises the steps of carrying out a first treatment on the surface of the The saidnRepresentation vector- >
Figure SMS_56
Component numbers of (2); the saidnRepresenting a networkGTotal number of medium nodes.
102. Computing networks using feature vector centralityGNode weight vector of (a) to construct a multiple networkG 2
103. From multiple networksG 2 Sample nodes are selected to obtain a non-heavy sample node setS
104. Computing a set of non-duplicate sample nodesSCenter of bettery, get networkGThe median centrality approximation.
Therefore, by implementing the method for approximate calculation of the betweenness centrality described by the embodiment of the invention, the centrality of the feature vector is obtained by calculating the centrality of the feature vector of the original network, the non-heavy sample node set is selected based on the centrality of the feature vector, and the value of the betweenness centrality of the non-heavy sample node set is calculated to obtain the value of the approximate betweenness centrality of the original network, so that the calculation is simplified, the complexity is reduced, and the rapid calculation of the betweenness centrality of the large-scale network is realized.
In an alternative embodiment, the computing network is calculated in step 101 aboveGFeature vector centrality, resulting in feature vector centrality, comprising:
constructing a networkGIs a contiguous matrix a of (a).
The a=
Figure SMS_57
If the network isGMiddle nodev i And nodev j With edges in betweena i,j =1, a step of; otherwisea i,j =0。
And constructing a first characteristic equation based on the adjacency matrix A.
The first characteristic equation is
Figure SMS_58
Where A is the adjacency matrix of the network,λas a value of the characteristic(s),
Figure SMS_59
is a feature vector; />
Figure SMS_60
=(x 1 x 2 ,…,x n )。
According to the Perron-Frobenius theorem,
Figure SMS_61
. Based on this characteristic, feature vector centrality calculation is often developed by a matrix exponentiation method, i.e., an initial vector +.>
Figure SMS_62
Then iterate to calculate +.>
Figure SMS_63
When iteratingNumber of timestWhen a certain threshold is reached, the person is allowed to go (I)>
Figure SMS_64
The value is close to the maximum characteristic valueλ max The corresponding feature vector, i.e. feature vector centrality.
It can be seen that the value of the centrality of the feature vector characterizes the networkGThe importance of a node in the set determines the probability that the node is selected as a sample node.
In another alternative embodiment, the feature vector centrality is used in step 102 to calculate a networkGNode weight vector of (a) to construct a multiple networkG 2 Comprising:
calculating the mean value of each component of the centrality of the feature vector to obtain a component mean valuex 0
The saidx 0 =
Figure SMS_65
The method comprises the steps of carrying out a first treatment on the surface of the The saidx i ∈{x 1 x 2 ,…,x n }。
Judging the component mean valuex 0 Whether or not it is smaller than a preset average weight numbercAnd obtaining a judging result.
The average weight numbercThe range of the values is as follows
Figure SMS_66
c≤5⋅/>
Figure SMS_67
The method comprises the steps of carrying out a first treatment on the surface of the Wherein->
Figure SMS_68
For networksGIs the average degree of the node. According to the judgment result, a preset vector coefficient calculation model is utilized to obtain a vector coefficient +. >
Figure SMS_69
To the above featuresVector centrality and vector coefficientsτMultiplying to obtain a second feature vector; center of the feature vector
Figure SMS_70
=(x 1 x 2 ,…,x n ) The method comprises the steps of carrying out a first treatment on the surface of the Said second eigenvector->
Figure SMS_71
=(τ˙x 1 τ˙x 2 ,…,τ˙x n )。
And rounding up each component in the second characteristic vector to obtain a node weight vector.
The node weight vector
Figure SMS_72
=(μ 1 μ 2 ,…,μ n ) The method comprises the steps of carrying out a first treatment on the surface of the Above-mentionedμ i =⌈τ˙x i ⌉,1≤inThe method comprises the steps of carrying out a first treatment on the surface of the Above-mentionedμ i Is a non-negative integer; the node weight vector is the firstiIndividual componentsμ i Characterizing a networkGMiddle (f)iPersonal nodev i Corresponding weight number.
Using node weight vector to networkGProcessing to obtain multiple networksG 2
Above-mentionedG 2 ={μ v 1 μ v 2 ,…,μ v n }。
Therefore, the universality and the diversity of the sampled samples based on the characteristic vector centrality are fully ensured by reasonably taking the average weight, the weight of the network node is determined according to the average value of each component of the characteristic vector centrality and the average weight, the weight of the node in the multiple network is in direct proportion to the characteristic vector centrality of the node, and the more important node becomes the sample node with higher probability due to the construction of the multiple network.
In yet another optional embodiment, the above-mentioned method uses a preset vector coefficient calculation model to obtain vector coefficients according to the above-mentioned determination resultτComprising:
when the judgment result is yes, calculating the minimum integer for enabling the preset first vector coefficient calculation model to be established z 1 The minimum integer is setz 1 As vector coefficientsτIs a value of (2).
The first vector coefficient calculation model is as follows
Figure SMS_73
Wherein, the abovex i ∈{x 1 x 2 ,…,x n };
Figure SMS_74
Representing an upward rounding of the abovenRepresenting the number of components in the centrality of the feature vector; above-mentionedz 1 Is an integer greater than 0; above-mentionedc 1 For a preset average weightc
When the judgment result is NO, calculating the maximum integer for enabling the preset second vector coefficient calculation model to be establishedz 2 The maximum integer is calculatedz 2 Is the inverse of (2) as a vector coefficientτIs a value of (2).
The second vector coefficient calculation model is as follows
Figure SMS_75
Wherein, the abovex i ∈{x 1 x 2 ,…,x n };
Figure SMS_76
Representing an upward rounding of the abovenRepresenting the number of components in the centrality of the feature vector; the saidz 2 Is an integer greater than 0; above-mentionedc 2 For a preset average weightc
It can be seen that the value of the centrality of the feature vector represents the importance degree of different nodes, and the average weight is used to control the value of the vector coefficient to construct multiple networksG 2 Thereby ensuring that important nodes appear in the multiple network with higher numbers of weights and that secondary nodes appear in the multiple network with lower numbers of weights while the total number of weights of the multiple network is controlled within a reasonable range.
In yet another alternative embodiment, the step 103 is performed from multiple networksG 2 Sample nodes are selected to obtain a non-heavy sample node set SComprising:
from multiple networksG 2 In the method, the method is selected according to uniform random probability distributionhObtaining a first sample node set by the nodes; above-mentionedh=⌈p⋅n⌉ above
Figure SMS_77
Representing an upward rounding; above-mentionednRepresenting the number of components in the centrality of the feature vector; above-mentionedpThe pre-set sampling proportion is characterized in that,pthe value range is 0.1-0pAnd less than or equal to 0.2, the sampling proportion is selected by comprehensively considering the representativeness of the sampling sample and the efficiency of an approximate calculation method.
Judging whether the first sample node set has the number of repeated nodes or not to obtain a second judging result;
if the second judgment result is yes, deleting repeated nodes in the first sample node set, selecting new nodes from a method of uniform random probability distribution in the multiple networks, adding the new nodes into the first sample node set, and enabling the total number of the nodes in the first sample node set to reachhAnd triggering and executing the judgment whether the first sample node set has the repeated node or not, and obtaining a second judgment result.
If the second judgment result is no, the first sample node set is confirmed to be a non-heavy sample node setS
It can be seen that the sampling proportion is preset by comprehensively considering the representativeness of the sampling samples and the efficiency of the approximate calculation method, so that the node set without the heavy samples SScale of (a) is compared with the initial networkGScale of
Figure SMS_78
Between them.
In yet another alternative embodiment, a set of no-heavy sample nodes is calculated in step 104 aboveSCenter of bettery, get networkGA median centrality approximation comprising:
for the node set without heavy sample by using the medium number centrality calculation modelSProcessing to obtain a sample node setSIs a median centrality of (a).
The above-mentioned medium centrality calculation model is:
Figure SMS_79
Figure SMS_80
in the method, in the process of the invention,Bet(v)representing nodesvThe centrality of the medium number,Bet(e)representing edgeseThe centrality of the medium number,σv i, v j ) Is a nodev i To the nodev j Is used to determine the number of shortest paths,σv i, v j v) Is a nodev i To the nodev j Through the node
Figure SMS_81
The number of shortest paths of (a);σv i, v j e) Is a nodev i To the nodev j Through the edgeeThe number of shortest paths of (a);Srepresenting a set of non-heavy sample nodesS
Will be put onThe sample node setSIs determined as a networkGIs approximated to the median centrality of (a) to obtain a networkGNode betweenness centrality approximation.
It can be seen that replacing the shortest set of paths between any node that need to be traversed in the exact definition of the median centrality with only needing to traverse the set of non-duplicate sample nodesSThe shortest path set among the inner nodes greatly reduces the scale of path search, and meanwhile, the approximate value of the centrality of the betweenness keeps the magnitude sequence relation of the centrality values of the betweenness of different nodes and edges as much as possible. Node set due to no heavy sample SIs only between 0.1 and 0.2 of the initial node set size, so as to have no heavy sample node setSThe calculation complexity of the approximation of the computation of the betweenness centrality is equivalent to 0.01 to 0.04 of the complexity of the accurate calculation method, so that the betweenness calculation complexity of the large-scale network is rapidly reduced.
To specifically explain the method of the present embodiment, a typical three-layer routing network g= (V, E) is described.
The above-mentioned typical three-layer routing network structure is shown in fig. 2, and numerals 1 to 31 in fig. 2 denote routing networks, respectivelyG31 router nodes in (1), 31 router nodes constitute a router node setV,The connection of the routers forms edges, all of which constitute an edge setE。
Specifically, the networkG=(VE) Medium router node setVSum edge setEThe following are provided:
V={1,2,⋯,31},E={(1,2),(1,3),(1,4),(1,5),(1,6),(2,7),(2,8),(2,9),(2,10),(2,11),(3,12),(3,13),(3,14),(3,15),(3,16),(4,17),(4,18),(4,19),(4,20),(4,21),(5,22),(5,23),(5,24),(5,25),(5,26),(6,27),(6,28),(6,29),(6,30),(6,31)}。
in order to calculate the betweenness centrality of nodes in the network, an approximate calculation method based on the feature vector centrality is performed as follows:
step 1: computing networkGThe characteristic vector values of the nodes, the characteristic vector centrality of the nodes is shown in Table 1The illustration is:
table 1 feature vector centrality for each node
Node Characteristic vector value Node Characteristic vector value Node Characteristic vector value Node Characteristic vector value
1 0.49999939 9 0.09999988 17 0.09999988 25 0.09999988
2 0.31622815 10 0.09999988 18 0.09999988 26 0.09999988
3 0.31622815 11 0.09999988 19 0.09999988 27 0.09999988
4 0.31622815 12 0.09999988 20 0.09999988 28 0.09999988
5 0.31622815 13 0.09999988 21 0.09999988 29 0.09999988
6 0.31622815 14 0.09999988 22 0.09999988 30 0.09999988
7 0.09999988 15 0.09999988 23 0.09999988 31 0.09999988
8 0.09999988 16 0.09999988 24 0.09999988
Step 2: computing networks using feature vector centrality GNode weight vector of (a) to construct a multiple networkG 2
Setting average weight number, due to average degree of node
Figure SMS_82
=2, select average weightc=2⋅/>
Figure SMS_83
=4;
The node weight distribution is determined from the distribution of feature vector centrality as shown in table 2 below:
table 2 node weight distribution
Node Weight number Node Weight number Node Weight number Node Weight number
1 14 9 3 17 3 25 3
2 9 10 3 18 3 26 3
3 9 11 3 19 3 27 3
4 9 12 3 20 3 28 3
5 9 13 3 21 3 29 3
6 9 14 3 22 3 30 3
7 3 15 3 23 3 31 3
8 3 16 3 24 3
Constructing multiple networks based on node reconstruction numbersG 2 As shown in table 2,G 2 including 14 nodes 1,9 nodes 2, … …,3 nodes 31.
Step 3: from multiple networksG 2 Sample nodes are selected to obtain a non-heavy sample node set S;
according to the node weight distribution and the method of uniform random probability distribution, the node sampling probability distribution can be calculated as shown in table 3:
TABLE 3 node sampling probability distribution
Node Sampling probability Node Sampling probability Node Sampling probability Node Sampling probability
1 0.10447761 9 0.02238806 17 0.02238806 25 0.02238806
2 0.06716418 10 0.02238806 18 0.02238806 26 0.02238806
3 0.06716418 11 0.02238806 19 0.02238806 27 0.02238806
4 0.06716418 12 0.02238806 20 0.02238806 28 0.02238806
5 0.06716418 13 0.02238806 21 0.02238806 29 0.02238806
6 0.06716418 14 0.02238806 22 0.02238806 30 0.02238806
7 0.02238806 15 0.02238806 23 0.02238806 31 0.02238806
8 0.02238806 16 0.02238806 24 0.02238806
Setting sampling proportionp=0.2。
According to sampling probabilitypSample node set can be knownSIs of the scale of |S|=⌈pIs calculated, and the calculated set of non-heavy sample nodes s= {2,3,5, 16, 21, 23, 28}.
Step 4: computing a set of non-duplicate sample nodesSCenter of bettery, get networkGThe median centrality approximation is shown in table 4:
table 4 networkGIntermediate center approximation
Node Intermediate approximation Node Intermediate approximation Node Intermediate approximation Node Intermediate approximation
1 0.04367816 9 0 17 0 25 0
2 0 10 0 18 0 26 0
3 0.01149425 11 0 19 0 27 0
4 0.01379310 12 0 20 0 28 0
5 0.01149425 13 0 21 0 29 0
6 0.01379310 14 0 22 0 30 0
7 0 15 0 23 0 31 0
8 0 16 0 24 0 0
Since the approximation and the exact value of the median centrality adopt the same normalization coefficient, but the node sets involved between the approximation and the exact value are respectivelySAndVtherefore, there is no practical significance in comparing the actual numerical values between the two. The selection of key nodes and key edges mainly depends on the order of magnitude of the median centrality values. From the approximate distribution of the betweenness centrality, the importance of the nodes is as follows: 1,4,6,3,5,2,7,9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31.
To analyze the effect of the median center approximation calculation method, table 5 gives the exact value distribution of the median center, and it is known that the node importance degree is: 1,2,3,4,5,6,7,9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31.
Table 5 networkGCenter of median precision value
Node Accurate value of medium number Node Accurate value of medium number Node Accurate value of medium number Node Accurate value of medium number
1 0.82758621 9 0 17 0 25 0
2 0.31034483 10 0 18 0 26 0
3 0.31034482 11 0 19 0 27 0
4 0.31034482 12 0 20 0 28 0
5 0.31034482 13 0 21 0 29 0
6 0.31034482 14 0 22 0 30 0
7 0 15 0 23 0 31 0
8 0 16 0 24 0 0
By comparing the approximate value and the accurate value sequencing result of the centrality of the medians, the accuracy rate reaches more than 90 percent except that the importance degree of the nodes 2,3 and 5 is changed and the sequences of other nodes are maintained.
When the node of the node set without the heavy sample is selected, the network needs to be traversed for accurate calculation of the medium number centrality
Figure SMS_84
The shortest paths between any pair of nodes are counted, the frequency of each shortest path passing through the nodes and the edges is counted, and the medium centrality of the nodes and the edges is calculated. Because the upper limit of the number of node pairs in the network is +.>
Figure SMS_85
nIs the number of nodes in the network. The time complexity Θ (mn+n) of the exact computation of the median centrality 2 log n )=Θ(n 3 ) The computational complexity is high and cannot be applied to a large-scale network. In the middle ofn=V represents the number of nodes in the network,m=i E represents the number of edges in the network. In order to solve the problem of central rapid calculation of medium numbers in a large-scale network, the network is replaced by a node set without heavy samples in the methodSThe number of node pairs is from
Figure SMS_86
Reduced to->
Figure SMS_87
The workload of shortest path traversal can be reduced to global shortest path traversal workloadp 2 . Meanwhile, sample nodes generated based on feature vector centrality sampling represent important nodes in a network, shortest paths among the sample nodes represent typical shortest paths in the network, the relative sequence of the betweenness centrality accurate values of the nodes and the edges can be well reserved based on the approximate value of the betweenness centrality calculated by the shortest paths among the sample nodes, the betweenness centrality relative sequence of the nodes and the edges is a problem which is focused on in a real network application scene, and the order keeping performance of the betweenness centrality approximate value can effectively solve the real network application problem.
Example two
Referring to fig. 3, fig. 3 is a schematic diagram of a median center approximation calculation apparatus according to an embodiment of the invention. The device described in fig. 3 is applicable to an information network, a social network, an internet of things and a traffic network, and the embodiment of the invention is not limited. As shown in fig. 3, the apparatus may include:
a first computing module for computing a networkGThe characteristic vector centrality is obtained; center of the feature vector
Figure SMS_88
=(x 1 x 2 ,…,x n ) The method comprises the steps of carrying out a first treatment on the surface of the Above-mentionednRepresentation vector->
Figure SMS_89
Component numbers of (2); above-mentionednEqual to the networkGThe total number of the middle nodes;
a first network construction module for calculating a network using feature vector centralityGNode weight vector of (a) to construct a multiple networkG 2
A second network construction module for constructing multiple networksG 2 Sample nodes are selected to obtain a non-heavy sample node setS
A second calculation module for calculating a set of non-heavy sample nodesSCenter of bettery, get networkGThe median centrality approximation.
Therefore, by implementing the intermediate number centrality approximation calculation device described in fig. 3, the intermediate number centrality of the non-heavy sample node set can be used as the intermediate number centrality approximation value of the original network by reasonably selecting the non-heavy sample node set, so that the scale of path search can be greatly reduced, and meanwhile, the intermediate number centrality approximation value also maintains the magnitude sequence relation of intermediate number centrality values of different nodes and edges as much as possible.
In another alternative embodiment, as shown in FIG. 3, the first computing module computes a networkGThe characteristic vector centrality is obtained by the specific way that:
constructing a networkGIs a contiguous matrix a of (a).
The above A=
Figure SMS_90
If the network isGMiddle nodev i And nodev j With edges in betweena i,j =1, a step of; otherwisea i,j =0。/>
Based on the adjacency matrix A, a first characteristic equation is constructed.
The first characteristic equation is
Figure SMS_91
Where A is the adjacency matrix of the network,λas a value of the characteristic(s),
Figure SMS_92
is a feature vector.
And carrying out calculation processing on the first characteristic equation, calculating to obtain a characteristic vector corresponding to the maximum characteristic value, and taking the characteristic vector corresponding to the maximum characteristic value as characteristic vector centrality.
It can be seen that the value of the centrality of the feature vector characterizes the networkGThe importance of a node in the set determines the probability that the node is selected as a sample node.
In yet another alternative embodiment, as shown in FIG. 3, the first network construction module calculates the network using feature vector centralityGNode weight vector of (a) to construct a multiple networkG 2 The method specifically comprises the following steps:
calculating the mean value of each component of the centrality of the feature vector to obtain a component mean valuex 0
Above-mentionedx 0 =
Figure SMS_93
The method comprises the steps of carrying out a first treatment on the surface of the The saidx i ∈{x 1 x 2 ,…,x n }。
Judging the component mean value x 0 Whether or not it is smaller than a preset average weight numbercAnd obtaining a judging result.
The average weight numbercThe range of the values is as follows
Figure SMS_94
c≤5⋅/>
Figure SMS_95
The method comprises the steps of carrying out a first treatment on the surface of the The range of the average weight value sufficiently ensures the universality and the variability of the sampled samples based on the centrality of the feature vector, wherein ∈>
Figure SMS_96
For networksGIs the average degree of the node.
According to the judgment result, a preset vector coefficient calculation model is utilized to obtain a vector coefficient
Figure SMS_97
Multiplying the centrality of the feature vector by the vector coefficient tau to obtain a second feature vector; center of the feature vector
Figure SMS_98
=(x 1 x 2 ,…,x n ) The method comprises the steps of carrying out a first treatment on the surface of the Said second eigenvector->
Figure SMS_99
=(τ˙x 1 τ˙x 2 ,…,τ˙x n );
And rounding up each component in the second characteristic vector to obtain a node weight vector.
The node weight vector
Figure SMS_100
=(μ 1 μ 2 ,…,μ n ) The method comprises the steps of carrying out a first treatment on the surface of the Above-mentionedμ i =⌈τ⋅x i ⌉,1≤inThe method comprises the steps of carrying out a first treatment on the surface of the Above-mentionedμ i Is a non-negative integer; the node weight vector is the firstiIndividual componentsμ i Characterizing a networkGMiddle (f)iPersonal nodev i Corresponding weight number.
Using node weight vector to networkGProcessing to obtain multiple networksG 2
Above-mentionedG 2 ={μ v 1 μ v 2 ,…,μ v n }。
Therefore, the universality and the diversity of the sampled samples based on the characteristic vector centrality are fully ensured by reasonably taking the average weight, the weight of the network node is determined according to the average value of each component of the characteristic vector centrality and the average weight, the weight of the node in the multiple network is in direct proportion to the characteristic vector centrality of the node, and the more important node becomes the sample node with higher probability due to the construction of the multiple network.
In yet another optional embodiment, the first network building module obtains the vector coefficients according to the determination result and by using a preset vector coefficient calculation modelτThe method specifically comprises the following steps:
when the judgment result is yes, calculating the minimum integer for enabling the preset first vector coefficient calculation model to be establishedz 1 The minimum integer is setz 1 As vector coefficientsτIs a value of (2).
The first vector coefficient calculation model is as follows
Figure SMS_101
Wherein, the abovex i ∈{x 1 x 2 ,…,x n };
Figure SMS_102
Representing an upward rounding of the abovenThe number of components in the centrality of the feature vector; the saidz 1 Is an integer greater than 0; the saidc 1 For a preset average weightc
When the judgment result is NO, calculating the maximum integer for enabling the preset second vector coefficient calculation model to be establishedz 2 The maximum integer is calculatedz 2 Is the inverse of (2) as a vector coefficientτIs a value of (2).
The second vector coefficient calculation model is as follows
Figure SMS_103
Wherein, the abovex i ∈{x 1 x 2 ,…,x n };
Figure SMS_104
Representing an upward rounding of the abovenRepresenting the number of components in the centrality of the feature vector; above-mentionedz 2 Is an integer greater than 0; the saidc 2 For a preset average weightc
In yet another alternative embodiment, as shown in FIG. 3, the second network building block is derived from multiple networksG 2 Sample nodes are selected to obtain a non-heavy sample node setSThe method specifically comprises the following steps:
from multiple networksG 2 In the method, the method is selected according to uniform random probability distribution hObtaining a first sample node set by the nodes; above-mentionedh=⌈p⋅n⌉ above
Figure SMS_105
Representing an upward rounding; above-mentionednRepresenting the number of components in the centrality of the feature vector; above-mentionedpThe pre-set sampling proportion is characterized in that,pthe value range is 0.1-0pAnd less than or equal to 0.2, the sampling proportion is selected by comprehensively considering the representativeness of the sampling sample and the efficiency of an approximate calculation method.
Judging whether the first sample node set has the number of repeated nodes or not to obtain a second judging result;
if the second judgment result is yes, deleting repeated nodes in the first sample node set, selecting new nodes from a method of uniform random probability distribution in the multiple networks, adding the new nodes into the first sample node set, and enabling the total number of the nodes in the first sample node set to reachhAnd triggering and executing the judgment whether the first sample node set has the repeated node or not, and obtaining a second judgment result.
If the second judgment result is no, the first sample node set is confirmed to be a non-heavy sample node set
Figure SMS_106
In yet another alternative embodiment, as shown in FIG. 3, the second calculation module calculates a set of no-heavy sample nodesSCenter of bettery, get networkGThe median centrality approximation value specifically comprises:
Using the medium number centrality calculation model to collect the non-heavy sample nodesSProcessing to obtain a sample node setSIs a median centrality of (a).
The above-mentioned medium centrality calculation model is:
Figure SMS_107
Figure SMS_108
in the method, in the process of the invention,Bet(v)representing nodesvThe centrality of the medium number,Bet(e)representing edgeseThe centrality of the medium number,σv i, v j ) Is a nodev i To the nodev j Is used to determine the number of shortest paths,σv i, v j v) Is a nodev i To the nodev j Through the process ofNode
Figure SMS_109
The number of shortest paths of (a);σv i, v j e) Is a nodev i To the nodev j Through the edgeeThe number of shortest paths of (a);Srepresenting a set of non-heavy sample nodesS
Aggregating the sample nodesSIs determined as a networkGIs approximated to the median centrality of (a) to obtain a networkGNode betweenness centrality approximation.
Example III
Referring to fig. 4, fig. 4 is a schematic structural diagram of another intermediate center-to-center approximation calculation apparatus according to an embodiment of the present invention. The device described in fig. 4 is applicable to an information network, a social network, an internet of things and a traffic network, and the embodiment of the invention is not limited. As shown in fig. 4, the apparatus may include:
a memory 401 storing executable program codes;
a processor 402 coupled with the memory 401;
the processor 402 invokes executable program code stored in the memory 401 for performing the steps in the medium centrality approximation calculation method of feature vector centrality described in embodiment one.
Example IV
The embodiment of the invention discloses a computer-readable storage medium storing a computer program for electronic data exchange, wherein the computer program causes a computer to execute the steps in the medium centrality approximation calculation method characterized by the feature vector centrality described in the embodiment.
Example five
The embodiment of the invention discloses a computer program product, which comprises a non-transitory computer readable storage medium storing a computer program, and the computer program is operable to cause a computer to execute the steps in the method for calculating the centrality approximation of the betweenness of the centrality vectors described in the embodiment.
Finally, it should be noted that: the embodiment of the invention discloses a method and a device for approximate calculation of medium centrality, which are only disclosed as the preferred embodiment of the invention, and are only used for illustrating the technical scheme of the invention, but not limiting the technical scheme; although the invention has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art will understand that; the technical scheme recorded in the various embodiments can be modified or part of technical features in the technical scheme can be replaced equivalently; such modifications and substitutions do not depart from the spirit and scope of the corresponding technical solutions.

Claims (6)

1. A method for central approximation calculation of a betweenness, comprising:
step 101, computing networkGThe characteristic vector centrality is obtained; the number of components in the centrality of the characteristic vector is equal to the networkGThe total number of the middle nodes;
102, calculating a network by utilizing the centrality of the feature vectorsGNode weight vector of (a) to construct a multiple networkG 2 The method comprises the steps of carrying out a first treatment on the surface of the Specific:
calculating the mean value of each component of the centrality of the feature vector to obtain a component mean value;
judging whether the component mean value is smaller than a preset average weight numbercObtaining a judgment result;
according to the judgment result, a preset vector coefficient calculation model is utilized to obtain a vector coefficientτ
Centering the feature vector with the vector coefficientsτMultiplying to obtain a second feature vector;
rounding up each component in the second feature vector to obtain a node weight vector;
using the node weight vector to determine the node weight vector for the networkGProcessing to obtain multiple networksG 2
Step 103, from the multiple networkG 2 Sample nodes are selected to obtain a non-heavy sample node setSThe method comprises the steps of carrying out a first treatment on the surface of the Specific:
from the multiple networksG 2 In the method, the method is selected according to uniform random probability distributionhObtaining a first sample node set by the nodes; the said h=p⋅n⌉; the saidpRepresenting a preset sampling proportion; the saidnFor networksGThe total number of the middle nodes;
judging whether repeated nodes exist in the first sample node set or not to obtain a second judging result;
when the second judgment result is yes, deleting repeated nodes in the first sample node set, selecting new nodes from a method of uniform random probability distribution in a multiple network, and adding the new nodes into the first sample node set to enable the total number of the nodes in the first sample node set to reachhTriggering and executing the judgment whether repeated nodes exist in the first sample node set to obtain a second judgment result;
when the second judgment result is no, the first sample node set is confirmed to be a no-heavy sample node setS
104, calculating the node set without the heavy sampleSCenter of bettery, get networkGA median centrality approximation; specific:
using a median centrality calculation model for the non-heavy sample node setSProcessing to obtain a sample node setSIs the median centrality of (2);
the medium centrality calculation model is as follows:
Figure QLYQS_1
Figure QLYQS_2
in the method, in the process of the invention,Bet(v)representing nodesvThe centrality of the medium number,Bet(e)representing edgeseThe centrality of the medium number,σv i, v j ) Is a node v i To the nodev j Is used to determine the number of shortest paths,σv i, v j v) Is a nodev i To the nodev j Through the node
Figure QLYQS_3
The number of shortest paths of (a);σv i, v j e) Is a nodev i To the nodev j Through the edgeeThe number of shortest paths of (a);Srepresenting a set of non-heavy sample nodesS
Aggregating the sample nodesSIs determined as a networkGIs approximated to the median centrality of (a) to obtain a networkGThe median centrality approximation.
2. The method of claim 1, wherein the vector coefficients are obtained by using a predetermined vector coefficient calculation model according to the determination resultτComprising:
when the judgment result is yes, calculating a minimum integer for enabling a preset first vector coefficient calculation model to be establishedz 1 The minimum integer is setz 1 As vector coefficients
Figure QLYQS_4
Is a value of (2);
the first vector coefficient calculation model is
Figure QLYQS_5
Wherein, the saidx i Any component of the feature vector centrality;
Figure QLYQS_6
representing an upward rounding;saidnA component number in the centrality of the feature vector; the saidz 1 Is an integer greater than 0; the saidc 1 For a preset average weightc
When the judging result is NO, calculating a maximum integer for enabling a preset second vector coefficient calculation model to be establishedz 2 The maximum integer is calculated z 2 Is the inverse of (2) as a vector coefficientτIs a value of (2);
the second vector coefficient calculation model is
Figure QLYQS_7
Wherein, the saidx i Any component of the feature vector centrality;
Figure QLYQS_8
representing an upward rounding; the saidnA component number in the centrality of the feature vector; the saidz 2 Is an integer greater than 0; the saidc 2 For a preset average weightc
3. The method of median centrality approximation calculation of claim 1, wherein the calculation networkGFeature vector centrality, resulting in feature vector centrality, comprising:
constructing a networkGAdjacent matrix of (a)A
Based on the adjacency matrixAConstructing a first characteristic equation;
the first characteristic equation is
Figure QLYQS_9
In the method, in the process of the invention,Aas an adjacency matrix for the network,λas a value of the characteristic(s),
Figure QLYQS_10
is a feature vector;
and carrying out calculation processing on the first characteristic equation, calculating to obtain a characteristic vector corresponding to the maximum characteristic value, and taking the characteristic vector corresponding to the maximum characteristic value as characteristic vector centrality.
4. A medium centrality approximation calculation apparatus, the apparatus comprising:
a first computing module for computing a networkGThe characteristic vector centrality is obtained; the number of components in the centrality of the characteristic vector is equal to the network GThe total number of the middle nodes;
a first construction module for calculating a network using the feature vector centralityGNode weight vector of (a) to construct a multiple networkG 2 The method comprises the steps of carrying out a first treatment on the surface of the Specific:
calculating the mean value of each component of the centrality of the feature vector to obtain a component mean value;
judging whether the component mean value is smaller than a preset average weight numbercObtaining a judgment result;
according to the judgment result, a preset vector coefficient calculation model is utilized to obtain a vector coefficientτ
Centering the feature vector with the vector coefficientsτMultiplying to obtain a second feature vector;
rounding up each component in the second feature vector to obtain a node weight vector;
using the node weight vector to determine the node weight vector for the networkGProcessing to obtain multiple networksG 2
A second building block for building up a network from the multiple networksG 2 Sample nodes are selected to obtain a non-heavy sample node setSThe method comprises the steps of carrying out a first treatment on the surface of the Specific:
from the multiple networksG 2 In the method, the method is selected according to uniform random probability distributionhObtaining a first sample node set by the nodes; the saidh=p⋅n⌉; the saidpRepresenting a preset sampling proportion; the saidnFor networksGThe total number of the middle nodes;
judging whether repeated nodes exist in the first sample node set or not to obtain a second judging result;
When the second judgment result is yes, deleting repeated nodes in the first sample node set, selecting new nodes from a method of uniform random probability distribution in a multiple network, and adding the new nodes into the first sample node set to enable the total number of the nodes in the first sample node set to reachhTriggering and executing the judgment whether repeated nodes exist in the first sample node set to obtain a second judgment result;
when the second judgment result is no, the first sample node set is confirmed to be a no-heavy sample node setS
A second calculation module for calculating the non-heavy sample node setSCenter of bettery, get networkGA median centrality approximation; specific:
using a median centrality calculation model for the non-heavy sample node setSProcessing to obtain a sample node setSIs the median centrality of (2);
the medium centrality calculation model is as follows:
Figure QLYQS_11
Figure QLYQS_12
in the method, in the process of the invention,Bet(v)representing nodesvThe centrality of the medium number,Bet(e)representing edgeseThe centrality of the medium number,σv i, v j ) Is a nodev i To the nodev j Is used to determine the number of shortest paths,σv i, v j v) Is a nodev i To the nodev j Through the node
Figure QLYQS_13
The number of shortest paths of (a);σv i, v j e) Is a nodev i To the nodev j Through the edgeeThe number of shortest paths of (a); SRepresenting a set of non-heavy sample nodesS
Aggregating the sample nodesSIs determined as a networkGIs approximated to the median centrality of (a) to obtain a networkGThe median centrality approximation.
5. A medium centrality approximation calculation apparatus, the apparatus comprising:
a memory storing executable program code;
a processor coupled to the memory;
the processor invokes the executable program code stored in the memory to perform the betweenness centrality approximation method of any of claims 1-3.
6. A computer storage medium storing a computer program, the computer program stored in the storage medium being called by a processor to perform the betweenness centrality approximation calculation method according to any one of claims 1-3.
CN202310167081.3A 2023-02-27 2023-02-27 Medium number centrality approximate calculation method and device Active CN116032828B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310167081.3A CN116032828B (en) 2023-02-27 2023-02-27 Medium number centrality approximate calculation method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310167081.3A CN116032828B (en) 2023-02-27 2023-02-27 Medium number centrality approximate calculation method and device

Publications (2)

Publication Number Publication Date
CN116032828A CN116032828A (en) 2023-04-28
CN116032828B true CN116032828B (en) 2023-06-13

Family

ID=86074164

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310167081.3A Active CN116032828B (en) 2023-02-27 2023-02-27 Medium number centrality approximate calculation method and device

Country Status (1)

Country Link
CN (1) CN116032828B (en)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115622902A (en) * 2022-12-19 2023-01-17 中国人民解放军国防科技大学 Telecommunication network node importance calculation method based on network structure and node value

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140068039A1 (en) * 2012-08-30 2014-03-06 Vodafone Ip Licensing Limited Information flow optimization in communication systems
CN104168158B (en) * 2014-08-15 2018-05-01 清华大学 The method that multidimensional network carries out network analysis based on betweenness center degree
KR101927613B1 (en) * 2017-03-29 2018-12-10 한국과학기술원 Method of computing betwenness centrality in directed hypergraphs and apparatus thereof
CN108255987A (en) * 2017-12-28 2018-07-06 清华大学 Community-based node betweenness center degree update method
CN109740106A (en) * 2019-01-09 2019-05-10 中国人民解放军国防科技大学 Large-scale network betweenness approximation method based on graph convolution neural network, storage device and storage medium
CN110135092A (en) * 2019-05-21 2019-08-16 江苏开放大学(江苏城市职业学院) Complicated weighting network of communication lines key node recognition methods based on half local center
CN114169591A (en) * 2021-11-19 2022-03-11 浙江大学 Complex network-based equipment support system key damage node identification method

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115622902A (en) * 2022-12-19 2023-01-17 中国人民解放军国防科技大学 Telecommunication network node importance calculation method based on network structure and node value

Also Published As

Publication number Publication date
CN116032828A (en) 2023-04-28

Similar Documents

Publication Publication Date Title
US9426233B2 (en) Multi-objective server placement determination
CN111932386B (en) User account determining method and device, information pushing method and device, and electronic equipment
US20180357335A1 (en) Systems for solving general and user preference-based constrained multi-objective optimization problems
CN113379176B (en) Method, device, equipment and readable storage medium for detecting abnormal data of telecommunication network
CN107623924A (en) It is a kind of to verify the method and apparatus for influenceing the related Key Performance Indicator KPI of Key Quality Indicator KQI
CN113422695A (en) Optimization method for improving robustness of topological structure of Internet of things
CN106506231A (en) The charge network key node recognition methodss in structure-oriented hole
CN107517201A (en) A kind of network vulnerability discrimination method removed based on sequential
WO2023207013A1 (en) Graph embedding-based relational graph key personnel analysis method and system
Xiao et al. Network security situation prediction method based on MEA-BP
Wang et al. Evolutionary algorithm-based and network architecture search-enabled multiobjective traffic classification
Shakarian et al. Fast and deterministic computation of fixation probability in evolutionary graphs
CN116032828B (en) Medium number centrality approximate calculation method and device
CN117829307A (en) Federal learning method and system for data heterogeneity
Data et al. Byzantine-resilient high-dimensional federated learning
CN116796481A (en) Social network key node identification method and system based on improved elastic model
Lai et al. Task assignment and capacity allocation for ml-based intrusion detection as a service in a multi-tier architecture
Pan et al. Is Stochastic Mirror Descent Vulnerable to Adversarial Delay Attacks? A Traffic Assignment Resilience Study
CN106911512B (en) Game-based link prediction method and system in exchangeable graph
CN111917589B (en) Power communication network resource backup method and related device
Krishnamurthy et al. Segregation in social networks: Markov bridge models and estimation
Reisizadeh et al. Almost sure convergence of distributed optimization with imperfect information sharing
Li et al. Influencing factors of job waiting time variance on a single machine
Gupta et al. A Spreader Ranking Algorithm for Extremely Low-budget Influence Maximization in Social Networks using Community Bridge Nodes
De Rossi et al. Meshless partition of unity method for attraction basins of periodic orbits: Fast detection of separatrix points

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant