CN114827021A - Multimedia service flow acceleration system based on SDN and machine learning - Google Patents

Multimedia service flow acceleration system based on SDN and machine learning Download PDF

Info

Publication number
CN114827021A
CN114827021A CN202210732754.0A CN202210732754A CN114827021A CN 114827021 A CN114827021 A CN 114827021A CN 202210732754 A CN202210732754 A CN 202210732754A CN 114827021 A CN114827021 A CN 114827021A
Authority
CN
China
Prior art keywords
flow
information
service
packet
component
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202210732754.0A
Other languages
Chinese (zh)
Other versions
CN114827021B (en
Inventor
郭永安
吴庆鹏
张啸
余昊
钱琪杰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing University of Posts and Telecommunications
Original Assignee
Nanjing University of Posts and Telecommunications
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing University of Posts and Telecommunications filed Critical Nanjing University of Posts and Telecommunications
Priority to CN202210732754.0A priority Critical patent/CN114827021B/en
Publication of CN114827021A publication Critical patent/CN114827021A/en
Application granted granted Critical
Publication of CN114827021B publication Critical patent/CN114827021B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L45/00Routing or path finding of packets in data switching networks
    • H04L45/302Route determination based on requested QoS
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L45/00Routing or path finding of packets in data switching networks
    • H04L45/12Shortest path evaluation
    • H04L45/123Evaluation of link metrics
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L45/00Routing or path finding of packets in data switching networks
    • H04L45/12Shortest path evaluation
    • H04L45/124Shortest path evaluation using a combination of metrics
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L45/00Routing or path finding of packets in data switching networks
    • H04L45/30Routing of multiclass traffic
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/10Flow control; Congestion control
    • H04L47/24Traffic characterised by specific attributes, e.g. priority or QoS
    • H04L47/2425Traffic characterised by specific attributes, e.g. priority or QoS for supporting services specification, e.g. SLA
    • H04L47/2433Allocation of priorities to traffic types
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/10Flow control; Congestion control
    • H04L47/24Traffic characterised by specific attributes, e.g. priority or QoS
    • H04L47/2441Traffic characterised by specific attributes, e.g. priority or QoS relying on flow classification, e.g. using integrated services [IntServ]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/10Flow control; Congestion control
    • H04L47/24Traffic characterised by specific attributes, e.g. priority or QoS
    • H04L47/2483Traffic characterised by specific attributes, e.g. priority or QoS involving identification of individual flows

Abstract

The invention discloses a multimedia service flow acceleration system based on SDN and machine learning, which comprises a flow classification module and a path selection module; the flow classification module trains a machine learning model according to multimedia service flow information in a network to generate a service flow classifier; the service flow classifier is used for classifying the imported network service and identifying the flow requirement corresponding to the network service, wherein the parameters of the flow requirement comprise packet loss rate, time delay and bandwidth; and calculating a corresponding routing strategy according to the flow requirement corresponding to the network service of the path selection module, and meeting the requirements of each parameter of the flow requirement on the basis of the path accessibility. The invention can carry out optimal deployment service flow classification and scheduling strategy, realize rapid deployment and implementation and improve the utilization rate of network resources.

Description

Multimedia service flow acceleration system based on SDN and machine learning
Technical Field
The invention belongs to the technical field of computer networks, and particularly relates to a multimedia service flow acceleration system based on SDN and machine learning.
Background
With the expansion of internet application scale, various multimedia services such as video service, audio service, game service, etc. are in endless, and these services have higher requirements on service quality indexes such as time delay and bandwidth. Therefore, network congestion is easily caused, which makes management difficult. Under the traditional network architecture, a best effort forwarding strategy is adopted, and a Differentiated Services model (Differentiated Services, Diffserv) aims to set different coding points and bind behavior sets for different network Services, so that more fine-grained Differentiated Services can be provided for multimedia service flows. However, when there are many nodes, it is difficult to deploy new functions on each routing node in time. Therefore, the traditional network framework faces the problems of difficult function deployment and simple scheduling strategy.
The invention with publication number CN113114573A provides a video stream classification and scheduling system in an SDN network, which includes: the flow classification module is used for analyzing the packet header of a data packet in the edge router, extracting the flow characteristics, training the classifier, predicting unknown data by using the classifier, forwarding the classified data packet to an output port of the edge router, and issuing a flow table containing modified DSCP domain and output port forwarding behaviors; the path selection module is used for acquiring a global network topology, monitoring state changes of time delay, residual bandwidth and packet loss rate of each link, normalizing and scaling a link state value, calculating to obtain a path list meeting QoS (quality of service) constraint through an FPTAS-MMCP (fast forward adaptive multi-path-multicast service) algorithm, sequentially distributing paths in the feasible path list according to the priorities of different video services, and issuing a flow table to routing nodes on the paths to control data packet forwarding.
Generally speaking, in multimedia services, the priority of a video session is higher, and the priority of an elastic stream and a background stream is lower, however, in the real routing process, routing forwarding cannot be performed discretely according to the priority of the streams, and a routing path needs to be arranged by considering various factors. Therefore, a solution is needed to make a logic control policy more flexible for multimedia traffic flows and implement fast deployment and implementation of the policy.
Disclosure of Invention
The technical problem to be solved is as follows: based on the technical problems, the invention provides a multimedia service flow acceleration system based on SDN and machine learning, which can optimally deploy service flow classification and scheduling strategies, realize rapid deployment and implementation and improve the utilization rate of network resources.
The technical scheme is as follows:
a multimedia service flow acceleration system based on SDN and machine learning comprises a flow classification module and a path selection module which are deployed in an SDN control plane, wherein the SDN control plane issues a flow table to a switch network through an openflow protocol, so that an SDN data plane where the switch network is located executes data forwarding service according to the flow table;
the flow classification module trains a machine learning model according to multimedia service flow information in a network to generate a service flow classifier; the service flow classifier is used for classifying the imported network service and identifying the flow requirement corresponding to the network service, wherein the parameters of the flow requirement comprise packet loss rate, time delay and bandwidth; and calculating a corresponding routing strategy according to the flow requirement corresponding to the network service of the path selection module, and meeting the requirements of each parameter of the flow requirement on the basis of the path accessibility.
Further, the flow classification module comprises a packet information acquisition sub-module, an offline training sub-module and a flow classification sub-module;
the packet information acquisition sub-module comprises a packet information analysis component and a flow characteristic calculation component;
the packet information analysis component is used for acquiring service flow packet information from a data plane through a packet-in event processing function in an openflow protocol, wherein the service flow packet information comprises a source IP address, a destination IP address, a source port, a target port, effective length and arrival time of a data packet; the traffic characteristic calculation component calculates a plurality of characteristic vectors including packet average size, packet size variance, packet average arrival time interval, packet arrival time interval variance and packet size conversion count according to the traffic packet information, and sends the traffic packet information and the corresponding characteristic vectors to the offline training sub-module and the traffic classification sub-module;
the off-line training submodule is used for receiving different types of service traffic packet information and corresponding feature vectors sent by the packet information acquisition submodule, performing off-line learning on the service traffic packet information and the corresponding feature vectors through a machine learning algorithm, and generating a service traffic classifier according to a training result;
and the flow classification submodule loads a service flow classifier, classifies the flow according to the packet information of the network service sent by the packet information acquisition submodule, and identifies the flow type and the flow requirement of the network service.
Further, the offline training sub-module comprises a flow information collection component, a GCN classification component, a classification evaluation component and an evaluation optimization component;
the traffic information collection component is used for receiving different types of service traffic packet information and corresponding characteristic vectors sent by the packet information acquisition sub-module, arranging the service traffic packet information and the corresponding characteristic vectors into training samples, sending the training samples to a sample data set, and updating the sample data set; the GCN classification component adopts a machine learning algorithm to perform off-line learning on training samples in the sample data set, and generates a service flow classifier according to a training result; the classification evaluation component is used for evaluating the classification precision and accuracy of the generated service flow classifier, if the evaluation is qualified, the service flow classifier is output to the flow classification submodule, otherwise, the evaluation result is sent to the evaluation optimization component, and the evaluation optimization component optimizes the training samples in the sample data set according to the evaluation result of the classification evaluation component;
the evaluation process of the classification evaluation component comprises the following steps: randomly extracting 20% of data from the sample data set to perform classification test, and judging that the classifier is qualified if the classification accuracy reaches 90% or more;
the strategy of the evaluation optimization component for optimizing the training samples of the sample data set comprises: according to the sequence of the timestamps of the training samples entering the sample data set from far to near, the training samples are ranked, the training sample with the farthest timestamp and the training sample with the business type accounting for less than 5% are removed regularly, and the data in the sample data set is maintained within a preset quantity range.
Further, the GCN classification component performs offline learning on the training samples in the sample data set by using a machine learning algorithm, and the process of generating the traffic classifier according to the training result includes the following steps:
s1, acquiring the topology structure information of the whole network, generating a graph G (V, E), wherein V is the collection of nodes V in the graph, and E is the collection of edges E of the graph; generating an adjacency matrix A of the weighted graph, wherein in the adjacency matrix A, the weight between two adjacent nodes is set to be 1, and the rest is 0;
s2, generating a node degree matrix D using the adjacency matrix a:
Figure 413652DEST_PATH_IMAGE002
adjacency matrix a is a diagonal matrix;
s3, taking a plurality of eigenvectors output by the flow characteristic calculation component as five-dimensional eigenvectors of each node, and constructing an eigenvector matrix X:
Figure 16671DEST_PATH_IMAGE004
wherein m represents the number of nodes in the network and n represents the dimension of the feature;
s4, constructing a flow calculation model based on a GCN algorithm, wherein the input of the flow calculation model is an adjacency matrix A, a degree matrix D and a feature matrix X, the output of the flow calculation model is a flow demand feature matrix, and the flow demand feature matrix comprises full graph node information, link connection state information and data packet information;
the flow calculation model is composed of
Figure DEST_PATH_IMAGE005
Rolling and laminating the layer diagrams; for the k-th layer map convolution layer, it is provided
Figure 988038DEST_PATH_IMAGE006
Which represents the input of the k-th layer,
Figure DEST_PATH_IMAGE007
an output node representation representing the k-th layer is obtained
Figure 884057DEST_PATH_IMAGE008
Feature matrix
Figure DEST_PATH_IMAGE009
Is the input to the first graph convolution layer; hidden feature representation of node vi in feature propagation process of GCN k layer
Figure 675295DEST_PATH_IMAGE010
Is the average value of its local neighbors, the update rule is as follows:
Figure DEST_PATH_IMAGE011
in the formula (I), the compound is shown in the specification,
Figure 816427DEST_PATH_IMAGE012
Figure DEST_PATH_IMAGE013
wherein
Figure 143765DEST_PATH_IMAGE014
And
Figure DEST_PATH_IMAGE015
representing the degree of node i and the degree of node j respectively,
Figure 977729DEST_PATH_IMAGE016
representing the values of i rows and j columns in the adjacency matrix A, and n represents the number of nodes;
s5, constructing three types of feature matrices according to the parameter value range of the flow demand: a first traffic information feature matrix T for limiting packet loss rate, a second traffic information feature matrix D for limiting time delay and a third traffic information feature matrix B for limiting bandwidth;
s6, respectively calculating the similarity between the flow demand characteristic matrix output by the flow calculation model and the three types of flow information characteristic matrices, and taking the similarity as the flow demand characteristic matrix
Figure DEST_PATH_IMAGE017
Setting similarity weights: (
Figure 990684DEST_PATH_IMAGE018
) Order:
Figure DEST_PATH_IMAGE019
in the formula, the similarity defined by the similarity is characterized in an approximately mode; the process of calculating the similarity between the flow demand characteristic matrix and the three types of flow information characteristic matrices is as follows: respectively solving flow demand characteristic matrix by using 1 norm method
Figure 935506DEST_PATH_IMAGE017
And the Manhattan distances of the first flow information characteristic matrix T, the second flow information characteristic matrix D and the third flow information characteristic matrix B are normalized to respectively obtain the three distances
Figure 819149DEST_PATH_IMAGE020
Order:
Figure DEST_PATH_IMAGE021
Figure 593988DEST_PATH_IMAGE022
Figure DEST_PATH_IMAGE023
further, the
Figure 94239DEST_PATH_IMAGE005
Is 3.
Further, the path selection module comprises a global view acquisition sub-module, a weight processing sub-module and a path calculation sub-module;
the global view acquisition submodule is used for acquiring current network link time delay, global topology information, switch port data rate, maximum data rate and configuration information through an LLDP data packet, an Echo message request and a switch port and flow table statistical information query request, and calculating to obtain the state information of the link of the whole network including the time delay of the whole network, the residual bandwidth and the packet loss rate;
the weight processing submodule is used for carrying out normalization and scaling on the link state information of the whole network;
and the path calculation submodule is used for carrying out route selection by utilizing a routing algorithm according to the link state information and the flow demand corresponding to the network service.
Further, the path computation submodule comprises a reachable path computation component, a path state acquisition component and an optimal path selection component;
the reachable path calculation component obtains all reachable paths corresponding to the imported network service through DFS algorithm query;
the path state acquisition component is used for acquiring the time delay d, the available bandwidth b and the packet loss rate t of all reachable paths;
the optimal path selection component calculates the time delay, the available bandwidth and the packet loss rate of each reachable path according to the similarity weight to obtain corresponding evaluation indexes
Figure 46015DEST_PATH_IMAGE024
Selecting the reachable path with the minimum evaluation index value as the optimal pathPath:
Figure DEST_PATH_IMAGE025
has the advantages that:
the multimedia service flow acceleration system based on the SDN and the machine learning combines the flexibility characteristic of the SDN and the intelligence of the machine learning, and can train the complicated multimedia network service flow and generate a classification model based on a machine learning model of a control domain; and through the classification model, the routing strategy is quickly calculated aiming at the current network state and is issued to the data plane through an openflow protocol. In the flow classification process, the invention not only considers the priority degree of the flow service, but also considers the position of a flow publisher in the network, and classifies the flow in a continuous mode, thereby optimally deploying the service flow classification and scheduling strategy, realizing rapid deployment and implementation and improving the utilization rate of network resources.
Drawings
Fig. 1 is a schematic structural diagram of a multimedia service traffic acceleration system based on SDN and machine learning;
FIG. 2 is a schematic diagram of a traffic classification module;
fig. 3 is a schematic structural diagram of a path selection module.
Detailed Description
The following examples are presented to enable one of ordinary skill in the art to more fully understand the present invention and are not intended to limit the invention in any way.
The embodiment provides a multimedia service traffic acceleration system based on SDN (Software Defined Defined Network, Software Defined Network) and machine learning, which can obtain an optimal deployment service flow classification and scheduling policy, implement rapid deployment and implementation, and improve Network resource utilization rate.
The multimedia service flow acceleration system is applied to the SDN environment. As shown in fig. 1, the multimedia service traffic acceleration system includes a traffic classification module and a path selection module deployed in an SDN control plane, where the SDN control plane (SDN controller) issues a flow table to a switch network through an openflow protocol, so that an SDN data plane where the switch network is located executes a data forwarding service according to the flow table.
The flow classification module trains a machine learning model according to multimedia service flow information in a network to generate a service flow classifier; when a service is generated in a network and needs to be classified, the service flow classifier classifies the introduced network service and identifies the flow requirement corresponding to the network service, wherein parameters of the flow requirement comprise packet loss rate, time delay and bandwidth; and calculating a corresponding routing strategy according to the flow requirement corresponding to the network service of the path selection module, and meeting the requirements of each parameter of the flow requirement on the basis of the path accessibility.
Flow classifying module
As shown in fig. 2, the traffic classification module includes a packet information acquisition sub-module, an offline training sub-module, and a traffic classification sub-module. The packet information acquisition submodule is used for acquiring packet information of unknown flow through an openflow protocol, extracting packet information characteristics and sending the packet information characteristics to the offline training submodule and the flow classification submodule;
the off-line training submodule is used for receiving packet information characteristics sent by the report information acquisition submodule and performing off-line learning on the packet information characteristics through a machine learning algorithm, a training set and a testing machine are input at the beginning of the submodule or manually, and the module can update the training set and the testing set through new network information along with the deployment of the module in a network so as to realize an extensible function of changing in real time according to network changes. Meanwhile, the off-line training submodule can also generate a service flow classifier according to the training result and load the service flow classifier into the flow classification submodule.
And the flow classification submodule loads a flow classifier, classifies the flow according to the packet information sent by the packet information acquisition submodule, and identifies the flow type and the flow requirement.
The flow classification module comprises a packet information acquisition submodule, an off-line training submodule and a flow classification submodule.
The packet information acquisition submodule comprises a packet information analysis component and a flow characteristic calculation component. The packet information parsing component is used for acquiring service traffic packet information from a data plane through a packet-in event processing function in an openflow protocol, wherein the service traffic packet information comprises src (source IP address), dst (destination IP address), src _ port (source port), dst _ port (target port), and effective length and arrival time of a data packet acquired through a len () function and a time () function. The traffic characteristic calculation component calculates a plurality of characteristic vectors including len _ mean (packet average size), len _ std (packet size variance), time _ mean (packet average arrival time interval), time _ std (packet arrival time interval variance) and count (packet size conversion count) according to the traffic packet information, and sends the traffic packet information and the corresponding characteristic vectors to the offline training submodule and the traffic classification submodule.
The off-line training sub-module is used for receiving different types of service flow packet information and corresponding feature vectors sent by the packet information acquisition sub-module, performing off-line learning on the service flow packet information and the corresponding feature vectors through a machine learning algorithm, and generating a service flow classifier according to a training result. In the initial training stage, a training set and a testing machine can be manually input; with the deployment of the offline training submodule in the network, the offline training submodule can update the training set and the test set through new network information so as to realize the extensible function which changes in real time according to the network change.
And the flow classification submodule loads a service flow classifier, classifies the flow according to the packet information of the network service sent by the packet information acquisition submodule, and identifies the flow type and the flow requirement of the network service.
Illustratively, the offline training sub-module includes a traffic information collection component, a GCN classification component, a classification evaluation component, and an evaluation optimization component.
The traffic information collection component is used for receiving different types of service traffic packet information and corresponding characteristic vectors sent by the packet information acquisition sub-module, arranging the service traffic packet information and the corresponding characteristic vectors into training samples, sending the training samples to a sample data set and updating the sample data set; the GCN classification component adopts a machine learning algorithm to perform off-line learning on training samples in the sample data set, and generates a service flow classifier according to a training result; and the classification evaluation component is used for evaluating the classification precision and accuracy of the generated service flow classifier, outputting the service flow classifier to the flow classification submodule if the evaluation is qualified, and otherwise, sending the evaluation result to the evaluation optimization component so that the evaluation optimization component optimizes the training samples in the sample data set according to the evaluation result of the classification evaluation component.
The evaluation process of the classification evaluation component is as follows: randomly extracting 20% of data from the sample data set to perform classification test, and judging that the classifier is qualified if the classification accuracy reaches 90% or more; the evaluation optimization component optimizes the training samples of the sample data set according to the following schemes: 1. ensuring that the data in the sample data set is kept in a stable quantity, namely, timely getting clear the data which is long in time; 2. and removing the data of the service type with the proportion lower than 5% in the sample data set.
Illustratively, traffic classification using the GCN algorithm is described in detail below. Generally speaking, in multimedia services, the priority of a video session is higher, and the priority of an elastic stream and a background stream is lower, however, in the real routing process, routing forwarding cannot be performed discretely according to the priority of the streams, and secondly, routing paths need to be arranged by considering various factors. Therefore, the process of traffic classification is important, which not only takes into account the priority of traffic but also the location of the traffic publisher in the network, classifying the traffic in a continuous manner. The specific training process comprises the following steps:
the process of generating the traffic classifier according to the training result comprises the following steps:
s1, acquiring the topology structure information of the whole network, and generating a graph G (V, E), wherein V is the set of nodes V in the graph, and E is the set of edges E of the graph; an adjacency matrix a of the weighted graph is generated, in which the weight between two adjacent nodes is set to 1, and the rest is 0.
S2, generating a node degree matrix D using the adjacency matrix a:
Figure DEST_PATH_IMAGE027
the adjacency matrix a is a diagonal matrix.
S3, taking a plurality of eigenvectors output by the flow characteristic calculation component as five-dimensional eigenvectors of each node, and constructing an eigenvector matrix X:
Figure DEST_PATH_IMAGE029
where m represents the number of nodes in the network and n represents the dimension of the feature.
S4, constructing a flow calculation model based on the GCN algorithm, wherein the input of the flow calculation model is an adjacency matrix A, a degree matrix D and a characteristic matrix X, the output of the flow calculation model is a flow demand characteristic matrix, and the flow demand characteristic matrix comprises full graph node information, link connection state information and data packet information.
The flow calculation model is composed of
Figure 410262DEST_PATH_IMAGE005
Layer diagram rolling and laminating; for the kth picture convolution layer, let
Figure 54870DEST_PATH_IMAGE006
Which represents the input of the k-th layer,
Figure 511259DEST_PATH_IMAGE007
an output node representation representing the k-th layer is obtained
Figure 266726DEST_PATH_IMAGE008
Feature matrix
Figure 859381DEST_PATH_IMAGE009
Is the input to the first graph convolution layer; hidden feature representation of node vi in feature propagation process of GCN k layer
Figure 674890DEST_PATH_IMAGE030
Is the average value of its local neighbors, the update rule is as follows:
Figure 618575DEST_PATH_IMAGE011
in the formula (I), the compound is shown in the specification,
Figure 941847DEST_PATH_IMAGE012
Figure 654588DEST_PATH_IMAGE013
. Preferably, the first and second liquid crystal materials are,
Figure 375419DEST_PATH_IMAGE005
is 3.
S5, constructing three types of feature matrices according to the parameter value range of the flow demand: a first traffic information feature matrix T for limiting packet loss rate, a second traffic information feature matrix D for limiting time delay and a third traffic information feature matrix B for limiting bandwidth.
S6, respectively calculating the similarity between the flow demand characteristic matrix output by the flow calculation model and the three types of flow information characteristic matrices, and taking the similarity as the flow demand characteristic matrix
Figure 806401DEST_PATH_IMAGE017
Setting similarity weights: (
Figure 903670DEST_PATH_IMAGE018
):
Figure DEST_PATH_IMAGE031
In the formula, the ≈ characterize the similarity defined by the similarity. The process of calculating the similarity between the flow demand characteristic matrix and the three types of flow information characteristic matrices is as follows: respectively solving flow demand characteristic matrix by using 1 norm method
Figure 533234DEST_PATH_IMAGE017
And the Manhattan distances of the first flow information characteristic matrix T, the second flow information characteristic matrix D and the third flow information characteristic matrix B, then normalizing the three distances,are respectively obtained
Figure 926432DEST_PATH_IMAGE020
Order:
Figure 844709DEST_PATH_IMAGE021
Figure 745669DEST_PATH_IMAGE022
Figure 167423DEST_PATH_IMAGE023
(II) route selection module
As shown in fig. 3, the path selection module is configured to select an optimal routing policy based on a current network state according to the traffic type and the traffic demand sent by the traffic classification module. The path selection module comprises a global view acquisition sub-module, a weight processing sub-module and a path calculation sub-module.
The global view obtaining sub-module is configured to obtain current network Link delay, global topology information, switch port data rate, maximum data rate, configuration information, and the like through an LLDP (Link Layer Discovery Protocol) data packet, an Echo message request, and a switch port and flow table statistical information query request, so as to obtain information such as a full network delay, a remaining bandwidth, and a packet loss rate.
And the weight processing submodule is used for normalizing and scaling the link state information of the whole network.
And the path calculation submodule is used for selecting a route by using a routing algorithm according to the link state information and the service requirement.
Illustratively, the specific workflow of the path selection module includes:
step 1: and the global view acquisition sub-module acquires global network information through an Openflow message mechanism. The method specifically comprises the following steps: by adding the timestamp information into the Echo message, the time of data going to and going from the controller and the switch can be acquired, and the subsequent acquisition of the link delay is facilitated. In addition, the OFPFlow states Request message can be used to obtain the packet count, bit number, flow table lifetime and other statistical information meeting the conditions of the flow table matching domain. The Event OFPPort states Reply request message is used for acquiring information such as the receiving/sending bit number of the port of the switch, the survival time of the port and the like. The Event OFPPort Desc states Reply request message is used for acquiring the hardware attributes of the port, such as the current data rate, the maximum data rate, the configuration information and the like of the port. Meanwhile, the LLDP data packet encapsulated with the timestamp information is issued to each switch through an OFPAction Output message. The LLDP Packet is forwarded to the adjacent switch through the port and sent back to the controller through the Packet-In message, so that the adjacency matrix of the switch can be obtained by statistics, and the maintenance and update of the global topology are realized by the message triggered by the data layer behaviors, such as the addition/departure of the switch, the addition/modification/deletion of the port, the addition/deletion of the link, and the like. Based on the basic information, the packet loss rate, the residual bandwidth and the link time delay of the whole network link can be calculated.
And 2, the weight processing submodule performs normalization and scaling processing on the full-network state information obtained by the global trying acquisition submodule.
And 3, the path calculation submodule selects a path by using a routing algorithm according to the network state information and the service requirement.
Illustratively, the specific process of path selection includes:
for a certain service flow, finding all reachable paths through a DFS algorithm; acquiring time delay d, available bandwidth b and packet loss rate t of all paths; calculating the time delay, the available bandwidth and the packet loss rate of each path according to the similarity weight, wherein the formula is as follows:
Figure 495636DEST_PATH_IMAGE025
(ii) a And finally, selecting the path with the minimum y value as the optimal path.

Claims (7)

1. The multimedia service flow acceleration system is characterized by comprising a flow classification module and a path selection module which are deployed in an SDN control plane, wherein the SDN control plane issues a flow table to a switch network through an openflow protocol, so that an SDN data plane where the switch network is located executes data forwarding service according to the flow table;
the flow classification module trains a machine learning model according to multimedia service flow information in a network to generate a service flow classifier; the service flow classifier is used for classifying the imported network service and identifying the flow requirement corresponding to the network service, wherein the parameters of the flow requirement comprise packet loss rate, time delay and bandwidth; and calculating a corresponding routing strategy according to the flow requirement corresponding to the network service of the path selection module, and meeting the requirements of each parameter of the flow requirement on the basis of the path accessibility.
2. The SDN and machine learning based multimedia service traffic acceleration system of claim 1, wherein the traffic classification module comprises a packet information acquisition sub-module, an offline training sub-module, and a traffic classification sub-module;
the packet information acquisition sub-module comprises a packet information analysis component and a flow characteristic calculation component;
the packet information analysis component is used for acquiring service flow packet information from a data plane through a packet-in event processing function in an openflow protocol, wherein the service flow packet information comprises a source IP address, a destination IP address, a source port, a target port, effective length and arrival time of a data packet; the traffic characteristic calculation component calculates a plurality of characteristic vectors including packet average size, packet size variance, packet average arrival time interval, packet arrival time interval variance and packet size conversion count according to the traffic packet information, and sends the traffic packet information and the corresponding characteristic vectors to an offline training submodule and a traffic classification submodule;
the off-line training submodule is used for receiving different types of service traffic packet information and corresponding feature vectors sent by the packet information acquisition submodule, performing off-line learning on the service traffic packet information and the corresponding feature vectors through a machine learning algorithm, and generating a service traffic classifier according to a training result;
and the flow classification submodule loads a service flow classifier, classifies the flow according to the packet information of the network service sent by the packet information acquisition submodule, and identifies the flow type and the flow requirement of the network service.
3. The SDN and machine learning based multimedia service traffic acceleration system of claim 2, wherein the offline training sub-module comprises a traffic information collection component, a GCN classification component, a classification evaluation component, and an evaluation optimization component;
the traffic information collection component is used for receiving different types of service traffic packet information and corresponding characteristic vectors sent by the packet information acquisition sub-module, arranging the service traffic packet information and the corresponding characteristic vectors into training samples, sending the training samples to a sample data set, and updating the sample data set; the GCN classification component adopts a machine learning algorithm to perform off-line learning on training samples in the sample data set, and generates a service flow classifier according to a training result; the classification evaluation component is used for evaluating the classification precision and accuracy of the generated service flow classifier, if the evaluation is qualified, the service flow classifier is output to the flow classification submodule, otherwise, the evaluation result is sent to the evaluation optimization component, and the evaluation optimization component optimizes the training samples in the sample data set according to the evaluation result of the classification evaluation component;
the evaluation process of the classification evaluation component comprises the following steps: randomly extracting 20% of data from the sample data set to perform classification test, and judging that the classifier is qualified if the classification accuracy reaches 90% or more;
the strategy of the evaluation optimization component for optimizing the training samples of the sample data set comprises: according to the sequence of the timestamps of the training samples entering the sample data set from far to near, the training samples are ranked, the training sample with the farthest timestamp and the training sample with the business type accounting for less than 5% are removed regularly, and the data in the sample data set is maintained within a preset quantity range.
4. The SDN and machine learning based multimedia service traffic acceleration system of claim 3, wherein the GCN classification component employs a machine learning algorithm to perform offline learning on training samples in the sample data set, and the process of generating the service traffic classifier according to the training result comprises the following steps:
s1, acquiring the topology structure information of the whole network, and generating a graph G (V, E), wherein V is the set of nodes V in the graph, and E is the set of edges E of the graph; generating an adjacency matrix A of the weighted graph, wherein in the adjacency matrix A, the weight between two adjacent nodes is set to be 1, and the rest are 0;
s2, generating a node degree matrix D using the adjacency matrix a:
Figure DEST_PATH_IMAGE001
adjacency matrix a is a diagonal matrix;
s3, taking a plurality of eigenvectors output by the flow characteristic calculation component as five-dimensional eigenvectors of each node, and constructing an eigenvector matrix X:
Figure 257824DEST_PATH_IMAGE002
wherein m represents the number of nodes in the network and n represents the dimension of the feature;
s4, constructing a flow calculation model based on a GCN algorithm, wherein the input of the flow calculation model is an adjacency matrix A, a degree matrix D and a feature matrix X, the output of the flow calculation model is a flow demand feature matrix, and the flow demand feature matrix comprises full graph node information, link connection state information and data packet information;
the flow calculation model is composed of
Figure DEST_PATH_IMAGE003
Layer diagram rolling and laminating; for the k-th layer map convolution layer, it is provided
Figure 361915DEST_PATH_IMAGE004
Which represents the input to the k-th layer,
Figure 407232DEST_PATH_IMAGE005
an output node representation representing the k-th layer is obtained
Figure 208966DEST_PATH_IMAGE006
The feature matrix is the input to the first graph convolution layer; hidden feature representation of node vi in feature propagation process of GCN k layer
Figure 288917DEST_PATH_IMAGE007
Is the average value of its local neighbors, the update rule is as follows:
Figure 767172DEST_PATH_IMAGE008
in the formula (I), the compound is shown in the specification,
Figure 565363DEST_PATH_IMAGE009
Figure 29843DEST_PATH_IMAGE010
wherein
Figure 105246DEST_PATH_IMAGE011
And
Figure 364189DEST_PATH_IMAGE012
representing the degree of node i and the degree of node j respectively,
Figure 774311DEST_PATH_IMAGE013
representing the values of i rows and j columns in the adjacency matrix A, and n represents the number of nodes;
s5, constructing three types of feature matrices according to the parameter value range of the flow demand: a first traffic information feature matrix T for limiting packet loss rate, a second traffic information feature matrix D for limiting time delay and a third traffic information feature matrix B for limiting bandwidth;
s6, respectively calculating the similarity between the flow demand characteristic matrix output by the flow calculation model and the three types of flow information characteristic matrices, and taking the similarity as the flow demand characteristic matrix
Figure 42481DEST_PATH_IMAGE014
Setting similarity weights: (
Figure 831446DEST_PATH_IMAGE015
) Order:
Figure 667815DEST_PATH_IMAGE016
in the formula, the similarity defined by the similarity is characterized in an approximately mode; the process of calculating the similarity between the flow demand characteristic matrix and the three types of flow information characteristic matrices is as follows: respectively solving flow demand characteristic matrix by using 1 norm method
Figure 175019DEST_PATH_IMAGE014
And the Manhattan distances of the first flow information characteristic matrix T, the second flow information characteristic matrix D and the third flow information characteristic matrix B are normalized to respectively obtain the three distances
Figure 105935DEST_PATH_IMAGE017
Order:
Figure 14985DEST_PATH_IMAGE018
Figure 881310DEST_PATH_IMAGE019
Figure 16756DEST_PATH_IMAGE020
5. the SDN and machine learning based multimedia service traffic acceleration system of claim 4, wherein the SDN and machine learning based multimedia service traffic acceleration system
Figure 361150DEST_PATH_IMAGE003
Is 3.
6. The SDN and machine learning based multimedia service traffic acceleration system of claim 4, wherein the path selection module comprises a global view acquisition sub-module, a weight processing sub-module, and a path computation sub-module;
the global view acquisition submodule is used for acquiring current network link time delay, global topology information, switch port data rate, maximum data rate and configuration information through an LLDP data packet, an Echo message request and a switch port and flow table statistical information query request, and calculating to obtain the state information of the link of the whole network including the time delay of the whole network, the residual bandwidth and the packet loss rate;
the weight processing submodule is used for carrying out normalization and scaling on the link state information of the whole network;
and the path calculation submodule is used for carrying out route selection by utilizing a routing algorithm according to the link state information and the flow demand corresponding to the network service.
7. The SDN and machine learning based multimedia service traffic acceleration system of claim 6, wherein the path computation sub-module comprises a reachable path computation component, a path state acquisition component, and an optimal path selection component;
the reachable path calculation component obtains all reachable paths corresponding to the imported network service through DFS algorithm query;
the path state acquisition component is used for acquiring the time delay d, the available bandwidth b and the packet loss rate t of all reachable paths;
the optimal path selection component calculates the time delay, the available bandwidth and the packet loss rate of each reachable path according to the similarity weight to obtain corresponding evaluation indexesSign board
Figure 514920DEST_PATH_IMAGE021
Selecting the reachable path with the minimum evaluation index value as the optimal path:
Figure 817725DEST_PATH_IMAGE022
CN202210732754.0A 2022-06-27 2022-06-27 Multimedia service flow acceleration system based on SDN and machine learning Active CN114827021B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210732754.0A CN114827021B (en) 2022-06-27 2022-06-27 Multimedia service flow acceleration system based on SDN and machine learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210732754.0A CN114827021B (en) 2022-06-27 2022-06-27 Multimedia service flow acceleration system based on SDN and machine learning

Publications (2)

Publication Number Publication Date
CN114827021A true CN114827021A (en) 2022-07-29
CN114827021B CN114827021B (en) 2022-11-25

Family

ID=82523230

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210732754.0A Active CN114827021B (en) 2022-06-27 2022-06-27 Multimedia service flow acceleration system based on SDN and machine learning

Country Status (1)

Country Link
CN (1) CN114827021B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116055378A (en) * 2023-01-10 2023-05-02 中国联合网络通信集团有限公司 Training method and device for traffic scheduling strategy generation model
CN116137630A (en) * 2023-04-19 2023-05-19 井芯微电子技术(天津)有限公司 Method and device for quantitatively processing network service demands
CN116781418A (en) * 2023-08-16 2023-09-19 南京邮电大学 SDN malicious controller detection method based on neural network and SVM

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108259367A (en) * 2018-01-11 2018-07-06 重庆邮电大学 A kind of Flow Policy method for customizing of the service-aware based on software defined network
CN111090764A (en) * 2019-12-20 2020-05-01 中南大学 Image classification method and device based on multitask learning and graph convolution neural network
CN113114573A (en) * 2021-04-16 2021-07-13 重庆邮电大学 Video stream classification and scheduling system in SDN network
WO2021243585A1 (en) * 2020-06-03 2021-12-09 Huawei Technologies Co., Ltd. Method and system for generating network configurations using graph neural network
CN113794695A (en) * 2021-08-26 2021-12-14 浙江工业大学 GRU-based network abnormal flow detection and identification method
CN113890857A (en) * 2021-09-30 2022-01-04 华南师范大学 Qos level marking method and system for SDN network
CN114124823A (en) * 2021-10-18 2022-03-01 西安电子科技大学 Self-adaptive routing method, system and equipment oriented to high-dynamic network topology
CN114417958A (en) * 2021-12-06 2022-04-29 天翼电子商务有限公司 Unbalanced financial data credit evaluation method based on improved graph convolution neural network

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108259367A (en) * 2018-01-11 2018-07-06 重庆邮电大学 A kind of Flow Policy method for customizing of the service-aware based on software defined network
CN111090764A (en) * 2019-12-20 2020-05-01 中南大学 Image classification method and device based on multitask learning and graph convolution neural network
WO2021243585A1 (en) * 2020-06-03 2021-12-09 Huawei Technologies Co., Ltd. Method and system for generating network configurations using graph neural network
CN113114573A (en) * 2021-04-16 2021-07-13 重庆邮电大学 Video stream classification and scheduling system in SDN network
CN113794695A (en) * 2021-08-26 2021-12-14 浙江工业大学 GRU-based network abnormal flow detection and identification method
CN113890857A (en) * 2021-09-30 2022-01-04 华南师范大学 Qos level marking method and system for SDN network
CN114124823A (en) * 2021-10-18 2022-03-01 西安电子科技大学 Self-adaptive routing method, system and equipment oriented to high-dynamic network topology
CN114417958A (en) * 2021-12-06 2022-04-29 天翼电子商务有限公司 Unbalanced financial data credit evaluation method based on improved graph convolution neural network

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
刘超等: ""融合多角度信息和图卷积网络的社交网络节点分类模型"", 《重庆理工大学学报(自然学报)》 *
戚琦等: "GAD:基于拓扑感知的时间序列异常检测", 《通信学报》 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116055378A (en) * 2023-01-10 2023-05-02 中国联合网络通信集团有限公司 Training method and device for traffic scheduling strategy generation model
CN116137630A (en) * 2023-04-19 2023-05-19 井芯微电子技术(天津)有限公司 Method and device for quantitatively processing network service demands
CN116137630B (en) * 2023-04-19 2023-08-18 井芯微电子技术(天津)有限公司 Method and device for quantitatively processing network service demands
CN116781418A (en) * 2023-08-16 2023-09-19 南京邮电大学 SDN malicious controller detection method based on neural network and SVM
CN116781418B (en) * 2023-08-16 2023-10-31 南京邮电大学 SDN malicious controller detection method based on neural network and SVM

Also Published As

Publication number Publication date
CN114827021B (en) 2022-11-25

Similar Documents

Publication Publication Date Title
CN114827021B (en) Multimedia service flow acceleration system based on SDN and machine learning
CN111600754B (en) Industrial heterogeneous network scheduling method for interconnection of TSN (transmission time network) and non-TSN (non-Transmission time network)
CN109818865B (en) SDN enhanced path boxing device and method
Chao et al. Flow classification for software-defined data centers using stream mining
CN108833279B (en) Method for multi-constraint QoS routing based on service classification in software defined network
CN109831386B (en) Optimal path selection algorithm based on machine learning under SDN
US20120087377A1 (en) Methods and apparatus for hierarchical routing in communication networks
CN102025620B (en) Cognitive network QoS (quality of service) guarantee method on basis of service differentiation
CN114143264B (en) Flow scheduling method based on reinforcement learning under SRv network
CN105791151B (en) A kind of dynamic flow control method and device
CN111628942A (en) Resource allocation method in time-sensitive network
CN113114573A (en) Video stream classification and scheduling system in SDN network
CN115277574B (en) Data center network load balancing method under SDN architecture
CN108768713A (en) A kind of guarantee QoS dynamic routing algorithms in SDN
CN105743804A (en) Data flow control method and system
CN109617819A (en) Software definition backhaul network method for routing based on traffic classification
CN103188152A (en) Cognitive network quality of service (QoS) routing method based on service differentiation
Zhou et al. Multi-task deep learning based dynamic service function chains routing in SDN/NFV-enabled networks
Bruni et al. Quality of experience provision in the future internet
CN114448899A (en) Method for balancing network load of data center
CN112350948B (en) Distributed network tracing method of SDN-based distributed network tracing system
CN116455824A (en) Network traffic load balancing method based on reinforcement learning
Gupta et al. Improved route selection approaches using Q-learning framework for 2D NoCs
CN113259263B (en) Data packet scheduling method in deep packet inspection cluster
Bolodurina et al. Comprehensive approach for optimization traffic routing and using network resources in a virtual data center

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant