CN113490254B

CN113490254B - VNF migration method based on bidirectional GRU resource demand prediction in federal learning

Info

Publication number: CN113490254B
Application number: CN202110919085.3A
Authority: CN
Inventors: 唐伦; 吴婷; 周鑫隆; 陈前斌
Original assignee: Chongqing University of Post and Telecommunications
Current assignee: Chongqing University of Post and Telecommunications
Priority date: 2021-08-11
Filing date: 2021-08-11
Publication date: 2022-03-29
Anticipated expiration: 2041-08-11
Also published as: CN113490254A

Abstract

The invention relates to a VNF migration method based on bidirectional GRU resource demand prediction in federal learning, and belongs to the technical field of mobile communication. The method comprises the following steps: s1: in a network slice scene, a VNF migration problem caused by time-varying network flow and a VNF migration delay problem caused by the fact that the resource demand of the VNF is lack of prediction are considered, and the resource demand of the VNF is predicted by adopting a FedBi-GRU algorithm; s2: according to the resource demand prediction result, calculating the resource utilization rate of the physical node, judging the physical node with overloaded resource use or underloaded resource use in the network system, and realizing system energy consumption optimization and load balance while ensuring network performance through VNF migration; s3: and obtaining the optimal decision of VNF migration by adopting a DPPO deep reinforcement learning method. The invention can reduce the migration times of the virtual network function and the energy consumption of the network system, and can ensure the load balance of the network system.

Description

VNF migration method based on bidirectional GRU resource demand prediction in federal learning

Technical Field

The invention belongs to the technical field of mobile communication, and relates to a VNF migration method based on federal learning bidirectional GRU resource demand prediction.

Background

Software Defined Networking (SDN) and Network Function Virtualization (NFV) are two new architectures for managing network systems, the main idea behind SDN is to make the network directly programmable and separate the control plane from the data plane, provide a centralized view for the network, provide manageability for complex network systems, NFV converts physical layer resources into virtual resources, separates software instances from underlying dedicated hardware, and makes the network flexible.

By using the NFV technology, traditional network hardware resources can be virtualized into a plurality of virtual machines, various network element function software of an operator network can be instantiated into a Virtual Network Function (VNF), and the VNF can be deployed in a general server, so that the flexibility of network service deployment is greatly improved, the investment cost of network equipment of the operator is reduced, and the high-efficiency network resource utilization rate is realized. The dynamic change of the network flow causes that the available resources of the physical nodes are not matched with the resources required by the VNF, if the new VNF is re-instantiated, the state information of the original VNF is lost, the VNF can be migrated to solve the problem of resource mismatching, and in addition, the VNF is migrated by utilizing the VNF to realize the integration of the VNF, so that the energy consumption of a system can be reduced.

The existing VNF migration problem is that VNF migration is carried out based on real-time network information, few documents are used for carrying out resource demand prediction work of the VNF, a VNF migration plan is made in advance, and prediction methods adopted by the documents are all centralized machine learning. In addition, in the face of high-dimensional complexity of a VNF migration remapping space, an existing heuristic algorithm is not enough to find an optimal VNF migration scheme.

Disclosure of Invention

In view of this, the present invention provides a VNF migration method based on federal learning bidirectional GRU resource demand prediction, which reduces the training burden of the existing centralized prediction mode, improves the generalization of the prediction model, and simultaneously can realize network system energy consumption optimization and load balancing under the limitation of the bottom CPU, storage and bandwidth resources, and end-to-end network delay.

In order to achieve the purpose, the invention provides the following technical scheme:

a VNF migration method based on bidirectional GRU resource demand forecasting of federal learning specifically comprises the following steps:

s1: under a network slice scene, a Virtual Network Function (VNF) migration problem caused by time-varying network flow and a VNF migration delay problem caused by the lack of prediction of VNF resource demand are considered, and the resource demand of the VNF is predicted by adopting a Federal learning-based bidirectional gating cycle unit network (FedBi-GRU) algorithm;

s2: calculating the resource utilization rate of the physical node according to the resource demand prediction result obtained in the step S1, determining the physical node with overloaded resource use or underloaded resource use in the network system, and migrating through the VNF, thereby realizing system energy consumption optimization and load balancing while ensuring network performance;

s3: in consideration of the complexity and high dimensionality of the virtual network function migration decision problem, the optimal decision of VNF migration is obtained by adopting a deep reinforcement learning method of distributed near-end policy optimization (DPPO).

Further, in step S1, the network slice scene includes three layers of architectures: the physical infrastructure layer provides physical resources of the system and mainly comprises a plurality of physical nodes; the virtualization layer abstracts physical resources such as an access network, a core network, a frequency spectrum and the like into virtual resources and provides resource support for the slice layer; the network slice layer is used for processing user network services and is processed by SFC formed by arranging a series of VNFs according to a set sequence; wherein SFC represents a service function chain;

the VNF migration problem is selecting a remapped physical node for a VNF;

the resource requirements of the VNF are CPU, storage and bandwidth resources required by the VNF at a future time.

Further, in step S1, the FedBi-GRU algorithm performs local Bi-GRU resource demand prediction model training for VNFs on SFCs on respective deployed physical nodes, performs aggregation and averaging of prediction model parameters at the SDN controller, and then issues the prediction model parameters to each VNF for continuing model training, where Bi-GRU represents a bidirectional gating cycle unit, and SDN represents a software defined network; the method specifically comprises the following steps:

s11: issuing initial global parameters by the SDN controller;

s12: each VNF on the SFC receives the initial parameters to carry out local Bi-GRU resource demand prediction model training;

s13: uploading parameters of a local Bi-GRU resource demand prediction model by each VNF on the SFC;

s14: the SDN controller performs federal averaging on the collected parameters to obtain new global parameters;

s15: the VNF on the SFC receives the new global parameters and continues to train the Bi-GRU resource demand prediction model;

s16: and repeating the steps S11-S15 until the Bi-GRU resource demand forecasting models of all VNFs on the SFC converge.

Further, in step S2, the resource utilization rate of the physical node is a ratio of the sum of CPU resource requirements of all VNFs on the physical node to the CPU capacity of the physical node;

the resource utilization rate of the physical node with the resource use overload or the resource use underload is higher than the highest CPU resource use threshold value of the physical node or lower than the lowest CPU resource use threshold value;

the network performance is to satisfy the network service quality of the user.

Further, in step S2, the system energy consumption is optimized to minimize the network system energy consumption after VNF migration, where the network system energy consumption includes the operation state energy consumption and the state switching energy consumption of the bottom physical node; the operation state energy consumption comprises basic energy consumption of physical nodes capable of operating and load energy consumption of the nodes, the load energy consumption is in direct proportion to the utilization rate of CPU resources, and the operation energy consumption of the physical nodes at the moment t is represented as:

wherein Z is_n(t) a binary variable indicating whether physical node n is on, where physical node n remains on as long as VNF is mapped to physical node n, and Z is at this time_n(t) 1, otherwise the physical node enters the sleep state, Z_n(t) ═ 0; is provided with

For the fundamental energy consumption of the operation of the physical node n,

the energy consumption is the energy consumption that the CPU resource of the physical node is occupied;

the CPU resource utilization rate of the physical node n at time t is expressed as:

wherein the content of the first and second substances,

indicating whether VNF j in the ith SFC at time instant tth maps to a binary variable of physical node n,

indicating that VNF j maps onto physical node n; f is a set of SFCs,

all VNFs on the ith SFC are collected; c_nRepresenting the computing resources owned by node n;

let S_n(t) represents a binary variable of which the operating state of the physical node n changes at the time t, if S_nWhen (t) '1' indicates that the operating state of the physical node n has changed, S_n(t) is represented as follows:

the switching energy consumption generated by the change of the working state of the node n at the moment t is expressed as

Therefore, when the VNF migrates from physical node n to physical node m, the energy consumption of the entire network system is expressed as:

wherein N is^PIs a set of underlying physical nodes.

Further, in step S2, the load balancing is to minimize a variance of network resources, where the variance of network resources is a variance of utilization rates of CPUs and storage resources of all physical nodes of the network system;

assuming that at time t, VNF j on SFC i triggers a migration condition, after VNF j is migrated from physical node n to physical node m, load change of physical node m may be caused, and the load of physical node m at time t is represented as:

wherein the content of the first and second substances,

and

respectively representing the CPU and storage resource loads of the physical node m at the moment t;

a binary variable indicating whether VNF j in the ith SFC maps to physical node n,

indicating that VNF j maps onto physical node n;

respectively representing the computing resource requirement and the storage resource requirement of the jth VNF on the ith SFC at time instant tth,

all VNFs on the ith SFC are collected; the network resource variance of a single physical node cannot measure the load balance of the whole network system, and the load balance of the whole network system can only be measuredEnsuring good network performance. The mean value of the CPUs of the entire network system at time t

Mean value of storage load

Is represented as follows:

wherein N is^PIs a set of underlying physical nodes, C_mAnd M_mRepresenting the CPU and storage resources owned by the physical node m;

the CPU and storage resource variance of the entire network system is represented as:

the resource variance of the entire network system is expressed as:

further, in step S3, the complexity and high-dimensional property of the virtual network function migration decision problem is that the VNF mapping decision space is high-dimensional and complex due to the diversity of network traffic flows and the dynamic variability of the network traffic environment.

Further, in step S3, the deep reinforcement learning method for distributed near-end policy optimization (DPPO) has a structure jointly trained by a global network and a plurality of agent local networks, and the global network and the agent local networks have the same near-end policy optimization (PPO) network structure, and specifically includes the following steps:

s31: taking an SFC as an agent, and independently and synchronously carrying out PPO algorithm training on each agent in different threads to obtain different PPO network parameters; wherein PPO represents near-end policy optimization;

s32: each agent regularly pushes a loss function gradient to the global PPO network;

s33: when the gradient of each agent is accumulated to the updating frequency times of the global network by collecting the gradient of each agent, the global network updates the global network parameters by using a gradient descent mode, wherein the updating mode of the global network parameters is asynchronous updating;

s34: pushing parameters obtained by the global network to each agent, enabling the agents to interactively update the gradient with the slice network system, pushing the parameters to the global network to update global network parameters, and repeating the steps until the DPPO algorithm is converged; where DPPO represents distributed near-end policy optimization.

Further, in step S3, the optimal decision of VNF migration is to find a VNF mapping policy that achieves approximately the minimum system energy consumption and network resource variance by using a DPPO algorithm.

The invention has the beneficial effects that: aiming at the problem of VNF migration delay caused by lack of effective resource prediction in the existing VNF migration problem, the invention provides a distributed Federal learning bidirectional gating unit resource prediction algorithm, the distributed prediction architecture relieves the problems of centralized prediction data training pressure and memory shortage, and an intelligent VNF migration algorithm based on distributed near-end strategy optimization is provided based on the resource demand prediction result, so that energy consumption optimization and load balance of a network system are realized.

Additional advantages, objects, and features of the invention will be set forth in part in the description which follows and in part will become apparent to those having ordinary skill in the art upon examination of the following or may be learned from practice of the invention. The objectives and other advantages of the invention may be realized and attained by the means of the instrumentalities and combinations particularly pointed out hereinafter.

Drawings

For the purposes of promoting a better understanding of the objects, aspects and advantages of the invention, reference will now be made to the following detailed description taken in conjunction with the accompanying drawings in which:

FIG. 1 is a VNF resource demand prediction flow diagram for a Federal bidirectional gated cycle Unit (FedBi-GRU) of the present invention;

fig. 2 is a flowchart of a VNF migration algorithm for distributed near-end policy optimization.

Detailed Description

The embodiments of the present invention are described below with reference to specific embodiments, and other advantages and effects of the present invention will be easily understood by those skilled in the art from the disclosure of the present specification. The invention is capable of other and different embodiments and of being practiced or of being carried out in various ways, and its several details are capable of modification in various respects, all without departing from the spirit and scope of the present invention. It should be noted that the drawings provided in the following embodiments are only for illustrating the basic idea of the present invention in a schematic way, and the features in the following embodiments and examples may be combined with each other without conflict.

Referring to fig. 1-2, the physical layer of the network slice of the embodiment is defined as a full-connection undirected graph G^P＝(N^P,L^P)，N^PIs a set of underlying physical nodes, L^PIs the set of underlying physical links. The invention considers two resource use conditions, namely computing resources and storage resources. Let C_nRepresenting the computational resources owned by node n, M_nRepresenting the storage resources owned by node N, where N, m ∈ N^p，l_nmRepresents a physical link between physical nodes n and m, and l_nm∈L^P。

A network slice layer is defined as a set of SFCs, denoted as F ═ F_i1,2, I, the ith SFC chain is abstracted as a directed graph

For all VNFs on the ith SFC,

is the set of all virtual links on the ith SFC. The computational resource requirement of the jth VNF on the ith SFC is

The storage resource needs are

Virtual link between jth VNF and kth VNF on SFC i

The bandwidth requirement is

Is provided with

meaning that VNF j maps onto physical node n,

representing a virtual link between the jth VNF and the kth VNF on SFC i

Whether or not to map to physical link/_nmIn the above-mentioned manner,

to represent

Mapping to l_nmIn the above-mentioned manner,

in this embodiment, when predicting VNF resource demand, each physical node acquires CPU, storage, and bandwidth resource characteristics of a VNF thereon, and for a VNF j on an SFC i, the network resource demand characteristics may be expressed as

Respectively, the CPU, storage, and bandwidth resource characteristics of VNF j.

Referring to fig. 1, fig. 1 is a flowchart of a VNF resource demand prediction algorithm of the present invention, including the following steps:

step 1): parameter aggregator SDN controller issuing initial parameter w₀To each VNF on SFC i.

Step 2): the VNF on the SFC collects network resource usage history data, and for the VNF j on the SFC i, a series of data feature sample sets o ═ o of the VNF j can be obtained by collecting receipts₁,o₂,...,o_s,., inputting the feature sample set into a local Bi-GRU model of VNF j in combination with an initial parameter w₀And (5) training.

Step 3): each VNF on SFC i will train the resulting parameter w₁,w₂,...,w_jUploading to an SDN controller for federal averaging to obtain new global parameters

Step 4): SDN controller assigns new global parameters

And issuing all VNFs on the SFC i, and continuing Bi-GRU model training by the local VNFs until the training times are reached.

In the embodiment, the energy consumption of the physical node running state and the energy consumption of state switching are mainly considered when the energy consumption of the network system is calculated. The operation state energy consumption comprises the basic energy consumption of the operation of the physical node and the load energy consumption of the node, the load energy consumption is in direct proportion to the load of the CPU, and the operation energy consumption of the physical node at the moment t is represented as follows:

wherein Z is_n(t) a binary variable indicating whether physical node n is on, where physical node n remains on as long as VNF is mapped to physical node n, and Z is at this time_n(t) 1, otherwise the physical node enters the sleep state, Z_n(t) is 0. Is provided with

For the fundamental energy consumption of the operation of the physical node n,

the energy consumption that the CPU resource is full is the physical node.

let S_n(t) represents a binary variable of which the operating state of the physical node n changes at the time t, if S_nWhen (t) '1' indicates that the operating state of the physical node n has changed, S_n(t) may be expressed as follows:

the switching energy consumption generated by the change of the working state of the node n at the time t can be expressed as

Therefore, when the VNF migrates from physical node n to physical node m, the energy consumption of the entire network system can be expressed as:

load balancing model:

the closer the average network resource variance usage is, the stronger the network load balancing capability of the physical node is, assuming that at time t, the VNF j on the SFC i triggers the migration condition, then the VNF j may cause load change of the physical node m after being migrated from the physical node n to the physical node m, and the load of the physical node m at time t may be represented as:

wherein the content of the first and second substances,

and

the loads of the CPU and the storage resource of the physical node m at the time t are respectively represented, the network resource variance of a single physical node cannot measure the load balance of the whole network system, and the load balance of the whole network system can ensure good network performance. Therefore, the CPU, storage load mean value of the whole network system at time t is expressed as follows:

wherein, C_mAnd M_mRepresenting the CPU and storage resources owned by the physical node m, the CPU and storage resource variance of the whole network system can be represented as:

the resource variance of the entire network system can be expressed as:

referring to fig. 2, fig. 2 is a VNF migration algorithm for distributed near-end policy optimization according to this embodiment, and the steps are as follows:

step 1): taking an SFC as an agent, placing each agent in different threads to independently and synchronously train a PPO algorithm to obtain different PPO network parameters, wherein for an agent n, the PPO network updating mode is as follows:

the criticic network process of agent n is a loss function for minimizing TD error, and the loss function is expressed as follows:

the Actor network loss function of the n-local PPO network of the agent is as follows:

wherein s is_n(T) represents the operating state of agent n at iteration step T, V(s)_n(T +1)) means that the agent n is in the working state s_nFunction of state value of (T +1), a_n(t) represents the VNF mapping action taken by agent n at time t, E_t[·]Represents the expectation function at time t, sigma represents the hyperparameter of the PPO network,

an Actor network parameter representing agent n;

for instant awards, gamma_v∈[0，1]For the discounting factor, T is the iteration step size in a round of training, A(s)_n(t),a_n(t)) is a merit function;

for the ratio of old and new mapping strategies, σ is set to 0.2 for the over-parameter.

Step 2): loss function of agent n

Critic network gradient for updating global PPO network, global Critic network gradient delta theta_cAnd a parameter theta_cThe update mode is as follows:

θ_c＝θ_c+ε_cΔθ_c

wherein epsilon_cIs the learning rate of the Critic network.

Gradient delta theta of global Actor network_aAnd a parameter theta_aThe update method (2) is as follows:

θ_a＝θ_a+ε_aΔθ_a

wherein epsilon_aIs the learning rate of the Actor network.

Step 3): when the gradient is accumulated to the updating frequency times of the global network, the global network updates the global network parameters by using a gradient descent mode, wherein the updating mode of the global network parameters is asynchronous updating;

step 4): and pushing the parameters obtained by the global network to each intelligent agent, wherein the intelligent agent interactively updates the gradient between the obtained parameters and the slice network system, pushes the parameters to the global network to update the global network parameters, and repeats the steps until the energy consumption and the load of the network system are balanced.

Finally, the above embodiments are only intended to illustrate the technical solutions of the present invention and not to limit the present invention, and although the present invention has been described in detail with reference to the preferred embodiments, it will be understood by those skilled in the art that modifications or equivalent substitutions may be made on the technical solutions of the present invention without departing from the spirit and scope of the technical solutions, and all of them should be covered by the claims of the present invention.

Claims

1. A VNF migration method based on bidirectional GRU resource demand forecasting of federal learning is characterized by comprising the following steps:

s1: in a network slice scene, a VNF migration problem caused by time-varying network flow and a VNF migration delay problem caused by the fact that the resource demand of the VNF is lack of prediction are considered, and the resource demand of the VNF is predicted by adopting a FedBi-GRU algorithm; the FedBi-GRU represents a bidirectional gating cycle unit network based on federal learning, and the VNF represents a virtual network function;

in step S1, the FedBi-GRU algorithm performs local Bi-GRU resource demand prediction model training on the respective deployed physical nodes for the VNFs on the SFC, performs aggregation averaging of prediction model parameters at the SDN controller, and then issues the prediction model parameters to each VNF for continuing model training, where Bi-GRU represents a bidirectional gated cyclic unit, and SDN represents a software defined network; the method specifically comprises the following steps:

s11: issuing initial global parameters by the SDN controller;

s16: repeating the steps S11-S15 until the Bi-GRU resource demand prediction models of all VNFs on the SFC are converged;

s3: obtaining an optimal decision of VNF migration by adopting a deep reinforcement learning method of distributed near-end strategy optimization;

in step S3, the method for deep reinforcement learning by distributed near-end policy optimization specifically includes the following steps:

s34: pushing parameters obtained by the global network to each agent, enabling the agents to interactively update the gradient with the slice network system, pushing the parameters to the global network to update global network parameters, and repeating the steps until the DPPO algorithm is converged; wherein DPPO represents distributed near-end policy optimization;

in step S3, the optimal decision of VNF migration is to find a VNF mapping policy that minimizes system energy consumption and network resource variance by using a DPPO algorithm.

2. The VNF migration method according to claim 1, wherein in step S1, the network slice scenario includes a three-tier architecture: the physical infrastructure layer provides physical resources of the system and consists of a plurality of physical nodes; the virtualization layer abstracts physical resources into virtual resources and provides resource support for the slice layer; the network slice layer is used for processing user network services and is processed by SFC formed by arranging a series of VNFs according to a set sequence; wherein SFC represents a service function chain;

the VNF migration problem is selecting a remapped physical node for a VNF;

3. The VNF migration method according to claim 1, wherein in step S2, the resource utilization of the physical node is a ratio of a sum of CPU resource requirements of all VNFs on the physical node to a CPU capacity of the physical node;

the network performance is to satisfy the network service quality of the user.

4. The VNF migration method according to claim 1, wherein in step S2, the system energy consumption is optimized to minimize network system energy consumption after VNF migration, where the network system energy consumption includes an operation state energy consumption and a state switching energy consumption of an underlying physical node; the operation state energy consumption comprises basic energy consumption of physical nodes capable of operating and load energy consumption of the nodes, the load energy consumption is in direct proportion to the utilization rate of CPU resources, and the operation energy consumption of the physical nodes at the moment t is represented as:

wherein Z is_n(t) binary variable indicating whether physical node n is on, physical node n remaining on as long as VNF is mapped to physical node n, which isTime Z_n(t) 1, otherwise the physical node enters the sleep state, Z_n(t) ═ 0; is provided with

For the fundamental energy consumption of the operation of the physical node n,

wherein the content of the first and second substances,

indicating that VNF j maps onto physical node n; f is a set of SFCs,

representing the computational resource requirements of the jth VNF on the ith SFC at time t;

wherein N is^PIs a set of underlying physical nodes.

5. The VNF migration method according to claim 1, wherein in step S2, the load balancing is to minimize a network resource variance, where the network resource variance is a variance of utilization rates of all physical nodes CPU and storage resources of the network system;

wherein the content of the first and second substances,

and

indicating that VNF j maps onto physical node n;

all VNFs on the ith SFC are collected; CPU mean value of the entire network system at time t

Mean value of storage load

Is represented as follows:

the resource variance of the entire network system is expressed as:

6. the VNF migration method according to claim 1, wherein in step S31, for agent n, the PPO network update method is as follows:

the loss function for agent n is:

wherein r is_n(t) is the instant prize, γ_v∈[0，1]For the discounting factor, T is the iteration step size in a round of training, s_n(T) represents the operating state of agent n at iteration step T, V(s)_n(T +1)) means that the agent n is in the working state s_nA state value function of (T + 1);

wherein the content of the first and second substances,

for the ratio of old and new mapping strategies, A(s)_n(t),a_n(t)) is the merit function, a_n(t) represents the VNF mapping action taken by agent n at time t, E_t[·]Represents the expectation function at time t, sigma represents the hyperparameter of the PPO network,

representing the Actor network parameter of agent n.

7. The VNF migration method according to claim 6, wherein in step S32, a loss function of agent n is used

Updating the Critic network gradient of the global PPO network, wherein the global Critic network gradient delta theta_cAnd a parameter theta_cThe updating method comprises the following steps:

θ_c＝θ_c+ε_cΔθ_c

wherein epsilon_cLearning rate of Critic network;

gradient delta theta of global Actor network_aAnd a parameter theta_aThe updating method comprises the following steps:

θ_a＝θ_a+ε_aΔθ_a

wherein epsilon_aIs the learning rate of the Actor network.