CN112148492A

CN112148492A - Service deployment and resource allocation method considering multi-user mobility

Info

Publication number: CN112148492A
Application number: CN202011038113.2A
Authority: CN
Inventors: 陈智麒; 张胜; 钱柱中; 李文中; 陆桑璐
Original assignee: Nanjing University
Current assignee: Nanjing University
Priority date: 2020-09-28
Filing date: 2020-09-28
Publication date: 2020-12-29
Anticipated expiration: 2040-09-28
Also published as: CN112148492B

Abstract

The invention provides a service deployment and resource allocation method considering multi-user mobility, which is applied to an edge computing network scene. The method is characterized in that modeling is carried out according to an edge computing scene, the edge computing scene is regarded as an optimization problem which integrates service computing time delay expense, transmission time delay expense and service migration expense, decision constraint is combined, and the optimization problem is solved to obtain a multi-user service deployment and computing resource allocation scheme. The invention fills the blank of the field, supports the multi-user service deployment, considers the mobility of the user, has wide applicability, and improves the task allocation and execution efficiency under the edge computing scene, thereby improving the overall processing performance of the network.

Description

Service deployment and resource allocation method considering multi-user mobility

Technical Field

The invention relates to the field of edge computing, in particular to a service deployment and resource allocation method considering multi-user mobility under an edge computing network environment.

Background

The last decade is a decade of rapid development of cloud computing technology, and cloud computing ensures dynamic network resource pool, virtualization and high availability for users, so that the internet becomes a data and computing center of users, a large amount of services can be performed in the cloud, and the cloud computing becomes an important development direction of current information technology. However, everything has two sides, and as people's knowledge of cloud computing is continuously deepened, its disadvantages are revealed: (1) privacy security issues: since users need to provide their own information on the cloud computing platform to use the service, the platform risks revealing privacy of these users. (2) Data transmission cost problem: the development of the internet of things enables various novel intelligent devices to emerge continuously, and the intelligent devices generate a large amount of data which far exceeds the bearing capacity of a network and a cloud computing center. For example, surveillance cameras that are now in view generate large amounts of video data each day; for the automatic driving car, data of up to 5TB is generated every day, and it is hard to transmit all the data to the cloud for processing under the current technical conditions. (3) The problem of high latency: due to the centralized characteristic of cloud computing, part of nodes are necessarily far away from a computing center, and the real-time performance of the nodes cannot be guaranteed. However, it is known that autodrive and industrial automation have high real-time requirements for data processing, and if network delay caused by data transmission cannot meet the real-time requirements, catastrophic consequences are likely to be caused.

For the existing problems of cloud Computing, Mobile Edge Computing (MEC) comes along. The mobile edge computing overcomes a series of defects of a traditional cloud computing architecture by pushing resources such as computing, storage and the like to the edge of a network, and provides a network computing mode with low time delay, high privacy and safety, high expandability, rapidness and high efficiency. In a typical moving edge computing scenario, there are three computing devices: cloud computing node, edge computing node (if deploy on edge server), user equipment node, and the three have the following advantages:

cloud computing node: generally, computing resources of cloud computing nodes are not limited, and any task can be processed by high-speed computing power after being submitted to a cloud, but the disadvantage is that communication cost for uploading local data to the cloud is high. Representative public Cloud services are AWS, Google Cloud, Microsoft Azure, and the like.

A user equipment node: these user devices may be smart mobile phones, smart car systems, embedded internet of things devices, etc. that are held by people and can be viewed as the origin of data or close to the origin itself, so that they have little data communication cost, but are not usually equipped with high computing performance in consideration of device hardware cost and long-term endurance.

Edge computing node: the above two trade-offs can be considered, there is a certain computational processing power, and the data communication cost is within an acceptable range. These edge computing nodes are often deployed on some Access Points (APs) of a mobile network, and can provide edge computing services for mobile devices within the coverage of the APs. Edge compute nodes may be considered extensions of cloud compute nodes, but are subject to performance and cost considerations of servers, providing limited computing resources. When facing a plurality of services, it is assumed that the edge computing node can allocate the CPU computing resource proportion, and different services occupy different CPU proportions to generate a certain service computing delay.

But in a typical mobile edge computing environment, it is often desirable to take into account the mobility of the user. During a certain period of time, a user may roam from one location to another, and accordingly the data latency of the cloud computing node and the user equipment node may not change due to a change in network location, but the latency to the respective edge computing nodes may change. The edge node providing the computing service needs to consider whether to continue running the service at the edge node or to migrate the service to another edge computing node with lower latency to provide the service. The contradiction here is that the former is only the increase of data communication delay, while the latter can effectively reduce the communication delay, but brings extra migration overhead, where the migration overhead includes the overhead of destroying and recreating the virtual machine, the bandwidth occupation of migrating the virtual machine state, and even the overhead of migration failure. However, according to the work of the current signature inventors, there are many disadvantages in the current research, such as considering only a single-user service decision deployment scheme and only coarse-grained allocation of computing resources, which cannot satisfy the high-performance allocation and execution of tasks in the case of multi-user mobility.

Disclosure of Invention

The purpose of the invention is as follows: aiming at the blank of the prior art, the invention provides a service deployment and resource allocation method considering multi-user mobility in an edge computing network environment, which realizes the comprehensive consideration of how to deploy user services to edge computing nodes and how to allocate CPU occupation rates to the services by the edge computing nodes.

The technical scheme is as follows: a service deployment and resource allocation method considering multi-user mobility comprises the following steps:

(1) establishing a mathematical model for a moving edge calculation scene:

computing cloud nodes under scene according to mobile edges

User equipment node

And (4) obtaining a computing node of the model by setting the edge computing node set as {1,2, …, E }, wherein

The decision to deploy a service is made between a discrete time slice, and the model time is defined as a set of discrete time slices

And analyzing a decision space formed by the service deployment and resource allocation problems: when the time slice t comes, useThe service of user u needs to decide on which computing node to deploy; and for each computing node, if more than 1 user service deployment exists in the time slice t, the CPU computing resource distribution ratio among the services needs to be considered;

three overheads contained in the scene are calculated from the moving edges: calculating time delay overhead, transmission time delay overhead and service migration overhead, and determining the optimization target of the model as follows: minimizing the weighted sum of the three overheads, wherein the calculation delay overheads refer to the time delay from the time when the service request of the user reaches the calculation node to the time when the calculation of the request is completed; the transmission delay overhead refers to the delay from the time when the service request of the user is sent by the user to the time when the computing node receives the request; the service migration overhead refers to the overhead generated by the corresponding migration of the service after the user moves;

(2) and solving the established mathematical model to obtain a multi-user service deployment and resource allocation scheme.

Further, the optimization objective of the model is represented as:

wherein the content of the first and second substances,

is a two-dimensional indicator variable that indicates whether, at time slice t, a user service j is deployed to compute node i,

represents deployment, and

on the contrary;

indicating the CPU computing resource allocation ratio,

is a number from 0 to 1An inter-rate variable representing the rate at which user service j is allocated computing resources at computing node i at time slice t;

represents the computation delay overhead of user service j at time slice t:

representing the transmission delay overhead of the user service j in the time slice t;

represents the migration overhead of user service j in time slice t; w is a₁,w₂,w₃Respectively, the weights of the three overheads.

Further, the step (2) of solving the established mathematical model includes:

note the book

Is X, any one X' is belonged to X, and has a unique corresponding optimal solution y^*Defining a service deployment and computing resource allocation state as s ═ x', y^*) E S, the total cost of the state is C_s'→sThe added overhead is needed to migrate from the state S' of the last time slice to the state S of the current time slice, where S refers to a feasible solution set of the service deployment and the computing resource allocation scheme;

the construction graph G is (V, L), the vertex set V represents a set of states, the edge set L represents a set of total expenses between two states, and particularly two adjacent states

And

the weight of the edge between represents the service migration cost involved from the state i to the state j, and the service calculation delay and transmission delay cost in the time slice t;

adding artificial nodes S and D on two sides of the graph G, and solving the shortest path between S and D through the following algorithm:

A. receiving input and initializing a dynamic programming state table phi, phi recording a deployment scenario s from a certain deployment scenario^tA mapping to corresponding overhead;

B. continuously cycling from T to T;

C. obtaining all (E +2) by Cartesian product for each cycle^URecording a possible deployment scheme as a set H, wherein E is the number of edge computing nodes, and U is the number of user services;

D. traversing the deployment schemes in each set H, and updating phi into one item with the minimum sum of the current overhead and the overhead of all the schemes in the last time slice t-1;

E. and returning one deployment scheme with the lowest cost in all the deployment schemes of phi.

As a more preferred embodiment, the step (2) of solving the established mathematical model includes:

note the book

And

the weight of the edge between represents the service migration overhead involved from state i to state j, such thatCalculating the time delay and the transmission time delay overhead in the service of the time slice t;

a. receiving input and initializing Φ recording a deployment scenario s from^tA mapping to corresponding overhead;

b. continuously cycling from T to T;

c. all (E +2) are obtained in each cycle^URecording a possible deployment scheme as a set H, wherein E is the number of edge computing nodes, and U is the number of user services;

d. obtaining kappa schemes from the deployment schemes in each set H through uniform sampling, and updating phi into one item with the minimum sum of the overhead of all the sampling schemes in the last time slice t-1 and the overhead of the current sampling scheme;

recording the service set deployed on the edge computing node i as lambda_i＝SUBSET({λ₁,λ₂,…,λ_n})，λ_nDenotes the nth service by w_ijRepresents the cost of the transmission delay and the service migration for deploying the user service j to the edge computing node i, and uses v_ij＝λ_j/c_iTo represent the computational latency overhead of deploying a user service j onto an edge compute node i, c_iComputing the computing power of node i for the edge;

deployment in arbitrary order { λ₁,λ₂,…,λ_nH, will be_jDeploying to all edge computing nodes, aiming at the ith computing node_iDefinition of LOAD: (_i) Meaning the sum of the costs made for the user services already carried by the current node, defining a LOAD (a), (b), (c), (d_i,λ_j) Is represented in_iThe total expense generated by deploying one user service j on the basis of the existing service;

and executing the following updating algorithm, and selecting the deployment with the minimum final cost increase:

1) computing nodes for each edge_iSetting LOAD: (_i)＝0；

2) When the time slice t comes, the following steps are executed:

i. for each customer service j, calculate LOAD: (_i,λ_j) And selecting the edge computing node with the minimum cost;

deploy customer service j on the edge compute node and then use the LOAD (, λ)_j) LOAD () is updated.

Has the advantages that: aiming at the problem of multi-user service deployment and computing resource allocation under the edge computing environment, the invention carries out modeling according to an edge computing scene, regards the edge computing scene as an optimization problem integrating service computing time delay overhead, transmission time delay overhead and service migration overhead, combines decision constraint, and solves the optimization problem to obtain a multi-user service deployment and computing resource allocation scheme. The invention fills the blank of the field, supports the multi-user service deployment, considers the mobility of the user, has wide applicability, and improves the task allocation and execution efficiency under the edge computing scene, thereby improving the overall processing performance of the network.

Drawings

FIG. 1 is a schematic diagram of a service deployment and computing resource allocation scenario provided by an embodiment of the present invention;

fig. 2 is an exemplary diagram of a dynamic planning algorithm of an offline algorithm FDSP according to an embodiment of the present invention;

fig. 3 is an exemplary diagram of an embodiment of the present invention providing an online algorithm OSP.

Detailed Description

The technical solution of the present invention is further explained below with reference to the accompanying drawings.

Referring to the edge computing scenario of fig. 1, in one embodiment, a service deployment and resource allocation method considering multi-user mobility includes the following steps:

step (1), establishing a mathematical model for the service deployment and resource allocation problem according to the mobile edge computing scene.

In a mobile edge computing scenario, related devices including cloud computing nodes are also called cloud servers

User equipment node

The edge compute node is also referred to as the set of edge servers {1,2, …, E }. All three computing devices can be regarded as so-called computing nodes

To facilitate modeling user movement, and service deployment decisions are made between a discrete time slice, time is thus defined as a collection of discrete time slices

Two points need to be considered for analyzing a decision space formed by service deployment and resource allocation problems: first, when a time slice t comes, the service of a user u needs to decide on which computing node to deploy; and a second point, for each compute node, if there are more than 1 user service deployment at time slice t, then the CPU computation resource allocation ratio between these services needs to be considered.

For the first point, firstly, the user equipment cannot be too far away from the edge computing node running the service, otherwise, too high network transmission delay is generated; then, the user service is not deployed on the same edge computing node as much as possible, otherwise, due to the preemption of the same server resource by a plurality of services, the service computing time delay is obviously increased; in addition, because the edge computing nodes have heterogeneity, different nodes can provide different computing capabilities when facing the same user service according to the current machine state, such as the configuration of hardware and software of a CPU main frequency, a memory frequency, an instruction system and the like. The invention uses decision variables

To characterize a service deployment scenario in which,

is an 0/1 indicator variable that indicates whether, at time slice t, user service j was deployed to compute node i,

represents deployment, and

the opposite is true. Note that in time slice t, one service can and can only be deployed on one compute node, so there are the following decision constraints:

for the second point, a simplest allocation manner is to allocate CPU computing resources uniformly, and assuming that a certain node provides a computing power with a size of c (e.g. using cycles per minute of CPU to characterize), and n user services have been deployed on the node, according to the principle of uniform allocation, each service obtains a computing power of c

For the ith service, the service calculation delay can be used

To indicate. It is easy to find that this distribution method following the uniform principle is not an optimal distribution method, a better method is according to the data processing amount λ of the service_iTo dynamically decide the CPU computing resource allocation rate of the service. The invention uses decision variables

To indicate that the user is not in a normal position,

is a ratio variable from 0 to 1 onwards that represents the ratio of computing resources that user service j has been allocated on compute node i during time slice t. Note that at time slice t, a compute node can provide a compute resource allocation ratio that sums up to 100%, so there are the following decision constraints:

wherein the last inequality is expressed as if

Indicating that in time slice t, user service j will not be deployed on compute node i, then there is naturally

If it is

User service j is deployed on computing node i, then

Any value between 0 and 1 may be taken.

The method mainly comprises the following steps of analyzing the processes from generation to execution of the service of a user in a mobile edge computing scene, wherein the processes mainly comprise three expenses: calculating time delay overhead, transmission time delay overhead and service migration overhead, wherein the calculated time delay overhead refers to the time delay from the time when a service request of a user arrives at a calculation node to the time when the calculation of the request is completed; the transmission delay overhead refers to the delay from the time when the service request of the user is sent by the user to the time when the computing node receives the request; the service migration overhead refers to the overhead generated by the corresponding migration of the service after the user moves. Then our goal is to minimize the weighted sum of these three costs. The optimization objectives and constraints of the mathematical model are given according to the above analysis of the decision space as follows:

in time slice t, the user has a service to calculate the demand amount and use according to the application demand of the user

Meaning specifically the computational demand of user service j at time slice t. It can be noted that the service calculation demand of the user changes with the time slice t, which is the result of the combination of factors of user mobility, service timeliness and user demand change, and it is assumed that the demand is directly known. Taking the video analysis service as an example,

the method can be obtained according to the size and frame rate of the input video, the precision requirement of the analysis task and the like. In addition, the present invention uses the symbol c_iTo represent the computing power (e.g., CPU clock frequency) provided by the computing node i, it is noted here that for cloud computing nodes

And edge computing nodes, which provide no difference in computing power to different users, but the local computing power of each user device may be considered different because internet-of-things devices, such as embedded computing devices, themselves often provide different computing power due to hardware cost and power endurance, among other considerations. Given service deployment and computing resource allocation decisions in accordance with the above definitions

And

the calculation delay cost of the user service j in the time slice t can be obtained as follows:

the transmission delay is usually composed of an access delay and a propagation delay. The access delay refers to a delay from the user equipment to the nearby access point, which is generally determined by the wireless environment and the terminal equipment together. For propagation delay, if the user service is deployed locally, the propagation delay is almost negligible and can be regarded as 0; if the user service is deployed at the cloud end, the propagation delay from the user equipment to the cloud end can be considered as a constant, and the specific numerical value is determined by a cloud service provider; if the user service is deployed on an edge compute node, the delay is typically related to the edge-to-edge delay between the access point to the edge compute node where the service is located. In the most ideal case, the service is deployed directly on the access point, and the edge-to-edge propagation delay is negligible. Aiming at the transmission time delay, the invention is uniformly used

Meaning that the time delay from the service request of user j to compute node i at time slice t. Given a service deployment decision, according to the above definition

The transmission delay overhead of the user service j in the time slice t can be obtained as follows:

because the network environment is changed due to the movement of the user, the edge computing node closest to the user can also be usedCorrespondingly, when the user generates a new movement, the deployment location of the user service needs to be reconsidered: one way is to always run the service in the original location no matter how the user moves; another way is to migrate services as the user moves, which incurs service migration overhead. The service migration overhead includes the overhead of destroying and recreating the virtual machine, the bandwidth occupation overhead of migrating the virtual machine state, and even the failure overhead of migration failure. The invention uses symbols

And representing service migration overhead which represents migration overhead for migrating the user service j from the computing node i to i' in the time slice t, wherein the overhead is related to a specific user service type and a user service running state, can be obtained through measurement in practical application, and can also be assumed to be a constant for convenience. Note that when i is equal to i', i.e. no migration of the service has occurred, the service migration overhead can be considered to be 0. According to the above definition, it is easy to derive the service migration cost of the user service j at the time slice t:

wherein

Indicating whether, at time slice t-1, user service j is deployed onto compute node i,

indicating whether, at time slice t, user service j is deployed onto compute node i',

and

are all [000 … 1 … 000 ]]Such a vector, with only one node being 1, is the currently deployed node,so as to pass through

Service migration overhead from the previous node to the next node can be calculated. Note that i 'and i are irrelevant and are both used to traverse the set of compute nodes, with the difference that i is the compute node under the t-1 time slice, and i' is the compute node under the t time slice.

The delay, transmission delay overhead, and service migration overhead are calculated from the services discussed above, and the overall goal is to optimize these three overheads to minimize their overall overhead. One way is to give different weights to the three overheads and then sum them, and respectively record the weights of the three overheads as w₁,w₂,w₃. Correspondingly, in time slice t, the total weighted sum cost of user service j is:

when given a time sequence

We wish to minimize the overall overhead for all users over time, and the overall problem can be formally expressed as:

the following table shows the meaning of the above symbols:

TABLE 1 symbolic meanings used in the mathematical model

And (2) solving the established mathematical model to obtain a multi-user service deployment and resource allocation scheme.

Aiming at the optimization problem established in the step (1), when the scale of the problem is small, the invention provides an FDSP (full Dynamic Service plan) algorithm based on Dynamic programming, which can obtain an accurate solution; when the problem scale becomes larger, the invention provides an improved SDSP (sampling Dynamic Service plan) algorithm, which can effectively relieve the problem of FDSP algorithm state number combination explosion based on the idea of sampling. Considering the unknown of the system future information in Service decision, the invention also provides an OSP (Online Service plan) online algorithm based on the greedy idea. Each is described in detail below.

FDSP is an accurate algorithm based on dynamic programming, and because an optimal calculation resource allocation scheme is uniquely determined by a service deployment scheme and a problem has a time series property, the FDSP can be used for dividing a plurality of stages for iterative calculation. The problem is converted into the problem of solving the shortest path between two artificial nodes by adding the two artificial nodes, and a state transition equation is given. In particular, a service deployment and computing resource allocation scheme may be considered as a state, and then by combining computations, all states total (E +2)^UAnd (4) respectively. Suppose that

Is X, then arbitrarily take one X' epsilon X from the space, and have the only corresponding optimal solution y^*Therefore, a service deployment and computing resource allocation state can be defined as s ═ x', y^*) E S, where S refers to the set of feasible solutions for service deployment and computing resource allocation schemes. From the foregoing definitions of computation delay, transmission delay overhead, and service migration overhead for a service, the overall cost of this state can be defined as C_s'→sI.e. the increased overhead required to transition from the state s' of the last time slice to the state s of the current time slice. In fact, a graph G ═ V, L can be constructed where V is the set of vertices representing states, L is the set of edges between vertices representing the overhead between two states, and for a time slice t there is (E +2)^UOne state represents all (E +2)^UKind of service deployment scenario, E+2 is the number of all computing nodes, 2 is composed of cloud server and user equipment, and the state of two adjacent computing nodes

And

the weights of the edges in between represent the service migration overhead involved from state i to state j, as well as the service computation and transmission delay overhead at time slice t. By adding artificial nodes S and D on both sides of graph G as starting and ending points of the shortest path, the problem translates into finding the shortest path between S and D. In fact, the problem can be solved using dynamic programming, the heart of which is the state transition equation

Wherein

Which represents the cumulative minimum cost of picking state s in the last step in time slice t, and can be regarded as a sub-problem of the original problem.

The algorithm is based on a dynamic planning idea, and comprises the following specific steps:

A. receiving input includes computing power c of the computing node_iServicing computing needs

Transmission time delay

Service migration overhead

Time slice number T, edge server number E and user service number U. Initializing a dynamic programming state table Φ, Φ records a deployment scenario s from a certain deployment scenario^tA mapping to corresponding overhead;

B. continuously cycling from T to 1 to T, and performing C, D for two steps;

C. obtaining all (E +2) by Cartesian product for each cycle^UA possible deployment scheme is marked as a set H;

Taking fig. 2 as an example, it can be seen that T is 4, E is 2, and U is 2, and when T is 1, one can find

All 16 possible deployment scenarios to

For example, it represents a deployment scheme in which the user service 1 is deployed to the edge server 1 while the user service 2 is also deployed to the edge server 1. The optimal path can be found between S and D in the figure

And obtaining an optimal deployment scheme.

FDSP can obtain an accurate solution, but because the mode based on dynamic programming faces the problem of state number combination explosion, an off-line algorithm SDSP based on state sampling is provided. The algorithm is based on a dynamic programming idea with state sampling, and comprises the following specific steps:

Transmission time delay

Service migration overhead

Number of time slices T, number of edge servers E, user serviceThe number U. Initializing a dynamic programming state table Φ, Φ records a deployment scenario s from a certain deployment scenario^tA mapping to corresponding overhead;

b. continuously cycling from T to 1 to T, and executing two steps of c and d;

c. all possible deployment schemes are obtained in each circulation and are marked as a set H;

d. from each set H (E +2)^UAcquiring kappa schemes through uniform sampling in the deployment scheme, and updating phi into one item with the minimum sum of overhead of all sampling schemes in the last time slice t-1 and overhead of the current sampling scheme;

Considering that the offline algorithm needs to know a period of time in advance

The invention converts the original continuous time optimization problem into a plurality of single step optimization problems and provides an online algorithm OSP to solve the problems.

For online problems, the last-time service deployment scenario x is known^t-1To obtain the service deployment scheme x under the current time slice t^t. In the original modeling, the calculation delay overhead, the transmission delay overhead and the service migration overhead are calculated from each user, but in fact, the calculation delay overhead, the transmission delay overhead and the service migration overhead can be considered from a calculation node. For the edge computing node i, if the service set deployed by the edge computing node i is lambda_i＝SUBSET({λ₁,λ₂,…,λ_n}) then its transmission delay overhead and service migration overhead are respectively

And

these two overheads are denoted here by w_ijShow to dress the userAnd the service j is deployed to the transmission delay overhead and the service migration overhead of the edge computing node i. Likewise, with v_ij＝λ_j/c_iTo represent the computational latency overhead of deploying a user service j onto an edge compute node i. To this problem, the following greedy strategy may be used: deploy λ in arbitrary order₁,λ₂,…,λ_nCan try to get λ_jAnd deploying to all edge servers, and selecting deployment with minimum final cost increase. For each edge server_iDefinition of LOAD: (_i) Meaning the sum of the costs made for the user services already carried by the current server, and the token LOAD: (_i,λ_j) Is represented in_iThe total cost generated by deploying one user service j on the basis of the existing service is increased.

There are the following algorithms:

1) for each edge server_iSetting LOAD: (_i)＝0；

2) When the time slice t comes, the following steps are executed:

i. for each customer service j, calculate LOAD: (_i,λ_j) And selecting the least costly;

deploy user service j on the edge server and then use LOAD (, λ)_j) LOAD () is updated.

Taking FIG. 3 as an example, edge compute nodes₁Has deployed { lambda₁,λ₂Two user services, then its existing overhead LOAD: (₁)＝2(v_1,1+v_1,2)+w_1,1+w_1,2(ii) a Similarly, there is LOAD (₂)＝v_2,3+w_2,3And LOAD (₃)＝3(v_3,4+v_3,5+v_3,6)+w_3,4+w_3,5+w_3,6. Now consider λ₇Is deployed to₁,₂,₃On one of them, then first try to get λ₇Deployed onto each, there are:

LOAD(₁∪λ₇)＝3(v_1,1+v_1,2+v_1,7)+w_1,1+w_1,2+w_1,7

LOAD(₂∪λ₇)＝2(v_2,3+v_2,7)+w_2,3+w_2,7

LOAD(₃∪λ₇)＝4(v_3,4+v_3,5+v_3,6+v_3,7)+w_3,4+w_3,5+w_3,6+w_3,7

and selecting the deployment with the minimum cost from the three schemes.

While the present invention has been described in detail with reference to the specific embodiments thereof, it will be apparent to those skilled in the art that various changes and modifications can be made therein without departing from the spirit and scope of the invention as defined by the appended claims.

Claims

1. A service deployment and resource allocation method considering multi-user mobility, applied to an edge computing network environment, is characterized in that the method comprises the following steps:

(1) establishing a mathematical model for a moving edge calculation scene:

computing cloud nodes under scene according to mobile edges

User equipment node

And divides the decision space formed by the service deployment and resource allocation problemsAnd (3) analysis: when the time slice t comes, the service of the user u needs to decide on which computing node to deploy; and for each computing node, if more than 1 user service deployment exists in the time slice t, the CPU computing resource distribution ratio among the services needs to be considered;

2. The method for multi-user mobility-aware service deployment and resource allocation according to claim 1, wherein the optimization objective of the model is expressed as:

wherein the content of the first and second substances,

represents deployment, and

on the contrary;

indicating the CPU computing resource allocation ratio,

is a ratio variable between 0 and 1, which represents the ratio of the computing resources allocated by the user service j on the computing node i in the time slice t;

represents the computation delay overhead of user service j at time slice t:

represents the migration overhead of user service j in time slice t; w is a₁,w₂,w₃Weights for three overheads respectively; t is the number of time slices.

3. The method for multi-user mobility-aware service deployment and resource allocation according to claim 2, wherein the method comprises

In a manner such asThe following:

wherein

Representing the computational demand of user service j, c, at time slice t_iRepresenting the computing power provided by computing node i;

the above-mentioned

The calculation method is as follows:

wherein

Representing the time delay of the service request of the user j to the computing node i in the time slice t;

the above-mentioned

The calculation method is as follows:

wherein

Represents the migration overhead for migrating user service j from compute node i to i' at time slice t.

4. The method for service deployment and resource allocation considering multi-user mobility according to claim 2, wherein the step (2) of solving the established mathematical model comprises:

note the book

And

B. continuously cycling from T to 1 to T, and performing C, D for two steps;

5. The method for service deployment and resource allocation considering multi-user mobility according to claim 2, wherein the step (2) of solving the established mathematical model comprises:

note the book

And

b. continuously cycling from T to 1 to T, and executing two steps of c and d;

c. all (E +2) are obtained in each cycle^UAnd recording a possible deployment scheme as a set H, wherein E is the number of the edge computing nodes,U is the number of user services;

6. The method for service deployment and resource allocation considering multi-user mobility according to claim 2, wherein the step (2) of solving the established mathematical model comprises:

deployment in arbitrary order { λ₁,λ₂,…,λ_nH, will be_jDeploying to all edge computing nodes, aiming at the ith edge computing node_iDefinition of LOAD: (_i) Meaning the sum of the costs made for the user services already carried by the current node, defining a LOAD (a), (b), (c), (d_i,λ_j) Is represented in_iThe total expense generated by deploying one user service j on the basis of the existing service;

1) computing nodes for each edge_iSetting LOAD: (_i)＝0；

2) When the time slice t comes, the following steps are executed: