CN111866979A

CN111866979A - Base station and channel dynamic allocation method based on multi-arm slot machine online learning mechanism

Info

Publication number: CN111866979A
Application number: CN202010473494.0A
Authority: CN
Inventors: 李峰; 于东晓; 刘政阳
Original assignee: Shandong University
Current assignee: Shandong University
Priority date: 2020-05-29
Filing date: 2020-05-29
Publication date: 2020-10-30
Anticipated expiration: 2040-05-29
Also published as: CN111866979B

Abstract

The invention belongs to the technical field of cognitive radio networks, and particularly relates to a base station and channel dynamic allocation method. The method comprises the following steps: initialization, for any user

Base station

And channel

Initializing gamma_i，j，k(0) 1 is ═ 1; enabling each user i to be sequentially connected with a base station j which is 1, 2, M and sequentially using a channel K which is 1, 2, …, K for data transmission; if the transmission is successful, r_i，j，k(0) 1 is ═ 1; otherwise, r_i，j，k(0) 0; order to

In each time slot T being 1, 2, … T, the dynamic allocation scheme of base station and channel is obtained by using the multi-arm slot machine online learning mechanism

And performs data transmission according to the scheme. The method of the invention has simple algorithm, improves the whole data transmission throughput of the secondary user as much as possible and realizes the dynamic allocation of the base station and the channel.

Description

Base station and channel dynamic allocation method based on multi-arm slot machine online learning mechanism

Technical Field

The invention belongs to the technical field of cognitive radio networks, and particularly relates to a dynamic allocation method of a base station and a channel.

Background

Cognitive Radio (CR) technology may utilize a dynamic channel access technology to improve the utilization efficiency of limited spectrum resources. In a Cognitive Radio Network (CRNs), users authorized to obtain spectrum become Primary users (Primary users), and those users not authorized become Secondary users (Secondary users). The primary user enjoys the priority of using the spectrum resources, not only can use the spectrum resources at any time, but also can give up the idle spectrum to the secondary users, so that the secondary users can share the spectrum resources on the premise of not influencing the primary user.

In a cognitive radio network, a core problem is how to improve the utilization efficiency of frequency band resources by secondary users (hereinafter, referred to as "users" without ambiguity). We consider a typical cognitive radio network system: a group of users is connected to a given group of base stations for data transmission in an access (orthogonal) channel. Then the problem is how to dynamically allocate base stations and channels to these users in order to achieve the goal of improving the overall data throughput of the users. Due to the wide distribution of base stations and users, the transmission links between a base station and (secondary) users may be close to different primary users, so that even the quality of the same channel on different transmission links is different. Furthermore, it is not always possible to measure the quality distribution of a channel over different transmission links in advance. At the same time, the base stations usually have different capacities, i.e. the maximum number of users that can be connected for parallel transmission of data. Although some existing channel allocation strategies propose to solve the problem of uncertainty of channel quality by using a machine learning technology (such as a Multi-arm slot machine (Multi-arm band) method, most existing strategies only consider the problem of matching between channels and users and cannot be directly applied to the problem of allocation of base stations and channels.

Disclosure of Invention

Aiming at the technical problems, the invention provides a dynamic allocation method of a base station and a channel based on a multi-arm slot machine online learning mechanism, which is oriented to a cognitive radio network, dynamically allocates the base station and the channel to users by learning the quality distribution of the channel on different user-base station transmission links through the multi-arm slot machine mechanism, improves the overall throughput rate of the users as much as possible, and meets the following constraints: 1) one user can only link one base station and access one channel at any time; 2) only one channel can be accessed by one base station at any time; 3) the number of users linked by a base station at any one time cannot exceed its capacity.

In order to solve the technical problem, the invention adopts the technical scheme that: a base station and channel dynamic allocation method based on a multi-arm slot machine online learning mechanism comprises the following steps:

s101: initialization, for any user

Base station

And channel

Wherein, γ_i，j，k(t) indicates the user up to time slot t

Through a base station

Access channel

The number of times of (c); r is_i，j，k(t) is belonged to {0, 1} to indicate whether the user i successfully transmits data through the base station j and the channel k in the time slot t;

indicating the times of successful data transmission of the user i by accessing the channel k through the base station j until the time slot t;

s102: in each time slot T-1, 2, … T, the following steps are performed:

1021. for each user

Base station

Channel with a plurality of channels

Calculate its weight u_i，j，k(t)；

1022. Using a greedy algorithm based on

Calculating an allocation scheme:

wherein,

indicating the base station to which user i is to connect in time slot t,

representing user i and base station b_i(t) channels accessed in time slot t;

1023. let each user i connect to base station b_i(t) and using channel c_i(t) transmitting data, and updating r according to the transmission result_i，j，k(t): i.e. when the transmission is successful, r_i，j，k(t) ═ 1; otherwise, r_i，j，k(t)＝0；

1024. For each user

Base station

Channel with a plurality of channels

Update it

And gamma_i，j，k(t): if (i, j, k) e.phi,

if not, then,

and jumps to step 1021.

As a preferred aspect of the present invention, in step 1022, the method for calculating the allocation scheme Φ (t) specifically includes:

(3) initialization

(4) If it is

If the number is null, outputting a distribution scheme phi (t); otherwise, the following steps are executed:

a. For any base station

And arbitrary channels

Let l_j，k＝min{N′，τ_jH, calculate u_i，j，k(t) the greatest weight of l_j，kSet of individual users

b. Order of calculation

Base station with the largest value

And channel

Namely, it is

c. For each user

Allocation base station j^*And channel k^*I.e. by

Φ(t)←Φ(t)∪{(i，j^*，k^*)}

d. Updating

And jumping to the step (2).

Compared with the prior art, the base station and channel dynamic allocation method based on the multi-arm slot machine online learning mechanism of the invention utilizes the multi-arm slot machine mechanism to learn the quality distribution of the channels on different user-base station transmission links online, and improves the overall throughput rate of users as much as possible by dynamically allocating the base station and the channels to the users. The difference between the expected cumulative throughput achievable by the method of the present invention and the expected cumulative throughput achievable under off-line conditions (i.e., known channel quality distribution) is linear with the number of users, the number of base stations, and the number of channels, and increases at an exponential rate of o (logt) over a finite time T.

Drawings

Fig. 1 is an overall flowchart of a base station and a channel dynamic allocation method according to an embodiment of the present invention;

FIG. 2 is a diagram of utilizing weight sets in an embodiment of the present invention

A flow chart for calculating the base station and channel allocation scheme Φ (t).

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.

On the contrary, the invention is intended to cover alternatives, modifications, equivalents and alternatives which may be included within the spirit and scope of the invention as defined by the appended claims. Furthermore, in the following detailed description of the present invention, certain specific details are set forth in order to provide a better understanding of the present invention. It will be apparent to one skilled in the art that the present invention may be practiced without these specific details.

One embodiment provided by the invention is as follows: the flow of the method for dynamically allocating base stations and channels based on the online learning mechanism of the multi-arm slot machine is shown in fig. 1, and specifically comprises the following steps:

order to

And

respectively representing a set of N (secondary) users, M base stations and K channels. Without loss of generality, assume M < K < N. Let τ be_jIndicating a base station

The capacity of (c). Suppose that time is divided into a set of time slots T0, 1, 2, …, T. Let gamma be _i，j，k(t) indicates the user up to time slot t

Through a base station

Access channel

Number of times of (r)_i，j，k(t) e 0, 1 indicates whether user i successfully transmits data through base station j and channel k in time slot t,

and the number of times of successful data transmission by the user i accessing the channel k through the base station j until the time slot t is shown. The invention is realized by the following technical scheme:

s101, an initialization phase. For any user

Base station

And channel

Initializing gamma_i，j，k(0)＝1；

Each user i is sequentially connected to the base station j 1, 2.. times.m and performs data transmission using the channel K1, 2, …, K in turn. If the transmission is successful, r_i，j，k(0) 1 is ═ 1; otherwise, r_i，j，k(0) 0. Order to

S102, in each time slot T1, 2, … T, executing the following steps:

1021. for each user

Base station

Channel with a plurality of channels

Calculate its weight u_i，j，k(t)；

1022. Using a greedy algorithm based on

Computing allocation schemes

Wherein,

indicating the base station to which user i is to connect in time slot t,

representing user i and base station b_i(t) channels accessed in time slot t;

the method for calculating the base station and the channel allocation scheme Φ (t) is shown in fig. 2, and comprises the following specific steps:

(1) initialization

(2) If, if

a. For any base station

And arbitrary channels

b. Order of calculation

Base station with the largest value

And channel

Namely:

c. for each user

Allocation base station j^*And channel k^*Namely:

Φ(t)←Φ(t)∪{(i，j^*，k^*)}

d. updating

And jumping to the step (2).

1023. Let each user i connect to base station b_i(t) and using channel c_i(t) transmitting data, and updating r according to the transmission result_i，j，k(t): when the transmission is successful, r_i，j，k(t) ═ 1; otherwise, r_i，j，k(t)＝0)；

1024. For each user

Base station

Channel with a plurality of channels

Update it

And gamma_i，j，k(t): if (i, j, k) e.phi,

if not, then,

and jumps to step 1021.

Claims

1. A base station and channel dynamic allocation method based on a multi-arm slot machine online learning mechanism is characterized by comprising the following steps:

s101: initialization, for any user

Base station

And channel

Initialization r_i，j，k(0) 1 is ═ 1; enabling each user i to be sequentially connected with a base station j which is 1, 2, M and sequentially using a channel K which is 1, 2, …, K for data transmission; if the transmission is successful, r_i，j，k(0) 1 is ═ 1; otherwise, r_i，j，k(0) 0; order to

Wherein, γ_i，j，k(t) indicates the user up to time slot t

Through a base station

Access channel

s102: in each time slot T-1, 2, … T, the following steps are performed:

1021. for each user

Base station

Channel with a plurality of channels

Calculate its weight u_i，j，k(t)；

1022. Using a greedy algorithm based on

Calculating an allocation scheme:

wherein,

indicating the base station to which user i is to connect in time slot t,

representing user i and base station b_i(t) channels accessed in time slot t;

1024. For each user

Base station

Channel with a plurality of channels

Update it

And r_i，j，k(t): if (i, j, k) e.phi,

if not, then,

and jumps to step 1021.

2. The method for dynamically allocating base stations and channels based on the dobby slot machine online learning mechanism as claimed in claim 1, wherein the calculation method of the allocation scheme Φ (t) in the step 1022 comprises:

(1) initialization