CN110365375B

CN110365375B - Beam alignment and tracking method in millimeter wave communication system and computer equipment

Info

Publication number: CN110365375B
Application number: CN201910560646.8A
Authority: CN
Inventors: 黄永明; 刘婷薇; 章建军
Original assignee: Southeast University
Current assignee: Southeast University
Priority date: 2019-06-26
Filing date: 2019-06-26
Publication date: 2021-06-08
Anticipated expiration: 2039-06-26
Also published as: CN110365375A

Abstract

The invention discloses a beam alignment and tracking method in a millimeter wave communication system. The proposed method models the beam alignment and tracking problem as a random selection optimization problem, where each option is defined by an offset and the number of scanned beams. In the initial stage of each time slot, an option is determined by using a confidence interval upper bound algorithm, a beam set needing training is determined according to the option, then each beam in the training beam set is used for transmitting a pilot signal in turn, and the optimal transmitting beam is determined based on the received signal. And performing data transmission by using the found optimal sending beam in the data transmission stage of the corresponding time slot, and simultaneously updating the related information of the option and determining the option of the next time slot. The beam alignment and tracking method designed by the invention can sense the change rate of the environment, adaptively adjust the beam training strategy, improve the successful probability of beam alignment, reduce the beam training overhead and effectively improve the throughput of the system.

Description

Beam alignment and tracking method in millimeter wave communication system and computer equipment

Technical Field

The invention belongs to the field of wireless communication, and particularly relates to a beam alignment and tracking method in a millimeter wave communication system and computer equipment.

Background

With the continuous development of wireless communication technology, high-speed data services and ubiquitous access demands are increasing explosively. The millimeter wave has a large available bandwidth, and can meet the exponentially-increased wireless transmission demand, thereby drawing wide attention of people. But the path loss of the millimeter wave signal is larger compared to the microwave communication. In order to solve the problem of large path loss of millimeter waves, a large-scale antenna array must be installed at a receiving/transmitting end. Fortunately, the shorter wavelength of the millimeter wave signal facilitates the placement of a large number of antennas in a small space. In wireless communication, channel state information plays an extremely important role in achieving high-speed transmission, but when the number of antennas is large and thus the beam width is narrow, it becomes more difficult to acquire Channel State Information (CSI), particularly in a dynamic environment such as a mobile environment.

Considering that the millimeter wave channel has sparsity, an effective method for obtaining CSI is through beam training/alignment, and accordingly obtains information such as an equivalent channel vector. The transmitting end/receiving end can determine the optimal transmitting/receiving beam through exhaustive search, adaptive search and other modes, but the beam search scheme is mainly suitable for single-user single-stream transmission, and the beam training overhead of a large-scale antenna array system is very large. More seriously, in dynamic or time-varying scenarios, the millimeter wave channel may change so quickly and with too short a coherence time that frequent and accurate beam alignment cannot be achieved. Although beam tracking techniques can be used to reduce training overhead for time-varying channels, beam tracking techniques are very demanding for channel modeling. Since channel modeling depends on the environment, factors influencing channel modeling include the positions of buildings, base stations and users, pedestrians, vehicles and the like, and it is difficult to provide a closed channel modeling expression for a practical complex scene, the conventional beam/channel tracking technology is only suitable for simpler or special occasions.

Frequent beam alignment in dynamic environments is unavoidable for better transmission performance and therefore more challenging. In order to reduce the overhead of beam alignment, effective information must be extracted from past beam alignment information to reduce the search space for subsequent beam alignment. With the rapid development of machine learning, low-overhead beam alignment is possible. The invention provides a beam alignment algorithm based on multi-arm gambling machine learning (bandit learning) from the perspective of sensing the rate of change of the environment by exploring/utilizing correlation techniques in machine/reinforcement learning. The proposed beam alignment method can sense the speed of environmental changes and learn the hidden probability structure information in the environment. Based on the sensed environment and probability structure information, the algorithm only needs to search a very small training beam set, and the corresponding training overhead can be ignored. Moreover, the proposed beam alignment algorithm can learn the required information from the environment, so that the prior knowledge such as the rigorous assumption required by time-varying channel modeling is not required, and the method has wider applicability.

Disclosure of Invention

The purpose of the invention is as follows: aiming at the problems in the prior art, the invention aims to provide a method, a device and equipment for aligning and tracking beams in a millimeter wave communication system, which are used for adaptively adjusting a beam training strategy, so that not only is the mismatching of the beams effectively avoided, but also the beam training overhead is reduced, and the throughput of the system is improved.

The technical scheme is as follows: in order to realize the purpose of the invention, the technical scheme adopted by the invention is as follows: a method for beam alignment and tracking in a millimeter wave communication system includes the following steps:

step (1): the beam alignment and tracking problem is modeled as a random optimization problem.

In this step, a randomly selected optimization model is constructed for a given millimeter wave communication system and a corresponding beam training codebook. For descriptive convenience, a mathematical model and a solving algorithm for random selection optimization are given, and then a beam alignment and tracking problem in communication is modeled into a corresponding random selection optimization problem.

Considering the most basic random selection optimization model, from a random variable family X_i,n(1 ≦ i ≦ K, n ≧ 1), where i represents an index of a random variable type and n represents an index of a random variable time. Successive selection/sampling of type i yields independent identically distributed returns, i.e., random variable X_i,1,X_i,2…. Recording random variable { X_i,nThe mathematical expectation of (u) } is_iNote that the mathematics expects μ_iIs unknown. It should be noted that independence is also applicable to different types of rewards, namely X_i,sAnd X_j,tAre independent of one another (where s, t ≧ 1 and 1 ≦ i < j ≦ K) but generally obey different distributions. Given family of random variables or randomSelection model { X_i,nI is more than or equal to 1 and less than or equal to K, n is more than or equal to 1, and the optimization goal is to select the type of the random variable at each moment n according to a certain strategy so as to maximize the accumulated return.

One effective random variable type selection strategy is the Upper Confidence Bound (UCB) algorithm, which selects the type of random variable at the next time based on past operations and the obtained returns. The UCB algorithm first selects a random variable of each type once and obtains a corresponding reward, and simultaneously counts the usage of each type by n_iInitialisation to 1 and total counter n initialised to K. For the nth (n is more than or equal to K +1) operation, the selected types are as follows:

wherein the content of the first and second substances,

corresponding to the average return of type j after the first n-1 operations, n_jIndicating the number of times type j was selected in the first n-1 operations.

The following is a modeling of the beam alignment and tracking problem in the millimeter wave communication system as a random selection optimization problem, assuming the used beam training code notation:

wherein M represents the number of codebooks, and when the transmitting end uses the ith beam f_iWhen sending signals, the receiving signal expression of the receiving end is as follows:

where P is signal transmission power, h is channel vector of transmitting and receiving end, s represents transmitted pilot symbol/sequence, n_iIs a complex gaussian noise vector.

As shown in fig. 2, each slot includes two phases, i.e., a beam training phase and a data transmission phase, a beam pairThe goal is to determine the best transmit and/or receive beams to improve the transmission efficiency of the system. Considering that the wave beam training occupies certain time resource, the practical reachable rate R is adopted_effAs a performance index for measuring throughput of different beam alignment algorithms, the actual achievable rate is defined as:

R_eff＝(1-t_B/t_S)log(1+P|h^Hf_i|²)

wherein, t_BAnd t_SRespectively representing the beam training time and the total time of each slot.

In order to sense the environment and learn the hidden probability structure information in the environment, the selection type in the random selection optimization model is defined as an integer pair (a, b), wherein a represents the offset of the beam index used by the current time slot relative to the beam index used by the last time slot, b represents the number of beams used by the scanning beam space/subspace, and each type (a, b) corresponds to one beam scanning space

Where c represents the optimal beam index at the last time, and the corresponding single return when type (a, b) is selected is:

wherein, t_BAnd t_SRespectively representing the beam training time and the total time of each slot,

scanning a space corresponding to a beam

The best beam of (2) is formed by a set of all types

Size N, i.e.:

it should be noted that the value of N is related to the specific communication system and the usage environment, such as: n should be increased if the number of antennas at the transmitting end is large or the environment in which the antennas are used is changed drastically.

Step (2): the initialization operation, two steps, namely, determining an initial reference beam and initializing each type, namely, performing each type once respectively.

(2.1) finding an initial reference beam: using codebooks in sequence in the first time slot

The beam in (1) sends a pilot signal, and for the received signal of the ith beam mobile user terminal, the following is:

y_i＝h^Hf_is+n_i

where h is the channel vector between the base station and the mobile subscriber,

representing a complex Gaussian noise vector, s representing a length L_PPilot sequence of { y }_iDetermine the best transmit beam as

Wherein i ∈ argmax_i||y_i||/L_PI.e. the index of the beam with the largest average received signal power.

(2.2) initialize each type: selecting each type once in each time slot of the next N time slots, and constructing corresponding beam scanning space for each type (a, b)

And transmitting a pilot signal using a beam in a beam scanning space and determining an initial backoff value

Wherein the content of the first and second substances,

to correspond to the beam space

The optimal transmission beam of (1) initializes each type of use count or the number of operations to 1, and makes the total number of operations N equal to N + 1.

And (3): the beam alignment and tracking operation is executed in the beginning stage or the beam training stage of each following time slot, and the related information is updated at the same time, which includes two steps:

(3.1) selecting a type using a confidence interval upper bound algorithm, i.e. as follows:

from which the arm (a) can be determined^*,b^*) Beam scanning space of

n_a,bIndicating the usage count of the arms (a, b).

(3.2) use of

The beam in (1) transmits a pilot signal, and the received signal can determine the space corresponding to the beam

The best transmit beam.

In particular, assume beam f_iIs expressed as

The length of the pilot signal s being L_pThen beam f_iCorresponding average received power Y_iIs composed of

The beam having the largest average received power is regarded as the best transmission beam.

(3.3) transmitting data using the best beam obtained in step (3) for the remaining time of the slot, i.e., the effective data transmission stage, while updating n and type (a) as follows^*,b^*) The related information of (2):

n←n+1,

wherein the content of the first and second substances,

type of representation (a)^*,b^*) The count of the number of uses of (c),

type of representation (a)^*,b^*) The average return of (a) is,

type of representation (a)^*,b^*) The rate is effectively reached in this report.

Furthermore, the present invention also proposes a computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, which computer program, when loaded into the processor, implements the beam alignment and tracking method in a millimeter wave communication system as described.

Has the advantages that: compared with the prior art, the technical scheme of the invention has the following beneficial technical effects:

(1) the beam alignment and tracking method disclosed by the invention can sense the change rate of the environment, adaptively adjust the beam training strategy, effectively avoid beam mismatch and improve the success rate of beam alignment.

(2) The beam alignment and tracking method disclosed by the invention has low beam training overhead and reserves more time for data transmission, thereby effectively improving the throughput of the system.

Drawings

FIG. 1 is an algorithmic flow chart of a beam alignment and tracking method of an embodiment of the present invention;

FIG. 2 is a diagram illustrating beam training and data transmission time allocation for each slot in an embodiment of the present invention;

FIG. 3 is a diagram illustrating a dynamic communication environment in an embodiment of the present invention;

FIG. 4 is an alignment success rate performance curve of the beam alignment and tracking method in an embodiment of the present invention;

fig. 5 is a plot of achievable rate performance for the beam alignment and tracking method in an embodiment of the invention.

Detailed Description

The invention is further elucidated with reference to the drawings and the embodiments.

As shown in fig. 1, the method for beam alignment and tracking in a millimeter wave communication system disclosed in the embodiment of the present invention mainly includes the following steps:

Considering the most basic random selection optimization model, from a random variable family X_i,n(1 ≦ i ≦ K, n ≧ 1), where i represents an index of a random variable type and n represents an index of a random variable time. Successive selection/sampling of type i yields independent identically distributed returns, i.e., random variable X_i,1,X_i,2…. Recording random variable { X_i,nThe mathematical expectation of (u) } is_iNote that the mathematics expects μ_iIs unknown. Finger for patientIt is noted that independence is also applicable to different types of rewards, namely X_i,sAnd X_j,tAre independent of one another (where s, t ≧ 1 and 1 ≦ i < j ≦ K) but generally obey different distributions. Given a family of random variables or a randomly selected model { X_i,nI is more than or equal to 1 and less than or equal to K, n is more than or equal to 1, and the optimization goal is to select the type of the random variable at each moment n according to a certain strategy so as to maximize the accumulated return.

wherein the content of the first and second substances,

wherein, P is signal transmission power, h is channel vector of transmitting and receiving end, s represents transmissionSending pilot symbols/sequences, n_iIs a complex gaussian noise vector.

As shown in fig. 2, each slot includes two phases, i.e., a beam training phase and a data transmission phase, and the aim of beam alignment is to determine an optimal transmission and/or reception beam to improve the transmission efficiency of the system. Considering that the wave beam training occupies certain time resource, the practical reachable rate R is adopted_effAs a performance index for measuring throughput of different beam alignment algorithms, the actual achievable rate is defined as:

R_eff＝(1-t_B/t_S)log(1+P|h^Hf_i|²)

scanning a space corresponding to a beam

The best beam of (2) is formed by a set of all types

Size N, i.e.:

y_i＝h^Hf_is+n_i

And using beam sweepingPilot signals are transmitted for beams in space and an initial return value is determined

Wherein the content of the first and second substances,

to correspond to the beam space

from which the arm (a) can be determined^*,b^*) Beam scanning space of

n_a,bIndicating the usage count of the arms (a, b).

(3.2) use of

The best transmit beam.

In particular, assume beam f_iIs expressed as

n←n+1,

wherein the content of the first and second substances,

type of representation (a)^*,b^*) The count of the number of uses of (c),

type of representation (a)^*,b^*) The average return of (a) is,

The method for aligning and tracking the wave beam in the millimeter wave communication system provided by the embodiment of the invention can learn the hidden probability structure information in the environment from the perspective of sensing the change rate of the communication environment. Based on the learned probability structure information, the interval of the optimal beam of the next time slot can be effectively determined, and because the interval is usually very small, the corresponding training overhead can be ignored, and the throughput of the system is effectively improved. Moreover, the proposed algorithm can learn the required information from the environment, does not need prior knowledge related to channel modeling, and can be suitable for more complex dynamic scenes.

It should be noted that the beam alignment and tracking method in the millimeter wave communication system disclosed by the invention is suitable for various complex dynamic scenes. To facilitate an understanding of the aspects and effects of the present invention, the present invention provides another exemplary embodiment considering a communication system environment as shown in fig. 3. The motion trail of the mobile user is a circle, and the base station is positioned at the center of the circle. The speed of motion of the mobile user is assumed to be random, but obeys a certain probability distribution. Due to the randomness of the motion of the mobile user, the beams switched in the next time slot are also random and obey a certain probability distribution as well. In order to capture the random behavior of the mobile user, a Markov random process is introduced to model the beam switching process of different time slots. Specifically, let f_cThe beam used by the current time slot is represented, and the base station of the next time slot is switched to { f_c+1,f_c+2,...,f_c+SHas a probability p of one of_sStill using the current beam f_cHas a probability of 1-p_s. Is switching to { f_c+1,f_c+2,...,f_c+SUnder the condition of a certain wave beam in the wave beam f, the base station is switched to the wave beam f_c+kThe conditional probability of (2) decays exponentially with k, i.e.

Where β represents the decay rate.

Assuming that the antenna array used by the base station is a uniform linear array, and the number of antennas is N-64, the mobile subscriber only installs a single antenna. Codebook used by base station

By uniformly sampling the beam space configuration, namely:

where a (-) denotes an array response vector of the antenna array, and the codebook size is M ═ 64. Let d and λ denote the spacing between adjacent antennas and the signal wavelength, respectively, then the mathematical expression for a (-) is:

the channel model is an extended Saleh-Vallenzuela geometric channel model, and the channel vector h between the base station and the mobile user can be expressed as

Where L and β are the number of paths and the average path loss, respectively (β is set to 1 in the simulation), α_lComplex path gain, phi, representing the first path_l＝cos(θ_l) Is the cosine of the angle of departure (where θ_lIndicating the exit angle of the first diameter). Assuming that the time consumed for transmitting a pilot symbol is a unit time t_UWhen the length of the pilot signal is L_pAnd the time consumed for selecting type (a, b) for beam training is t_B＝bL_pt_U。

The beam alignment and tracking method in the millimeter wave communication system provided by the embodiment comprises the following steps:

step (1): and (3) performing mathematical modeling on the beam alignment and tracking problem, and randomly selecting an optimization model by modeling, wherein the key of the mathematical modeling is a definition type. In the present embodiment, the number of types is K-9, i.e.

Where a represents an offset of a beam index used in a current slot with respect to a beam index used in a previous slot, and b represents a size of a beam scanning space.

Assuming a last time slotThe beam used for data transmission is f_cThen the beam corresponding to type (a, b) scans the space

Is composed of

The corresponding single reward is:

wherein the content of the first and second substances,

to correspond to the beam space

The best transmit beam.

Step (2): the initialization operation mainly includes two steps, i.e., determining an initial reference beam and initializing each type (i.e., selecting and executing a beam training process corresponding to each type).

First, finding an initial reference beam: using codebooks in sequence in the first time slot

The beam in (1) transmits pilot signal, and for the received signal of the ith beam mobile user terminal is

y_i＝h^Hf_is+n_i，

Where P is the transmit power, h is the channel vector between the base station and the mobile user,

representing a complex Gaussian noise vector, s representing a length L_PThe pilot sequence of (1). By { y_iDetermine the best transmit beam as

Second, initialize each type: performing a beam training operation for each type in the next time slot of K-9, and constructing a corresponding beam scanning space for each type (a, b)

And using the beam in the beam scanning space to transmit the pilot signal and determine the initial return value

Wherein the content of the first and second substances,

to correspond to the beam space

The best transmit beam. The usage count or the number of operations of each arm is initialized to 1, and the total number of operations is made n + K +1 to 10.

first, a confidence interval upper bound algorithm is used, i.e., a type is selected as follows:

from which the type (a) can be determined^*,b^*) Beam scanning space of

Second step, using in sequence

The best transmit beam. In particular, assume beam f_iIs expressed as

Thirdly, in the remaining time of the time slot, namely the effective data transmission stage, the data is transmitted by using the optimal beam obtained in the step (3), and the n and the type (a) are updated according to the following formula^*,b^*) The related information of (2):

n←n+1,

wherein the content of the first and second substances,

type of representation (a)^*,b^*) The count of the number of uses of (c),

type of representation (a)^*,b^*) The average return of (a) is,

The success rate of beam alignment for the beam alignment and tracking method in the millimeter wave communication system disclosed in this embodiment is shown in fig. 4. In order to show the superiority of the method disclosed by the present invention, fig. 4 also provides the beam alignment success rate performance of the other two algorithms (beam alignment algorithm ESBB based on exhaustive search and beam alignment algorithm DUBB directly based on UCB). The DUBB approach performs the worst in terms of alignment success rate among the three algorithms because the DUBB only chooses one beam at a time randomly, often with beam mismatch. The ESBB method performs best because the exhaustive search can find the best transmit beam. It can be seen from fig. 4 that the success rate of the beam alignment and tracking method disclosed in the present embodiment gradually approaches the ESBB as the signal-to-noise ratio increases.

The actual achievable rate performance of the beam alignment and tracking method disclosed in this embodiment is shown in fig. 5. In order to embody the superiority of the method disclosed by the present invention, fig. 5 also provides the actual achievable rate performance of the other three algorithms (beam alignment algorithm ESBB based on exhaustive search, beam alignment algorithm DUBB directly based on UCB, and algorithm-idealized algorithm based on Oracle, without training overhead and providing the best transmission beam each time). Compared with an ESBB algorithm and an algorithm directly based on UCB, the method for aligning and tracking the beam disclosed by the embodiment can obtain the optimal reachable rate performance, and is characterized in that the method disclosed by the invention can sense the change of the environment and adaptively adjust the beam training strategy, so that the training overhead is effectively reduced, more time is reserved for a data transmission stage, and the throughput of the system is improved.

Claims

1. A method for beam alignment and tracking in a millimeter wave communication system, the method comprising the steps of:

(1) modeling a beam alignment and tracking problem in a millimeter wave communication system as a random selection optimization problem, wherein options in the optimization model are defined as integer pairs (a, b), and an optimization target is maximized cumulative return, namely a cumulative reachable rate;

the method of step (1) is specifically as follows:

(1.1) the beam training code used is assumed to be written as:

where P is signal transmission power, h is channel vector of transmitting and receiving end, s represents transmitted pilot symbol/sequence, n_iIs a complex gaussian noise vector;

(1.2) each time slot comprises two stages, namely a beam training stage and a data transmission stage, the aim of beam alignment is to determine the optimal receiving and transmitting beam to improve the transmission efficiency of the system, and the actual achievable rate R is adopted in consideration of certain time resources occupied by beam training_effAs a performance index for measuring throughput of different beam alignment algorithms, the actual achievable rate is defined as:

R_eff＝(1-t_B/t_S)log(1+P|h^Hf_i|²)

wherein, t_BAnd t_SRespectively representing the beam training time and the total time of each time slot;

(1.3) defining each selectable item in the randomly selected optimization model as an integer pair (a, b), where a represents the offset of the beam index used by the current slot relative to the beam index used by the last slot, b represents the number of beams used when scanning the beam space/subspace, and each option (a, b) corresponds to one beam scanning space

Where c represents the optimal beam index at the last time, and the corresponding single return when (a, b) is selected is:

scanning a space corresponding to a beam

The best beam in the set of global options is

Its size is N, i.e.:

(2) initializing operation, namely determining an initial optimal reference beam and initializing each option;

(3) in the wave beam training stage of each time slot, a confidence interval upper bound algorithm is applied to determine an option, a corresponding training wave beam set is determined according to the option, wave beams in the set are used for sending pilot signals in sequence, and the best wave beam is found out according to the received signals.

2. The method for beam alignment and tracking in a millimeter wave communication system according to claim 1, wherein the specific method in step (2) is as follows:

The beam in (1) sends a pilot signal, and for a received signal of the ith beam user terminal, the received signal is:

y_i＝h^Hf_is+n_i

wherein the content of the first and second substances,h is the channel vector between the base station and the mobile user,

representing a complex Gaussian noise vector, s representing a length L_PPilot sequence of { y }_iDetermine the best transmission beam as

Wherein i^*∈arg max_i||y_i||/L_PI is the index of the beam with the maximum average power of the received signal, and I is an identity matrix;

(2.2) initialize each option: selecting each option once in each of the next N time slots, and constructing corresponding beam scanning space for each option (a, b)

Transmitting a pilot signal using a beam in a beam sweep space and determining an initial reward value

Wherein the content of the first and second substances,

to correspond to the beam space

The number of times of operation or the use count of each option is initialized to 1, and the total number of operations is made N + 1.

3. The method for beam alignment and tracking in a millimeter wave communication system according to claim 2, wherein the specific method in step (3) is as follows:

(3.1) selecting an option using a confidence interval upper bound algorithm, i.e. as follows:

accordingly, determine (a)^*,b^*) Beam scanning space of

n_a,bRepresents the usage count of (a, b);

(3.2) use of

The beam in (1) transmits a pilot signal, and the beam space is determined from the received signal

The optimal transmit beam of (a);

suppose a beam f_iIs expressed as

The length of the pilot signal s being L_pThen beam f_iCorresponding average received power Y_iComprises the following steps:

regarding the beam with the maximum average received power as the optimal transmission beam;

(3.3) transmitting data using the best beam obtained in step (3.2) for the remaining time of the slot, i.e., the effective data transmission stage, while updating n and option (a) as follows^*,b^*) The related information of (2):

wherein the content of the first and second substances,

presentation options (a)^*,b^*) The count of the number of uses of (c),

presentation options (a)^*,b^*) The average return of (a) is,

presentation options (a)^*,b^*) The rate is effectively reached in this report.

4. A computer device comprising a memory, a processor, and a computer program stored on the memory and executable on the processor, the computer program when loaded into the processor implementing the beam alignment and tracking method in a millimeter wave communication system of any of claims 1 to 3.