CN113708886A - Unmanned aerial vehicle anti-interference communication system and joint track and beam forming optimization method - Google Patents

Unmanned aerial vehicle anti-interference communication system and joint track and beam forming optimization method Download PDF

Info

Publication number
CN113708886A
CN113708886A CN202110983755.8A CN202110983755A CN113708886A CN 113708886 A CN113708886 A CN 113708886A CN 202110983755 A CN202110983755 A CN 202110983755A CN 113708886 A CN113708886 A CN 113708886A
Authority
CN
China
Prior art keywords
interference
signal
intelligent
beam forming
unmanned aerial
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110983755.8A
Other languages
Chinese (zh)
Inventor
陈瑾
侯志峰
罗屹洁
方贵
谷江春
徐逸凡
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Army Engineering University of PLA
Original Assignee
Army Engineering University of PLA
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Army Engineering University of PLA filed Critical Army Engineering University of PLA
Priority to CN202110983755.8A priority Critical patent/CN113708886A/en
Publication of CN113708886A publication Critical patent/CN113708886A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04KSECRET COMMUNICATION; JAMMING OF COMMUNICATION
    • H04K3/00Jamming of communication; Counter-measures
    • H04K3/20Countermeasures against jamming
    • H04K3/22Countermeasures against jamming including jamming detection and monitoring
    • H04K3/224Countermeasures against jamming including jamming detection and monitoring with countermeasures at transmission and/or reception of the jammed signal, e.g. stopping operation of transmitter or receiver, nulling or enhancing transmitted power in direction of or at frequency of jammer
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B7/00Radio transmission systems, i.e. using radiation field
    • H04B7/02Diversity systems; Multi-antenna system, i.e. transmission or reception using multiple antennas
    • H04B7/04Diversity systems; Multi-antenna system, i.e. transmission or reception using multiple antennas using two or more spaced independent antennas
    • H04B7/04013Intelligent reflective surfaces
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B7/00Radio transmission systems, i.e. using radiation field
    • H04B7/02Diversity systems; Multi-antenna system, i.e. transmission or reception using multiple antennas
    • H04B7/04Diversity systems; Multi-antenna system, i.e. transmission or reception using multiple antennas using two or more spaced independent antennas
    • H04B7/06Diversity systems; Multi-antenna system, i.e. transmission or reception using multiple antennas using two or more spaced independent antennas at the transmitting station
    • H04B7/0613Diversity systems; Multi-antenna system, i.e. transmission or reception using multiple antennas using two or more spaced independent antennas at the transmitting station using simultaneous transmission
    • H04B7/0615Diversity systems; Multi-antenna system, i.e. transmission or reception using multiple antennas using two or more spaced independent antennas at the transmitting station using simultaneous transmission of weighted versions of same signal
    • H04B7/0617Diversity systems; Multi-antenna system, i.e. transmission or reception using multiple antennas using two or more spaced independent antennas at the transmitting station using simultaneous transmission of weighted versions of same signal for beam forming
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B7/00Radio transmission systems, i.e. using radiation field
    • H04B7/02Diversity systems; Multi-antenna system, i.e. transmission or reception using multiple antennas
    • H04B7/04Diversity systems; Multi-antenna system, i.e. transmission or reception using multiple antennas using two or more spaced independent antennas
    • H04B7/08Diversity systems; Multi-antenna system, i.e. transmission or reception using multiple antennas using two or more spaced independent antennas at the receiving station
    • H04B7/0837Diversity systems; Multi-antenna system, i.e. transmission or reception using multiple antennas using two or more spaced independent antennas at the receiving station using pre-detection combining
    • H04B7/0842Weighted combining
    • H04B7/086Weighted combining using weights depending on external parameters, e.g. direction of arrival [DOA], predetermined weights or beamforming
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04KSECRET COMMUNICATION; JAMMING OF COMMUNICATION
    • H04K3/00Jamming of communication; Counter-measures
    • H04K3/20Countermeasures against jamming
    • H04K3/22Countermeasures against jamming including jamming detection and monitoring
    • H04K3/224Countermeasures against jamming including jamming detection and monitoring with countermeasures at transmission and/or reception of the jammed signal, e.g. stopping operation of transmitter or receiver, nulling or enhancing transmitted power in direction of or at frequency of jammer
    • H04K3/228Elimination in the received signal of jamming or of data corrupted by jamming
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04KSECRET COMMUNICATION; JAMMING OF COMMUNICATION
    • H04K3/00Jamming of communication; Counter-measures
    • H04K3/40Jamming having variable characteristics
    • H04K3/43Jamming having variable characteristics characterized by the control of the jamming power, signal-to-noise ratio or geographic coverage area
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04KSECRET COMMUNICATION; JAMMING OF COMMUNICATION
    • H04K3/00Jamming of communication; Counter-measures
    • H04K3/40Jamming having variable characteristics
    • H04K3/45Jamming having variable characteristics characterized by including monitoring of the target or target signal, e.g. in reactive jammers or follower jammers for example by means of an alternation of jamming phases and monitoring phases, called "look-through mode"
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W16/00Network planning, e.g. coverage or traffic planning tools; Network deployment, e.g. resource partitioning or cells structures
    • H04W16/22Traffic simulation tools or models
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Radar, Positioning & Navigation (AREA)
  • Remote Sensing (AREA)
  • Mobile Radio Communication Systems (AREA)

Abstract

An unmanned aerial vehicle anti-interference communication system and a joint track and beam forming optimization method belong to the technical field of wireless communication anti-interference, a communication network resisting full-band strong power interference is considered, a base station in the network provides communication service for mobile users, and meanwhile, a full-band omnidirectional jammer interferes a legal communication link in a power suppression mode. The jammer has certain concealment, and information of the jammer cannot be accurately acquired by both legal communication parties. And acquiring state information by using the unmanned aerial vehicle to sense the environment. And selecting the action of the optimal return value estimated by the neural network according to the probability by searching and utilizing the strategy through epsilon-greedy. The intelligent reflection unmanned aerial vehicle obtains a real return value of the action through an execution scheme, and stores experience information in a sliding storage window. And after the experience information in the multi-step experience pool reaches a certain amount, updating the neural network, and fitting the environment through the neural network to perform optimal decision. The method can be well applied to a real-time anti-interference communication scene.

Description

Unmanned aerial vehicle anti-interference communication system and joint track and beam forming optimization method
Technical Field
The invention relates to the technical field of wireless communication anti-interference, in particular to an unmanned aerial vehicle anti-interference communication system and a joint track and beam forming optimization method.
Background
When the full-band interference with strong power exists in a scene, the realization of reliable and robust information transmission is an important research direction. Due to the double suppression of interference in the frequency domain and the power domain, the common anti-interference means is difficult to solve. The intelligent reflection surface has the advantages of passive reflection, flexible phase adjustment, convenient deployment and the like as a new technology, and has wide prospect in solving the problem of anti-interference communication in the scene. Through combining the unmanned aerial vehicle technique, with intelligent reflection surface deployment on unmanned aerial vehicle, through adjusting the unmanned aerial vehicle position in real time, in the dynamic environment, the interference immunity of system can obtain further promotion.
Disclosure of Invention
The invention provides an unmanned aerial vehicle anti-interference communication system and a joint track and beam forming optimization method, which are well applied to a real-time anti-interference communication network scene to realize efficient anti-interference communication under the condition of full-band strong-power interference. On the basis of a deep reinforcement learning algorithm, the passive beam forming action decision space is greatly reduced by utilizing the phase change of the intelligent reflection surface and the high coupling of beam forming, and meanwhile, multi-step experience citation is introduced into the trajectory planning, so that the accuracy of path decision is further improved through the analysis of multi-step paths.
The technical solution for realizing the purpose of the invention is as follows: an anti-interference communication model under a dynamic unknown environment is characterized in that: consider a communication network that is resistant to full-band strong power interference, in which there is a base station for performing communication coverage service for mobile users, but a full-band omni-directional jammer interferes with legitimate communications in a power-suppressing manner. The jammer has certain concealment, and information of the jammer cannot be accurately acquired by both legal communication parties.
In a certain topological state, the signal received by the mobile user D is defined as:
Figure BDA0003229862780000011
wherein, PSFor base station transmitting power, PJIs the jammer interference power, HS、HJ、hSDAnd hJDThe gain of a direct link channel between a base station S and a mobile user D, the gain of a direct link channel between an interference machine J and the mobile user D, the combined channel gain of a signal of the base station S which is reflected by an intelligent reflection surface U and reaches the mobile user D, and the combined channel gain of a signal of the interference machine J which is reflected by the intelligent reflection surface U and reaches the mobile user D are obtained. x is the number ofSFor transmitting signals to the base station, xJFor jammers to transmit signals, z is white gaussian noise received by mobile user D, obeying a mean of 0 and a variance of δ2A gaussian distribution of (a). Therefore, the signal to interference plus noise ratio of the mobile user D in a certain topological state is specifically expressed as:
Figure BDA0003229862780000021
thus, the achievable communication rate received by mobile user D is expressed as:
Figure BDA0003229862780000022
considering that the unmanned aerial vehicle motion trajectory and the intelligent reflection surface passive beam forming are optimized in a combined mode, wherein a strong coupling relation exists between an intelligent reflection surface channel and a beam forming direction, and therefore a user receiving channel after being reflected by the intelligent reflection surface is specifically represented as follows:
Figure BDA0003229862780000023
wherein G ist,GrAnd G is the antenna gain of the transmitting end, the receiving end and the intelligent reflection surface reflection unit, M and K are the number of rows and columns of the intelligent reflection surface including the reflection unit, and dxAnd dyIs the row and column width of the reflective element, d1And d2Is the distance, θ, between the intelligent reflective surface and the transmitting and receiving endstAnd thetarIs the angle between the transmitting end signal and the receiving end signal and the x axis,
Figure BDA0003229862780000024
and
Figure BDA0003229862780000025
is the included angle between the transmitting end signal and the receiving end signal and the z axis, A is the adjustment value of the intelligent reflecting surface to the signal amplitude, lambda is the wavelength of the signal,
Figure BDA0003229862780000026
is the normalized power radiation function of the reflecting element,
Figure BDA0003229862780000027
is a beamforming direction offset function, which reflects the signal amplitude attenuation caused by beamforming direction offset, and is specifically expressed as:
Figure BDA0003229862780000028
wherein,
Figure BDA0003229862780000029
φk,mis the phase variation value corresponding to the first row and column of the reflective unit, when the beam forming direction is completely aligned with the receiving end, the phase variation value of the reflective unit can be expressed as
Figure BDA0003229862780000031
According to the coupling relation between the intelligent reflecting surface phase and the beam forming direction, the algorithm complexity of phase design can be reduced through the beam forming direction.
In the communication process of the sender and the intelligent reflection surface, firstly state perception is carried out, then an optimization algorithm based on deep reinforcement learning is executed, optimization of position deployment and beam forming is carried out according to probability, and the probability is adjusted according to a feedback value after communication. Wherein, the optimization algorithm comprises the following steps:
step 1, modeling the anti-interference communication problem into a Markov decision process by designing a state, an action, a return value and a transition probability function
And 2, initializing a system scene and a neural network, and updating parameters of the neural network according to state collection, action decision and feedback.
And 3, recording the path after the algorithm is executed and the beam forming scheme.
According to the invention, through the design of the passive beam forming model, the action state space of the phase change optimization algorithm of the large-scale intelligent reflection surface is reduced, the enhancement of the reinforcement learning convergence speed in the path planning process is facilitated, and the reaction speed of the algorithm and the adaptability under an anti-interference scene are further improved.
The transmitting end of the invention obtains the position and the channel state information of the mobile user through the channel perception and the state information communication link of the unmanned aerial vehicle, and can directly correct the beam forming direction from the transmitting end to the intelligent reflecting surface end under the condition of setting the beam forming direction. On the other hand, because the information of the jammer is unknown, the information of the jammer is explored in a deep reinforcement learning interaction mode, and the optimal intelligent reflector passive beam forming scheme and the flight track of the unmanned aerial vehicle are searched by utilizing the real-time control of the beam forming direction.
The passive beam forming design which can be executed by the intelligent reflecting surface after the channel state is sensed can be divided into two conditions: the useful signal is enhanced and the interference signal is reduced.
The useful signal is enhanced, namely the phase between the signal which reaches the receiving end of the user after the transmitting end is reflected by the intelligent reflection surface and the direct signal of the transmitting end and the receiving end is the same, the strength of the useful signal after the receiving end superposes the signal is far greater than that of the interference signal, and therefore the anti-interference purpose is achieved.
Reduce interfering signal, no matter how big power interfering signal interferes with, intelligence reflecting surface carries out refocusing at the receiving terminal after the signal that sends with the interference reflects, but the signal is through reverse phase processing, consequently the interfering signal of direct incidence and the interfering signal of reflection at the receiving terminal can offset, for useful signal, interfering signal obtains reducing, anti-interference communication effect obtains promoting.
The invention provides an intelligent reflection surface unmanned aerial vehicle anti-interference communication system and a joint track and beam forming optimization method based on deep reinforcement learning, and aims to provide a scheme for realizing efficient anti-interference communication under the condition of full-band strong-power interference. On the basis of a deep reinforcement learning algorithm, the passive beam forming action decision space is greatly reduced by utilizing the phase change of the intelligent reflection surface and the high coupling of beam forming, and meanwhile, multi-step experience citation is introduced into the trajectory planning, so that the accuracy of path decision is further improved through the analysis of multi-step paths.
Drawings
Fig. 1 is a network schematic diagram of an unmanned aerial vehicle anti-jamming communication system.
Fig. 2 is a schematic diagram of a trajectory and beam forming joint optimization method based on deep reinforcement learning.
Fig. 3 is a schematic topology diagram in embodiment 1 of the present invention.
FIG. 4 is a graph comparing the performance of the algorithm of the present patent with other algorithms
Fig. 5 is a graph comparing the performance of the algorithm of the present patent for different jammer locations.
Detailed Description
The technical scheme of the invention is described in detail in the following with reference to the attached drawings:
as shown in fig. 1, an anti-interference communication system for an unmanned aerial vehicle considers a communication network resistant to full-band strong power interference, where a base station exists in the communication network to perform communication coverage service for a mobile user, but a full-band omni-directional jammer interferes with legal communication in a power-suppressing manner. This jammer possesses certain disguise, and its information can't accurately acquire to legal both sides of communication, in order to reach maximum throughput, through jointly adjusting intelligent reflection surface unmanned aerial vehicle's position and the passive beam forming of intelligent reflection surface, fully explores the anti-interference efficiency of this system.
Further, in a certain topological state, the signal received by the mobile user D is defined as:
Figure BDA0003229862780000041
wherein, PSFor base station transmitting power, PJIs the jammer interference power, HS、HJ、hSDAnd hJDThe gain of a direct link channel between a base station S and a mobile user D, the gain of a direct link channel between an interference machine J and the mobile user D, the combined channel gain of a signal of the base station S which is reflected by an intelligent reflection surface U and reaches the mobile user D, and the combined channel gain of a signal of the interference machine J which is reflected by the intelligent reflection surface U and reaches the mobile user D are obtained. x is the number ofSFor transmitting signals to the base station, xJFor jammers to transmit signals, z is white gaussian noise received by mobile user D, obeying a mean of 0 and a variance of δ2A gaussian distribution of (a). Therefore, the signal to interference plus noise ratio of the mobile user D in a certain topological state is specifically expressed as:
Figure BDA0003229862780000051
thus, the achievable communication rate received by mobile user D is expressed as:
Figure BDA0003229862780000052
considering that the unmanned aerial vehicle motion trajectory and the intelligent reflection surface passive beam forming are optimized in a combined mode, wherein a strong coupling relation exists between an intelligent reflection surface channel and a beam forming direction, and therefore a user receiving channel after being reflected by the intelligent reflection surface is specifically represented as follows:
Figure BDA0003229862780000053
wherein G ist,GrAnd G is the antenna gain of the transmitting end, the receiving end and the intelligent reflection surface reflection unit, M and K are the number of rows and columns of the intelligent reflection surface including the reflection unit, and dxAnd dyIs the row and column width of the reflective element, d1And d2Is the distance, θ, between the intelligent reflective surface and the transmitting and receiving endstAnd thetarIs the angle between the transmitting end signal and the receiving end signal and the x axis,
Figure BDA0003229862780000054
and
Figure BDA0003229862780000055
is the included angle between the transmitting end signal and the receiving end signal and the z axis, A is the adjustment value of the intelligent reflecting surface to the signal amplitude, lambda is the wavelength of the signal,
Figure BDA0003229862780000056
is the normalized power radiation function of the reflecting element,
Figure BDA0003229862780000057
is a beamforming direction offset function, which reflects the signal amplitude attenuation caused by beamforming direction offset, and is specifically expressed as:
Figure BDA0003229862780000058
wherein,
Figure BDA0003229862780000059
φk,mis a first row and column reflection unitThe corresponding phase variation value, when the beam forming direction is completely aligned with the receiving end, the phase variation value of the reflection unit can be expressed as
Figure BDA00032298627800000510
According to the coupling relation between the intelligent reflecting surface phase and the beam forming direction, the algorithm complexity of phase design can be reduced through the beam forming direction.
As shown in fig. 2, the specific process of the deep reinforcement learning-based joint trajectory and beamforming optimization method of the present invention is as follows: in the communication process of the sender and the intelligent reflection surface, firstly state perception is carried out, then an optimization algorithm based on deep reinforcement learning is executed, optimization of position deployment and beam forming is carried out according to probability, and the probability is adjusted according to a feedback value after communication. Wherein, the optimization algorithm comprises the following steps:
step 1, modeling the anti-interference communication problem into a Markov decision process by designing a state, an action, a return value and a transition probability function
And 2, initializing a system scene and a neural network, and updating parameters of the neural network according to state collection, action decision and feedback.
And 3, recording the path after the algorithm is executed and the beam forming scheme.
In the step 1, through the design of a passive beam forming model, the action state space of the phase change optimization algorithm of the large-scale intelligent reflection surface is reduced, the enhancement of the reinforcement learning convergence speed in the path planning process is facilitated, and the reaction speed of the algorithm and the adaptability of the algorithm in an anti-interference scene are further improved. In step 2, the transmitting end obtains the position and channel state information of the mobile user through channel sensing and a state information communication link of the unmanned aerial vehicle, and under the condition of a set beam forming direction, the beam forming direction from the transmitting end to the intelligent reflection surface end can be directly corrected. On the other hand, because the information of the jammer is unknown, the information of the jammer is explored in a deep reinforcement learning interaction mode, and the optimal intelligent reflector passive beam forming scheme and the flight track of the unmanned aerial vehicle are searched by utilizing the real-time control of the beam forming direction.
In the deep reinforcement learning algorithm, the method comprises the following steps:
initialization, setting the number of iterations to i-1, setting state-action-reporting storage space, neural network initialization parameters, and exploration-utilization probability epsilon.
Step 11, the intelligent reflection surface-unmanned aerial vehicle interacts with the environment to obtain the state s at the moment ii
And step 12, selecting the motion of the motion trail and the passive beam forming phase design by the intelligent reflecting surface-unmanned aerial vehicle according to the probability. Exploration-utilization strategy through epsilon-greeny, which is specifically expressed as:
Figure BDA0003229862780000061
wherein, aiFor the selection of the action of the intelligent reflecting surface-unmanned plane at time i, siFor the Intelligent reflective surface-the State of the drone at time i, Q(s)i,ai) For intelligent reflective surfaces-unmanned aerial vehicle at siIs made in a state ofiThe return value after the action is according to formula (7) at siThe intelligent reflecting surface has the action with the probability of epsilon to select the action with the maximum return value, and has the probability of 1-epsilon to randomly select the action. By gradually increasing the value of epsilon in the iterative process-the drone can seek a balance between exploration and exploitation, getting the optimal solution faster, exploring all the actions as much as possible.
Step 13, perform action aiThe next time state s is reachedi+1And obtaining the return value, and sending the state-action-return value-next-time state [ s ]i,ai,ri,si+1]Stored in a sliding memory window.
Step 14, let i equal i +1 and si=si+1And repeating the steps 11-14 until the sliding memory window is filled with historical experience data.
Step 15, after the sliding storage window is filled with the historical empirical data, calculating the multistep empirical data, wherein the multistep return value is specifically represented as:
Figure BDA0003229862780000071
where N is the length of the sliding storage window, γ is the discount rate of the reported value, which reflects the influence of future reported value on the current action, and when γ is 1, the influence of unreported value is fully considered, but the excessive oscillation of the reported value is caused, which affects the convergence effect. When γ is equal to 0, only the influence of the reported value at the next time is considered, and the algorithm has a fast decision speed, but loses the grasp of the future trend.
After obtaining the multi-step reward value, the multi-step experience information si,ai:i+N,si+N,ri:i+N]Stored in an experience pool for further updating of the neural network. At the same time, new s is continuously acquiredi,ai,ri,si+1]When the information is stored in the sliding memory window, the old information is discarded, and new multi-step experience information s is obtained continuously through the sliding memory windowi,ai:i+N,si+N,ri:i+N]And continuously storing the data into an experience pool.
And step 16, when the multi-step experience information in the experience pool reaches a certain amount, updating the neural network, otherwise, repeating the steps 11-15.
And step 17, updating the neural network parameters. And the value of epsilon is continuously increased during the iteration process. So that the intelligent reflective surface-unmanned aerial vehicle makes action selection with a large return value in a trending manner.
The passive beam forming design which can be executed by the intelligent reflecting surface after the channel state is sensed can be divided into two conditions: the useful signal is enhanced and the interference signal is reduced.
The useful signal is enhanced, namely the phase between the signal which reaches the receiving end of the user after the transmitting end is reflected by the intelligent reflection surface and the direct signal of the transmitting end and the receiving end is the same, the strength of the useful signal after the receiving end superposes the signal is far greater than that of the interference signal, and therefore the anti-interference purpose is achieved.
Reduce interfering signal, no matter how big power interfering signal interferes with, intelligence reflecting surface carries out refocusing at the receiving terminal after the signal that sends with the interference reflects, but the signal is through reverse phase processing, consequently the interfering signal of direct incidence and the interfering signal of reflection at the receiving terminal can offset, for useful signal, interfering signal obtains reducing, anti-interference communication effect obtains promoting.
Example 1
The first embodiment of the invention is specifically described as follows, the system simulation adopts python language, and the parameter setting does not affect the generality. Assume that the relative positions of the nodes are as shown in fig. 3. Base station position WS=[10,10]Mobile user initial WD[0]=[1000,1000]The simulation results are shown in fig. 4.
Fig. 4 shows a schematic diagram of the anti-interference communication performance of each optimization scheme under different interference powers, and it can be seen that the anti-interference performance of the system can be significantly improved in a dynamic environment compared to a single optimization method through the proposed joint trajectory and beam forming optimization. And when the interference power is higher, the obvious anti-interference communication effect can be kept, and the steady anti-interference communication under the strong-power full-frequency-band interference condition is realized.
Example 2
In the second embodiment of the invention, the system simulation adopts python software, and the parameter setting does not affect the generality. Aiming at different interference positions, the provided combined optimization method can keep higher anti-interference communication rate under the condition of unknown interference machine positions, and the anti-interference efficiency is obviously improved compared with a single optimization method. The suitability of the algorithm in unknown environments is demonstrated.
In addition, the performance of the algorithm is simulated and analyzed, and the algorithm is compared with different algorithms, and a comparison graph of the achievable communication rates is shown in fig. 5. The joint optimization algorithm proposed herein is significantly superior to the general random algorithm for different interference locations.
In summary, the patent provides a joint path and beam forming optimization scheme, which can optimize the anti-interference communication performance under the condition of a full-band high-power jammer.

Claims (9)

1. An unmanned aerial vehicle anti-interference communication system is characterized by comprising an anti-interference communication network under the condition of full-band strong power interference, wherein a base station for carrying out communication coverage service on a mobile user is arranged in the network; the power of omnidirectional interference machine and the dual suppression characteristic of frequency channel carry on intelligent reflection surface reinforcing useful signal/offset interference signal through deploying unmanned aerial vehicle and improve the anti-interference communication performance of system, through the unmanned aerial vehicle motion trail with intelligent reflection surface passive beam forming's joint optimization, promote the anti-interference communication ability of the unknown environment of developments.
2. The unmanned aerial vehicle anti-jamming communication system of claim 1, wherein within a topological state, the signal received by mobile user D is defined as:
Figure FDA0003229862770000011
wherein, PSFor base station transmitting power, PJIs the jammer interference power, HS、HJ、hSDAnd hJDThe gain of a direct link channel between a base station S and a mobile user D, the gain of a direct link channel between an interference machine J and the mobile user D, the combined channel gain of a signal of the base station S which is reflected by an intelligent reflection surface U and reaches the mobile user D, and the combined channel gain of a signal of the interference machine J which is reflected by the intelligent reflection surface U and reaches the mobile user D, xSFor transmitting signals to the base station, xJFor jammers to transmit signals, z is white gaussian noise received by mobile user D, with a obedient mean of 0 and a variance of δ2(ii) a gaussian distribution of;
the signal-to-interference-and-noise ratio of the mobile user D in the topological state is specifically represented as:
Figure FDA0003229862770000012
the achievable communication rate received by mobile user D is expressed as:
Figure FDA0003229862770000013
the achievable communication rate at the mobile user side is affected by the combination of the position of the intelligent reflective surface and the passive beamforming.
3. The anti-jamming communication system for unmanned aerial vehicles according to claim 1, wherein the channel of the intelligent reflective surface is strongly coupled to the beam forming direction, so that the user reception channel after reflection by the intelligent reflective surface is specifically represented as:
Figure FDA0003229862770000014
wherein G ist,GrAnd G is the antenna gain of the transmitting end, the receiving end and the reflection unit of the intelligent reflection surface, M and K are the number of rows and columns of the reflection unit contained in the intelligent reflection surface, and dxAnd dyIs the row and column width of the reflective element, d1And d2Is the distance, θ, between the intelligent reflective surface and the transmitting and receiving endstAnd thetarIs the angle between the transmitting end signal and the receiving end signal and the x axis,
Figure FDA0003229862770000021
and
Figure FDA0003229862770000022
for the transmission and reception of end signalsAngle, a is the adjustment of the intelligent reflective surface to the amplitude of the signal, λ is the wavelength of the signal,
Figure FDA0003229862770000023
is the normalized power radiation function of the reflecting element,
Figure FDA0003229862770000024
is a beamforming direction offset function, which reflects the signal amplitude attenuation caused by the beamforming direction offset, and is specifically expressed as:
Figure FDA0003229862770000025
wherein,
Figure FDA0003229862770000026
φk,mis the phase variation value corresponding to the first row and column of the reflective unit, when the beam forming is completely aligned with the receiving end, the phase variation value of the reflective unit can be expressed as
Figure FDA0003229862770000027
And according to the coupling relation between the phase of the intelligent reflection surface and the beam forming direction, the algorithm complexity of phase design is reduced through the beam forming direction.
4. The joint trajectory and beam forming optimization method of the unmanned aerial vehicle anti-interference communication system based on claim 1, wherein in the communication process of the sender and the intelligent reflection surface, state sensing is firstly performed, then an optimization algorithm based on deep reinforcement learning is executed, optimization of position deployment and beam forming is performed according to probability, and the probability is adjusted according to a feedback value after communication.
5. The joint trajectory and beam forming optimization method of claim 4, characterized in that the optimization algorithm comprises the steps of:
step 1, modeling the anti-interference communication problem into a Markov decision process by designing a state, an action, a return value and a transition probability function
And 2, initializing a system scene and a neural network, and updating parameters of the neural network according to state collection, action decision and feedback.
And 3, recording the path after the algorithm is executed and the beam forming scheme.
6. The joint trajectory and beam forming optimization method according to claim 5, characterized in that in step 1, through the design of the passive beam forming model, the motion state space of the phase change optimization algorithm of the large-scale intelligent reflection surface is reduced, which is beneficial to the enhancement of the reinforcement learning convergence speed in the path planning process, and further improves the reaction speed of the algorithm and the adaptability under the anti-interference scene.
7. The joint trajectory and beam forming optimization method of claim 5, wherein in step 1, the passive beam forming design that the intelligent reflection surface can perform after sensing the channel state is divided into two cases: enhancing the useful signal or reducing the interference signal;
the useful signal is enhanced, namely the phase between the signal which reaches the receiving end of the user after the transmitting end is reflected by the intelligent reflecting surface and the direct signal of the transmitting end and the receiving end is the same, and the strength of the useful signal after the receiving end superposes the signal is far greater than that of the interference signal, so that the aim of resisting interference is fulfilled;
reduce interfering signal, no matter how big power interfering signal interferes with, intelligence reflecting surface carries out refocusing at the receiving terminal after the signal that sends with the interference reflects, but the signal is through reverse phase processing, and interfering signal and the interfering signal of reflection of penetrating directly at the receiving terminal can offset, for useful signal, interfering signal obtains reducing, and anti-interference communication effect obtains promoting.
8. The joint trajectory and beamforming optimization method according to claim 5, wherein in step 2, the transmitting end obtains the position and channel state information of the mobile user through channel sensing and the state information communication link of the unmanned aerial vehicle, and directly corrects the beamforming direction from the transmitting end to the intelligent reflective surface end under the condition of a given beamforming direction; because the information of the jammer is unknown, the information of the jammer is explored in a deep reinforcement learning interaction mode, and an optimal intelligent reflector passive beam forming scheme and an unmanned aerial vehicle flight track are searched by utilizing the real-time control of the beam forming direction.
9. The joint trajectory and beam forming optimization method of claim 8, wherein the deep reinforcement learning algorithm comprises the steps of:
initializing, setting the iteration number to be i-1, setting a state-action-report storage space, initializing parameters of a neural network, and exploring-utilizing a probability epsilon;
step 11, the intelligent reflection surface-unmanned aerial vehicle interacts with the environment to obtain the state s at the moment ii
And step 12, selecting the motion of the motion trail and the passive beam forming phase design by the intelligent reflecting surface-unmanned aerial vehicle according to the probability. Exploration-utilization strategy through epsilon-greeny, which is specifically expressed as:
Figure FDA0003229862770000031
wherein, aiFor the selection of the action of the intelligent reflecting surface-unmanned plane at time i, siFor the Intelligent reflective surface-the State of the drone at time i, Q(s)i,ai) For intelligent reflective surfaces-unmanned aerial vehicle at siIs made in a state ofiThe return value after the action is according to formula (7) at siUnder the state of (1), the intelligent reflecting surface has the action of selecting the probability of epsilon to ensure that the return value is maximum, and has the action of randomly selecting the probability of 1-epsilon; intelligent reflective surface by gradually increasing the value of epsilon in an iterative process-unmannedThe machine seeks balance between exploration and utilization, and an optimal scheme is obtained quickly under the condition of exploring all actions as much as possible;
step 13, perform action aiThe next time state s is reachedi+1And obtaining the return value, and sending the state-action-return value-next-time state [ s ]i,ai,ri,si+1]Storing in a sliding memory window;
step 14, let i equal i +1 and si=si+1Repeating the steps 11-14 until the sliding memory window is filled with historical experience data;
step 15, after the sliding storage window is filled with the historical empirical data, calculating the multistep empirical data, wherein the multistep return value is specifically represented as:
Figure FDA0003229862770000041
when the gamma is 1, the influence of the unreturned value is fully considered, but the excessive oscillation of the reported value is caused, so that the convergence effect is influenced; when gamma is 0, only the influence of the reported value at the next moment is considered, the algorithm decision speed is high, but the grasp on the future trend is lost;
after obtaining the multi-step reward value, the multi-step experience information si,ai:i+N,si+N,ri:i+N]Stored in an experience pool for further updating of the neural network. At the same time, new s is continuously acquiredi,ai,ri,si+1]When the information is stored in the sliding memory window, the old information is discarded, and new multi-step experience information s is obtained continuously through the sliding memory windowi,ai:i+N,si+N,ri:i+N]Continuously storing the data into an experience pool;
step 16, when the multi-step experience information in the experience pool reaches a certain amount, updating the neural network, otherwise, repeating the steps 11-15;
and step 17, updating the neural network parameters. And the value of epsilon is continuously increased during the iteration process. So that the intelligent reflective surface-unmanned aerial vehicle makes action selection with a large return value in a trending manner.
CN202110983755.8A 2021-08-25 2021-08-25 Unmanned aerial vehicle anti-interference communication system and joint track and beam forming optimization method Pending CN113708886A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110983755.8A CN113708886A (en) 2021-08-25 2021-08-25 Unmanned aerial vehicle anti-interference communication system and joint track and beam forming optimization method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110983755.8A CN113708886A (en) 2021-08-25 2021-08-25 Unmanned aerial vehicle anti-interference communication system and joint track and beam forming optimization method

Publications (1)

Publication Number Publication Date
CN113708886A true CN113708886A (en) 2021-11-26

Family

ID=78654797

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110983755.8A Pending CN113708886A (en) 2021-08-25 2021-08-25 Unmanned aerial vehicle anti-interference communication system and joint track and beam forming optimization method

Country Status (1)

Country Link
CN (1) CN113708886A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114025330A (en) * 2022-01-07 2022-02-08 北京航空航天大学 Air-ground cooperative self-organizing network data transmission method
CN114257298A (en) * 2022-01-17 2022-03-29 电子科技大学 Intelligent reflecting surface phase shift and unmanned aerial vehicle path planning method
CN114585005A (en) * 2022-03-04 2022-06-03 大连理工大学 Intelligent reflector assisted wireless enabling safety communication method
CN115348577A (en) * 2022-08-10 2022-11-15 福州大学 Beam scanning method based on reinforcement learning in covert communication system
CN116545489A (en) * 2023-07-04 2023-08-04 南京邮电大学 Vibration robustness improving method and system of unmanned aerial vehicle, storage medium and computing equipment
CN116614143A (en) * 2023-02-09 2023-08-18 四川九强通信科技有限公司 Unmanned aerial vehicle communication link anti-interference system and method

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100178874A1 (en) * 2009-01-13 2010-07-15 Hsien-Chyi Chiou Method for performing active jammer suppression on electronic device, and associated apparatus
US20190097712A1 (en) * 2016-04-18 2019-03-28 Intel Corporation Selection of beamforming directions based on learned performance
CN111917508A (en) * 2020-08-10 2020-11-10 中国人民解放军陆军工程大学 Anti-interference communication model based on multiple antennas and dynamic spatial spectrum anti-interference method
CN112954690A (en) * 2021-01-22 2021-06-11 西北工业大学 Anti-interference method and system based on space-based reconfigurable intelligent surface
WO2021120425A1 (en) * 2019-12-17 2021-06-24 北京航空航天大学 Millimeter wave full-duplex unmanned aerial vehicle communication relay transmission method
CN113162679A (en) * 2021-04-01 2021-07-23 南京邮电大学 DDPG algorithm-based IRS (inter-Range instrumentation System) auxiliary unmanned aerial vehicle communication joint optimization method
CN113194488A (en) * 2021-03-31 2021-07-30 西安交通大学 Unmanned aerial vehicle track and intelligent reflecting surface phase shift joint optimization method and system
CN113225119A (en) * 2021-05-11 2021-08-06 中国人民解放军国防科技大学 Wave beam domain anti-interference and anti-interception communication method based on information super-surface
CN113259836A (en) * 2021-04-26 2021-08-13 北京科技大学 IRS-assisted unmanned aerial vehicle communication network joint optimization method

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100178874A1 (en) * 2009-01-13 2010-07-15 Hsien-Chyi Chiou Method for performing active jammer suppression on electronic device, and associated apparatus
US20190097712A1 (en) * 2016-04-18 2019-03-28 Intel Corporation Selection of beamforming directions based on learned performance
WO2021120425A1 (en) * 2019-12-17 2021-06-24 北京航空航天大学 Millimeter wave full-duplex unmanned aerial vehicle communication relay transmission method
CN111917508A (en) * 2020-08-10 2020-11-10 中国人民解放军陆军工程大学 Anti-interference communication model based on multiple antennas and dynamic spatial spectrum anti-interference method
CN112954690A (en) * 2021-01-22 2021-06-11 西北工业大学 Anti-interference method and system based on space-based reconfigurable intelligent surface
CN113194488A (en) * 2021-03-31 2021-07-30 西安交通大学 Unmanned aerial vehicle track and intelligent reflecting surface phase shift joint optimization method and system
CN113162679A (en) * 2021-04-01 2021-07-23 南京邮电大学 DDPG algorithm-based IRS (inter-Range instrumentation System) auxiliary unmanned aerial vehicle communication joint optimization method
CN113259836A (en) * 2021-04-26 2021-08-13 北京科技大学 IRS-assisted unmanned aerial vehicle communication network joint optimization method
CN113225119A (en) * 2021-05-11 2021-08-06 中国人民解放军国防科技大学 Wave beam domain anti-interference and anti-interception communication method based on information super-surface

Non-Patent Citations (6)

* Cited by examiner, † Cited by third party
Title
JINGYI LI ET AL.: "Sum Rate Maximization via Reconfigurable Intelligent Surface in UAV Communication:Phase Shift and Trajectory Optimization", 《2020 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC)》, pages 131 *
WANKAI TANG ET AL.: "Wireless Communications With Reconfigurable Intelligent Surface: Path Loss Modeling and Experimental Measurement Wankai", 《IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS》, pages 2 *
XIAO LIU ET AL.: "Machine Learning Empowered Trajectory and Passive Beamforming Design in UAV-RIS Wireless Networks", 《IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS》 *
姚建文 等: "智能反射面――大有前景的6G技术", 《电信快报》 *
程吟轩 等: "基于空中智能表面的毫米波通信性能分析", 《计算机与现代化》 *
罗屹洁 等: "依概率主动窃听下D2D通信的物理层安全研究", 《信号处理》 *

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114025330A (en) * 2022-01-07 2022-02-08 北京航空航天大学 Air-ground cooperative self-organizing network data transmission method
CN114025330B (en) * 2022-01-07 2022-03-25 北京航空航天大学 Air-ground cooperative self-organizing network data transmission method
CN114257298A (en) * 2022-01-17 2022-03-29 电子科技大学 Intelligent reflecting surface phase shift and unmanned aerial vehicle path planning method
CN114257298B (en) * 2022-01-17 2022-09-27 电子科技大学 Intelligent reflecting surface phase shift and unmanned aerial vehicle path planning method
CN114585005A (en) * 2022-03-04 2022-06-03 大连理工大学 Intelligent reflector assisted wireless enabling safety communication method
CN114585005B (en) * 2022-03-04 2023-07-25 大连理工大学 Intelligent reflector-assisted wireless energized secure communication method
CN115348577A (en) * 2022-08-10 2022-11-15 福州大学 Beam scanning method based on reinforcement learning in covert communication system
CN115348577B (en) * 2022-08-10 2024-04-30 福州大学 Beam scanning method based on reinforcement learning in hidden communication system
CN116614143A (en) * 2023-02-09 2023-08-18 四川九强通信科技有限公司 Unmanned aerial vehicle communication link anti-interference system and method
CN116614143B (en) * 2023-02-09 2024-01-26 四川九强通信科技有限公司 Unmanned aerial vehicle communication link anti-interference system and method
CN116545489A (en) * 2023-07-04 2023-08-04 南京邮电大学 Vibration robustness improving method and system of unmanned aerial vehicle, storage medium and computing equipment
CN116545489B (en) * 2023-07-04 2023-12-12 南京邮电大学 Vibration robustness improving method and system of unmanned aerial vehicle, storage medium and computing equipment

Similar Documents

Publication Publication Date Title
CN113708886A (en) Unmanned aerial vehicle anti-interference communication system and joint track and beam forming optimization method
US8254988B2 (en) Method of controlling wireless communication system and wireless communication system
CN114422056B (en) Space-to-ground non-orthogonal multiple access uplink transmission method based on intelligent reflecting surface
CN102394680B (en) Beam searching method used in multibeam switching antenna system
CN111917508A (en) Anti-interference communication model based on multiple antennas and dynamic spatial spectrum anti-interference method
CN114422363A (en) Unmanned aerial vehicle loaded RIS auxiliary communication system capacity optimization method and device
CN113727405B (en) Method for improving safety rate of wireless communication system based on intelligent reflecting surface
CN112738764A (en) Broadband millimeter wave beam tracking method based on vehicle motion track cognition
CN114666803B (en) Deployment and control method and system of mobile edge computing system
Rahmati et al. Probabilistic spatially-divided multiple access in underwater acoustic sparse networks
Jiang et al. Reconfigurable intelligent surface assisted mmWave UAV wireless cellular networks
Hou et al. Joint trajectory and passive beamforming optimization in IRS-UAV enhanced anti-jamming communication networks
CN110708129A (en) Wireless channel state information acquisition method
CN115412159B (en) Safety communication method based on air intelligent reflecting surface assistance
US11303348B1 (en) Systems and methods for enhancing communication network performance using vector based deep learning
Ghaseminajm et al. RIS-aided mobile localization error bounds under hardware impairments
CN116669073A (en) Resource allocation and track optimization method based on intelligent reflecting surface auxiliary unmanned aerial vehicle cognitive network
CN115348577B (en) Beam scanning method based on reinforcement learning in hidden communication system
CN115334524B (en) Communication and radar target detection method based on omnidirectional intelligent super surface
CN116390114A (en) Internet of vehicles perception auxiliary communication pre-beam alignment method based on non-line-of-sight link identification
CN114866377A (en) Pilot frequency reconstruction-based reflection channel estimation method in industrial Internet of things RIS auxiliary communication
CN112383336B (en) Beam tracking method for long-distance high-speed flying object
Tapio et al. Near-field beamforming at intelligent re-configurable surface in uplink transmission
CN118300645A (en) Intelligent reflector auxiliary communication anti-interference method based on receiving feedback
CN117177253B (en) Intelligent super-surface tracking and coverage enhancing method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination