JP7505569B2

JP7505569B2 - Communication information prediction device, communication information prediction method, and communication information prediction program

Info

Publication number: JP7505569B2
Application number: JP2022556754A
Authority: JP
Inventors: 理一工藤; 馨子高橋; 友規村上; 匡史岩渕; 陸大宮; 智明小川
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2020-10-14
Filing date: 2020-10-14
Publication date: 2024-06-25
Anticipated expiration: 2040-10-14
Also published as: JPWO2022079833A1; WO2022079833A1

Description

本発明は、環境情報を用いて予測モデルを生成し、無線通信システムの未来の通信情報を予測する技術に関する。 The present invention relates to a technology that uses environmental information to generate a predictive model and predict future communication information of a wireless communication system.

近年、様々な装置がインターネットにつながるＩｏＴ（Internet of Things）の実現が進んでおり、自動車、ドローン、建設機械車両など様々な機器が無線により接続されつつある。また、ＩＥＥＥ８０２．１１規格の無線ＬＡＮ（Local Area Network）、Ｂｌｕｅｔｏｏｔｈ（登録商標）、ＬＴＥ（Long Term Evolution）や５Ｇによるセルラー通信、ＩｏＴ向けのＬＰＷＡ（Low Power Wide Area）通信、車通信に用いられるＥＴＣ（Electronic Toll Collection System）、ＶＩＣＳ（Vehicle Information and Communication System（登録商標））、ＡＲＩＢ－ＳＴＤ－Ｔ１０９など、多種多様な無線通信規格が展開され、様々な無線通信サービスが広く利用されている。In recent years, the realization of the Internet of Things (IoT), in which various devices are connected to the Internet, is progressing, and various devices such as automobiles, drones, and construction machinery vehicles are being connected wirelessly. In addition, a wide variety of wireless communication standards have been developed, such as IEEE 802.11 wireless LAN (Local Area Network), Bluetooth (registered trademark), LTE (Long Term Evolution) and 5G cellular communications, IoT-oriented LPWA (Low Power Wide Area) communications, ETC (Electronic Toll Collection System) used for vehicle communications, VICS (Vehicle Information and Communication System (registered trademark)), and ARIB-STD-T109, and various wireless communication services are widely used.

しかしながら、無線通信サービスによっては、通信品質の要求条件を必ずしも満たすことができない、という問題がある。特に端末装置や周辺のオブジェクトが動く場合、アンテナの指向性や伝搬環境が変動するため、通信品質が大きな影響を受ける。例えば非特許文献１では、移動するロボットと基地局装置との間の距離情報を用いて無線通信の予測モデルを生成し、通信品質を予測する技術が検討されている。However, there is a problem that some wireless communication services may not necessarily satisfy the required conditions for communication quality. In particular, when the terminal device or surrounding objects move, the antenna directivity and propagation environment fluctuate, which significantly affects communication quality. For example, Non-Patent Document 1 considers a technology that uses distance information between a moving robot and a base station device to generate a wireless communication prediction model and predict communication quality.

C.J.Lowrance, A.P.Lauf,“An active and incremental learning framework for the online prediction of link quality in robot networks”,Engineering Applications of Artificial Intelligence,77,pp.197-211,2019.C. J. Lowrance, A. P. Lauf, “An active and incremental learning framework for the online prediction of link quality in robot networks,” Engineering Applications of Artificial Intelligence, 77, pp. 197-211, 2019.

無線通信システムでは、無線通信機能が搭載された端末装置の位置、姿勢および動きなどの端末装置の状態、端末装置周辺の静的または動的な物体、などにより、通信相手との間の電波伝搬環境が変化して通信品質が影響を受け、無線通信により実現されるサービスやシステムに大きな影響を及ぼすことがある。例えば、高い周波数を用いる場合は電波の直進性が強く、通信品質は電波伝搬環境の変化による影響を受けやすい。このため、端末装置の安定した通信や通信品質をより高度に管理するには、端末装置自身と端末装置の周辺環境による通信品質の変動に対する対策が必要である。 In wireless communication systems, the radio wave propagation environment between communication partners can change depending on the state of the terminal device equipped with wireless communication capabilities, such as the position, attitude, and movement of the terminal device, as well as static or dynamic objects around the terminal device, and this can affect communication quality and have a significant impact on services and systems realized through wireless communication. For example, when using high frequencies, radio waves tend to travel in a very straight line, and communication quality is easily affected by changes in the radio wave propagation environment. For this reason, in order to ensure stable communication and more advanced management of communication quality for terminal devices, measures are needed to deal with fluctuations in communication quality caused by the terminal device itself and its surrounding environment.

この対策として、機械学習を用いた通信品質と環境情報の関係性をモデル化し、未来の通信品質を予測する技術は、良好な通信品質を維持する上で有用な手段の一つである。 As a countermeasure to this issue, technology that uses machine learning to model the relationship between communication quality and environmental information and predict future communication quality is one useful means of maintaining good communication quality.

ところが、通信に関する情報は時間的な相関が低く、例えば１秒後の通信環境が全く異なる場合が少なくないため、通信品質や、通信品質を高めるための制御方法を高精度に予測することが難しいという課題がある。以下、通信品質や通信品質を高めるための通信や端末装置の制御情報を通信情報と定義する。However, communication-related information has low temporal correlation, and it is not rare for the communication environment to be completely different one second later, for example, which makes it difficult to accurately predict communication quality or control methods for improving communication quality. Hereinafter, communication information is defined as communication quality and control information for communications and terminal devices for improving communication quality.

上記課題に鑑み、本発明は、ターゲット情報となる未来の通信情報の時間に対応する補助情報を生成し、補助情報または補助情報と入力情報からターゲットとなる未来の時間の通信情報を出力する機械学習モデルを訓練して生成することにより、未来の通信情報の予測を高精度に行うことができる通信情報予測装置、通信情報予測方法、および通信情報予測プログラムを提供することを目的とする。In view of the above problems, the present invention aims to provide a communication information prediction device, a communication information prediction method, and a communication information prediction program that are capable of predicting future communication information with high accuracy by generating auxiliary information corresponding to the time of future communication information that is target information, and training and generating a machine learning model that outputs communication information at the target future time from the auxiliary information or the auxiliary information and input information.

本発明は、移動する端末装置の無線通信に係る通信環境モデルを生成し、前記端末装置の通信情報を予測する通信情報予測装置において、前記端末装置および前記端末装置の無線通信先装置の少なくとも一方の装置環境に係る環境情報を生成する環境情報生成部と、前記端末装置の無線通信に係る通信情報を生成する通信部と、前記環境情報、または前記環境情報および前記通信情報を入力情報として、ターゲット情報の時間に対応する補助情報を生成して、前記補助情報または前記補助情報と入力情報に基づいて前記端末装置の無線通信に係る未来の時間の前記通信情報をターゲット情報として出力する通信環境モデルを生成する通信環境モデル生成部と、前記通信環境モデルを用いて、前記端末装置の通信情報を予測するモデル利用部とを有することを特徴とする。 The present invention is characterized in that the communication information prediction device generates a communication environment model related to wireless communication of a moving terminal device and predicts communication information of the terminal device, the device having an environmental information generation unit that generates environmental information related to the device environment of at least one of the terminal device and a wireless communication destination device of the terminal device, a communication unit that generates communication information related to the wireless communication of the terminal device, a communication environment model generation unit that uses the environmental information, or the environmental information and the communication information, as input information, to generate auxiliary information corresponding to the time of target information, and generates a communication environment model that outputs the communication information at a future time related to the wireless communication of the terminal device as target information based on the auxiliary information or the auxiliary information and the input information, and a model utilization unit that predicts communication information of the terminal device using the communication environment model.

また、本発明は、移動する端末装置の無線通信に係る通信環境モデルを生成し、前記端末装置の通信情報を予測する通信情報予測方法であって、前記端末装置および前記端末装置の無線通信先装置の少なくとも一方の装置環境に係る環境情報を生成する環境情報生成処理と、前記端末装置の無線通信に係る通信情報を生成する通信処理と、前記環境情報、または前記環境情報および前記通信情報を入力情報として、ターゲット情報の時間に対応する補助情報を生成して、前記補助情報または前記補助情報と入力情報に基づいて前記端末装置の無線通信に係る未来の時間の前記通信情報をターゲット情報として出力する通信環境モデルを生成する通信環境モデル生成処理と、前記通信環境モデルを用いて、前記端末装置の通信情報を予測するモデル利用処理とを行うことを特徴とする。The present invention also provides a communication information prediction method for generating a communication environment model related to wireless communication of a moving terminal device and predicting communication information of the terminal device, comprising the steps of: an environmental information generation process for generating environmental information related to the device environment of at least one of the terminal device and a wireless communication destination device of the terminal device; a communication process for generating communication information related to the wireless communication of the terminal device; a communication environment model generation process for using the environmental information, or the environmental information and the communication information, as input information, to generate auxiliary information corresponding to the time of target information, and generating a communication environment model that outputs the communication information at a future time related to the wireless communication of the terminal device as target information based on the auxiliary information or the auxiliary information and the input information; and a model utilization process for predicting communication information of the terminal device using the communication environment model.

また、本発明は、移動する端末装置の無線通信に係る通信環境モデルを生成し、前記端末装置の通信情報を予測するコンピュータで実行可能な通信情報予測プログラムであって、前記端末装置および前記端末装置の無線通信先装置の少なくとも一方の装置環境に係る環境情報を生成する環境情報生成処理と、前記端末装置の無線通信に係る通信情報を生成する通信処理と、前記環境情報、または前記環境情報および前記通信情報を入力情報として、ターゲット情報の時間に対応する補助情報を生成して、前記補助情報または前記補助情報と入力情報に基づいて前記端末装置の無線通信に係る未来の時間の前記通信情報をターゲット情報として出力する通信環境モデルを生成する通信環境モデル生成処理と、前記通信環境モデルを用いて、前記端末装置の通信情報を予測するモデル利用処理とをコンピュータで実行することを特徴とする。The present invention also provides a communication information prediction program executable on a computer that generates a communication environment model related to wireless communication of a moving terminal device and predicts communication information of the terminal device, characterized in that the program executes on a computer an environmental information generation process that generates environmental information related to the device environment of at least one of the terminal device and a wireless communication destination device of the terminal device, a communication process that generates communication information related to the wireless communication of the terminal device, a communication environment model generation process that uses the environmental information, or the environmental information and the communication information, as input information, to generate auxiliary information corresponding to the time of target information, and generates a communication environment model that outputs the communication information at a future time related to the wireless communication of the terminal device as target information based on the auxiliary information or the auxiliary information and the input information, and a model utilization process that predicts communication information of the terminal device using the communication environment model.

本発明に係る通信情報予測装置、通信情報予測方法、および通信情報予測プログラムは、時間的に相関の高いパラメータを入力情報として補助情報を生成し、補助情報からターゲットとなる未来の時間の通信情報を出力する機械学習モデルを訓練して生成することにより、通信情報の予測を高精度に行うことができる。 The communication information prediction device, communication information prediction method, and communication information prediction program of the present invention generate auxiliary information using parameters that are highly temporally correlated as input information, and train and generate a machine learning model that outputs communication information for a target future time from the auxiliary information, thereby making it possible to predict communication information with high accuracy.

無線通信システムの構成例を示す図である。FIG. 1 is a diagram illustrating an example of the configuration of a wireless communication system. 通信情報に関するパラメータと環境情報に係るパラメータの関係を示す図である。FIG. 11 is a diagram showing the relationship between parameters relating to communication information and parameters relating to environmental information. 通信環境モデル生成部を訓練する場合の構成例を示す図である。FIG. 13 is a diagram illustrating an example of a configuration when training a communication environment model generating unit. 訓練時の通信環境モデルの一例を示す図である。FIG. 13 is a diagram illustrating an example of a communication environment model during training. 利用時の通信環境モデルの一例を示す図である。FIG. 13 is a diagram illustrating an example of a communication environment model during use. 自律移動ロボットの一例を示す図である。FIG. 1 is a diagram illustrating an example of an autonomous mobile robot. 自律移動ロボットが認識する屋内マップとゴールの位置を示す図である。FIG. 2 is a diagram showing an indoor map recognized by an autonomous mobile robot and the position of a goal. 比較例の通信環境モデルの一例を示す図である。FIG. 1 is a diagram illustrating an example of a communication environment model of a comparative example. 第１の構成の通信環境モデルの一例を示す図である。FIG. 2 is a diagram illustrating an example of a communication environment model of a first configuration. 第２の構成の通信環境モデルの一例を示す図である。FIG. 13 is a diagram illustrating an example of a communication environment model of a second configuration. 実証実験における性能評価の結果を示す図である。FIG. 13 is a diagram showing the results of performance evaluation in a demonstration experiment.

以下、図面を参照して本発明に係る通信情報予測装置、通信情報予測方法、および通信情報予測プログラムの実施形態について説明する。なお、以下に説明する実施形態は本発明の一例であり、本発明は以下の実施形態に制限されるものではない。Hereinafter, an embodiment of a communication information prediction device, a communication information prediction method, and a communication information prediction program according to the present invention will be described with reference to the drawings. Note that the embodiment described below is an example of the present invention, and the present invention is not limited to the following embodiment.

図１は、無線通信システム１００の構成例を示す。無線通信システム１００は、基地局装置１０１、基地局装置１０１と無線通信を行うＭ台（Ｍは正の整数）の端末装置１０２（１）から１０２（Ｍ）で構成されている。ここで、Ｍ台の端末装置１０２（１）から１０２（Ｍ）に共通する説明を行う場合は、符号末尾の（番号）を省略して端末装置１０２と記載する。なお、端末装置１０２を構成する複数の同じブロックに対しても同様に記載する。 Figure 1 shows an example configuration of a wireless communication system 100. The wireless communication system 100 is composed of a base station device 101 and M terminal devices 102(1) to 102(M) (M is a positive integer) that perform wireless communication with the base station device 101. Here, when providing an explanation common to the M terminal devices 102(1) to 102(M), the (number) at the end of the reference number will be omitted and the device will be referred to as terminal device 102. The same description will be used for multiple identical blocks that make up terminal device 102.

図１の例では、基地局装置１０１およびＭ台の端末装置１０２の全てが本発明に係る通信情報予測装置としての機能を有し、通信情報の予測を行うための通信環境モデルを生成し、通信情報予測を行うことができる。In the example of Figure 1, the base station device 101 and all of the M terminal devices 102 have the function of a communication information prediction device according to the present invention, and can generate a communication environment model for predicting communication information and perform communication information prediction.

なお、本実施形態では、基地局装置１０１を介してネットワークに接続する構成として説明するが、基地局装置１０１とは別にネットワークに接続可能な通信装置を有していてもよい。そして、端末装置１０２は、基地局装置１０１またはネットワークに接続可能な通信装置を介して、ネットワークに接続されたカメラやセンサを搭載する機器（基地局装置１０１を含むアクセスポイントや周辺路上の監視カメラなど）からカメラ情報やセンサ情報などの非通信情報を環境情報として収集することができる。また、端末装置１０２は、他の端末装置１０２と直接、通信を行って端末装置１０２に搭載されたカメラ／センサからカメラ情報やセンサ情報を収集してもよい。例えば端末装置１０２が自動車に搭載された通信装置の場合、車車間通信（車対車間の通信）により、カメラ情報やセンサ情報（位置情報などを含む）が収集される。このようにして、ターゲットとする基地局装置１０１と端末装置１０２との間の無線通信のモデルを機械学習により生成するための環境情報が収集される。そして、形成された通信環境モデルを用いて基地局装置１０１と端末装置１０２との間の通信情報の予測が行われる。In this embodiment, the configuration is described as being connected to the network via the base station device 101, but the terminal device 102 may have a communication device connectable to the network separately from the base station device 101. The terminal device 102 can collect non-communication information such as camera information and sensor information from devices equipped with cameras and sensors connected to the network (such as access points including the base station device 101 and surveillance cameras on surrounding roads) as environmental information via the base station device 101 or a communication device connectable to the network. The terminal device 102 may also directly communicate with other terminal devices 102 to collect camera information and sensor information from cameras/sensors mounted on the terminal device 102. For example, when the terminal device 102 is a communication device mounted on an automobile, camera information and sensor information (including location information, etc.) are collected by vehicle-to-vehicle communication (vehicle-to-vehicle communication). In this way, environmental information for generating a model of wireless communication between the target base station device 101 and the terminal device 102 by machine learning is collected. Then, the communication environment model formed is used to predict communication information between the base station device 101 and the terminal device 102.

特に、以降で説明する実施形態では、時間的な相関の高いパラメータを入力情報として一旦、補助情報を生成して機械学習を行うことにより、精度の高い通信情報の予測を行うことができる。ここで、補助情報は、未来の通信情報（例えば受信電力など）に対応する環境情報のパラメータであり、ターゲット情報の予測に利用可能な非通信情報である。In particular, in the embodiment described below, parameters with high temporal correlation are used as input information to generate auxiliary information, which is then used for machine learning, allowing for highly accurate prediction of communication information. Here, the auxiliary information is a parameter of environmental information corresponding to future communication information (e.g., received power, etc.), and is non-communication information that can be used to predict target information.

（基地局装置１０１の構成例）
図１において、基地局装置１０１は、ネットワークに接続する機能を有し、Ｍ台の端末装置１０２との間で無線通信を行う。基地局装置１０１は、ＮＷ部２０１、通信部２０２（１）から通信部２０２（Ｎ）、通信環境モデル生成部２０３、環境情報生成部２０４およびモデル利用部２０５を有する。 (Example of the configuration of the base station device 101)
1 , a base station device 101 has a function of connecting to a network, and performs wireless communication with M terminal devices 102. The base station device 101 has a NW unit 201, communication units 202(1) to 202(N), a communication environment model generation unit 203, an environment information generation unit 204, and a model utilization unit 205.

ＮＷ部２０１は、外部のネットワークに接続するためのインターフェースである。基地局装置１０１は、ＮＷ部２０１を介して、外部のネットワークに接続されるカメラやセンサなどの装置からカメラ情報（撮影画像など）やセンサ情報（三次元レーザなど）を取得することができる。これらの情報は、基地局装置１０１に接続される端末装置１０２も取得可能である。The NW unit 201 is an interface for connecting to an external network. The base station device 101 can acquire camera information (such as captured images) and sensor information (such as three-dimensional lasers) from devices such as cameras and sensors connected to the external network via the NW unit 201. This information can also be acquired by the terminal device 102 connected to the base station device 101.

通信部２０２は、通信部２０２（１）から通信部２０２（Ｎ）までのＮ個（Ｎは正の整数）を有し、複数の端末装置１０２との間で無線通信を行うことができる（通信処理）。また、通信部２０２は、無線通信に関係する通信情報を生成し、通信環境モデル生成部２０３およびモデル利用部２０５に出力する。通信情報は、例えば、受信信号電力、信号対雑音電力比（ＳＮＲ）、信号対干渉雑音電力比（ＳＩＮＲ）、ＲＳＳＩ（Received Signal Strength Indication）、ＲＳＲＱ（Received Signal Reference Quality）、パケット誤り率、到達ビット数、単位時間当たりの到達ビット数、ＭＣＳ（Modulation and Coding Scheme index）、再送回数、遅延時間、誤り訂正方式、通信システムの周波数、利用するリソースの帯域幅などの周波数条件、などである。さらに、これらの値の微分情報、これらの値から計算式を用いて算出される指標、これらの指標に影響を与えるシステムの設定項目などであってもよい。The communication unit 202 has N units (N is a positive integer) from communication unit 202(1) to communication unit 202(N), and can perform wireless communication with multiple terminal devices 102 (communication processing). In addition, the communication unit 202 generates communication information related to wireless communication and outputs it to the communication environment model generation unit 203 and the model utilization unit 205. The communication information is, for example, received signal power, signal-to-noise power ratio (SNR), signal-to-interference-plus-noise power ratio (SINR), received signal strength indication (RSSI), received signal reference quality (RSRQ), packet error rate, number of arriving bits, number of arriving bits per unit time, modulation and coding scheme index (MCS), number of retransmissions, delay time, error correction method, frequency of the communication system, frequency conditions such as bandwidth of the resource to be used, etc. Furthermore, it may be differential information of these values, indexes calculated from these values using a formula, and system setting items that affect these indexes.

通信環境モデル生成部２０３は、環境情報生成部２０４が生成する環境情報と、通信部２０２（１）から通信部２０２（Ｎ）、通信部３０１（１）から通信部３０１（Ｍ）のうち少なくとも一つが生成する通信情報と、を用いて、通信情報を予測するための通信環境モデルを生成する（通信環境モデル生成処理）。なお、通信環境モデルの生成方法については後述する。The communication environment model generation unit 203 generates a communication environment model for predicting communication information using the environment information generated by the environment information generation unit 204 and communication information generated by at least one of the communication units 202(1) to 202(N) and the communication units 301(1) to 301(M) (communication environment model generation process). The method of generating the communication environment model will be described later.

環境情報生成部２０４は、装置環境に係る環境情報を生成する（環境情報生成処理）。環境情報は、端末装置１０２および基地局装置１０１の位置／姿勢／状態／動き／制御指令／端末装置１０２および基地局装置１０１またはネットワークを介して取得可能な機器のカメラ情報およびセンサ情報（物体の有無、サイズ、材質など様々な情報を含む）／構造物の動作／端末装置１０２およびネットワークの行動戦略、などの少なくとも一つの情報を含む。The environmental information generating unit 204 generates environmental information related to the device environment (environmental information generating process). The environmental information includes at least one of the following information: the position/attitude/state/movement/control command of the terminal device 102 and the base station device 101; camera information and sensor information (including various information such as the presence or absence, size, and material of an object) of the terminal device 102 and the base station device 101 or of equipment obtainable via the network; the operation of a structure; and the behavioral strategy of the terminal device 102 and the network.

ここで、制御指令は、例えば、端末装置１０２が自律移動ロボットである場合、ロボット正面方向へのタイヤ回転指令、ロボットの回転指令などに対応する。また、端末装置１０２およびネットワークの行動戦略は、例えば、端末装置１０２やロボットの予定移動経路、端末装置１０２の電力モード、端末装置１０２が利用するアンテナ数、端末装置１０２へのＱｏＳ設定、ネットワークの経路設定、ネットワークの混雑状況、ネットワークにおけるルーティングルール、アプリケーションの設定、などである。Here, if the terminal device 102 is an autonomous mobile robot, the control command corresponds to, for example, a command to rotate the tires in the direction toward the front of the robot, a command to rotate the robot, etc. Also, the behavioral strategy of the terminal device 102 and the network corresponds to, for example, the planned movement path of the terminal device 102 and the robot, the power mode of the terminal device 102, the number of antennas used by the terminal device 102, QoS settings for the terminal device 102, network route settings, network congestion status, routing rules in the network, application settings, etc.

さらに、環境情報は、端末装置１０２の所有者情報やタイプ（スマートフォンなどの携帯端末、自動車、ロボット、ドローンなど）の情報、入店者情報、自動車の移動履歴、など、通信環境モデルでターゲット情報を予測するために使えるのではないかと思われる全ての非通信情報を含む情報であってもよい。環境情報が多いほど、通信環境モデルの予測精度が向上する。 Furthermore, the environmental information may be information including all non-communication information that may be used to predict target information with the communication environment model, such as owner information and type information of the terminal device 102 (mobile terminal such as a smartphone, automobile, robot, drone, etc.), store visitor information, and automobile movement history. The more environmental information there is, the higher the prediction accuracy of the communication environment model will be.

モデル利用部２０５は、通信環境モデル生成部２０３により生成された通信環境モデルを用いて、ターゲット情報の出力、または報酬を最大化するような戦略情報の出力を行う（モデル利用処理）。例えば、端末装置１０２が自律移動ロボットの場合、受信電力を最大化することに報酬を設定し、報酬が高くなるようなＸ座標速度、Ｙ座標速度、回転指令などを戦略情報として出力することができる。なお、モデル利用部２０５の動作については後述する。The model utilization unit 205 uses the communication environment model generated by the communication environment model generation unit 203 to output target information or strategy information that maximizes the reward (model utilization process). For example, if the terminal device 102 is an autonomous mobile robot, the reward can be set to maximizing the received power, and X-coordinate speed, Y-coordinate speed, rotation commands, and the like that will increase the reward can be output as strategy information. The operation of the model utilization unit 205 will be described later.

このようにして、基地局装置１０１が通信情報予測装置として機能する場合、通信環境モデル生成部２０３は、環境情報生成部２０４から得られた環境情報と、通信部２０２から得られる通信情報と、を用いて、通信環境モデルを生成する。そして、モデル利用部２０５は、通信環境モデル生成部２０３が生成した通信環境モデルを用いて、通信情報を予測することができる。In this way, when the base station device 101 functions as a communication information prediction device, the communication environment model generation unit 203 generates a communication environment model using the environmental information obtained from the environmental information generation unit 204 and the communication information obtained from the communication unit 202. Then, the model utilization unit 205 can predict communication information using the communication environment model generated by the communication environment model generation unit 203.

（端末装置１０２の構成例）
端末装置１０２（１）から端末装置１０２（Ｍ）は、それぞれ基地局装置１０１との間で無線通信を行う。 (Example of the configuration of the terminal device 102)
Each of the terminal devices 102 ( 1 ) to 102 (M) performs wireless communication with the base station device 101 .

端末装置１０２は、通信部３０１、通信環境モデル生成部３０２、環境情報生成部３０３およびモデル利用部３０４を有する。 The terminal device 102 has a communication unit 301, a communication environment model generation unit 302, an environmental information generation unit 303 and a model utilization unit 304.

通信部３０１は、基地局装置１０１の通信部２０２、または他の端末装置１０２との間で無線通信を行う（通信処理）。また、通信部３０１は、基地局装置１０１の通信部２０２と同様に、無線通信に関係する通信情報を生成し、通信環境モデル生成部３０２およびモデル利用部３０４に出力する。The communication unit 301 performs wireless communication with the communication unit 202 of the base station device 101 or with other terminal devices 102 (communication processing). Similarly to the communication unit 202 of the base station device 101, the communication unit 301 generates communication information related to wireless communication and outputs it to the communication environment model generation unit 302 and the model utilization unit 304.

通信環境モデル生成部３０２は、基地局装置１０１の通信環境モデル生成部２０３と同様に、環境情報生成部３０３から得られた環境情報と、通信部３０１から得られる通信情報と、を用いて通信環境モデルを生成する（通信環境モデル生成処理）。The communication environment model generation unit 302, similar to the communication environment model generation unit 203 of the base station device 101, generates a communication environment model using the environmental information obtained from the environmental information generation unit 303 and the communication information obtained from the communication unit 301 (communication environment model generation process).

環境情報生成部３０３は、基地局装置１０１の環境情報生成部２０４と同様に、環境情報を生成する（環境情報生成処理）。なお、通信環境モデルの生成方法については後述する。The environmental information generating unit 303 generates environmental information (environmental information generation process) in the same manner as the environmental information generating unit 204 of the base station device 101. The method of generating the communication environment model will be described later.

モデル利用部３０４は、基地局装置１０１のモデル利用部２０５と同様に、通信環境モデル生成部３０２により生成された通信環境モデルを用いて、ターゲット情報の出力、または報酬を最大化するような戦略情報の出力を行う（モデル利用処理）。 The model utilization unit 304, similar to the model utilization unit 205 of the base station device 101, uses the communication environment model generated by the communication environment model generation unit 302 to output target information or strategy information that maximizes rewards (model utilization processing).

このようにして、端末装置１０２が通信情報予測装置として機能する場合、通信環境モデル生成部３０２は、環境情報生成部３０３から得られた環境情報と、通信部３０１から得られる通信情報と、を用いて、通信環境モデルを生成する。そして、モデル利用部３０４は、通信環境モデル生成部３０２が生成した通信環境モデルを用いて、通信情報を予測することができる。In this way, when the terminal device 102 functions as a communication information prediction device, the communication environment model generation unit 302 generates a communication environment model using the environmental information obtained from the environmental information generation unit 303 and the communication information obtained from the communication unit 301. Then, the model utilization unit 304 can predict communication information using the communication environment model generated by the communication environment model generation unit 302.

ここで、上述の説明では、基地局装置１０１と端末装置１０２とに分けて説明したが、基地局装置１０１と端末装置１０２との間で機能を分担してもよい。例えば、通信環境モデル生成部２０３で生成された通信環境モデルを各端末装置１０２のモデル利用部３０４が利用するようにしてもよい。Here, in the above explanation, the base station device 101 and the terminal device 102 are described separately, but the functions may be shared between the base station device 101 and the terminal device 102. For example, the communication environment model generated by the communication environment model generation unit 203 may be used by the model utilization unit 304 of each terminal device 102.

（通信環境モデル生成部）
次に、基地局装置１０１の通信環境モデル生成部２０３および端末装置１０２の通信環境モデル生成部３０２について詳しく説明する。なお、ここでは、基地局装置１０１の通信環境モデル生成部２０３として説明するが、端末装置１０２の通信環境モデル生成部３０２についても基本的な動作は同じである。 (Communication environment model generation unit)
Next, a detailed description will be given of the communication environment model generating unit 203 of the base station device 101 and the communication environment model generating unit 302 of the terminal device 102. Note that, although the description will be given here of the communication environment model generating unit 203 of the base station device 101, the basic operation of the communication environment model generating unit 302 of the terminal device 102 is the same.

基地局装置１０１の通信環境モデル生成部２０３は、複数の端末装置１０２との間の無線通信における通信情報および環境情報を収集して多くの訓練データを用いることができるので、端末装置１０２の通信環境モデル生成部３０２よりも望ましい。しかし、端末装置１０２ごとに最適化されたモデルとなるように転移学習やファインチューニングなどの技術で調整することが可能である。The communication environment model generation unit 203 of the base station device 101 is more preferable than the communication environment model generation unit 302 of the terminal device 102 because it can collect communication information and environmental information in wireless communication between the base station device 101 and multiple terminal devices 102 and use a large amount of training data. However, it is possible to adjust the model by techniques such as transfer learning and fine tuning so that it is optimized for each terminal device 102.

通信環境モデルは、環境情報、または、環境情報および通信情報を入力情報として、同じ時間または未来の時間におけるターゲット情報を出力するように、生成されるモデルである。通信環境モデルは、出力の値が高精度となるように、生成されてもよいし、出力値とは異なるあるパラメータを最大化するように強化学習により生成されてもよい。 A communication environment model is a model that is generated so as to use environmental information, or environmental information and communication information, as input information and output target information at the same time or at a future time. The communication environment model may be generated so that the output value is highly accurate, or may be generated by reinforcement learning so as to maximize a certain parameter different from the output value.

例えば、未来の時間の受信電力を通信環境モデルにより出力する場合には、未来の時間の受信電力をターゲット情報のパラメータとして訓練データを形成し、通信環境モデルの出力となる受信電力の予測値と実際の受信電力との誤差が少なくなるように、通信環境モデルが生成される。あるいは、端末装置１０２が自律移動ロボットである場合、受信電力を最大化するように、ロボットのＸ座標速度、Ｙ座標位置速度、回転指令を出力させるために、強化学習により、受信電力を最大化することに報酬を設定し、報酬が高くなるようなＸ座標速度、Ｙ座標速度、回転指令を出力するように通信環境モデルが生成される。For example, when the received power at a future time is output by the communication environment model, training data is formed using the received power at a future time as a parameter of the target information, and the communication environment model is generated so as to reduce the error between the predicted value of the received power output by the communication environment model and the actual received power. Alternatively, when the terminal device 102 is an autonomous mobile robot, in order to output the robot's X-coordinate velocity, Y-coordinate position velocity, and rotation command so as to maximize the received power, a reward is set by reinforcement learning to maximizing the received power, and the communication environment model is generated so as to output the X-coordinate velocity, Y-coordinate velocity, and rotation command that will increase the reward.

通信環境モデルをできるだけ多くの訓練データにより生成するためには、できるだけ共通して利用できるデータを増やして、モデルの生成を行う方が高い特性が得られると想定される。一方、通信システムは一般に広い帯域幅が利用可能であり、その中の周波数チャネルの利用方法も通信システムにより様々なものが存在する。通信システムの設定値ごとに独立の通信環境モデルを生成すれば、訓練に要するデータ量が十分に得られないなどの問題が生じる。従って、できるだけ多くのデータを用いた共通の通信環境モデルとすることが望ましい。 In order to generate a communication environment model using as much training data as possible, it is assumed that higher performance will be obtained by generating the model using as much commonly available data as possible. On the other hand, communication systems generally have a wide bandwidth available, and the methods of using frequency channels within that bandwidth vary depending on the communication system. If an independent communication environment model is generated for each setting value of the communication system, problems such as insufficient data required for training will arise. Therefore, it is desirable to create a common communication environment model using as much data as possible.

図２は、通信情報に関するパラメータと環境情報に係るパラメータの関係を示す。図２において、縦軸は時間を示し、ｔ_－ＮはＮスロット過去の時間、ｔ_０は現在、ｔ_ＭはＭスロット未来の時間を表す。ここで、スロットとは、任意の時間幅での区切りを表し、無線通信のシンボル間隔と考えてもよいし、環境情報が得られる周期と考えてもよい。 Fig. 2 shows the relationship between parameters related to communication information and parameters related to environmental information. In Fig. 2, the vertical axis indicates time, with t _-N representing time N slots in the past, t ₀ representing the present, and t _M representing time M slots in the future. Here, a slot represents a division of an arbitrary time width, and may be considered as a symbol interval in wireless communication, or as a period in which environmental information is obtained.

図２に示すように、一般に、環境情報は時間的な相関が高いが、通信情報は時間的な相関が低い。映像や位置情報などの環境情報は、物理的な環境における情報なので時間的な相関が高い。例えば１秒間で物体が動作する範囲は限られているため、現在と過去の情報から１秒後の映像や物体の位置などの状態を高精度に予測することが可能である。As shown in Figure 2, environmental information generally has a high temporal correlation, while communication information has a low temporal correlation. Environmental information such as video and location information has a high temporal correlation because it is information about the physical environment. For example, since the range in which an object moves within one second is limited, it is possible to predict with high accuracy the state of the video, object position, etc. one second from now based on current and past information.

一方、受信電力などの通信情報は、１秒後の通信環境が全く異なるものになることが少なくない。例えば、移動する端末装置の通信状態が伝搬特性の変動などにより急激に悪化することがある。このため、実際の無線通信システムでは、パイロット信号を使ったチャネル情報の取得をマイクロ秒オーダーの短い間隔で実施しており、数十ミリ秒未来の時間の通信情報の予測は困難となる。 On the other hand, communication information such as received power often changes completely depending on the communication environment one second later. For example, the communication status of a moving terminal device can suddenly deteriorate due to fluctuations in propagation characteristics. For this reason, in actual wireless communication systems, channel information is obtained using pilot signals at short intervals on the order of microseconds, making it difficult to predict communication information tens of milliseconds into the future.

このように、未来の通信情報に関するパラメータを予測するアルゴリズムでは、先ず時間的に相関の高いパラメータを用いて未来の予測すべき時間における情報を生成した後、当該情報を通信情報に変換することが有効であると考えられる。 In this way, in an algorithm for predicting parameters related to future communication information, it is considered effective to first generate information for the future time to be predicted using parameters that are highly correlated in time, and then convert this information into communication information.

そこで、本実施形態では、時間的に相関の高いパラメータを用いて補助情報を一旦生成し、補助情報を用いて通信情報を予測する通信環境モデルを生成する。Therefore, in this embodiment, auxiliary information is first generated using parameters that are highly correlated over time, and then a communication environment model is generated that predicts communication information using the auxiliary information.

図２において、未来の時間ｔ_Ｍにおける通信情報に関するパラメータを予測するために、現在および過去の通信情報を用いた場合、時間的な相関が低いので、推定精度が低くなるという問題がある。 In FIG. 2, when current and past communication information is used to predict parameters related to communication information at future time _tM , there is a problem that the estimation accuracy is low because of low temporal correlation.

このため、本実施形態では、未来の時間ｔ_Ｍにおける通信情報を得るために、現在時間ｔ_０から未来の時間ｔ_Ｍの間の任意の時間の環境情報または通信情報を補助情報として生成し、さらに生成した補助情報を用いて、時間ｔ_Ｍにおける通信情報に関するパラメータを予測することにより、推定精度を高めることができる。 Therefore, in this embodiment, in order to obtain communication information at a future time _tM , environmental information or communication information at any time between the current time _t0 and the future time _tM is generated as auxiliary information, and the generated auxiliary information is used to predict parameters related to the communication information at the time _tM , thereby improving the estimation accuracy.

図３は、通信環境モデル生成部２０３を訓練する場合の構成例を示す。なお、ここでは、基地局装置１０１の通信環境モデル生成部２０３として説明するが、端末装置１０２の通信環境モデル生成部３０２の場合も同様である。 Figure 3 shows an example of the configuration when training the communication environment model generation unit 203. Note that, although the explanation here is of the communication environment model generation unit 203 of the base station device 101, the same applies to the communication environment model generation unit 302 of the terminal device 102.

モデル訓練時には、通信環境モデル生成部２０３は、訓練データを用いて、通信環境モデル４０１を生成する。訓練データは、入力情報、ターゲット情報および補助情報からなる。入力情報は、環境情報のみ、または環境情報および通信情報の両方からなる。ターゲット情報は、通信情報または通信情報のパラメータを最大化する制御に関するパラメータからなる。制御に関するパラメータとは、端末装置１０２の動き、基地局装置１０１の動き、物理層からアプリケーション層までのＯＳＩ参照モデルにおけるモード／通信宛先／通信経路／通信方式などの制御、電波伝搬環境に影響を与える構造物／メタマテリアル／誘電体の位置・動き・設定の制御、などが挙げられる。補助情報は、ターゲットとなる出力と同じ時間の情報である環境情報と通信情報の中から選択された任意のパラメータからなる。During model training, the communication environment model generation unit 203 generates the communication environment model 401 using training data. The training data consists of input information, target information, and auxiliary information. The input information consists of only environmental information, or both environmental information and communication information. The target information consists of communication information or parameters related to control that maximize the parameters of the communication information. The parameters related to control include the movement of the terminal device 102, the movement of the base station device 101, control of the mode/communication destination/communication path/communication method in the OSI reference model from the physical layer to the application layer, and control of the position, movement, and setting of structures/metamaterials/dielectrics that affect the radio wave propagation environment. The auxiliary information consists of any parameter selected from the environmental information and communication information, which are information at the same time as the target output.

ここで、入力情報は、通信環境モデル４０１の利用時にも入力され、ターゲット情報を出力するために必要な情報である。また、補助情報は、訓練時には入力されるが、通信環境モデル４０１の利用時には入力されず、通信環境モデル生成部２０３の内部で用いられる情報である。 Here, the input information is also input when the communication environment model 401 is used, and is information necessary for outputting target information. In addition, the auxiliary information is input during training, but is not input when the communication environment model 401 is used, and is information used inside the communication environment model generation unit 203.

なお、入力情報として、図２で説明したように、時間的な相関が高い環境情報の使用が想定される。通信情報は時間的な相関が低い傾向にあるが、時間的な相関が高い通信情報のパラメータを選択して入力情報として用いてもよい。例えば、過去の時間ｔ_Ｎと現在の時間ｔ_０との間隔が短い場合、通信情報の時間的な相関はある程度得られる考えることができる。 As the input information, it is assumed that environmental information with high temporal correlation is used as described in Fig. 2. Although communication information tends to have low temporal correlation, parameters of communication information with high temporal correlation may be selected and used as the input information. For example, when the interval between the past time _tN and the current time _t0 is short, it is considered that the temporal correlation of the communication information can be obtained to a certain extent.

このようにして、通信環境モデル生成部２０３（または通信環境モデル生成部３０２）は、通信環境モデル４０１を生成することができる。In this way, the communication environment model generation unit 203 (or the communication environment model generation unit 302) can generate the communication environment model 401.

図４は、訓練時の通信環境モデル４０１の一例を示す。図４において、訓練データとして、利用時に用いる入力情報と、利用時には入力されないがモデルの訓練に利用する補助情報と、利用時に出力するターゲット情報を、教師データとして利用する。なお、機械学習アルゴリズムとして、ニューラルネットワーク、決定木、ランダムフォレスト、など任意のアルゴリズムが利用可能である。 Figure 4 shows an example of a communication environment model 401 during training. In Figure 4, input information used during use, auxiliary information that is not input during use but is used to train the model, and target information that is output during use are used as training data, as teacher data. Note that any algorithm, such as a neural network, a decision tree, or a random forest, can be used as a machine learning algorithm.

図４において、通信環境モデル４０１は、第１の機械学習ブロック５０１および第２の機械学習ブロック５０２を有する。なお、第１の機械学習ブロック５０１および第２の機械学習ブロック５０２は、それぞれ複数具備されていてもよい。In FIG. 4, the communication environment model 401 has a first machine learning block 501 and a second machine learning block 502. Note that the first machine learning block 501 and the second machine learning block 502 may each be provided in multiple numbers.

第１の機械学習ブロック５０１は、入力情報が入力されると、予め決められた機械学習のアルゴリズムを用い、補助情報を含む情報を出力する。訓練時には、第１の機械学習ブロック５０１から出力された補助情報は、補助情報出力５０３において、既知の訓練データの補助情報と比較される。例えば第１の機械学習ブロック５０１から出力される補助情報と、訓練データの補助情報との誤差（例えばＭＳＥ（Mean square error））を求め、より精度の高い補助情報が出力されるように、第１の機械学習ブロック５０１内のアルゴリズムの係数が更新される。このようにして、第１の機械学習ブロック５０１は、より精度の高い補助情報が得られるように訓練される。この補助情報は、ターゲット情報が対応する未来の時間の環境情報とすることで、第２の機械学習ブロック５０２は、補助情報とターゲット情報の時間差の小さい関係性を学習できる。つまり、入力情報に含まれる時間ｔ_０までの環境情報を用い、同じ種類の情報であるが、それより未来の時間ｔ_Ｆの環境情報を補助情報として用いることができる。この場合、訓練データは、入力信号の少なくとも一部の時系列データを、時間的に未来にシフトしたものを補助情報として用いることができるため、時系列で訓練データが得られている場合、同じデータから入力情報と補助情報を得ることができるメリットもある。 When input information is input, the first machine learning block 501 uses a predetermined machine learning algorithm to output information including auxiliary information. During training, the auxiliary information output from the first machine learning block 501 is compared with the auxiliary information of known training data in the auxiliary information output 503. For example, an error (e.g., MSE (Mean Square Error)) between the auxiliary information output from the first machine learning block 501 and the auxiliary information of the training data is obtained, and the coefficient of the algorithm in the first machine learning block 501 is updated so that more accurate auxiliary information is output. In this way, the first machine learning block 501 is trained to obtain more accurate auxiliary information. By using this auxiliary information as environmental information of a future time corresponding to the target information, the second machine learning block 502 can learn a relationship with a small time difference between the auxiliary information and the target information. In other words, the environmental information up to the time t ₀ included in the input information can be used as auxiliary information, which is the same type of information, but is environmental information of a time t _F in the future. In this case, the training data can be at least a portion of the time series data of the input signal shifted forward in time and used as auxiliary information. Therefore, if training data is obtained in time series, there is also the advantage that the input information and auxiliary information can be obtained from the same data.

第２の機械学習ブロック５０２は、第１の機械学習ブロック５０１が補助情報出力５０３を介して出力する補助情報を少なくとも含む入力情報に基づいて、ターゲット情報を出力する。なお、第２の機械学習ブロック５０２は、補助情報に加えて、第１の機械学習ブロック５０１を介して入力情報を入力するようにしてもよい。訓練時には、第２の機械学習ブロック５０２から出力されたターゲット情報は、ターゲット情報出力５０４において、既知の訓練データのターゲット情報と比較される。例えば第２の機械学習ブロック５０２から出力されるターゲット情報と、訓練データのターゲット情報との誤差（例えばＭＳＥ）を求め、より精度の高いターゲット情報が出力されるように、第２の機械学習ブロック５０２内のアルゴリズムの係数が更新される。このようにして、第２の機械学習ブロック５０２は、より精度の高いターゲット情報が得られるように訓練される。The second machine learning block 502 outputs target information based on input information including at least auxiliary information output by the first machine learning block 501 via the auxiliary information output 503. In addition to the auxiliary information, the second machine learning block 502 may input input information via the first machine learning block 501. During training, the target information output from the second machine learning block 502 is compared with the target information of known training data in the target information output 504. For example, the error (e.g., MSE) between the target information output from the second machine learning block 502 and the target information of the training data is calculated, and the coefficients of the algorithm in the second machine learning block 502 are updated so that more accurate target information is output. In this way, the second machine learning block 502 is trained to obtain more accurate target information.

なお、第１の機械学習ブロック５０１および第２の機械学習ブロック５０２がニューラルネットワークで構成されている場合には、補助情報やターゲット情報に対する誤差に基づいて、それぞれバックプロパゲーションにより、ニューラルネットワーク内の係数（重みおよびバイアス）が更新され、第１の機械学習ブロック５０１および第２の機械学習ブロック５０２がより正確な補助情報およびターゲット情報を出力できるように訓練される。このとき、ターゲット情報に対するバックプロパゲーションは、第１の機械学習ブロック５０１と第２の機械学習ブロック５０２の両方の係数を更新するように訓練してもよいし、第２の機械学習ブロック５０２のみを訓練するようにしてもよい。In addition, when the first machine learning block 501 and the second machine learning block 502 are configured as neural networks, the coefficients (weights and biases) in the neural networks are updated by backpropagation based on the errors for the auxiliary information and the target information, and the first machine learning block 501 and the second machine learning block 502 are trained to output more accurate auxiliary information and target information. At this time, the backpropagation for the target information may be trained to update the coefficients of both the first machine learning block 501 and the second machine learning block 502, or only the second machine learning block 502 may be trained.

このように、機械学習ブロックの係数を更新することにより、図２で説明した時間的な相関の低い通信情報からターゲット情報を出力するのではなく、第１の機械学習ブロック５０１により、ターゲット情報と同じ時間軸（ｔ_０）における環境情報や通信情報を補助情報として用い、補助情報を高精度に入力情報から生成できるように通信環境モデル４０１を訓練することが可能である。これにより、ターゲット情報の精度が向上され、強化学習による訓練で設定された通信情報の任意のパラメータを最大化するようなターゲット情報の出力が可能になる。 In this way, by updating the coefficients of the machine learning block, it is possible to train the communication environment model 401 so that the auxiliary information can be generated from the input information with high accuracy by using environmental information and communication information on the same time axis (t ₀ ) as the target information as auxiliary information by the first machine learning block 501, rather than outputting target information from communication information with low temporal correlation as described in Fig. 2. This improves the accuracy of the target information, and makes it possible to output target information that maximizes any parameter of the communication information set in training by reinforcement learning.

図５は、利用時の通信環境モデル４０１の一例を示す。通信環境モデル生成部２０３は、図４で説明した訓練時とは異なり、入力情報を入力してターゲット情報を出力する。 Figure 5 shows an example of a communication environment model 401 during use. Unlike during training described in Figure 4, the communication environment model generation unit 203 receives input information and outputs target information.

図５において、通信環境モデル４０１は、第１の機械学習ブロック５０１および第２の機械学習ブロック５０２を有する。なお、第１の機械学習ブロック５０１および第２の機械学習ブロック５０２は、それぞれ複数具備されていてもよい。 In FIG. 5, the communication environment model 401 has a first machine learning block 501 and a second machine learning block 502. Note that the first machine learning block 501 and the second machine learning block 502 may each be provided in multiple numbers.

第１の機械学習ブロック５０１は、図４の訓練時とは異なり、入力情報から補助情報を生成し、補助情報を第２の機械学習ブロック５０２に出力する。ここで、補助情報は、図４に示した訓練時に強化学習された精度の高い補助情報である。Unlike the training in FIG. 4, the first machine learning block 501 generates auxiliary information from the input information and outputs the auxiliary information to the second machine learning block 502. Here, the auxiliary information is highly accurate auxiliary information that has been reinforced learned during the training shown in FIG. 4.

第２の機械学習ブロック５０２は、第１の機械学習ブロック５０１から出力される補助情報を少なくとも含む入力に基づいて、ターゲット情報を出力する。ここで、ターゲット情報は、図４に示した訓練時に強化学習された精度の高いターゲット情報である。The second machine learning block 502 outputs target information based on an input including at least auxiliary information output from the first machine learning block 501. Here, the target information is highly accurate target information that has been reinforced learned during training as shown in FIG.

このようにして、通信環境モデル４０１は、通信情報の予測を精度高く行うことができる。特に、ターゲット情報と同じ時間における環境情報や通信情報を補助情報として用い、補助情報を高精度に入力情報から生成できるように訓練された通信環境モデルを用いるので、精度の高いターゲット情報を得ることができる。In this way, the communication environment model 401 can predict communication information with high accuracy. In particular, it uses environmental information and communication information at the same time as the target information as auxiliary information, and uses a communication environment model that has been trained to be able to generate auxiliary information from input information with high accuracy, so that highly accurate target information can be obtained.

［実証実験］
実施形態の効果を実証するために行った実験とその結果について説明する。 [Demonstration experiment]
Experiments conducted to verify the effects of the embodiment and the results thereof will be described.

実験に用いた移動する端末装置１０２は、パルス状に発光するレーザー照射に対する散乱）を用いたリモートセンシング技術の一つであるＬＩＤＡＲ（"Light Detection and Ranging"または"Laser Imaging Detection and Ranging"）を搭載した自律移動ロボットである。自律移動ロボットは、ＬＩＤＡＲにより、自己の位置情報、向き情報、オドメトリ情報、及び制御指令情報を端末情報として収集することができる。The mobile terminal device 102 used in the experiment was an autonomous mobile robot equipped with LIDAR ("Light Detection and Ranging" or "Laser Imaging Detection and Ranging"), a remote sensing technology that uses scattering of pulsed laser radiation. The autonomous mobile robot can collect its own position information, orientation information, odometry information, and control command information as terminal information using LIDAR.

また、無線通信はＩＥＥＥ８０２．１１ａｃ規格の無線ＬＡＮを用い、０．２秒間に通信するビット量をスループットとして計測し、性能を評価するものとする。 In addition, wireless communication will use IEEE 802.11ac standard wireless LAN, and the amount of bits communicated per 0.2 seconds will be measured as throughput to evaluate performance.

図６は、自律移動ロボットの一例を示す。図６において、横軸はｘ－ａｘｉｓ［ｍ］、縦軸はｙ－ａｘｉｓ［ｍ］の二次元平面を示し、時刻τにおける自律移動ロボットの位置は（ｘ［τ］、ｙ［τ］）である。自己の位置情報は、オドメトリ情報とＬＩＤＡＲの測定結果から、ＲｏｂｏｔＯＳのＡＭＣＬ（Adaptive Monte Carlo Localization）のアルゴリズムにより計算される。向きは、ロボット分野の座標系で用いられるクォータニオンにより、｛ｋｚ［τ］、ｗ［τ］｝＝｛ｓｉｎ（θ［τ］／２）、ｃｏｓ（θ［τ］／２）｝で表される。また、ロボットのタイヤの回転から検出されるロボットの正面方向に対する速度や回転速度などのオドメトリ情報、ロボット正面方向へ対応するタイヤ回転指令、ロボットの回転に対応する指令などの制御指令情報、などの特徴量として、オドメトリ（Ｖｘ［τ］、Ｖ_θ［τ］）、制御指令情報（Ｏｘ［τ］、Ｏ_θ［τ］）を用いることができる。後述の自律移動ロボットを用いた実証実験では、（ｘ［τ］、ｙ［τ］、ｋｚ［τ］、ｗ［τ］、Ｖｘ［τ］、Ｖ_θ［τ］、Ｏｘ［τ］、Ｏ_θ［τ］）の合計８つの特徴量のうち、すべての特徴量（８つの特徴量）を入力情報として使用する場合と、オドメトリおよび制御指令情報を除く（Ｘ［τ］、Ｙ［τ］、ｋｚ［τ］、ｗ［τ］）の４つの特徴量を入力情報として使用する場合とについて行う。 FIG. 6 shows an example of an autonomous mobile robot. In FIG. 6, the horizontal axis indicates a two-dimensional plane with the x-axis [m] and the vertical axis indicates the y-axis [m], and the position of the autonomous mobile robot at time τ is (x[τ], y[τ]). The position information of the autonomous mobile robot itself is calculated from the odometry information and the measurement results of the LIDAR by the AMCL (Adaptive Monte Carlo Localization) algorithm of the Robot OS. The direction is expressed by {kz[τ], w[τ]}={sin(θ[τ]/2), cos(θ[τ]/2)} according to the quaternion used in the coordinate system in the robotics field. In addition, odometry (Vx[τ], V θ[τ]) and control command information (Ox[τ], O _θ [τ]) can be used as feature quantities such as odometry information such as the speed and rotation speed of the robot in the forward direction detected from the rotation of the tires of the robot, tire rotation commands corresponding to the robot's forward direction, and control command information such as commands corresponding to the rotation of the robot. In a demonstration experiment using _an autonomous mobile robot described later, a case in which all of the eight feature quantities (x[τ], y[τ], kz[τ], w[τ], _Vx [τ], V θ[τ], Ox[τ], O _θ [τ]) are used as input information, and a case in which four feature quantities (X[τ], Y[τ], kz[τ], w[τ]) excluding the odometry and control command information are used as input information are performed.

さらに、通信情報はＲＳＳＩとスループットとを用い、ＲＳＳＩは１０ｍｓごとに取得し、０．２秒のタイムスロット内のＲＳＳＩの中央値をγ_ａｖｅ［τ］、分散値をγ_ｓｔｄ［τ］、スループットをＣ［τ］とする。 Furthermore, the communication information used is RSSI and throughput, RSSI is acquired every 10 ms, the median value of RSSI within a 0.2 second time slot is γ _ave [τ], the variance value is γ _std [τ], and the throughput is C[τ].

ここで、実証実験における環境情報は、ロボットから得られる、位置（Ｘ軸、Ｙ軸）、向き、オドメトリ（自己位置推定）（Ｘ軸上の速度、Ｙ軸上の速度、回転速度）、制御指令情報（ロボット正面方向へのタイヤ回転指令、ロボットの回転指令）、および過去のスループット（現在から過去２秒まで）を用いる。Here, the environmental information used in the demonstration experiment is the position (X-axis, Y-axis), orientation, odometry (self-position estimation) (speed on the X-axis, speed on the Y-axis, rotation speed), control command information (tire rotation command toward the front of the robot, robot rotation command), and past throughput (from the present to the past 2 seconds) obtained from the robot.

また、時間方向には１０サンプルのパラメータを用いるので、入力情報は、パラメータが４つの場合は４×１０＝４０、パラメータが８つの場合は８×１０＝８０となる。出力情報（ターゲット情報）は、現在時間ｔ_０の１．２秒後から２秒後までを０．２秒間ウィンドウで規格化したスループットＣ［ｔ_Ｍ－４］、Ｃ［ｔ_Ｍ－３］、Ｃ［ｔ_Ｍ－２］、Ｃ［ｔ_Ｍ－１］、Ｃ［ｔ_Ｍ］を予測する。ここで、ｔ_Ｍ＝ｔ_０＋２［ｓｅｃ］、ｔ_Ｍ－１＝ｔ_０＋１．８［ｓｅｃ］、ｔ_Ｍ－２＝ｔ_０＋１．６［ｓｅｃ］、ｔ_Ｍ－３＝ｔ_０＋１．４［ｓｅｃ］、ｔ_Ｍ－４＝ｔ_０＋１．２［ｓｅｃ］である。 In addition, since parameters of 10 samples are used in the time direction, the input information is 4×10=40 when there are four parameters, and 8×10=80 when there are eight parameters. The output information (target information) predicts throughputs C[t _M -4], C[t _M -3], C[t _M -2], C[t M -1], and C[t _M ] standardized in a 0.2 second window from 1.2 seconds to ₂ seconds after the current time t 0. Here, t _M = t ₀ + 2 [sec], t _M -1 = t ₀ + 1.8 [sec], t _M -2 = t ₀ + 1.6 [sec], t _M _-3 = t ₀ + 1.4 [sec], and t _M -4 = t ₀ + 1.2 [sec].

また、訓練時には、出力されるターゲット情報のスループットＣ［ｔ_Ｍ－４］からＣ［ｔ_Ｍ］と、既知の訓練データのスループットＣ［ｔ_Ｍ－４］からＣ［ｔ_Ｍ］とを比較して誤差を算出し、誤差が最小になるように逆伝播により機械学習ブロックの係数が更新される。 During training, the throughput of the output target information C[t _M -4] to C[t _M ] is compared with the throughput of the known training data C[t _M -4] to C[t _M ] to calculate the error, and the coefficients of the machine learning block are updated by backpropagation so as to minimize the error.

なお、後述のＲ２スコアで評価するスループットＣ_ｉは、（Ｃ［ｔ_Ｍ－４］＋Ｃ［ｔ_Ｍ－３］＋Ｃ［ｔ_Ｍ－２］＋Ｃ［ｔ_Ｍ－１］＋Ｃ［ｔ_Ｍ］）／５として評価する。 The throughput C _i evaluated by the R2 score described below is evaluated as (C[t _M -4] + C[t _M -3] + C[t _M -2] + C[t _M -1] + C[t _M ])/5.

図７は、自律移動ロボットが認識する屋内マップとゴールの位置を示す。図７に示すように、ロボットは、設置された１１３個のゴールの中から一つを選択して、自律移動を続ける。そして、ロボットが選択されたゴールの周辺３０ｃｍに入れば、次のゴールを再設定して、ロボットは次のゴールに向かうように設計されている。 Figure 7 shows the indoor map and goal locations recognized by the autonomous mobile robot. As shown in Figure 7, the robot selects one of the 113 goals set up and continues moving autonomously. When the robot comes within 30 cm of the selected goal, the next goal is reset and the robot moves toward the next goal.

実証実験では、深層学習により通信環境モデルを生成するため、前述の入力情報およびターゲット情報のいずれも、およそ－１から＋１の分布となるように前処理を行っている。また、予測精度は、式（１）に示すＲ２スコアにより評価する。In the demonstration experiment, in order to generate a communication environment model using deep learning, both the input information and the target information mentioned above are preprocessed so that they have a distribution ranging from approximately -1 to +1. In addition, the prediction accuracy is evaluated using the R2 score shown in formula (1).

ここで、Ｃ_ｉはスループットの実測値、^Ｃ_ｉは機械学習による予測値、ｎはサンプル数、ｉはサンプル番号、Ｃ_ａｖｅはスループットの予測値をそれぞれ表す。 Here, C _i represents the actual measured value of throughput, ^C _i represents the predicted value by machine learning, n represents the number of samples, i represents the sample number, and C _ave represents the predicted value of throughput.

Ｒ２スコアは機械学習による通信環境モデルがどの程度の精度でデータを予測できるかを表し、一般に、Ｒ２スコアが０．６以上で正常にモデル化ができていることを示し、Ｒ２スコアが０．７の場合はモデルの精度が高いことを示す。 The R2 score indicates how accurately a machine learning communication environment model can predict data. Generally, an R2 score of 0.6 or higher indicates successful modeling, while an R2 score of 0.7 indicates high model accuracy.

図８は、比較例の通信環境モデルの一例を示す。比較例は、従来方式の通信環境モデルであり、リカレントニューラルネットワーク（ＲＮＮ）と全結合層３層を用いて構成した通信環境モデルの例を示す。 Figure 8 shows an example of a communication environment model for the comparative example. The comparative example is a conventional communication environment model, and shows an example of a communication environment model constructed using a recurrent neural network (RNN) and three fully connected layers.

図８において、環境情報（８つのパラメータ）と通信情報（２つのパラメータ）の入力情報がＲＮＮに入力される。入力情報は、過去２秒間に０．２秒ごとの１０のタイミングのパラメータを用いる。つまり、現在時間ｔ_０から０．２秒ごとの過去の時間ｔ_０－０．２秒、ｔ_０－０．４秒、ｔ_０－０．６秒、ｔ_０－０．８秒、ｔ_０－１．０秒、ｔ_０－１．２秒、ｔ_０－１．４秒、ｔ_０－１．６秒、ｔ_０－１．８秒、の１０のタイミングのパラメータを用いる。また、ＲＮＮは、ＧＲＵ（Gated Recurrent Unit）を用い、隠れ層を１、隠れ層の次元を３５とする。 In FIG. 8, input information of environmental information (8 parameters) and communication information (2 parameters) is input to the RNN. The input information uses parameters at 10 timings every 0.2 seconds for the past 2 seconds. That is, parameters at ₁₀ timings are used at past times t ₀ -0.2 seconds, t ₀ -0.4 seconds, t ₀ -0.6 seconds, t ₀ -0.8 seconds, t ₀ -1.0 seconds, t ₀ -1.2 seconds, t ₀ -1.4 seconds, t ₀ -1.6 seconds, and t ₀ -1.8 seconds every 0.2 seconds from the current time t 0. In addition, the RNN uses a GRU (Gated Recurrent Unit), has one hidden layer, and has a dimension of 35.

また、全結合層は、入力３５および出力３５の層が２つと、入力３５および出力１の層が１つとで構成され、ターゲット情報を出力する。なお、全結合層の間にＲｅＬＵ（Rectified Linear Unit）による活性化層が用いられる。 The fully connected layer is composed of two layers with input 35 and output 35, and one layer with input 35 and output 1, and outputs target information. An activation layer using ReLU (Rectified Linear Unit) is used between the fully connected layers.

図９は、本実施形態に係る第１の構成の通信環境モデル４０１の一例を示す。図９に示す第１の構成の通信環境モデル４０１は、ＲＮＮと全結合層３層を基本構造とする第１の機械学習ブロック５０１と第２の機械学習ブロック５０２とを有する。 Figure 9 shows an example of a communication environment model 401 of a first configuration according to this embodiment. The communication environment model 401 of the first configuration shown in Figure 9 has a first machine learning block 501 and a second machine learning block 502, each of which has a basic structure of an RNN and three fully connected layers.

第１の機械学習ブロック５０１は、ＲＮＮとして隠れ層が１、隠れ層の次元が３５のＧＲＵを用い、入力３５で出力３５の層が２つ、入力３５で出力Ｍ（Ｍは正の整数）の層が１つ、を有する全結合層３層で構成され、ターゲット情報の時間ｔ_０に対応するＭ個の補助情報を出力する。 The first machine learning block 501 uses a GRU with one hidden layer as an RNN and a hidden layer dimension of 35, and is composed of three fully connected layers having two layers with input 35 and output 35 and one layer with input 35 and output M (M is a positive integer), and outputs M pieces of auxiliary information corresponding to time _t0 of the target information.

第２の機械学習ブロック５０２は、ＲＮＮとして隠れ層が１、隠れ層の次元が３５のＧＲＵを用い、入力３５で出力３５の層が２つ、入力３５で出力１の層が１つ、を有する全結合層３層で構成され、ターゲット情報を出力する。The second machine learning block 502 uses a GRU with one hidden layer as an RNN and a hidden layer dimension of 35, and is composed of three fully connected layers with two layers with input 35 and output 35 and one layer with input 35 and output 1, and outputs target information.

ここで、先に述べたように、訓練時には、第１の機械学習ブロック５０１の出力となる補助情報が訓練データに含まれるように構成する。そして、出力される補助情報と、教師データの補助情報とを比較して、誤差が最小となるように、第１の機械学習ブロック５０１のニューラルネットワークの係数を更新することにより、補助情報の精度が高められる。Here, as mentioned above, during training, the auxiliary information output by the first machine learning block 501 is configured to be included in the training data. The output auxiliary information is then compared with the auxiliary information of the teacher data, and the coefficients of the neural network of the first machine learning block 501 are updated so as to minimize the error, thereby improving the accuracy of the auxiliary information.

また、実証実験の入力情報は、先に述べたように、（Ｘ［τ］、Ｙ［τ］、ｋｚ［τ］、ｗ［τ］）の４つの場合と、（Ｘ［τ］、Ｙ［τ］、ｋｚ［τ］、ｗ［τ］、Ｖｘ［τ］、Ｖ_θ［τ］、Ｏｘ［τ］、Ｏ_θ［τ］）の８つの場合と、を想定する。 As mentioned above, the input information for the demonstration experiment is assumed to be four cases: (X[τ], Y[τ], kz[τ], w[τ]) and eight cases: (X[τ], Y[τ], kz[τ], w[τ], Vx[τ], V _θ [τ], Ox[τ], O _θ [τ]).

また、補助情報出力は、（Ｘ［ｔ］、Ｙ［ｔ］、ｋｚ［ｔ］、ｗ［ｔ］、Ｖｘ［ｔ］、Ｖ_θ［ｔ］、γ_ａｖｅ［ｔ］、γ_ｓｔｄ［ｔ］、Ｃ［ｔ］）の１１の係数を８つの時間（ｔ_Ｍ－７、ｔ_Ｍ－６、・・・、ｔ_Ｍ）のサンプル分だけ生成される。つまり、補助情報出力は、ｔ_Ｍ－７、ｔ_Ｍ－６、・・・、ｔ_Ｍの時間に対応し、１１×８＝８８の補助情報が出力される。 Furthermore, the auxiliary information output is generated by generating 11 coefficients (X[t], Y[t], kz[t], w[t], Vx[t], V _θ [t], γ _ave [t], γ _std [t], C[t]) for eight time samples (t _M-7 , t _M-6 , ..., t _M ). That is, the auxiliary information output corresponds to times t _M-7 , t _M-6 , ..., t _M , and 11×8=88 pieces of auxiliary information are output.

このようにして得られる補助情報出力の時系列データは、さらに第２の機械学習ブロック５０２のＲＮＮ（ＧＲＵ）に入力され、全結合層３層からターゲット情報が出力される。第２の機械学習ブロック５０２から出力されるターゲット情報は、現在時間ｔ_０の１．２秒後から２秒後までを０．２秒間ウィンドウで規格化したスループットＣ［ｔ_Ｍ－４］、Ｃ［ｔ_Ｍ－３］、Ｃ［ｔ_Ｍ－２］、Ｃ［ｔ_Ｍ－１］、Ｃ［ｔ_Ｍ］となる。ここで、ｔ_Ｍ＝ｔ_０＋２［ｓｅｃ］、ｔ_Ｍ－１＝ｔ_０＋１．８［ｓｅｃ］、ｔ_Ｍ－２＝ｔ_０＋１．６［ｓｅｃ］、ｔ_Ｍ－３＝ｔ_０＋１．４［ｓｅｃ］、ｔ_Ｍ－４＝ｔ_０＋１．２［ｓｅｃ］である。 The time series data of the auxiliary information output obtained in this manner is further input to the RNN (GRU) of the second machine learning block 502, and target information is output from the three fully connected layers. The target information output from the second machine learning block 502 is the throughputs C[ _tM-4 ], C[tM _- 3], C[ _tM-2 ], C[ _tM-1 _] , and C[ _tM ] normalized in a 0.2 second window from 1.2 seconds to 2 seconds after the current time t0. Here, _tM = _t0 + 2 [sec], tM _-1 = _t0 + 1.8 [sec], _tM-2 = t0 ₊ 1.6 [sec], _tM-3 = _t0 + 1.4 [sec], and _tM-4 = _t0 + 1.2 [sec].

訓練時には、８８の補助情報と、５つの出力情報（ターゲット情報）と、対応する訓練データのターゲット情報とを比較して、誤差が最小になるように逆伝播を実施し、第１の機械学習ブロック５０１のニューラルネットワークの係数を更新することにより、補助情報の精度が高められる。During training, 88 pieces of auxiliary information, five pieces of output information (target information), and the target information of the corresponding training data are compared, and backpropagation is performed to minimize the error, and the accuracy of the auxiliary information is improved by updating the coefficients of the neural network of the first machine learning block 501.

図１０は、本実施形態に係る第２の構成の通信環境モデル４０１の一例を示す。図１０に示す第２の構成の通信環境モデル４０１は、ＲＮＮと全結合層３層を基本構造とする第１の機械学習ブロック５０１と第２の機械学習ブロック５０２とを有する。 Figure 10 shows an example of a communication environment model 401 of a second configuration according to this embodiment. The communication environment model 401 of the second configuration shown in Figure 10 has a first machine learning block 501 and a second machine learning block 502 having a basic structure of an RNN and three fully connected layers.

第１の機械学習ブロック５０１は、図９の第１の構成と同じ構成であり、ＲＮＮとして隠れ層が１、隠れ層の次元が３５のＧＲＵを用い、入力３５で出力３５の層が２つ、入力３５で出力Ｍ（Ｍは正の整数）の層が１つ、を有する全結合層３層で構成され、ターゲット情報の時間ｔ_０に対応するＭ個の第１の補助情報を出力する。 The first machine learning block 501 has the same configuration as the first configuration in Figure 9, and uses a GRU with one hidden layer as an RNN and a hidden layer dimension of 35, and is composed of three fully connected layers having two layers with input 35 and output 35 and one layer with input 35 and output M (M is a positive integer), and outputs M pieces of first auxiliary information corresponding to time _t0 of the target information.

第２の機械学習ブロック５０２は、図９の第１の構成と少し異なり、第２の機械学習ブロック５０２の中に第２’の機械学習ブロック５０２’を有する。第２’の機械学習ブロック５０２’は、入力３５で出力３５の層が２つ、入力３５で出力１の層が１つ、を有する全結合層３層で構成され、第２の補助情報を出力する。そして、第２の補助情報は、入力３７で出力３５の層が１つ、入力３５で出力３５の層が１つ、入力３５で出力１の層が１つ、を有する全結合層３層にＧＲＵの出力とともに入力され、ターゲット情報が出力される。The second machine learning block 502 is slightly different from the first configuration in FIG. 9, and has a second' machine learning block 502' in the second machine learning block 502. The second' machine learning block 502' is composed of three fully connected layers having two layers with input 35 and output 35, and one layer with input 35 and output 1, and outputs the second auxiliary information. The second auxiliary information is then input together with the output of the GRU to three fully connected layers having one layer with input 37 and output 35, one layer with input 35 and output 35, and one layer with input 35 and output 1, and target information is output.

図１０の第２の構成においても、実証実験の入力情報は、先に述べたように、（Ｘ［τ］、Ｙ［τ］、ｋｚ［τ］、ｗ［τ］）の４つの場合と、（Ｘ［τ］、Ｙ［τ］、ｋｚ［τ］、ｗ［τ］、Ｖｘ［τ］、Ｖ_θ［τ］、Ｏｘ［τ］、Ｏ_θ［τ］）の８つの場合と、を想定する。 In the second configuration of Figure 10, as mentioned above, the input information for the demonstration experiment is assumed to be four cases of (X[τ], Y[τ], kz[τ], w[τ]) and eight cases of (X[τ], Y[τ], kz[τ], w[τ], Vx[τ], V _θ [τ], Ox[τ], O _θ [τ]).

また、第１の補助情報の出力は、（Ｘ［ｔ］、Ｙ［ｔ］、ｋｚ［ｔ］、ｗ［ｔ］、Ｖｘ［ｔ］、Ｖ_θ［ｔ］）の８つの係数を８つの時間（ｔ_Ｍ－７、ｔ_Ｍ－６、・・・、ｔ_Ｍ）のサンプル分だけ生成される。つまり、第１の補助情報の出力は、ｔ_Ｍ－７、ｔ_Ｍ－６、・・・、ｔ_Ｍの時間に対応し、８×８＝６４の補助情報が出力される。 Furthermore, the output of the first auxiliary information is generated by generating eight coefficients (X[t], Y[t], kz[t], w[t], Vx[t], _Vθ [t]) for eight time samples ( _tM-7 , tM _-6 , ..., _tM ). That is, the output of the first auxiliary information corresponds to times _tM-7 , _tM-6 , ..., _tM , and 8×8=64 pieces of auxiliary information are output.

図１０の第２の構成では、入力情報を非通信情報である環境情報と通信情報とに分けて扱い、先ず時間的に相関の高い環境情報を未来の時間（時間ｔ_Ｍ－７からｔ_Ｍまで）の環境情報（第１の補助情報）に変換する。そして、未来の時間の第１の補助情報から、同じ時間の第２の補助情報として通信情報（γ_ａｖｅ［ｔ］、γ_ｓｔｄ［ｔ］、Ｃ［ｔ］）を出力するようにすることで、環境情報と通信情報とを時間的に相関の高い特徴量同士として訓練が実施可能である。これにより、第１の構成に比べて、ターゲット情報の精度が向上する。なお、第２の補助情報の出力は、ｔ_Ｍ－４、ｔ_Ｍ－３、・・・、ｔ_Ｍの出力時間と同様に５サンプルとする。第１の補助情報を入力する第２の機械学習ブロック５０２の出力と、第２’の機械学習ブロック５０２’が出力する第２の補助情報とに基づいて、全結合層で最後の予測が行われ、ターゲット情報が出力される。 In the second configuration of FIG. 10, the input information is divided into environmental information, which is non-communication information, and communication information, and first, environmental information with high temporal correlation is converted into environmental information (first auxiliary information) of a future time (from time t _M-7 to t _M ). Then, communication information (γ _ave [t], γ _std [t], C[t]) is output as second auxiliary information of the same time from the first auxiliary information of the future time, so that training can be performed with the environmental information and the communication information as features with high temporal correlation. This improves the accuracy of the target information compared to the first configuration. Note that the output of the second auxiliary information is 5 samples, the same as the output times of t _M-4 , t _M-3 , ..., t _M. Based on the output of the second machine learning block 502 that inputs the first auxiliary information and the second auxiliary information output by the second' machine learning block 502', the final prediction is performed in the fully connected layer, and the target information is output.

なお、実証実験において、第２の機械学習ブロック５０２から出力されるターゲット情報は、現在時間ｔ_０の１．２秒後から２秒後までを０．２秒間ウィンドウで規格化したスループットＣ［ｔ_Ｍ－４］、Ｃ［ｔ_Ｍ－３］、Ｃ［ｔ_Ｍ－２］、Ｃ［ｔ_Ｍ－１］、Ｃ［ｔ_Ｍ］となる。ここで、ｔ_Ｍ＝ｔ_０＋２［ｓｅｃ］、ｔ_Ｍ－１＝ｔ_０＋１．８［ｓｅｃ］、ｔ_Ｍ－２＝ｔ_０＋１．６［ｓｅｃ］、ｔ_Ｍ－３＝ｔ_０＋１．４［ｓｅｃ］、ｔ_Ｍ－４＝ｔ_０＋１．２［ｓｅｃ］である。 In the demonstration experiment, the target information output from the second machine learning block 502 is the throughputs C[ _tM-4 ], C[ _tM-3 ], C[tM-2], _C [ _tM- ₁ ], and C[ _tM ] standardized in a 0.2 second window from 1.2 seconds to 2 seconds after the current time t0. Here, _tM = _t0 + 2 [sec], _tM-1 = _t0 + 1.8 [sec], tM _-2 = t0 ₊ 1.6 [sec], tM _-3 = _t0 + 1.4 [sec], and _tM-4 = _t0 + 1.2 [sec].

訓練時には、６４の第１の補助情報と、５つ時間の３つの特徴量に対する１５の第２の補助情報と、最終出力の５つのターゲット情報との合計８４の出力情報と、この出力情報に対応する訓練データの情報とを比較して誤差が最小になるように逆伝播を実施し、各機械学習ブロックの係数が更新される。During training, a total of 84 pieces of output information (64 pieces of first auxiliary information, 15 pieces of second auxiliary information for three features over five times, and five pieces of target information for the final output) are compared with the training data information corresponding to this output information, and backpropagation is performed to minimize the error, and the coefficients of each machine learning block are updated.

本実施形態の副次的な効果は、第１の機械学習ブロック５０１の入力情報を変更しても、第１の補助情報の出力を予め決めておけば、第２の機械学習ブロック５０２を共通に使用できることである。つまり、予測に使用できる情報が、端末装置１０２によってバラバラであったとしても、第１の機械学習ブロック５０１の第１の補助情報の出力内容を予め決めておくことで、各端末装置１０２は同じ補助情報が出力できる第１の機械学習ブロック５０１をそれぞれ有していればよい。つまり、第２の機械学習ブロック５０２は他の端末装置１０２と同じものが使える。A secondary effect of this embodiment is that even if the input information of the first machine learning block 501 is changed, the second machine learning block 502 can be used in common as long as the output of the first auxiliary information is determined in advance. In other words, even if the information that can be used for prediction varies depending on the terminal device 102, by determining in advance the output content of the first auxiliary information of the first machine learning block 501, each terminal device 102 only needs to have a first machine learning block 501 that can output the same auxiliary information. In other words, the second machine learning block 502 can be the same as that of the other terminal devices 102.

図１１は、実証実験における性能評価の結果を示す。なお、実証実験では、先に説明した実証実験で取得された１００時間分のデータに基づいて訓練を行った後、別に用意したテスト用の６時間分のデータセットにより、１秒後から２秒後までの規格化されたスループットが得られた。 Figure 11 shows the results of performance evaluation in the demonstration experiment. In the demonstration experiment, training was performed based on 100 hours of data acquired in the demonstration experiment described above, and then normalized throughput from 1 second to 2 seconds was obtained using a separately prepared 6-hour data set for testing.

図１１において、入力情報として（Ｘ［ｔ］、Ｙ［ｔ］、ｋｚ［ｔ］、ｗ［ｔ］）の４種の特徴量を入力した場合、従来方法のＲ２スコアが０．６３３９５であるのに対して、本実施形態に係る第１の構成方法のＲ２スコアは０．６４１７９２、第２の構成方法のＲ２スコアは０．６４３３８であり、Ｒ２スコアは、従来方法＜第１の構成方法＜第２の構成方法の順に改善されていることが確認できる。In Figure 11, when four types of features (X[t], Y[t], kz[t], w[t]) are input as input information, the R2 score for the conventional method is 0.63395, while the R2 score for the first configuration method of this embodiment is 0.641792 and the R2 score for the second configuration method is 0.64338. It can be seen that the R2 scores improve in the order of conventional method < first configuration method < second configuration method.

同様に、入力情報として（Ｘ［ｔ］、Ｙ［ｔ］、ｋｚ［ｔ］、ｗ［ｔ］、Ｖｘ［ｔ］、Ｖ_θ［ｔ］、Ｏｘ［ｔ］、Ｏ_θ［ｔ］）の８種の特徴量を入力した場合、従来方法のＲ２スコアが０．６４５６９であるのに対して、本実施形態に係る第１の構成方法のＲ２スコアは０．６５６０９７、第２の構成方法のＲ２スコアは０．６５９６４であり、Ｒ２スコアは、従来方法＜第１の構成方法＜第２の構成方法の順に改善されていることが確認できる。 Similarly, when eight types of features (X[t], Y[t], kz[t], w[t], Vx[t], V _θ [t], Ox[t], O _θ [t]) are input as input information, the R2 score of the conventional method is 0.64569, while the R2 score of the first configuration method of this embodiment is 0.656097, and the R2 score of the second configuration method is 0.65964, and it can be confirmed that the R2 scores improve in the order of conventional method < first configuration method < second configuration method.

（他の実現形態）
ここで、上述した各実施形態における基地局装置１０１または端末装置１０２の各ブロックまたは一部のブロックを構成要素として行う処理をコンピュータで実行するようにしてもよい。その場合、各ブロックまたは一部のブロックが行う処理を実現するためのプログラムをコンピュータで読み取り可能な記録媒体に記録して、この記録媒体に記録されたプログラムをコンピュータシステムに読み込ませて、コンピュータに実行させるようにしてもよい。 (Other implementations)
Here, the processing performed by each block or some blocks of the base station device 101 or the terminal device 102 in each of the above-mentioned embodiments as components may be executed by a computer. In this case, a program for realizing the processing performed by each block or some blocks may be recorded on a computer-readable recording medium, and the program recorded on this recording medium may be read into a computer system to be executed by the computer.

なお、「コンピュータ」とは、ＯＳや周辺機器等のハードウェアを含むものとする。また、「コンピュータで読み取り可能な記録媒体」とは、フレキシブルディスク、光磁気ディスク、ＲＯＭ、ＣＤ－ＲＯＭ等の可搬媒体、コンピュータシステムに内蔵または外部に接続されるハードディスク等の記憶装置である。 Note that the term "computer" includes hardware such as the OS and peripheral devices. Furthermore, "computer-readable recording media" refers to portable media such as flexible disks, optical magnetic disks, ROMs, and CD-ROMs, as well as storage devices such as hard disks built into or connected externally to a computer system.

さらに「コンピュータで読み取り可能な記録媒体」は、インターネット等のネットワークや電話回線等の通信回線を介して取得されるプログラムを含み、短時間だけ保持されるプログラム、動的に保持されるプログラム、なども含まれる。 Furthermore, "computer-readable recording medium" includes programs obtained via a network such as the Internet or a communication line such as a telephone line, as well as programs that are retained for only a short period of time and programs that are dynamically retained.

また、サーバやクライアントとなるコンピュータシステム内部の揮発性メモリのように、一定時間プログラムを保持しているものを含んでもよい。また、プログラムは、前述した構成要素の一部を実現するためのものであってもよく、さらに前述した構成要素をコンピュータシステムに既に記録されているプログラムとの組み合わせで実現できるものであってもよい。 It may also include something that holds a program for a certain period of time, such as a volatile memory inside a computer system that is a server or client. The program may also be for implementing some of the components described above, or may be capable of implementing the components described above in combination with a program already stored in the computer system.

また、プログラムは、ＰＬＤ（Programmable Logic Device）やＦＰＧＡ（Field Programmable Gate Array）等のハードウェアを用いて実現されるものであってもよい。 The program may also be implemented using hardware such as a PLD (Programmable Logic Device) or an FPGA (Field Programmable Gate Array).

以上説明したように、本発明に係る通信情報予測装置、通信情報予測方法、および通信情報予測プログラムは、時間的に相関の高いパラメータを入力情報として補助情報を生成し、補助情報からターゲットとなる未来の通信情報を出力する機械学習モデルを訓練して生成することにより、通信情報の予測を高精度に行うことができる。 As described above, the communication information prediction device, communication information prediction method, and communication information prediction program of the present invention generate auxiliary information using parameters that are highly temporally correlated as input information, and train and generate a machine learning model that outputs target future communication information from the auxiliary information, thereby making it possible to predict communication information with high accuracy.

なお、本発明の実施形態について図面を参照して説明してきたが、具体的な構成は上述の実施形態に限られるものではなく、本発明の基本的な特徴を逸脱しない範囲の設計等も含まれる。Although embodiments of the present invention have been described with reference to the drawings, the specific configuration is not limited to the above-described embodiments, and also includes designs that do not deviate from the basic features of the present invention.

１００・・・無線通信システム；１０１・・・基地局装置；１０２・・・端末装置；２０１・・・ＮＷ部；２０２・・・通信部；２０３・・・通信環境モデル生成部；２０４・・・環境情報生成部；２０５・・・モデル利用部；３０１・・・通信部；３０２・・・通信環境モデル生成部；３０３・・・環境情報生成部；３０４・・・モデル利用部；４０１・・・通信環境モデル；５０１・・・第１の機械学習ブロック；５０２・・・第２の機械学習ブロック；５０２’・・・第２’の機械学習ブロック；５０３・・・補助情報出力；５０４・・・ターゲット情報出力 100...Wireless communication system; 101...Base station device; 102...Terminal device; 201...NW unit; 202...Communication unit; 203...Communication environment model generation unit; 204...Environment information generation unit; 205...Model utilization unit; 301...Communication unit; 302...Communication environment model generation unit; 303...Environment information generation unit; 304...Model utilization unit; 401...Communication environment model; 501...First machine learning block; 502...Second machine learning block; 502'...Second' machine learning block; 503...Auxiliary information output; 504...Target information output

Claims

移動する端末装置の無線通信に係る通信環境モデルを生成し、前記端末装置の通信情報を予測する通信情報予測装置において、
前記端末装置および前記端末装置の無線通信先装置の少なくとも一方の装置環境に係る環境情報を生成する環境情報生成部と、
前記端末装置の無線通信に係る通信情報を生成する通信部と、
前記環境情報、または前記環境情報および前記通信情報を入力情報として、ターゲット情報の時間に対応する補助情報を生成して、前記補助情報または前記補助情報と入力情報に基づいて前記端末装置の無線通信に係る未来の時間の前記通信情報をターゲット情報として出力する通信環境モデルを生成する通信環境モデル生成部と、
前記通信環境モデルを用いて、前記端末装置の通信情報を予測するモデル利用部と
を有し、
前記通信環境モデルは、
異なる前記入力情報に対して、ターゲット情報の未来の時間に対応する環境情報を、前記補助情報として出力する第１の機械学習ブロックと、
前記補助情報または前記補助情報と入力情報から、未来の時間の前記通信情報をターゲット情報として生成する第２の機械学習ブロックと
を有することを特徴とする通信情報予測装置。 1. A communication information prediction device that generates a communication environment model related to wireless communication of a moving terminal device and predicts communication information of the terminal device,
an environment information generating unit that generates environment information relating to a device environment of at least one of the terminal device and a wireless communication destination device of the terminal device;
a communication unit that generates communication information related to wireless communication of the terminal device;
a communication environment model generation unit that uses the environmental information or the environmental information and the communication information as input information to generate auxiliary information corresponding to a time of target information, and generates a communication environment model that outputs the communication information at a future time related to wireless communication of the terminal device as target information based on the auxiliary information or the auxiliary information and the input information;
a model utilization unit that predicts communication information of the terminal device by using the communication environment model ;
The communication environment model is
A first machine learning block that outputs, for each of the different input information, environmental information corresponding to a future time of the target information as the auxiliary information;
a second machine learning block that generates the communication information of a future time as target information from the auxiliary information or the auxiliary information and input information;
A communication information prediction device comprising :

請求項１に記載の通信情報予測装置において、
前記通信環境モデル生成部は、
前記入力情報、および前記補助情報を教師データとして用い、前記入力情報を前記第１の機械学習ブロックに入力した際に生成される前記補助情報の出力を、前記教師データの補助情報と比較して、その間の誤差から、より精度の高い前記補助情報を生成するように前記第１の機械学習ブロックの係数を更新する訓練を行い、かつ、
前記補助情報または前記補助情報と前記入力情報、および前記ターゲット情報を教師データとして用い、前記補助情報または前記補助情報と前記入力情報を前記第２の機械学習ブロックに入力した際に生成される前記ターゲット情報の出力を、前記教師データのターゲット情報と比較して、その間の誤差から、より精度の高い前記ターゲット情報を生成するように前記第２の機械学習ブロックの係数を更新する訓練を行って、前記通信環境モデルを生成する
ことを特徴とする通信情報予測装置。 2. The communication information prediction device according to claim 1 ,
The communication environment model generation unit
Using the input information and the auxiliary information as teacher data, the output of the auxiliary information generated when the input information is input to the first machine learning block is compared with the auxiliary information of the teacher data , and training is performed to update the coefficients of the first machine learning block so as to generate the auxiliary information with higher accuracy from the error therebetween; and
a communication information prediction device which uses the auxiliary information or the auxiliary information and the input information, and the target information as teacher data, compares the output of the target information generated when the auxiliary information or the auxiliary information and the input information are input into the second machine learning block with the target information of the teacher data, and trains to update the coefficients of the second machine learning block so as to generate the target information with higher accuracy from the error between them , thereby generating the communication environment model.

請求項２に記載の通信情報予測装置において、
前記補助情報は、前記入力情報の少なくとも一部の情報の時間的に未来に対応するものを含むこととする
ことを特徴とする通信情報予測装置。 3. The communication information prediction device according to claim 2 ,
The communication information prediction device, wherein the auxiliary information includes information corresponding to at least a portion of the input information that is temporally future.

請求項１から請求項３のいずれか一項に記載の通信情報予測装置において、
前記第２の機械学習ブロックは、
非通信情報である前記環境情報と前記通信情報とに分けて扱い、未来の時間の前記通信情報を第２の補助情報として出力する機械学習ブロックを有する
ことを特徴とする通信情報予測装置。 The communication information prediction device according to any one of claims 1 to 3 ,
The second machine learning block includes:
A communication information prediction device comprising a machine learning block that handles the environmental information, which is non-communication information, and the communication information separately, and outputs the communication information in the future as second auxiliary information.

移動する端末装置の無線通信に係る通信環境モデルを生成し、前記端末装置の通信情報を予測する通信情報予測方法であって、
前記端末装置および前記端末装置の無線通信先装置の少なくとも一方の装置環境に係る環境情報を生成する環境情報生成処理と、
前記端末装置の無線通信に係る通信情報を生成する通信処理と、
前記環境情報、または前記環境情報および前記通信情報を入力情報として、ターゲット情報の時間に対応する補助情報を生成して、前記補助情報または前記補助情報と入力情報に基づいて前記端末装置の無線通信に係る未来の時間の前記通信情報をターゲット情報として出力する通信環境モデルを生成する通信環境モデル生成処理と、
前記通信環境モデルを用いて、前記端末装置の通信情報を予測するモデル利用処理と
を行い、
前記通信環境モデルは、
異なる前記入力情報に対して、ターゲット情報の未来の時間に対応する環境情報を、前記補助情報として出力する第１の機械学習ブロックと、
前記補助情報または前記補助情報と入力情報から、未来の時間の前記通信情報をターゲット情報として生成する第２の機械学習ブロックと
を有することを特徴とする通信情報予測方法。 A communication information prediction method for generating a communication environment model related to wireless communication of a moving terminal device and predicting communication information of the terminal device, comprising:
an environment information generating process for generating environment information relating to an environment of at least one of the terminal device and a wireless communication destination device of the terminal device;
A communication process for generating communication information related to wireless communication of the terminal device;
a communication environment model generation process in which auxiliary information corresponding to a time of target information is generated using the environmental information or the environmental information and the communication information as input information, and a communication environment model is generated in which the communication information at a future time related to wireless communication of the terminal device is output as target information based on the auxiliary information or the auxiliary information and the input information;
performing a model utilization process of predicting communication information of the terminal device using the communication environment model;
The communication environment model is
A first machine learning block that outputs, for each of the different input information, environmental information corresponding to a future time of the target information as the auxiliary information;
a second machine learning block that generates the communication information of a future time as target information from the auxiliary information or the auxiliary information and input information;
A communication information prediction method comprising the steps of :

請求項５に記載の通信情報予測方法において、The communication information prediction method according to claim 5,
前記通信環境モデル生成処理では、In the communication environment model generation process,
前記入力情報、および前記補助情報を教師データとして用い、前記入力情報を前記第１の機械学習ブロックに入力した際に生成される前記補助情報の出力を、前記教師データの補助情報と比較して、その間の誤差から、より精度の高い前記補助情報を生成するように前記第１の機械学習ブロックの係数を更新する訓練を行い、かつ、Using the input information and the auxiliary information as teacher data, the output of the auxiliary information generated when the input information is input to the first machine learning block is compared with the auxiliary information of the teacher data, and training is performed to update the coefficients of the first machine learning block so as to generate the auxiliary information with higher accuracy from the error therebetween; and
前記補助情報または前記補助情報と前記入力情報、および前記ターゲット情報を教師データとして用い、前記補助情報または前記補助情報と前記入力情報を前記第２の機械学習ブロックに入力した際に生成される前記ターゲット情報の出力を、前記教師データのターゲット情報と比較して、その間の誤差から、より精度の高い前記ターゲット情報を生成するように前記第２の機械学習ブロックの係数を更新する訓練を行って、前記通信環境モデルを生成するThe auxiliary information or the auxiliary information and the input information, and the target information are used as training data, and the output of the target information generated when the auxiliary information or the auxiliary information and the input information are input to the second machine learning block is compared with the target information of the training data, and training is performed to update the coefficients of the second machine learning block so as to generate the target information with higher accuracy based on the error therebetween, thereby generating the communication environment model.
ことを特徴とする通信情報予測方法。A communication information prediction method comprising:

移動する端末装置の無線通信に係る通信環境モデルを生成し、前記端末装置の通信情報を予測するコンピュータで実行可能な通信情報予測プログラムであって、
前記端末装置および前記端末装置の無線通信先装置の少なくとも一方の装置環境に係る環境情報を生成する環境情報生成処理と、
前記端末装置の無線通信に係る通信情報を生成する通信処理と、
前記環境情報、または前記環境情報および前記通信情報を入力情報として、ターゲット情報の時間に対応する補助情報を生成して、前記補助情報または前記補助情報と入力情報に基づいて前記端末装置の無線通信に係る未来の時間の前記通信情報をターゲット情報として出力する通信環境モデルを生成する通信環境モデル生成処理と、
前記通信環境モデルを用いて、前記端末装置の通信情報を予測するモデル利用処理と
をコンピュータで実行し、
前記通信環境モデルは、
異なる前記入力情報に対して、ターゲット情報の未来の時間に対応する環境情報を、前記補助情報として出力する第１の機械学習ブロックと、
前記補助情報または前記補助情報と入力情報から、未来の時間の前記通信情報をターゲット情報として生成する第２の機械学習ブロックと
を有することを特徴とする通信情報予測プログラム。 A communication information prediction program executable by a computer to generate a communication environment model related to wireless communication of a moving terminal device and predict communication information of the terminal device, comprising:
an environment information generating process for generating environment information relating to an environment of at least one of the terminal device and a wireless communication destination device of the terminal device;
A communication process for generating communication information related to wireless communication of the terminal device;
a communication environment model generation process for generating auxiliary information corresponding to a time of target information using the environmental information or the environmental information and the communication information as input information, and generating a communication environment model that outputs the communication information at a future time related to wireless communication of the terminal device as target information based on the auxiliary information or the auxiliary information and the input information;
A model utilization process for predicting communication information of the terminal device by using the communication environment model is executed by a computer ;
The communication environment model is
A first machine learning block that outputs, for each of the different input information, environmental information corresponding to a future time of the target information as the auxiliary information;
a second machine learning block that generates the communication information of a future time as target information from the auxiliary information or the auxiliary information and input information;
A communication information prediction program comprising :

請求項７に記載の通信情報予測プログラムにおいて、
前記通信環境モデル生成処理では、
前記入力情報、および前記補助情報を教師データとして用い、前記入力情報を前記第１の機械学習ブロックに入力した際に生成される前記補助情報の出力を、前記教師データの補助情報と比較して、その間の誤差から、より精度の高い前記補助情報を生成するように前記第１の機械学習ブロックの係数を更新する訓練を行い、かつ、
前記補助情報または前記補助情報と前記入力情報、および前記ターゲット情報を教師データとして用い、前記補助情報または前記補助情報と前記入力情報を前記第２の機械学習ブロックに入力した際に生成される前記ターゲット情報の出力を、前記教師データのターゲット情報と比較して、その間の誤差から、より精度の高い前記ターゲット情報を生成するように前記第２の機械学習ブロックの係数を更新する訓練を行って、前記通信環境モデルを生成する
ことを特徴とする通信情報予測プログラム。 8. The communication information prediction program according to claim 7 ,
In the communication environment model generation process,
Using the input information and the auxiliary information as teacher data, the output of the auxiliary information generated when the input information is input to the first machine learning block is compared with the auxiliary information of the teacher data, and training is performed to update the coefficients of the first machine learning block so as to generate the auxiliary information with higher accuracy from the error therebetween; and
a communication information prediction program which uses the auxiliary information or the auxiliary information and the input information, and the target information as teacher data, compares the output of the target information generated when the auxiliary information or the auxiliary information and the input information are input into the second machine learning block with the target information of the teacher data, and trains the second machine learning block to update its coefficients so as to generate more accurate target information from the error therebetween, thereby generating the communication environment model.