WO2024128090A1

WO2024128090A1 - Information processing device, control system, search method, and search program

Info

Publication number: WO2024128090A1
Application number: PCT/JP2023/043618
Authority: WO
Inventors: 翔太林; 俊平野原; 裕司白石
Original assignee: 日立造船株式会社
Priority date: 2022-12-12
Filing date: 2023-12-06
Publication date: 2024-06-20

Abstract

The objective of the present invention is to set a parameter relating to inference and control appropriately regardless of the technical skill of an individual. An information processing device (3) comprises: a predicted distribution calculating unit (303) for calculating a predicted distribution of a function representing a relationship between a parameter of a control system (7) that controls a controlled object (5) in accordance with the result of an inference obtained by an inference model, and an operating state of the controlled object (5) in a period in which the parameter is applied; and a search unit (304) for searching for a candidate of an optimal value of the parameter on the basis of the predicted distribution.

Description

情報処理装置、制御システム、探索方法、および探索プログラムInformation processing device, control system, search method, and search program

　本発明は、推論モデルによる推論の結果に応じて所定の制御対象に対する制御を行う制御システムにおけるパラメータの設定に関する。 The present invention relates to setting parameters in a control system that controls a specified control object according to the results of inference using an inference model.

　機械学習モデルにより推論を行い、その推論の結果に基づいて制御対象を制御する技術が従来から知られている。例えば、下記の特許文献１には、ごみの焼却を行うと共に、焼却時に発生する熱を利用して発電を行うごみ焼却プラント設備において、機械学習したニューラルネットワークモデルを用いて所定時間後の発生蒸気量を予測し、その予測結果に基づいて運転制御を行うことが記載されている。 Technology has been known for some time that uses a machine learning model to make inferences and control a control target based on the results of the inferences. For example, the following Patent Document 1 describes a waste incineration plant that incinerates waste and generates electricity using the heat generated during incineration, where a machine-learned neural network model is used to predict the amount of steam generated after a specified time, and operation is controlled based on the prediction results.

日本国特開２００５－２４９３４９号公報Japanese Patent Publication No. 2005-249349

　上述のような従来技術では、例えば、発生蒸気量の低下が予測された場合に行われる、発生蒸気量の低下を回避するための運転制御の内容についてはオペレータが判断する必要がある。このため、発生蒸気量の低下を避けることができるか否かが、オペレータの運転技術等の個人の技能に左右されるという問題がある。 In the conventional technology described above, for example, when a decrease in the amount of steam generated is predicted, the content of the operational control to avoid the decrease in the amount of steam generated must be determined by the operator. This causes the problem that whether or not a decrease in the amount of steam generated can be avoided depends on the individual skills of the operator, such as their driving technique.

　また、ごみ焼却プラント設備を稼働させている期間中は、その制御システムも常に稼働させておく必要があり、その制御システムの制御内容はごみ焼却プラント設備の稼働状態に直接的な影響を与える。このため、幾度もトライアンドエラーを繰り返し、その結果に基づいて適切な運転制御の内容を自動で決定するような構成を採用することは難しく、上記のとおり個人の技能に頼らざるを得なかった。 In addition, while the waste incineration plant equipment is in operation, the control system must also be running at all times, and the control content of the control system has a direct impact on the operating state of the waste incineration plant equipment. For this reason, it is difficult to adopt a configuration that automatically determines the appropriate operational control content based on the results of repeated trial and error, and as mentioned above, it has been necessary to rely on individual skill.

　このような問題は、プラントで使用される制御システムに限られず、任意の推論モデルによる推論の結果に応じて任意の制御対象に対する制御を行う制御システムにおいて共通して生じる問題である。また、このような問題は、運転制御の内容を決定する場合に限られず、推論や制御に関する様々なパラメータの設定において共通して生じる問題点である。 This type of problem is not limited to control systems used in plants, but is a common problem that arises in control systems that control any control object according to the results of inference using an arbitrary inference model. Furthermore, this type of problem is not limited to determining the content of operational control, but is a common problem that arises when setting various parameters related to inference and control.

　本発明の一態様は、推論モデルによる推論の結果に応じて所定の制御対象に対する制御を行う制御システムにおいて、個人の技能に左右されることなく、推論や制御に関するパラメータを適切に設定することを可能にする情報処理装置等を提供することを目的とする。 One aspect of the present invention aims to provide an information processing device or the like that enables appropriate setting of parameters related to inference and control, regardless of the skills of individuals, in a control system that controls a specified control target according to the results of inference using an inference model.

　上記の課題を解決するために、本発明の一態様に係る情報処理装置は、推論モデルによる推論の結果に応じて所定の制御対象に対する制御を行う制御システムにおける当該推論および当該制御の少なくとも何れかに関するパラメータと、当該パラメータが適用されている期間における前記制御対象の稼働状態との関係を示す関数の予測分布を算出する予測分布算出部と、前記予測分布に基づいて前記パラメータの最適値の候補を探索する探索部と、を備える。 In order to solve the above problem, an information processing device according to one embodiment of the present invention includes a predictive distribution calculation unit that calculates a predictive distribution of a function indicating the relationship between parameters related to at least one of the inference and the control in a control system that controls a specific control object according to the result of inference by an inference model and the operating state of the control object during a period in which the parameters are applied, and a search unit that searches for candidates for the optimal value of the parameter based on the predictive distribution.

　また、本発明の一態様に係る探索方法は、少なくとも１つの情報処理装置が実行する探索方法であって、推論モデルによる推論の結果に応じて所定の制御対象に対する制御を行う制御システムにおける当該推論および当該制御の少なくとも何れかに関するパラメータと、当該パラメータが適用されている期間における前記制御対象の稼働状態との関係を示す関数の予測分布を算出するステップと、前記予測分布に基づいて前記パラメータの最適値の候補を探索するステップと、を含む。 A search method according to one aspect of the present invention is a search method executed by at least one information processing device, and includes the steps of: calculating a predictive distribution of a function indicating a relationship between a parameter relating to at least one of an inference and control in a control system that controls a specified control object according to the result of inference by an inference model, and the operating state of the control object during a period in which the parameter is applied; and searching for a candidate optimal value for the parameter based on the predictive distribution.

　本発明の一態様によれば、個人の技能に左右されることなく、推論や制御に関するパラメータを適切に設定することが可能になる。 According to one aspect of the present invention, it becomes possible to appropriately set parameters related to inference and control, regardless of the skill of the individual.

本発明の一実施形態に係る制御システムの概要を示す図である。FIG. 1 is a diagram showing an overview of a control system according to an embodiment of the present invention. 本発明の一実施形態に係る情報処理装置（推論モデルの適合性を示す指標値を算出する装置）の要部構成の一例を示すブロック図である。A block diagram showing an example of the main configuration of an information processing device (a device that calculates an index value indicating the suitability of an inference model) according to one embodiment of the present invention. 本発明の一実施形態に係る情報処理装置（パラメータの最適値を探索する装置）の要部構成の一例を示すブロック図である。1 is a block diagram showing an example of a configuration of a main part of an information processing device (a device that searches for optimal parameter values) according to an embodiment of the present invention; 閾値の決定方法と入力データが外れ値であるか否かの判定方法を説明する図である。10A and 10B are diagrams illustrating a method for determining a threshold value and a method for determining whether or not input data is an outlier. 廃棄物を焼却することにより発生した熱を利用して発電を行う焼却発電プラントにおいて、算出した指標値と再現率の遷移を示す図である。FIG. 1 is a diagram showing the transition of the calculated index value and the recurrence rate in an incineration power plant that generates power by utilizing heat generated by incinerating waste. 探索結果の表示画面の例を示す図である。FIG. 13 is a diagram showing an example of a display screen of a search result. 上記情報処理装置（推論モデルの適合性を示す指標値を算出する装置）が実行する処理の一例を示すフローチャートである。A flowchart showing an example of processing performed by the information processing device (a device that calculates an index value indicating the suitability of an inference model). 上記情報処理装置（パラメータの最適値を探索する装置）が実行する処理の一例を示すフローチャートである。4 is a flowchart showing an example of a process executed by the information processing device (a device that searches for optimal values of parameters).

　〔システム概要〕
　図１は、本発明の一実施形態に係る制御システム７の概要を示す図である。図示のように、制御システム７には、学習装置１と、情報処理装置２と、情報処理装置３と、制御装置４と、制御対象５とが含まれている。制御システム７は、制御装置４により制御対象５の動作を制御するシステムであり、制御対象５に対する制御は、推論モデルによる推論の結果に基づいて行われる。 [System Overview]
1 is a diagram showing an overview of a control system 7 according to an embodiment of the present invention. As shown in the figure, the control system 7 includes a learning device 1, an information processing device 2, an information processing device 3, a control device 4, and a control target 5. The control system 7 is a system that controls the operation of the control target 5 by the control device 4, and the control of the control target 5 is performed based on the result of inference using an inference model.

　制御対象５は、制御装置４の制御対象であり機器５１を含む。また、制御対象５には計測装置５２が取り付けられている。計測装置５２は、機器５１に関するデータの計測を行うものであり、機器５１や計測したいデータに応じたものを適用すればよい。例えば、機器５１の内部あるいは外部の温度を計測したい場合には、計測装置５２として温度センサを用いればよい。 The controlled object 5 is an object controlled by the control device 4 and includes equipment 51. A measuring device 52 is also attached to the controlled object 5. The measuring device 52 measures data related to the equipment 51, and may be adapted to suit the equipment 51 and the data to be measured. For example, if it is desired to measure the internal or external temperature of the equipment 51, a temperature sensor may be used as the measuring device 52.

　制御対象に含まれる機器５１および計測装置５２は、各１つであってもよいし、複数であってもよい。例えば、制御対象５が１つのプラントである場合、機器５１はプラント内の機器である。この場合、計測装置５２は、機器５１に取り付けられていてもよいし、プラント内の他の場所に取り付けられてもよく、プラント外に設置されていてもよい。 The controlled object may include one of each of the equipment 51 and the measuring device 52, or multiple of each. For example, if the controlled object 5 is a plant, the equipment 51 is an equipment within the plant. In this case, the measuring device 52 may be attached to the equipment 51, or may be attached elsewhere within the plant, or may be installed outside the plant.

　学習装置１は、制御対象５に対する制御の内容を決定するために用いられる推論モデルを生成する。推論モデルは、訓練データを用いた機械学習により生成されるから機械学習モデルと呼ぶこともできる。また、学習装置１は、推論モデルの再学習を行い、当該推論モデルを更新する。なお、機械学習のアルゴリズムは任意であり、使用する訓練データや推論内容等に応じて適当なアルゴリズムを適用すればよい。 The learning device 1 generates an inference model used to determine the content of control for the control target 5. The inference model can also be called a machine learning model because it is generated by machine learning using training data. The learning device 1 also re-learns the inference model and updates the inference model. Note that any machine learning algorithm can be used, and an appropriate algorithm can be applied depending on the training data used, the inference content, etc.

　推論モデルは、制御対象５に対する制御の内容を決定する指針となるような事項についての推論を行うものであればよい。例えば、推論モデルは、制御対象５の将来の稼働状態が正常であるか否かを予測するものであってもよい。また、例えば、推論モデルは、将来の計測装置５２の測定値を予測するものであってもよいし、制御対象５に対する最適な制御の内容を予測するものであってもよい。 The inference model may be one that performs inference on matters that serve as guidelines for determining the content of control for the control object 5. For example, the inference model may predict whether the future operating state of the control object 5 will be normal or not. Also, for example, the inference model may predict future measurement values of the measuring device 52, or may predict the optimal content of control for the control object 5.

　情報処理装置２は、学習装置１が生成する推論モデルによる推論の際に当該推論モデルに入力された入力データに対する当該推論モデルの適合性を示す指標値を算出する。詳細は後述するが、情報処理装置２は、上記指標値の算出にあたり、推論モデルの生成に用いた訓練データセットに含まれる訓練データが外れ値であるか否かを判定するための閾値を用いて、推論モデルに入力された複数の入力データのそれぞれについて、当該入力データが外れ値であるか否かを判定する。そして、情報処理装置２は、その判定結果に基づいて上記指標値を算出する。これにより、入力データに対する推論モデルの適合性を的確に判定することが可能になる。この指標値は、学習装置１に推論モデルを更新させるタイミングを決定するために使用される。 The information processing device 2 calculates an index value indicating the suitability of the inference model for input data input to the inference model generated by the learning device 1 when making an inference using the inference model. Details will be described later, but in calculating the index value, the information processing device 2 determines whether or not each of the multiple input data input to the inference model is an outlier, using a threshold value for determining whether or not training data included in the training dataset used to generate the inference model is an outlier. The information processing device 2 then calculates the index value based on the determination result. This makes it possible to accurately determine the suitability of the inference model for the input data. This index value is used to determine the timing for causing the learning device 1 to update the inference model.

　情報処理装置３は、制御対象５の制御に関するパラメータの最適値の探索を行う。また、情報処理装置３は、制御対象５の制御に関するパラメータの代わりに、あるいは当該パラメータに加えて、推論モデルによる推論に関するパラメータの最適値の探索を行ってもよい。この探索により検出されたパラメータは制御装置４に提供され、制御装置４はこのパラメータを適用して制御対象５に対する制御を行う。 The information processing device 3 searches for optimal values of parameters related to the control of the control object 5. The information processing device 3 may also search for optimal values of parameters related to inference by an inference model instead of or in addition to the parameters related to the control of the control object 5. The parameters detected by this search are provided to the control device 4, and the control device 4 applies these parameters to control the control object 5.

　詳細は後述するが、情報処理装置３は、推論モデルによる推論および該推論の結果に応じた制御の少なくとも何れかに関する上述のパラメータと、当該制御後の制御対象５の稼働状態との関係を示す関数の予測分布を算出する。そして、情報処理装置３は、算出した予測分布に基づいてパラメータの最適値の候補を探索する。これにより、オペレータの技能に左右されることなくパラメータを適切に設定することが可能になる。 Although details will be described later, the information processing device 3 calculates a predictive distribution of a function that indicates the relationship between the above-mentioned parameters relating to at least one of the inference based on the inference model and the control according to the result of the inference, and the operating state of the control target 5 after the control.The information processing device 3 then searches for candidates for the optimal value of the parameters based on the calculated predictive distribution.This makes it possible to appropriately set the parameters without being influenced by the skill of the operator.

　制御装置４は、上述の推論モデルを用いて推論を行うと共に、当該推論の結果に基づいて制御対象５に対する制御内容を決定する。具体的には、制御装置４は、データ取得部４１と、推論部４２と、制御内容決定部４３とを備えており、これらの各部により上述の機能が実現される。 The control device 4 performs inference using the above-mentioned inference model, and determines the control content for the control target 5 based on the result of the inference. Specifically, the control device 4 includes a data acquisition unit 41, an inference unit 42, and a control content determination unit 43, and the above-mentioned functions are realized by each of these units.

　データ取得部４１は、推論に必要なデータを取得する。例えば、データ取得部４１は、計測装置５２により計測された計測データを取得してもよい。データ取得部４１が取得したデータは、そのままあるいは所定のデータ処理を施した上で、推論モデルの入力データとされる。そして、推論部４２は、上述の推論モデルに上述の入力データを入力し、出力データを得る。 The data acquisition unit 41 acquires data necessary for inference. For example, the data acquisition unit 41 may acquire measurement data measured by the measuring device 52. The data acquired by the data acquisition unit 41 is used as input data for the inference model either directly or after undergoing a specified data processing. The inference unit 42 then inputs the above-mentioned input data to the above-mentioned inference model to obtain output data.

　制御内容決定部４３は、推論部４２の推論の結果、すなわち上述の出力データに応じた制御内容を決定する。例えば、推論モデルが制御対象５の将来の稼働状態が正常であるか否かを予測するものである場合、制御内容決定部４３は、稼働状態が正常である可能性が高いことを示す出力データが出力されたときには制御内容を従前の内容から変更しないようにしてもよい。一方、制御内容決定部４３は、稼働状態が正常ではない可能性が高いことを示す出力データが出力されたときには、稼働状態を正常にするための制御内容を決定すればよい。制御内容は、情報処理装置３が検出したパラメータに基づいて決定される。 The control content determination unit 43 determines the control content according to the result of inference by the inference unit 42, i.e., the output data described above. For example, if the inference model predicts whether or not the future operating state of the control target 5 will be normal, the control content determination unit 43 may not change the control content from the previous content when output data indicating that the operating state is likely to be normal is output. On the other hand, when output data indicating that the operating state is likely to be abnormal is output, the control content determination unit 43 may determine the control content for normalizing the operating state. The control content is determined based on parameters detected by the information processing device 3.

　なお、学習装置１が推論モデルを更新したときには、データ取得部４１は更新後の推論モデルを取得し、推論部４２は更新後の推論モデルに入力データを入力することにより出力データを得て、制御内容決定部４３は当該出力データに基づいて制御対象５に対する制御内容を決定する。 When the learning device 1 updates the inference model, the data acquisition unit 41 acquires the updated inference model, the inference unit 42 inputs input data to the updated inference model to obtain output data, and the control content determination unit 43 determines the control content for the control target 5 based on the output data.

　以上のように、制御システム７は、訓練データが外れ値であるか否かを判定するための閾値を用いて推論モデルに入力された入力データが外れ値であるか否かを判定し、その判定結果に基づいて、当該入力データに対する推論モデルの適合性を示す指標値を算出する情報処理装置２と、算出された指標値に基づいて決定されたタイミングで推論モデルを更新する学習装置１と、更新後の推論モデルに入力データを入力することにより得られた出力データに基づいて制御対象５に対する制御内容を決定する制御装置４とを含む。このようにして推論モデルが更新されると、情報処理装置２は、更新後の推論モデルに入力された入力データを取得して、当該入力データに対する推論モデルの適合性を示す指標値を算出する。 As described above, the control system 7 includes an information processing device 2 that determines whether input data input to the inference model is an outlier using a threshold value for determining whether training data is an outlier, and calculates an index value indicating the suitability of the inference model for the input data based on the determination result, a learning device 1 that updates the inference model at a timing determined based on the calculated index value, and a control device 4 that determines the control content for the control target 5 based on output data obtained by inputting the input data to the updated inference model. When the inference model is updated in this manner, the information processing device 2 obtains the input data input to the updated inference model, and calculates an index value indicating the suitability of the inference model for the input data.

　このように、制御システム７では、推論モデルの再学習および更新と、推論モデルと入力データとの適合性を示す指標値の算出とを繰り返し行うことができる。これにより、適切なタイミングで再学習を行い、制御対象５に対する制御内容が妥当である状態を維持することが可能になる。 In this way, the control system 7 can repeatedly re-learn and update the inference model and calculate an index value that indicates the compatibility between the inference model and the input data. This makes it possible to perform re-learning at an appropriate time and maintain the appropriateness of the control content for the control object 5.

　また、以上のように、制御システム７は、あるパラメータが適用されている期間における、当該パラメータと制御対象５の稼働状態との関係を示す関数の予測分布を算出し、算出した予測分布に基づいて上記パラメータの最適値の候補を探索する情報処理装置３と、検出された最適値の候補を適用して制御対象５に対する制御を行う制御装置４と、を含む。なお、あるパラメータとは、制御対象５の制御と、推論モデルによる推論との少なくとも何れかに関するパラメータである。 As described above, the control system 7 includes an information processing device 3 that calculates a predictive distribution of a function that indicates the relationship between a certain parameter and the operating state of the control object 5 during a period in which the parameter is applied, and searches for a candidate optimal value for the parameter based on the calculated predictive distribution, and a control device 4 that applies the detected candidate optimal value to control the control object 5. Note that the certain parameter is a parameter related to at least either the control of the control object 5 or inference using an inference model.

　そして、情報処理装置３は、検出された候補が適用されている期間における稼働状態を示す結果データを取得し、当該結果データに基づいて予測分布を更新すると共に、更新後の予測分布に基づいてパラメータの最適値の新たな候補を探索する。これにより、制御システム７では、制御対象５に対する制御を行いながら、パラメータを最適値に近付けていくことができる。 Then, the information processing device 3 acquires result data indicating the operating state during the period in which the detected candidate is applied, updates the predictive distribution based on the result data, and searches for new candidates for the optimal value of the parameter based on the updated predictive distribution. This allows the control system 7 to bring the parameter closer to the optimal value while controlling the control target 5.

　〔情報処理装置２の構成〕
　図２は、情報処理装置２の要部構成の一例を示すブロック図である。図示のように、情報処理装置２は、情報処理装置２の各部を統括して制御する制御部２０と、情報処理装置２が使用する各種データを記憶する記憶部２１を備えている。また、情報処理装置２は、情報処理装置２が他の装置と通信するための通信部２２、情報処理装置２に対する各種データの入力を受け付ける入力部２３、および情報処理装置２が各種データを出力するための出力部２４を備えている。 [Configuration of information processing device 2]
2 is a block diagram showing an example of a main configuration of the information processing device 2. As shown in the figure, the information processing device 2 includes a control unit 20 that controls each unit of the information processing device 2, and a storage unit 21 that stores various data used by the information processing device 2. The information processing device 2 also includes a communication unit 22 that allows the information processing device 2 to communicate with other devices, an input unit 23 that accepts input of various data to the information processing device 2, and an output unit 24 that allows the information processing device 2 to output various data.

　また、制御部２０には、データ取得部２０１、平均距離算出部２０２、閾値決定部２０３、外れ値判定部２０４、指標値算出部２０５、再学習要否判定部２０６、再現率算出部２０７、および学習用データ抽出部２０８が含まれている。 The control unit 20 also includes a data acquisition unit 201, an average distance calculation unit 202, a threshold determination unit 203, an outlier determination unit 204, an index value calculation unit 205, a relearning necessity determination unit 206, a recall calculation unit 207, and a learning data extraction unit 208.

　データ取得部２０１は、情報処理装置２で使用する各種データを取得する。例えば、データ取得部２０１は、推論モデルの生成に用いた複数の訓練データを含む訓練データセットや、推論のために推論モデルに入力された入力データを取得する。例えば、データ取得部２０１は、学習装置１から訓練データセットを取得し、制御装置４から入力データを取得してもよい。 The data acquisition unit 201 acquires various data used by the information processing device 2. For example, the data acquisition unit 201 acquires a training dataset including multiple training data used to generate an inference model, and input data input to the inference model for inference. For example, the data acquisition unit 201 may acquire a training dataset from the learning device 1 and acquire input data from the control device 4.

　平均距離算出部２０２は、データ間の平均距離を算出する。具体的には、平均距離算出部２０２は、データ取得部２０１が取得する訓練データセットに含まれる訓練データの１つと、当該訓練データからの距離が最も近い所定数の訓練データのそれぞれとの距離の平均値を算出する処理を、各訓練データについて行う。また、平均距離算出部２０２は、データ取得部２０１が取得する複数の入力データのうちの１つと、当該入力データからの距離が最も近い所定数の他の入力データのそれぞれとの距離の平均値を算出する処理を、各入力データについて行う。 The average distance calculation unit 202 calculates the average distance between data. Specifically, the average distance calculation unit 202 performs a process for each training data to calculate the average value of the distance between one of the training data included in the training data set acquired by the data acquisition unit 201 and each of a predetermined number of training data that are closest to the training data. In addition, the average distance calculation unit 202 performs a process for each input data to calculate the average value of the distance between one of the multiple input data acquired by the data acquisition unit 201 and each of a predetermined number of other input data that are closest to the input data.

　閾値決定部２０３は、データ取得部２０１が取得する訓練データセットに含まれる訓練データが外れ値であるか否かを判定するための閾値を決定する。より詳細には、閾値決定部２０３は、訓練データセットに含まれる各訓練データについて、当該訓練データが他の訓練データに対して乖離している程度を示す乖離度を算出し、算出した乖離度のうち所定の順位の乖離度を閾値に決定する。上記乖離度は、例えば、平均距離算出部２０２が各訓練データについて算出する平均値であってもよい。 The threshold determination unit 203 determines a threshold for determining whether or not training data included in the training data set acquired by the data acquisition unit 201 is an outlier. More specifically, the threshold determination unit 203 calculates a deviation indicating the degree to which the training data included in the training data set deviates from other training data, for each training data included in the training data set, and determines a deviation of a predetermined rank from among the calculated deviations as the threshold. The deviation may be, for example, an average value calculated by the average distance calculation unit 202 for each training data.

　外れ値判定部２０４は、データ取得部２０１が取得する複数の入力データのそれぞれについて、当該入力データが外れ値であるか否かを判定する。この判定には、閾値決定部２０３が決定する閾値が用いられる。つまり、外れ値判定部２０４は、閾値決定部２０３が決定する閾値を基準として入力データが外れ値であるか否かを判定する。 The outlier determination unit 204 determines whether each of the multiple input data acquired by the data acquisition unit 201 is an outlier. This determination is made using a threshold determined by the threshold determination unit 203. In other words, the outlier determination unit 204 determines whether the input data is an outlier based on the threshold determined by the threshold determination unit 203.

　指標値算出部２０５は、外れ値判定部２０４の判定結果に基づき、入力データに対する推論モデルの適合性を示す指標値を算出する。指標値については後記〔指標値の例〕の項目で説明する。 The index value calculation unit 205 calculates an index value that indicates the suitability of the inference model for the input data based on the judgment result of the outlier judgment unit 204. The index value will be explained in the section [Examples of Index Values] below.

　再学習要否判定部２０６は、指標値算出部２０５が算出する指標値に基づいて推論モデルの再学習の要否を判定する。そして、再学習要否判定部２０６は、推論モデルの再学習が必要と判定したときには、学習装置１に指示して推論モデルの再学習を行わせる。つまり、再学習要否判定部２０６が、再学習が必要と判定したタイミングが、再学習を行うべきタイミングということになり、再学習要否判定部２０６は再学習のタイミングを決定しているともいえる。 The relearning necessity determination unit 206 determines whether or not the inference model needs to be re-learned based on the index value calculated by the index value calculation unit 205. When the relearning necessity determination unit 206 determines that the inference model needs to be re-learned, it instructs the learning device 1 to re-learn the inference model. In other words, the timing at which the relearning necessity determination unit 206 determines that re-learning is necessary is the timing at which re-learning should be performed, and it can be said that the relearning necessity determination unit 206 determines the timing of re-learning.

　再現率算出部２０７は、推論モデルに複数の入力データを入力して行った各推論における再現率を算出する。再現率は、真の値が正事例のものの中で、正事例と予測した割合を示す指標である。例えば、状態が正常であるか異常であるかを予測する推論モデルの場合、再現率は、（異常と正しく判定できた件数）／{（異常と正しく判定できた件数）＋（異常を正常と誤判定した件数）}との式で算出することができる。つまり、再現率算出部２０７は、異常と正しく判定できた件数を、異常と正しく判定できた件数と異常を正常と誤判定した件数との和で除することにより、当該判定における再現率を算出する。 The recall calculation unit 207 calculates the recall for each inference made by inputting multiple pieces of input data into the inference model. The recall is an index showing the proportion of cases where the true value is predicted as a positive case among positive cases. For example, in the case of an inference model that predicts whether a state is normal or abnormal, the recall can be calculated by the formula (number of cases correctly judged as abnormal) / {(number of cases correctly judged as abnormal) + (number of cases where an abnormality was erroneously judged as normal)}. In other words, the recall calculation unit 207 calculates the recall for the judgment by dividing the number of cases correctly judged as abnormal by the sum of the number of cases correctly judged as abnormal and the number of cases where an abnormality was erroneously judged as normal.

　学習用データ抽出部２０８は、指標値算出部２０５が算出する指標値に基づいて、データ取得部２０１が取得する複数の入力データの中から推論モデルの再学習に用いるものを抽出する。上述のように、指標値算出部２０５が算出する指標値は、外れ値判定部２０４の判定結果に基づいて算出されるから、学習用データ抽出部２０８は、外れ値判定部２０４の判定結果に基づいて再学習に用いる入力データを抽出しているともいえる。 The learning data extraction unit 208 extracts input data to be used for re-learning the inference model from among the multiple input data acquired by the data acquisition unit 201, based on the index value calculated by the index value calculation unit 205. As described above, the index value calculated by the index value calculation unit 205 is calculated based on the judgment result of the outlier determination unit 204, so it can be said that the learning data extraction unit 208 extracts input data to be used for re-learning based on the judgment result of the outlier determination unit 204.

　また、学習用データ抽出部２０８は、再現率算出部２０７が算出する再現率についても考慮して再学習に用いる入力データを抽出してもよい。学習用データ抽出部２０８は、抽出した入力データを、例えば再学習要否判定部２０６が推論モデルの再学習が必要と判定したときに、学習装置１に送信するようにしてもよい。 The learning data extraction unit 208 may also extract input data to be used for re-learning, taking into consideration the recall calculated by the recall calculation unit 207. The learning data extraction unit 208 may transmit the extracted input data to the learning device 1, for example, when the re-learning necessity determination unit 206 determines that re-learning of the inference model is necessary.

　以上のように、情報処理装置２は、推論モデルの生成に用いた訓練データセットに含まれる訓練データが外れ値であるか否かを判定するための閾値を用いて、推論モデルに入力された複数の入力データのそれぞれについて、当該入力データが外れ値であるか否かを判定する外れ値判定部２０４と、外れ値判定部２０４の判定結果に基づき、入力データに対する推論モデルの適合性を示す指標値を算出する指標値算出部２０５と、を備える。 As described above, the information processing device 2 includes an outlier determination unit 204 that determines whether each of a plurality of input data input to the inference model is an outlier using a threshold value for determining whether the training data included in the training dataset used to generate the inference model is an outlier, and an index value calculation unit 205 that calculates an index value indicating the suitability of the inference model for the input data based on the determination result of the outlier determination unit 204.

　上記の構成によれば、訓練データが外れ値であるか否かを判定するための閾値を用いて推論モデルに入力された入力データが外れ値であるか否かを判定している。この処理では、仮に訓練データセットに含まれていたとすると外れ値と判定されるような入力データ、つまり学習済みの範囲から乖離した入力データが外れ値と判定される。 With the above configuration, a threshold for determining whether training data is an outlier is used to determine whether input data input to the inference model is an outlier. In this process, input data that would be determined to be an outlier if it were included in the training data set, that is, input data that deviates from the learned range, is determined to be an outlier.

　そして、学習済みの範囲から乖離した入力データの数や割合が増えたとき、入力データに対する推論モデルの適合性は低くなっているといえる。また、このとき、推論モデルの再学習や交換の必要性は高くなっているといえる。よって、上記の判定結果に基づいて指標値を算出する上記の構成によれば妥当な指標値を算出することができる。 When the number or proportion of input data that deviates from the learned range increases, it can be said that the suitability of the inference model for the input data is decreasing. In addition, at this time, it can be said that there is a high need to re-learn or replace the inference model. Therefore, the above configuration, which calculates an index value based on the above judgment result, makes it possible to calculate a valid index value.

　また、この指標値の算出は、推論モデルによる推論結果を用いることなく行うことができる。このため、上記の構成によれば、推論結果の正否の検証が難しい場合にも妥当な指標値を算出すること、つまり推論に用いた入力データに対する推論モデルの適合性を的確に判定することが可能になる。そして、この指標値を用いることにより、推論モデルの更新に関する各種処理を適切に行うことも可能になる。 Furthermore, this index value can be calculated without using the inference results from the inference model. Therefore, with the above configuration, it is possible to calculate a valid index value even when it is difficult to verify the correctness of the inference result, that is, to accurately determine the suitability of the inference model for the input data used in the inference. Furthermore, by using this index value, it is also possible to appropriately perform various processes related to updating the inference model.

　また、以上のように、情報処理装置２は、推論モデルの生成に用いた訓練データセットに含まれる訓練データが外れ値であるか否かを判定するための閾値を用いて、推論モデルに入力された複数の入力データのそれぞれについて、当該入力データが外れ値であるか否かを判定する外れ値判定部２０４と、外れ値判定部２０４の判定結果に基づき、推論モデルに入力された複数の入力データの中から推論モデルの再学習に用いるものを抽出する学習用データ抽出部２０８とを備えている。 As described above, the information processing device 2 is equipped with an outlier determination unit 204 that determines whether each of the multiple input data input to the inference model is an outlier using a threshold value for determining whether the training data included in the training dataset used to generate the inference model is an outlier, and a learning data extraction unit 208 that extracts data to be used for re-learning the inference model from the multiple input data input to the inference model based on the determination result of the outlier determination unit 204.

　上述のように、学習済みの範囲から乖離した入力データの数や割合が増えたとき、入力データに対する推論モデルの適合性は低くなっているといえる。また、このとき、推論モデルの再学習や交換の必要性は高くなっているといえる。よって、外れ値判定部２０４の判定結果に基づいて推論モデルの再学習に用いる入力データを抽出する上記の構成によれば、再学習用のデータとして妥当なものを抽出することができる。 As described above, when the number or proportion of input data that deviates from the learned range increases, the suitability of the inference model for the input data is decreasing. In addition, at this time, the need to re-learn or replace the inference model is increasing. Therefore, with the above configuration that extracts input data to be used for re-learning the inference model based on the judgment result of the outlier judgment unit 204, it is possible to extract data that is appropriate for re-learning.

　なお、学習用データ抽出部２０８は、指標値算出部２０５が算出する指標値を用いて入力データを抽出してもよいし、当該指標値は用いず、外れ値判定部２０４の判定結果を用いて入力データを抽出してもよい。つまり、再学習に用いる入力データを抽出するにあたり、指標値算出部２０５を備えていることは必須ではない。 The learning data extraction unit 208 may extract input data using the index value calculated by the index value calculation unit 205, or may extract input data using the determination result of the outlier determination unit 204 without using the index value. In other words, it is not essential to have the index value calculation unit 205 when extracting input data to be used for re-learning.

　また、上記の構成によれば、再学習に用いる入力データを抽出するにあたり、推論モデルによる推論結果を用いる必要がない。このため、上記の構成によれば、推論結果の正否の検証が難しい場合にも、推論モデルの更新に関する処理である再学習用のデータを抽出する処理を適切に行うことが可能になる。 Furthermore, with the above configuration, when extracting input data to be used for re-learning, it is not necessary to use the inference results from the inference model. Therefore, with the above configuration, even when it is difficult to verify the accuracy of the inference results, it is possible to appropriately perform the process of extracting data for re-learning, which is a process related to updating the inference model.

　〔情報処理装置３の構成〕
　図３は、情報処理装置３の要部構成の一例を示すブロック図である。図示のように、情報処理装置３は、情報処理装置２と同様に、制御部３０、記憶部３１、通信部３２、入力部３３、および出力部３４を備えている。情報処理装置３の制御部３０には、データ取得部３０１、評価値算出部３０２、予測分布算出部３０３、探索部３０４、最適化制御部３０５、および表示制御部３０６が含まれている。 [Configuration of information processing device 3]
3 is a block diagram showing an example of a main configuration of the information processing device 3. As shown in the figure, the information processing device 3 includes a control unit 30, a storage unit 31, a communication unit 32, an input unit 33, and an output unit 34, similar to the information processing device 2. The control unit 30 of the information processing device 3 includes a data acquisition unit 301, an evaluation value calculation unit 302, a prediction distribution calculation unit 303, a search unit 304, an optimization control unit 305, and a display control unit 306.

　データ取得部３０１は、情報処理装置３において使用される各種データを取得する。具体的には、データ取得部３０１は、情報処理装置３（より正確には探索部３０４）が検出したパラメータを適用した制御が行われた期間における制御対象５の稼働状態を示す結果データを取得する。例えば、データ取得部３０１は、当該期間における計測装置５２の計測データを結果データとして取得してもよい。 The data acquisition unit 301 acquires various data used in the information processing device 3. Specifically, the data acquisition unit 301 acquires result data indicating the operating state of the control target 5 during the period in which control was performed using parameters detected by the information processing device 3 (more precisely, the search unit 304). For example, the data acquisition unit 301 may acquire measurement data from the measuring device 52 during that period as result data.

　評価値算出部３０２は、データ取得部３０１が取得する結果データを用いて、制御対象５の稼働状態を評価した評価値を算出する。評価値は、制御対象５の稼働状態の良し悪しを表すものであればよい。なお、この評価値は、適用したパラメータの良し悪しを表しているともいえる。 The evaluation value calculation unit 302 calculates an evaluation value that evaluates the operating state of the control target 5 using the result data acquired by the data acquisition unit 301. The evaluation value may be anything that represents the quality of the operating state of the control target 5. It should be noted that this evaluation value can also be said to represent the quality of the applied parameters.

　例えば、制御システム７において、制御対象５の将来の稼働状態を示す値を予測する推論モデルを用い、制御対象５の稼働状態が正常である状態を維持する制御を行う場合、評価値算出部３０２は、異常時間率を評価値としてもよい。異常時間率は、制御対象５の稼働時間に対する、制御対象５の稼働状態が異常であった時間の比である。制御対象５の稼働状態が異常であるか正常であるかはデータ取得部３０１が取得する結果データに基づいて判定することができる。 For example, in the control system 7, when an inference model that predicts a value indicating the future operating state of the controlled object 5 is used to perform control to maintain the operating state of the controlled object 5 in a normal state, the evaluation value calculation unit 302 may use the abnormal time rate as the evaluation value. The abnormal time rate is the ratio of the time during which the operating state of the controlled object 5 was abnormal to the operating time of the controlled object 5. Whether the operating state of the controlled object 5 is abnormal or normal can be determined based on the result data acquired by the data acquisition unit 301.

　予測分布算出部３０３は、上述のパラメータと、当該パラメータが適用されている期間における制御対象５の稼働状態との関係を示す関数の予測分布を算出する。制御対象５の稼働状態は、評価値算出部３０２の算出する評価値により表される。例えば、評価値算出部３０２が異常時間率を算出する場合、予測分布算出部３０３は、制御対象５の稼働状態を示す情報として異常時間率を用いる。この場合、異常時間率を低くすることができるような値にパラメータを設定することが可能になる。なお、評価値算出部３０２に評価値を算出させる代わりに、当該評価値を、入力部３３等を介して入力するようにし、評価値算出部３０２を省略してもよい。 The predictive distribution calculation unit 303 calculates the predictive distribution of a function indicating the relationship between the above-mentioned parameters and the operating state of the control object 5 during the period in which the parameters are applied. The operating state of the control object 5 is represented by an evaluation value calculated by the evaluation value calculation unit 302. For example, when the evaluation value calculation unit 302 calculates the abnormal time rate, the predictive distribution calculation unit 303 uses the abnormal time rate as information indicating the operating state of the control object 5. In this case, it becomes possible to set the parameters to values that can reduce the abnormal time rate. Note that instead of having the evaluation value calculation unit 302 calculate the evaluation value, the evaluation value may be input via the input unit 33 or the like, and the evaluation value calculation unit 302 may be omitted.

　探索部３０４は、予測分布算出部３０３が算出する予測分布に基づいてパラメータの最適値の候補を探索する。なお、予測分布の算出方法と候補の探索方法については後記〔予測分布の算出方法と最適値の候補の探索方法の詳細〕の項目で説明する。 The search unit 304 searches for candidates for the optimal parameter value based on the predictive distribution calculated by the predictive distribution calculation unit 303. Note that the method of calculating the predictive distribution and the method of searching for the candidates will be explained later in the section [Details of the method of calculating the predictive distribution and the method of searching for the optimal value candidates].

　最適化制御部３０５は、予測分布算出部３０３と探索部３０４によるパラメータの最適化に関する制御を行う。例えば、最適化制御部３０５は、所定の条件を満たした場合に、予測分布算出部３０３と探索部３０４に最適化を終了させる制御を行う。 The optimization control unit 305 controls the parameter optimization performed by the predictive distribution calculation unit 303 and the search unit 304. For example, when a predetermined condition is satisfied, the optimization control unit 305 controls the predictive distribution calculation unit 303 and the search unit 304 to end the optimization.

　表示制御部３０６は、探索部３０４が検出した候補を表示装置に表示出力させる。表示装置は情報処理装置３が備えているものであってもよい。例えば、出力部３４が表示装置である場合、表示制御部３０６は、探索部３０４が検出した候補を出力部３４に表示出力させてもよい。また、表示制御部３０６は、情報処理装置３に接続されている表示装置に表示出力させてもよいし、通信部３２を介して通信可能な他の装置を介して当該装置に接続された表示装置に表示出力させてもよい。 The display control unit 306 causes the candidates detected by the search unit 304 to be displayed on a display device. The display device may be a device provided in the information processing device 3. For example, if the output unit 34 is a display device, the display control unit 306 may cause the candidates detected by the search unit 304 to be displayed on the output unit 34. The display control unit 306 may also cause the candidates to be displayed on a display device connected to the information processing device 3, or to be displayed on a display device connected to that device via another device that can communicate via the communication unit 32.

　以上のように、情報処理装置３は、上述のパラメータと当該パラメータが適用されている期間における制御対象５の稼働状態との関係を示す関数の予測分布を算出する予測分布算出部３０３と、算出された予測分布に基づいてパラメータの最適値の候補を探索する探索部３０４と、を備える。 As described above, the information processing device 3 includes a predictive distribution calculation unit 303 that calculates the predictive distribution of a function that indicates the relationship between the above-mentioned parameters and the operating state of the control target 5 during the period in which the parameters are applied, and a search unit 304 that searches for candidates for the optimal value of the parameters based on the calculated predictive distribution.

　上記の構成によれば、当該パラメータと稼働状態との関係を定式化することが難しい場合であっても、妥当なパラメータを検出することが可能になる。よって、上記の構成によれば、推論モデルによる推論の結果に応じて所定の制御対象に対する制御を行う制御システム７において、個人の技能に左右されることなくパラメータを適切に設定することが可能になる。 The above configuration makes it possible to detect appropriate parameters even when it is difficult to formulate the relationship between the parameters and the operating state. Therefore, the above configuration makes it possible to appropriately set parameters without being influenced by individual skills in a control system 7 that controls a specific control target according to the results of inference using an inference model.

　また、予測分布算出部３０３は、探索部３０４が検出した候補と、当該候補が適用されている期間における制御対象５の稼働状態とに基づいて予測分布を更新する。そして、探索部３０４は、更新後の予測分布に基づいてパラメータの最適値の新たな候補を探索する。 The predictive distribution calculation unit 303 also updates the predictive distribution based on the candidates detected by the search unit 304 and the operating state of the control target 5 during the period in which the candidates are applied. The search unit 304 then searches for new candidates for the optimal parameter values based on the updated predictive distribution.

　この構成によれば、先に検出した最適値の候補を適用したときの稼働状態を踏まえて、最適値の新たな候補を探索するから、より妥当性の高い候補を検出することが可能になる。また、例えば季節変動や経年変化により制御対象の状態が変化した場合であっても、変化後の状態に適合した候補を検出することが可能になる。なお、この構成ではある程度のトライアンドエラーが発生し得るが、その回数は必要最小限に抑えられる。 With this configuration, a new candidate for the optimum value is searched for based on the operating state when the previously detected candidate for the optimum value is applied, making it possible to detect a candidate with a higher degree of validity. Furthermore, even if the state of the controlled object changes due to seasonal fluctuations or aging, for example, it is possible to detect a candidate that matches the post-change state. Although a certain amount of trial and error may occur with this configuration, the number of such trials is kept to a necessary minimum.

　〔閾値の決定方法と、外れ値であるか否かの判定方法〕
　図４は、閾値の決定方法と入力データが外れ値であるか否かの判定方法を説明する図である。本項目では、図４に基づいて、閾値決定部２０３による閾値の決定方法と、外れ値判定部２０４による外れ値であるか否かの判定方法について説明する。 [Method of determining threshold and method of determining whether or not a value is an outlier]
4 is a diagram for explaining a method for determining a threshold and a method for determining whether input data is an outlier. In this section, a method for determining a threshold by the threshold determination unit 203 and a method for determining whether input data is an outlier by the outlier determination unit 204 will be explained based on FIG.

　閾値決定部２０３は、推論モデルの機械学習に用いた訓練データセットを用いて閾値を決定する。閾値の決定にあたり、平均距離算出部２０２が、訓練データセットに含まれる訓練データの１つと、当該訓練データからの距離が最も近い所定数の訓練データのそれぞれとの距離の平均値を算出する処理を、各訓練データについて行う。上述のように、このようにして算出される平均値は、各訓練データが他の訓練データに対して乖離している程度を示すものであり、乖離度と呼ぶこともできる。 The threshold determination unit 203 determines the threshold using the training data set used for the machine learning of the inference model. When determining the threshold, the average distance calculation unit 202 performs a process for each training data to calculate the average value of the distance between one piece of training data included in the training data set and each of a predetermined number of training data that are closest to the training data. As described above, the average value calculated in this manner indicates the degree to which each training data deviates from the other training data, and can also be called the degree of deviation.

　図４の左側には、訓練データセットに含まれる各訓練データの特徴空間におけるプロットを示している。例えば、上述の所定数を５としたとする。この場合、平均距離算出部２０２は、訓練データの１つであるデータＤ１について、当該特徴空間内においてデータＤ１から距離が最も近い５つの訓練データのそれぞれとデータＤ１との距離を算出し、それらの平均値を算出する。同様に、平均距離算出部２０２は、データＤ２についても、データＤ２から距離が最も近い５つの訓練データのそれぞれとデータＤ２との距離の平均値を算出する。平均距離算出部２０２は、このような処理を訓練データセットに含まれる各訓練データについて行う。これらの処理は、データ分類の一手法であるｋ近傍法で行われる処理と同様の処理である。 The left side of FIG. 4 shows plots in the feature space of each training data included in the training dataset. For example, assume that the above-mentioned predetermined number is 5. In this case, for data D1, which is one of the training data, the average distance calculation unit 202 calculates the distance between data D1 and each of the five training data that are closest to data D1 in the feature space, and calculates the average value. Similarly, for data D2, the average distance calculation unit 202 calculates the average value of the distance between data D2 and each of the five training data that are closest to data D2. The average distance calculation unit 202 performs this process for each training data included in the training dataset. These processes are similar to the processes performed in the k-nearest neighbor method, which is a method of data classification.

　次に、閾値決定部２０３が、上述のようにして算出された平均値をその大きさの昇順に並べ、順位が所定の順位となる平均値を求め、当該平均値を閾値に決定する。算出された平均値が大きい値の訓練データほど、他の訓練データから乖離しており、訓練データセットの全体からみて外れ値である可能性が高いといえる。例えば、図４に示されるデータＤ２について算出される平均値は、データＤ１について算出される平均値よりも大きいため、データＤ２はデータＤ１よりも外れ値である可能性が高いといえる。 Then, the threshold determination unit 203 sorts the average values calculated as described above in ascending order, finds the average value that has a predetermined rank, and determines this average value as the threshold. The training data with a larger calculated average value deviates more from the other training data, and is more likely to be an outlier when viewed from the perspective of the entire training data set. For example, since the average value calculated for data D2 shown in FIG. 4 is larger than the average value calculated for data D1, data D2 is more likely to be an outlier than data D1.

　所定の順位は適宜設定すればよく、どのような値に設定するかは特に限定されない。例えば、訓練データセットにおける所定の割合の訓練データが当該順位より上位になるように所定の順位を設定してもよい。具体例を挙げれば、９６％の訓練データが当該順位より上位になるようにする場合、（訓練データの総数）×０．９６の値が所定の順位となる。この場合、訓練データの４％が外れ値となる。 The predetermined rank may be set as appropriate, and there is no particular limit to what value it is set to. For example, the predetermined rank may be set so that a predetermined percentage of training data in the training data set is ranked higher than the predetermined rank. To give a specific example, if 96% of the training data is to be ranked higher than the predetermined rank, the value of (total number of training data) x 0.96 will be the predetermined rank. In this case, 4% of the training data will be outliers.

　一方、図４の右側には、推論モデルに入力された各入力データの特徴空間におけるプロットを示している。以下では、外れ値判定部２０４による外れ値であるか否かの判定方法について説明する。 On the other hand, the right side of Figure 4 shows plots in the feature space of each piece of input data input to the inference model. Below, we will explain how the outlier determination unit 204 determines whether or not a given piece of data is an outlier.

　外れ値判定部２０４は、以上のようにして決定された閾値を用いて、推論モデルに入力された入力データが外れ値であるか否かを判定する。外れ値であるか否かの判定においても、まず、平均距離算出部２０２による平均値の算出が行われる。より詳細には、平均距離算出部２０２は、入力データの１つと、当該入力データからの距離が最も近い所定数の入力データのそれぞれとの距離の平均値を算出する処理を、各入力データについて行う。平均距離算出部２０２により算出される平均値は、各入力データが他の入力データに対して乖離している程度を示しており、乖離度と呼ぶこともできる。 The outlier determination unit 204 uses the threshold determined as described above to determine whether or not the input data input to the inference model is an outlier. In determining whether or not an input data is an outlier, the average value is first calculated by the average distance calculation unit 202. More specifically, the average distance calculation unit 202 performs a process for each input data to calculate the average value of the distance between one piece of input data and each of a predetermined number of input data that are closest to the input data. The average value calculated by the average distance calculation unit 202 indicates the degree to which each input data deviates from the other input data, and can also be called the degree of deviation.

　図４の右側の例においても、同図の左側の例と同様に所定数は５である。この場合、平均距離算出部２０２は、入力データの１つであるデータｄ１について、特徴空間内においてデータｄ１から距離が最も近い５つの入力データのそれぞれとデータｄ１との距離を算出し、それらの平均値を算出する。 In the example on the right side of FIG. 4, the predetermined number is 5, as in the example on the left side of the same figure. In this case, for data d1, which is one of the input data, the average distance calculation unit 202 calculates the distance between data d1 and each of the five input data that are closest to data d1 in the feature space, and calculates the average value of these.

　そして、外れ値判定部２０４は、算出された平均値と、閾値決定部２０３が決定した閾値とを比較し、それらの大小関係に基づいてデータｄ１が外れ値であるか否かを判定する。例えば、外れ値判定部２０４は、図示のように、データｄ１について算出された平均値が閾値以上であった場合に、データｄ１を外れ値と判定してもよい。同様にして外れ値判定部２０４は、他の入力データのそれぞれについて、当該入力データが外れ値であるか否かを判定する。 Then, the outlier determination unit 204 compares the calculated average value with the threshold value determined by the threshold value determination unit 203, and determines whether or not the data d1 is an outlier based on the magnitude relationship between the two values. For example, as shown in the figure, the outlier determination unit 204 may determine that the data d1 is an outlier when the average value calculated for the data d1 is equal to or greater than the threshold value. In the same manner, the outlier determination unit 204 determines whether or not each of the other input data is an outlier.

　なお、外れ値の検方法は上述の例に限られない。例えば、ＬＯＦ（Local Outlier Factor）により外れ値を検出してもよい。この場合、閾値決定部２０３は、局所密度の閾値を決定し、外れ値判定部２０４は、局所密度が当該閾値以下である入力データを外れ値と判定すればよい。また、例えば、ホテリング理論により外れ値を検出してもよい。この場合、閾値決定部２０３は、異常度の閾値を決定し、外れ値判定部２０４は、異常度が当該閾値以上である入力データを外れ値と判定すればよい。 Note that the method of detecting outliers is not limited to the above example. For example, outliers may be detected by LOF (Local Outlier Factor). In this case, the threshold determination unit 203 determines a threshold for the local density, and the outlier determination unit 204 determines that input data whose local density is below the threshold is an outlier. Also, for example, outliers may be detected by Hotelling's theory. In this case, the threshold determination unit 203 determines a threshold for the degree of abnormality, and the outlier determination unit 204 determines that input data whose degree of abnormality is above the threshold is an outlier.

　以上のように、閾値決定部２０３は、訓練データセットに含まれる各訓練データについて、当該訓練データが他の訓練データに対して乖離している程度を示す乖離度を算出し、算出した乖離度のうち所定の順位の乖離度を閾値に決定してもよい。そして、外れ値判定部２０４は、複数の入力データに含まれる各入力データについて算出された、当該入力データが他の入力データに対して乖離している程度を示す乖離度を、閾値決定部２０３が決定した閾値と比較することにより、各入力データが外れ値であるか否かを判定してもよい。 As described above, the threshold determination unit 203 may calculate a deviation indicating the degree to which each piece of training data included in the training dataset deviates from other training data, and determine a deviation of a predetermined rank from among the calculated deviations as the threshold. The outlier determination unit 204 may then determine whether each piece of input data is an outlier by comparing the deviation indicating the degree to which each piece of input data deviates from other input data, calculated for each piece of input data included in the plurality of input data, with the threshold determined by the threshold determination unit 203.

　上記の判定により外れ値と判定される入力データの数が増えた場合、推論に用いた入力データの特徴が、訓練データの特徴とは異なるものに変化したといえる。よって、上記の構成によれば、入力データに対する推論モデルの適合性を的確に判定することが可能になる。 If the number of input data that are determined to be outliers increases, it can be said that the characteristics of the input data used for inference have changed to become different from the characteristics of the training data. Therefore, with the above configuration, it becomes possible to accurately determine the suitability of the inference model for the input data.

　なお、外れ値の検出方法と同様に、乖離度の算出方法も特に限定されない。例えば、上述のように近傍の所定数のデータからの平均距離を乖離度としてもよい。この場合、当該距離は、例えばユークリッド距離であってもよいし、マンハッタン距離やマハラノビス距離等の他の距離であってもよい。また、例えば、ホテリング理論における異常度を乖離度としてもよいし、ＬＯＦで用いられる局所密度を乖離度としてもよい。 As with the method of detecting outliers, the method of calculating the deviation is not particularly limited. For example, as described above, the deviation may be the average distance from a certain number of nearby data. In this case, the distance may be, for example, the Euclidean distance, or other distances such as the Manhattan distance or the Mahalanobis distance. In addition, for example, the degree of anomaly in Hotelling's theory may be used as the deviation, or the local density used in LOF may be used as the deviation.

　〔指標値の例〕
　指標値算出部２０５が算出する指標値は、入力データに対する推論モデルの適合性を示すものであればよい。例えば、指標値算出部２０５は、所定期間に外れ値判定部２０４が外れ値と判定した入力データの総数を上記指標値としてもよい。例えば、制御対象５を毎日稼働させる場合に、一日のうちに推論に用いられた複数の入力データのそれぞれについて当該入力データが外れ値であるか否かが判定されたとする。この場合、指標値算出部２０５は、ある日に外れ値であると判定された入力データの総数を、その日の終了時点における推論モデルの適合性を示す指標値として算出してもよい。 [Examples of index values]
The index value calculated by the index value calculation unit 205 may indicate the suitability of the inference model for the input data. For example, the index value calculation unit 205 may set the total number of input data determined to be outliers by the outlier determination unit 204 during a predetermined period as the index value. For example, when the control target 5 is operated every day, it is assumed that each of a plurality of input data used in inference during a day is determined to be an outlier or not. In this case, the index value calculation unit 205 may calculate the total number of input data determined to be outliers on a certain day as an index value indicating the suitability of the inference model at the end of that day.

　また、例えば、指標値算出部２０５は、外れ値判定部２０４が外れ値と判定した入力データの数を用いて指標値を算出してもよい。例えば、指標値算出部２０５は、所定期間に計測装置５２で計測された入力データのうち、外れ値判定部２０４が外れ値と判定した入力データの割合を指標値として算出してもよい。 Also, for example, the index value calculation unit 205 may calculate the index value using the number of input data that the outlier determination unit 204 has determined to be outliers. For example, the index value calculation unit 205 may calculate, as the index value, the proportion of input data that the outlier determination unit 204 has determined to be outliers among the input data measured by the measuring device 52 during a predetermined period.

　また、推論モデルが、所定の対象（例えば制御対象５）に関する入力データから、当該対象の将来の状態が正常であるか異常であるかを予測するモデルであったとする。この場合、指標値算出部２０５は、外れ値であると判定された入力データのうち、当該入力データを用いた予測後に上記対象の状態が異常であった入力データの数または当該入力データの数を用いて算出される値を指標値として算出してもよい。 Also, assume that the inference model is a model that predicts whether the future state of a specific object (e.g., a controlled object 5) will be normal or abnormal based on input data related to the object. In this case, the index value calculation unit 205 may calculate, as an index value, the number of pieces of input data that were determined to be outliers and for which the state of the object was abnormal after a prediction using the input data, or a value calculated using the number of pieces of input data.

　外れ値であると判定された入力データは、入力データを用いた予測後に対象の状態が異常であったものと正常であったものとに分類することができる。このうち、予測後に対象の状態が異常であったものについては、異常という状態に応じた特徴が含まれているといえる。 Input data that is determined to be an outlier can be classified into those in which the target state is abnormal after prediction using the input data and those in which the target state is normal. Of these, those in which the target state is abnormal after prediction can be said to contain characteristics that correspond to an abnormal state.

　一般に正常という状態は変わらないが、異常という状態には多様性があるから、異常という状態に応じた特徴が含まれている入力データに外れ値が多いときには、異常という状態が学習時から変化した可能性があり、再学習の必要性が高いといえる。よって、上記の構成によれば、異常という状態の性質が変化することを考慮した妥当な指標値を算出することができる。 Generally, normal states do not change, but abnormal states vary. Therefore, when there are many outliers in input data that contains characteristics corresponding to abnormal states, it is possible that the abnormal state has changed since learning, and re-learning is highly necessary. Therefore, with the above configuration, it is possible to calculate an appropriate index value that takes into account the changing nature of the abnormal state.

　〔再現率を考慮した再学習要否の判定〕
　上述のように、推論モデルは、所定の対象に関する入力データから、当該対象の将来の状態が正常であるか異常であるかを予測するモデルであってもよい。この場合、異常である状態を見逃すつまり実際には異常であるのに正常と判定してしまう可能性を低減するという観点から、推論モデルの精度は再現率で評価することが好ましい。 [Determining whether or not re-learning is necessary, taking recall into account]
As described above, the inference model may be a model that predicts whether a future state of a given object is normal or abnormal based on input data about the object. In this case, it is preferable to evaluate the accuracy of the inference model using the recall rate in order to reduce the possibility of overlooking an abnormal state, that is, of determining an abnormal state as normal when it is actually abnormal.

　上記のような推論モデルの推論結果の正否の検証は難しい場合があるが、再現率を単体で利用するのではなく、指標値算出部２０５が算出する指標値と共に利用することにより、効果的な再学習が可能になる。 It can be difficult to verify the accuracy of the inference results of the inference model described above, but by using the recall rate together with the index value calculated by the index value calculation unit 205, rather than using it alone, effective re-learning becomes possible.

　これについて、図５に基づいて説明する。図５は、廃棄物を焼却することにより発生した熱を利用して発電を行う廃棄物焼却発電プラントにおいて、指標値算出部２０５が算出した指標値と再現率の遷移を示す図である。つまり、ここでは制御対象５が廃棄物焼却発電プラントである例を示している。なお、指標値は、１日あたりの外れ値と判定された入力データの総数である。また、同図には、上記廃棄物焼却発電プラントにおける蒸気量の設定値の遷移についても併せて示している。同図の横軸は時間（日）であり、縦軸の値は０～１の値に正規化したものである。 This will be explained with reference to Figure 5. Figure 5 is a diagram showing the transition of the index value calculated by the index value calculation unit 205 and the recurrence rate in a waste incineration power plant that generates power by using heat generated by incinerating waste. In other words, this shows an example in which the control target 5 is a waste incineration power plant. The index value is the total number of input data items determined to be outliers per day. The figure also shows the transition of the set value of the steam volume in the waste incineration power plant. The horizontal axis of the figure is time (days), and the vertical axis is normalized to a value between 0 and 1.

　図５において、円Ｃ１で示した期間では、再現率は高い水準で推移している。一方、指標値については、直近まで低い水準であった値が急増し、その後、高い水準で推移している。指標値の急増の直前に設定値が大きい値に変化していることから、指標値の急増の原因は設定値が変更されて入力データの特徴が変化したことにあると考えられる。 In Figure 5, during the period indicated by circle C1, the recall rate remains at a high level. Meanwhile, the index value, which had been at a low level until recently, suddenly increased and has since remained at a high level. Since the setting value changed to a large value immediately before the sudden increase in the index value, it is believed that the cause of the sudden increase in the index value is that the setting value was changed, resulting in a change in the characteristics of the input data.

　一方、円Ｃ２で示した期間では、設定値に大きな変化がないにもかかわらず、指標値が増加し、再現率が低下している。この期間においては、設定値の変更以外の要因により、外れ値と判定された入力データが増えると共に、再現率が低下したと考えられる。このように指標値が増加し、かつ再現率が低下している期間は、推論モデルが異常と正常を分類する能力が低い期間であるといえる。 On the other hand, in the period indicated by circle C2, the index value increases and the recall rate decreases, even though there is no significant change in the setting value. During this period, it is believed that the amount of input data determined to be outliers increased and the recall rate decreased due to factors other than the change in the setting value. A period in which the index value increases and the recall rate decreases in this way can be said to be a period in which the inference model has a low ability to classify normal from abnormal.

　このように、所定の対象に関する入力データから、当該対象の将来の状態が正常であるか異常であるかを予測する推論モデルを用いる場合、再現率算出部２０７は、推論モデルに複数の入力データを入力して行った予測における再現率、すなわち、異常と正しく判定できた件数を、異常と正しく判定できた件数と異常を正常と誤判定した件数との和で除した値を算出してもよい。 In this way, when an inference model is used that predicts whether the future state of a specific target will be normal or abnormal based on input data about the target, the recall calculation unit 207 may calculate the recall in a prediction made by inputting multiple input data into the inference model, i.e., the number of cases that were correctly determined to be abnormal divided by the sum of the number of cases that were correctly determined to be abnormal and the number of cases where an abnormality was erroneously determined to be normal.

　再現率が低下している期間は、推論モデルによる正常であるか異常であるかの予測の精度が低下している期間であるといえるから、再現率は、推論モデルが異常と正常を分類する能力の高低を示す指標として利用できると考えられる。しかし、再現率の算出に用いられる、異常と正しく判定できた件数と異常を正常と誤判定した件数は、推論モデルの予測結果に基づく介入制御やオペレータの操作等の影響を受けるため、再現率を絶対的な評価指標とすることは好ましくない。一方で、上述の指標値を用いれば、使用された入力データが未学習の傾向を持つ期間を特定することが可能である。 A period when the recall rate is declining can be said to be a period when the accuracy of the inference model's predictions of whether something is normal or abnormal is declining, so recall rate can be used as an index showing the level of the inference model's ability to classify normal from abnormal. However, the number of cases correctly determined to be abnormal and the number of cases where an abnormality is incorrectly determined to be normal, which are used to calculate recall rate, are affected by intervention control based on the prediction results of the inference model and operator operations, so it is not desirable to use recall rate as an absolute evaluation index. On the other hand, by using the above index value, it is possible to identify periods when the input data used tends to be unlearned.

　このため、学習用データ抽出部２０８は、再現率算出部２０７が算出する再現率と、指標値算出部２０５が算出する指標値とに基づいて、複数の入力データの中から推論モデルの再学習に用いるものを抽出してもよい。これにより、特に学習すべき期間（入力データが未学習の傾向を持ち、かつ回避すべき異常となってしまった期間）を推定することができ、効率よく学習すべきデータ期間を特定し、当該期間の入力データを再学習用のデータとして抽出することができる。例えば、図５の例において、取得された入力データのうち、円Ｃ２で示した期間の入力データを用いて効果的な再学習を行うこともできる。 For this reason, the learning data extraction unit 208 may extract input data to be used for re-learning the inference model from among the multiple input data, based on the recall calculated by the recall calculation unit 207 and the index value calculated by the index value calculation unit 205. This makes it possible to estimate a period in particular that should be learned (a period in which the input data has a tendency to not be learned and has become an anomaly that should be avoided), to efficiently identify a data period that should be learned, and to extract input data from that period as data for re-learning. For example, in the example of Figure 5, effective re-learning can be performed using input data from the period indicated by circle C2 from among the acquired input data.

　〔予測分布の算出方法と最適値の候補の探索方法の詳細〕
　予測分布算出部３０３による予測分布の算出方法（更新も含む）の詳細と、探索部３０４による最適値の候補の探索方法の詳細を以下説明する。なお、以下の説明は、ベイズ最適化における予測分布の算出方法と探索方法に関するものである。ただし、予測分布を用いて最適値の候補を探索するものであれば、以下の説明に係る方法以外の方法を適用することも可能である。 [Details of how to calculate the predictive distribution and how to search for optimal value candidates]
The details of the method of calculating the predictive distribution (including updating) by the predictive distribution calculation unit 303 and the method of searching for optimal value candidates by the search unit 304 are described below. Note that the following description relates to the calculation method of the predictive distribution and the search method in Bayesian optimization. However, it is also possible to apply a method other than the method described below as long as it uses the predictive distribution to search for optimal value candidates.

　最適化の対象とするパラメータがＮ個ある場合、それらの制御パラメータは、 If there are N parameters to be optimized, the control parameters are:

と表され、
それに対する評価値は、 It is expressed as
The evaluation value for this is

と表される。 This is expressed as:

　予測分布算出部３０３は、データ取得部３０１が取得する結果データに基づいて、最適化の対象とするパラメータと制御後における制御対象の稼働状態との関係を示す関数の予測分布を算出する。なお、この関数を以下では評価関数ｆ（θ）と呼ぶ。また、新たな結果データ（例えば、探索部３０４が検出した候補を適用した制御の結果を示す結果データ）が取得されたときには、予測分布算出部３０３は、その結果データが反映されるように予測分布を更新する。 The predictive distribution calculation unit 303 calculates the predictive distribution of a function indicating the relationship between the parameters to be optimized and the operating state of the controlled object after control, based on the result data acquired by the data acquisition unit 301. Note that this function will be referred to as the evaluation function f(θ) below. Furthermore, when new result data (for example, result data indicating the result of control in which a candidate detected by the search unit 304 is applied) is acquired, the predictive distribution calculation unit 303 updates the predictive distribution so that the result data is reflected.

　パラメータと稼働状態との関係を、ガウスノイズε_ｎ～Ν（０，β）を用いて以下のように仮定する。 The relationship between the parameters and the operating state is assumed as follows using Gaussian noise ε _n ∼N(0, β):

これにより、ガウス過程による評価関数の予測分布として以下の分布が得られる。 This gives us the following distribution as the predictive distribution of the evaluation function using the Gaussian process:

　ここで、ｋ_＊＝ｋ（θ，θ）であり、Ｋ_Θは［Ｋ_Θ］_ｉ，ｊ＝ｋ（θ_ｉ，θ_ｊ）で得られるグラム行列である。また、ｆは以下のように表される。 Here, k _* =k(θ,θ), and K _Θ is the Gram matrix obtained by [K _Θ ] _i,j =k(θ _i ,θ _j ). Furthermore, f is expressed as follows.

ｋ_Θ，＊は、［ｋ_Θ，＊］_ｉ＝ｋ（θ_ｉ，θ）の縦ベクトルであり、ｋ（・，・）は、カーネル関数である。ここではカーネル関数のパラメータをα_ｋとする。 k _Θ,* is a column vector of [k _Θ,* ] _i =k(θ _i ,θ), and k(·,·) is a kernel function, where the parameter of the kernel function is α _k .

　平均関数μ（θ）は、結果データ（および当該結果データから評価値算出部３０２が算出した評価値）から予測される評価関数の平均値を示す。また、分散関数σ（θ）は、結果データ（および当該結果データから評価値算出部３０２が算出した評価値）から予測される評価関数の分散である。σ（θ）は、予測の不確実性を示し、結果データが不足している領域ではその値が大きくなる傾向がある。σが大きいと、予測が不確実であるといえ、予測の確実性を上げるために必要な結果データが不足しているといえる。数式（３）から明らかなように、分散関数σ（θ）に含まれるカーネル関数およびカーネル関数のパラメータα_ｋは、予測分布の算出に影響を与える。予測分布の算出の際には、パラメータα_ｋの最適化が行われる。最適化の方法は特に限定されず、例えば一般的なベイズ最適化で適用されている各種の最適化手法を適用することもできる。 The average function μ(θ) indicates the average value of the evaluation function predicted from the result data (and the evaluation value calculated by the evaluation value calculation unit 302 from the result data). The variance function σ(θ) is the variance of the evaluation function predicted from the result data (and the evaluation value calculated by the evaluation value calculation unit 302 from the result data). σ(θ) indicates the uncertainty of the prediction, and its value tends to be large in areas where result data is insufficient. If σ is large, it can be said that the prediction is uncertain, and that the result data required to increase the certainty of the prediction is insufficient. As is clear from the formula (3), the kernel function and the parameter α _k of the kernel function included in the variance function σ(θ) affect the calculation of the predictive distribution. When the predictive distribution is calculated, the parameter α _k is optimized. The optimization method is not particularly limited, and various optimization methods applied in general Bayesian optimization can be applied.

　探索部３０４は、最適な制御パラメータを求めるために、最適な制御パラメータの候補を探索する。具体的には、探索部３０４は、後記数式（４）、（５）に示されるように、平均関数μ（θ）と分散関数σ（θ）を用いて獲得関数ａ（θ）を最大にするパラメータを探索する。この探索で検出されたパラメータが、最適なパラメータの候補となる。この探索は、ＵＣＢ（Upper Confidence Bound）戦略に基づいている。 The search unit 304 searches for optimal control parameter candidates in order to obtain the optimal control parameters. Specifically, the search unit 304 searches for parameters that maximize the acquisition function a(θ) using the mean function μ(θ) and the variance function σ(θ), as shown in equations (4) and (5) below. The parameters detected in this search become optimal parameter candidates. This search is based on the UCB (Upper Confidence Bound) strategy.

　数式（５）におけるκは探索と利用を調節するためのパラメータである。受付部１０４が、フィードバック制御の結果が既知の値の周辺と、フィードバック制御の結果が未知の範囲と、の何れを重視して探索を行うかの選択を受け付けた場合、探索部３０４は、κを受け付けた選択に応じた値に設定し、候補の探索を行う。 In formula (5), κ is a parameter for adjusting search and utilization. When the reception unit 104 receives a selection as to whether to place emphasis on the search in the vicinity of values where the feedback control results are known, or in a range where the feedback control results are unknown, the search unit 304 sets κ to a value according to the received selection and searches for candidates.

　無論、他の方法で新たなパラメータを探索することも可能である。例えば、ＰＩ（Probability of Improvement）戦略や、ＥＩ（Expected Improvement）戦略で最適なパラメータの候補を探索してもよい。この他にも、例えば、ＰＴＲ（Probability in Target Range）あるいはＭＩ（Mutual Information）等の戦略を適用し、各戦略に応じた獲得関数を用いて最適なパラメータの候補を探索してもよい。 Of course, it is also possible to search for new parameters using other methods. For example, optimal parameter candidates may be searched for using the PI (Probability of Improvement) strategy or the EI (Expected Improvement) strategy. In addition, optimal parameter candidates may be searched for using an acquisition function appropriate for each strategy, for example, by applying a strategy such as PTR (Probability in Target Range) or MI (Mutual Information).

　なお、評価関数の値を最小化するパラメータを最適なパラメータとして求める場合（例えば稼働状態を示す情報として異常時間率を用いる場合等）には、獲得関数ａ（θ）を最小にするパラメータを探索すればよい。 In addition, when the optimal parameters are those that minimize the value of the evaluation function (for example, when the abnormal time rate is used as information indicating the operating state), it is sufficient to search for parameters that minimize the acquisition function a(θ).

　探索部３０４が検出したパラメータの最適値の候補は、表示制御部３０６が表示装置に表示させる等により情報処理装置３のオペレータに提示されてもよい。そして、制御装置４によりその候補を適用した制御対象５に対する制御が開始され、当該制御が行われた期間に計測装置５２により計測された計測データが制御装置４に取得され、蓄積される。そして、蓄積された計測データには適用されたパラメータが対応付けられ、新たな結果データとして情報処理装置３に入力される。 The candidate optimal values for the parameters detected by the search unit 304 may be presented to the operator of the information processing device 3 by the display control unit 306 displaying them on a display device, etc. Then, the control device 4 starts control of the control target 5 to which the candidate has been applied, and the measurement data measured by the measuring device 52 during the period in which the control is performed is acquired and stored by the control device 4. The applied parameters are then associated with the stored measurement data, and the data is input to the information processing device 3 as new result data.

　そして、予測分布算出部３０３は、新たな結果データに基づいて予測分布を更新し、探索部３０４は、予測分布算出部３０３による更新後の予測分布に基づいて構成される評価関数に基づき、パラメータの最適値を探索する。このように、予測分布の更新と最適値の探索とを繰り返すことにより、最適なパラメータを検出することが可能になる。 Then, the predictive distribution calculation unit 303 updates the predictive distribution based on the new result data, and the search unit 304 searches for the optimal value of the parameter based on an evaluation function constructed based on the predictive distribution updated by the predictive distribution calculation unit 303. In this way, by repeating the updating of the predictive distribution and the search for the optimal value, it becomes possible to detect the optimal parameter.

　〔表示画面例（結果表示）〕
　表示制御部３０６は、例えば図６のような表示画面を表示させて探索の結果をオペレータに提示してもよい。図６は、探索結果の表示画面の例を示す図である。より詳細には、図６は、探索するパラメータに含まれる要素ａ～ｄのそれぞれについて検出された候補の履歴を平行座標上に示した図である。 [Display screen example (result display)]
The display control unit 306 may present the search results to the operator by displaying a display screen such as that shown in Fig. 6. Fig. 6 is a diagram showing an example of a display screen showing the search results. More specifically, Fig. 6 is a diagram showing, on a parallel coordinate system, the history of candidates detected for each of elements a to d included in the parameters to be searched.

　このように、探索するパラメータには、複数の要素が含まれていてもよく、これにより複数の要素を並行して最適化することができる。例えば、上述の焼却発電プラントが制御対象５である場合、上記要素ａ～ｄは、自動燃焼制御に関する各種制御パラメータとしてもよい。自動燃焼制御に関する制御パラメータとしては、例えば、燃焼速度の調整、乾燥空気の供給流量の調整、燃焼空気の供給流量の調整、燃焼空気の温度の調整、および焼却する廃棄物層の高さの調整に関するパラメータ等が挙げられる。 In this way, the parameters to be searched may include multiple elements, which allows multiple elements to be optimized in parallel. For example, if the above-mentioned incineration power plant is the control object 5, the above elements a to d may be various control parameters related to automatic combustion control. Examples of control parameters related to automatic combustion control include parameters related to adjustment of the combustion speed, adjustment of the dry air supply flow rate, adjustment of the combustion air supply flow rate, adjustment of the combustion air temperature, and adjustment of the height of the waste layer to be incinerated.

　また、探索するパラメータには、例えば、推論モデルの推論の結果に応じた、制御対象５に対する制御の内容と制御量とを示すものが含まれていてもよい。これにより、推論モデルによる推論の結果に応じて制御の内容と制御量を適切に設定することが可能になる。 The parameters to be searched may also include, for example, parameters indicating the content and amount of control of the control target 5 according to the result of inference by the inference model. This makes it possible to appropriately set the content and amount of control according to the result of inference by the inference model.

　また、探索するパラメータには、制御対象５に対する制御の内容と制御量とを示す上記パラメータに加え、推論モデルによる推論に用いられる推論用パラメータが含まれていてもよい。この場合、探索部３０４は、予測分布算出部３０３が算出する予測分布に基づいて、制御対象５に対する制御の内容と制御量の最適値の候補と、推論用パラメータの最適値の候補の両方を探索する。 The parameters to be searched may include, in addition to the above parameters indicating the content and amount of control over the control object 5, inference parameters used for inference by the inference model. In this case, the search unit 304 searches for both candidates for the optimal values of the content and amount of control over the control object 5 and candidates for the optimal values of the inference parameters based on the predictive distribution calculated by the predictive distribution calculation unit 303.

　なお、推論用パラメータは、例えば、推論モデルがニューラルネットワークモデルである場合には、学習により求められる重み値のようなハイパーパラメータであってもよい。また、推論モデルが、複数の機械学習モデルの推論結果をアンサンブルして最終的な推論結果を出力するものである場合、上記推論用パラメータを、アンサンブル比（各推論結果に対する重み）としてもよい。 In addition, when the inference model is a neural network model, for example, the inference parameters may be hyperparameters such as weight values obtained by learning. Furthermore, when the inference model is an ensemble of inference results of multiple machine learning models to output a final inference result, the inference parameters may be the ensemble ratio (weight for each inference result).

　制御対象５の稼働状態は、推論モデルによる推論の結果や、その結果に応じた介入制御等の影響を受ける。このため、制御対象５に対する制御の内容と制御量の最適値と、推論用パラメータの最適値とを個別に算出した場合、全体として最適なパラメータの組み合わせとならない場合がある。この点、上記の構成によれば、制御対象５に対する制御の内容と制御量の最適値と、推論用パラメータの最適値の両方を探索するから、全体として最適なパラメータの組み合わせを検出することが可能になる。 The operating state of the control object 5 is affected by the results of inference using the inference model and by intervention control based on those results. For this reason, if the optimum value of the control content and control amount for the control object 5 and the optimum value of the inference parameters are calculated separately, the parameter combination that is optimal overall may not be obtained. In this regard, with the above configuration, both the optimum value of the control content and control amount for the control object 5 and the optimum value of the inference parameters are searched for, making it possible to detect the optimum parameter combination overall.

　また、制御対象５の将来の状態が正常であるか異常であるかを予測する推論モデルを用いる場合、当該推論モデルの出力値は、正常である確率および／または異常である確率を示すものとなる。このため、制御装置４において、制御内容決定部４３が、正常であるか異常であるかに応じた制御内容を決定するためには、上記の確率についての閾値を用いて、正常であるか異常であるかを判定する必要がある。 Furthermore, when an inference model is used that predicts whether the future state of the controlled object 5 is normal or abnormal, the output value of the inference model indicates the probability of normality and/or the probability of abnormality. Therefore, in order for the control content decision unit 43 in the control device 4 to decide the control content according to whether it is normal or abnormal, it is necessary to determine whether it is normal or abnormal using a threshold value for the above probability.

　そして、上記閾値をどのような値に設定するかは、状況に応じた妥当な制御内容を決定するために重要である。このため、上記要素ａ～ｄには、制御対象５の将来の状態が正常であるか異常であるかを判定する当該閾値が含まれていてもよい。これにより、正常であるか異常であるかを適切に判定し、その判定結果に応じた妥当な制御を行うことが可能になる。なお、最適化するパラメータに含まれる要素の数は任意であり、また、各要素の内容も上記の例に限られない。 The value at which the above thresholds are set is important in determining appropriate control content according to the situation. For this reason, the above elements a to d may include a threshold for determining whether the future state of the control target 5 is normal or abnormal. This makes it possible to properly determine whether it is normal or abnormal, and to carry out appropriate control according to the determination result. Note that the number of elements included in the parameters to be optimized is arbitrary, and the content of each element is not limited to the above example.

　ここで図６の説明に戻る。図６に示す平行座標は、縦軸がパラメータに含まれる各要素の値である。そして、当該平行座標上に、１回の探索で検出されたパラメータに含まれる要素を折れ線で結んだグラフを図示している。つまり、折れ線と縦軸との交点が各要素の値を示している。このようなグラフは並行座標プロットと呼ばれる。図６に示す並行座標プロットは、探索部３０４による各探索で検出された候補の値の遷移を示しており、遷移情報と呼ぶことができる。 Now, let us return to the explanation of Figure 6. In the parallel coordinate system shown in Figure 6, the vertical axis represents the value of each element included in the parameters. A graph is then shown on the parallel coordinate system, with broken lines connecting the elements included in the parameters detected in one search. In other words, the intersections of the broken lines and the vertical axis represent the values of each element. This type of graph is called a parallel coordinate plot. The parallel coordinate plot shown in Figure 6 shows the transitions in the values of the candidates detected in each search by search unit 304, and can be called transition information.

　このように、表示制御部３０６は、探索部３０４による各探索で検出された候補の値の遷移を示す遷移情報を表示してもよい。これにより、パラメータの最適値の候補が、探索を重ねることによりどのように遷移したかをオペレータに容易に認識させ、最適化が正常に進んでいるかを確認させることができる。 In this way, the display control unit 306 may display transition information indicating the transition of the candidate values detected in each search by the search unit 304. This allows the operator to easily recognize how the candidate optimal values for the parameters have transitioned through repeated searches, and to confirm whether the optimization is proceeding normally.

　特に、最適化の対象とするパラメータが複数の要素を含んでいる場合、表示制御部３０６は、平行座標プロットすなわち各探索において検出された候補の値を要素ごとに示すグラフを平行座標上に表示してもよい。これにより、パラメータに含まれる各要素について検出された各候補が、探索を重ねることによりどのように変化したかをオペレータに容易に認識させ、最適化が正常に進んでいるかを確認させることができる。 In particular, when the parameters to be optimized include multiple elements, the display control unit 306 may display a parallel coordinate plot, i.e., a graph showing the values of the candidates detected in each search for each element, on parallel coordinates. This allows the operator to easily recognize how each candidate detected for each element included in the parameters has changed as the search is repeated, and to confirm whether the optimization is proceeding normally.

　なお、平行座標プロットは遷移情報の表示態様の一例に過ぎず、遷移情報の表示態様は任意である。例えば、表示制御部３０６は、探索部３０４による各探索で検出された候補の値の遷移を示すパラレルセットグラフ、等高線プロット、あるいはコレログラフ等を遷移情報として表示してもよい。 Note that the parallel coordinate plot is merely one example of a display format for the transition information, and the display format for the transition information is arbitrary. For example, the display control unit 306 may display, as the transition information, a parallel set graph, a contour plot, a correlograph, or the like, which shows the transition of the values of the candidates detected in each search by the search unit 304.

　また、図６の例において、折れ線グラフを構成する線分には、実線のものと、破線のものと、一点鎖線のものとがある。これらの線分のパターンは、当該線分上の各要素の値を適用して制御対象５の制御を行った後の制御対象５の稼働状態に応じたものとなっている。具体的には、異常時間率が第１の閾値未満であったときの要素を結ぶ線分は実線、異常時間率が第１の閾値以上、第２の閾値未満であったときの要素を結ぶ線分は一点鎖線、異常時間率が第２の閾値以上であったときの要素を結ぶ線分は破線となっている。 In the example of FIG. 6, the lines constituting the line graph include solid lines, dashed lines, and dashed lines. The patterns of these lines correspond to the operating state of the control object 5 after the control object 5 is controlled by applying the values of each element on the line. Specifically, the lines connecting elements when the abnormal time rate is less than the first threshold are solid lines, the lines connecting elements when the abnormal time rate is equal to or greater than the first threshold and less than the second threshold are dashed lines, and the lines connecting elements when the abnormal time rate is equal to or greater than the second threshold are dashed lines.

　このように、表示制御部３０６は、検出された候補を適用して制御対象５の制御を行った後の制御対象５の稼働状態に応じて遷移情報に含まれる各候補の値の表示態様を異ならせてもよい。これにより、各候補の良し悪しを容易に認識させることができる。なお、稼働状態に応じた表示態様をどのようなものとするかは任意であり、図示の例に限られない。例えば、表示制御部３０６は、稼働状態に応じた色で各候補の値を表示させてもよい。この場合、稼働状態を評価した評価値（例えば異常時間率等）の値と、表示色との対応関係を予め定めておけばよい。これにより、表示制御部３０６は、評価値算出部３０２が算出する評価値に応じた表示色で各候補の値を表示させることができる。 In this way, the display control unit 306 may change the display mode of the value of each candidate included in the transition information depending on the operating state of the control target 5 after the control of the control target 5 is performed by applying the detected candidate. This allows the good and bad of each candidate to be easily recognized. Note that the display mode according to the operating state is arbitrary and is not limited to the example shown in the figure. For example, the display control unit 306 may display the value of each candidate in a color according to the operating state. In this case, it is sufficient to determine in advance the correspondence between the evaluation value (e.g., abnormal time rate) that evaluates the operating state and the display color. This allows the display control unit 306 to display the value of each candidate in a display color according to the evaluation value calculated by the evaluation value calculation unit 302.

　〔情報処理装置２が実行する処理の流れ〕
　情報処理装置２が実行する処理（指標値算出方法）の流れを図７に基づいて説明する。図７は、情報処理装置２が実行する処理の一例を示すフローチャートである。 [Flow of processing executed by information processing device 2]
The flow of the process (index value calculation method) executed by the information processing device 2 will be described with reference to Fig. 7. Fig. 7 is a flowchart showing an example of the process executed by the information processing device 2.

　Ｓ１１では、データ取得部２０１が、訓練データセットを取得する。訓練データセットの取得方法は任意である。例えば、データ取得部２０１は、入力部２３を介して入力された訓練データセットを取得してもよいし、通信部２２を介して学習装置１から訓練データセットを取得してもよい。 In S11, the data acquisition unit 201 acquires a training dataset. The method of acquiring the training dataset is arbitrary. For example, the data acquisition unit 201 may acquire a training dataset input via the input unit 23, or may acquire a training dataset from the learning device 1 via the communication unit 22.

　Ｓ１２では、平均距離算出部２０２が、Ｓ１１で取得された訓練データセットに含まれる各訓練データについて、当該訓練データの近傍の所定数の訓練データとの距離の平均値をそれぞれ算出する。なお、ある訓練データの近傍の所定数の訓練データとは、その訓練データからの距離が最も近い所定数の他の訓練データである。 In S12, the average distance calculation unit 202 calculates, for each training data included in the training data set acquired in S11, the average value of the distance between the training data and a predetermined number of training data in the vicinity of the training data. Note that the predetermined number of training data in the vicinity of a certain training data refers to a predetermined number of other training data that are closest in distance to the training data.

　Ｓ１３では、閾値決定部２０３が、Ｓ１２で算出された複数の平均値のうち所定の順位の平均値を、入力データが外れ値であるか否かを判定するための閾値に決定する。なお、Ｓ１１～Ｓ１３の処理は、遅くともＳ１６の処理が行われるまでに行っておけばよく、必ずしもＳ１４の直前に行う必要はない。 In S13, the threshold determination unit 203 determines the average value of a predetermined rank among the multiple average values calculated in S12 as the threshold value for determining whether the input data is an outlier. Note that the processes of S11 to S13 should be performed at the latest by the time the process of S16 is performed, and do not necessarily have to be performed immediately before S14.

　Ｓ１４では、データ取得部２０１が、推論モデルによる推論に用いられた入力データを取得する。入力データの取得方法は任意である。例えば、データ取得部２０１は、入力部２３を介して入力された入力データを取得してもよい。また、例えば、データ取得部２０１は、通信部２２を介して制御装置４から入力データを取得してもよいし、計測装置５２から計測データを取得して入力データを生成してもよい。 In S14, the data acquisition unit 201 acquires the input data used for inference by the inference model. The method of acquiring the input data is arbitrary. For example, the data acquisition unit 201 may acquire the input data input via the input unit 23. Also, for example, the data acquisition unit 201 may acquire the input data from the control device 4 via the communication unit 22, or may acquire measurement data from the measurement device 52 to generate the input data.

　なお、データ取得部２０１は、所定期間分の入力データをまとめて取得してもよいし、入力データをリアルタイムつまり当該入力データが推論モデルによる推論に用いられたあるいは用いられるタイミングで取得してもよい。 The data acquisition unit 201 may acquire input data for a predetermined period of time all at once, or may acquire the input data in real time, that is, at the time when the input data is used or will be used in inference by the inference model.

　Ｓ１５では、平均距離算出部２０２が、Ｓ１４で取得された入力データについて、当該入力データの近傍の所定数の入力データとの距離の平均値を算出する。なお、Ｓ１４で１つの入力データが取得された場合、平均距離算出部２０２は、それ以前に取得された複数の入力データとの距離の平均値を算出する。一方、Ｓ１４で複数の入力データが取得された場合、平均距離算出部２０２は、入力データと、当該入力データの近傍の所定数の入力データとの距離の平均値を算出する処理を、取得された複数の入力データのそれぞれについて行う。 In S15, the average distance calculation unit 202 calculates the average value of the distance between the input data acquired in S14 and a predetermined number of input data in the vicinity of the input data. If one input data is acquired in S14, the average distance calculation unit 202 calculates the average value of the distance between the input data and the multiple input data acquired previously. On the other hand, if multiple input data are acquired in S14, the average distance calculation unit 202 performs the process of calculating the average value of the distance between the input data and a predetermined number of input data in the vicinity of the input data for each of the multiple acquired input data.

　Ｓ１６（外れ値判定ステップ）では、外れ値判定部２０４が、Ｓ１５で算出された平均値に基づいて、Ｓ１４で取得された入力データが外れ値であるか否かを判定する。より詳細には、外れ値判定部２０４は、Ｓ１５で算出された平均値とＳ１３で決定された閾値とを比較し、その比較結果に基づいて入力データが外れ値であるか否かを判定する。なお、Ｓ１４で複数の入力データが取得された場合には、外れ値判定部２０４は、各入力データについて当該入力データが外れ値であるか否かを判定する。 In S16 (outlier determination step), the outlier determination unit 204 determines whether or not the input data acquired in S14 is an outlier based on the average value calculated in S15. More specifically, the outlier determination unit 204 compares the average value calculated in S15 with the threshold value determined in S13, and determines whether or not the input data is an outlier based on the comparison result. Note that if multiple pieces of input data are acquired in S14, the outlier determination unit 204 determines for each piece of input data whether or not the input data is an outlier.

　Ｓ１７（指標値算出ステップ）では、指標値算出部２０５が、Ｓ１６の判定結果に基づいて入力データに対する推論モデルの適合性を示す指標値を算出する。例えば、指標値算出部２０５は、所定期間に計測装置５２で計測された計測データに対応する入力データのうち、Ｓ１６で外れ値であると判定された入力データの数を指標値として算出してもよい。 In S17 (index value calculation step), the index value calculation unit 205 calculates an index value indicating the suitability of the inference model for the input data based on the determination result of S16. For example, the index value calculation unit 205 may calculate, as the index value, the number of input data that are determined to be outliers in S16 among the input data corresponding to the measurement data measured by the measuring device 52 during a predetermined period.

　Ｓ１８では、再学習要否判定部２０６が、Ｓ１７で算出された指標値に基づいて再学習の要否を判定する。例えば、再学習要否判定部２０６は、Ｓ１７で算出された指標値が所定の閾値以上であった場合に再学習要（Ｓ１８でＹＥＳ）と判定し、当該閾値未満であった場合に再学習不要（Ｓ１８でＮＯ）と判定してもよい。Ｓ１８でＮＯと判定された場合には図７の処理は終了する。なお、リアルタイムで入力データを取得する場合、Ｓ１８でＮＯと判定された場合にＳ１４に戻り、新たな入力データを取得するようにしてもよい。 In S18, the relearning necessity determination unit 206 determines whether relearning is necessary based on the index value calculated in S17. For example, the relearning necessity determination unit 206 may determine that relearning is necessary (YES in S18) if the index value calculated in S17 is equal to or greater than a predetermined threshold value, and may determine that relearning is not necessary (NO in S18) if the index value is less than the threshold value. If NO is determined in S18, the processing in FIG. 7 ends. Note that when input data is acquired in real time, if NO is determined in S18, the process may return to S14 and new input data may be acquired.

　Ｓ１８でＹＥＳと判定された場合にはＳ１９に進み、再学習要否判定部２０６は、学習装置１にＳ１４で取得された入力データを送信して推論モデルの再学習を行わせ、これにより図７の処理は終了する。 If the answer is YES in S18, the process proceeds to S19, where the re-learning necessity determination unit 206 transmits the input data acquired in S14 to the learning device 1 to re-learn the inference model, and the process in FIG. 7 ends.

　なお、再現率算出部２０７は、Ｓ１４で取得された入力データを用いて行われた推論の再現率を算出してもよい。そして、学習用データ抽出部２０８は、Ｓ１７で算出された指標値と、再現率算出部２０７が算出する再現率とに基づいて、Ｓ１４で取得された複数の入力データの中から推論モデルの再学習に用いるものを抽出してもよい。そして、学習用データ抽出部２０８は、Ｓ１８でＹＥＳと判定されたときに、再学習に用いる入力データを学習装置１に送信するようにしてもよい。 The reproducibility calculation unit 207 may calculate the reproducibility of the inference performed using the input data acquired in S14. The learning data extraction unit 208 may extract input data to be used for re-learning the inference model from the multiple input data acquired in S14 based on the index value calculated in S17 and the reproducibility calculated by the reproducibility calculation unit 207. The learning data extraction unit 208 may transmit the input data to be used for re-learning to the learning device 1 when the determination in S18 is YES.

　〔再学習を促す構成について〕
　また、Ｓ１９において、再学習要否判定部２０６は、再学習を行うように学習装置１を制御する代わりに、オペレータに推論モデルの再学習を促す処理を行ってもよい。この場合、再学習要否判定部２０６は、外れ値判定部２０４の判定結果に応じて推論モデルの再学習を促す報知部として機能する。なお、報知の態様および報知の対象は何れも特に限定されない。例えば、報知部２１０は、再学習が必要であることを示す情報を出力することによって報知してもよい。出力先は特に限定されず、例えば報知部２１０は、出力部２４に当該情報を出力させてもよいし、学習装置１に当該情報を出力させてもよく、オペレータの所持する端末装置等の他の装置に当該情報を出力させてもよい。 [Structure to encourage relearning]
In addition, in S19, the relearning necessity determination unit 206 may perform a process of prompting the operator to relearn the inference model instead of controlling the learning device 1 to perform relearning. In this case, the relearning necessity determination unit 206 functions as a notification unit that prompts the operator to relearn the inference model according to the determination result of the outlier determination unit 204. Note that the notification mode and the notification target are not particularly limited. For example, the notification unit 210 may notify by outputting information indicating that relearning is necessary. The output destination is not particularly limited, and for example, the notification unit 210 may output the information to the output unit 24, may output the information to the learning device 1, or may output the information to another device such as a terminal device owned by the operator.

　また、再学習を促す処理を行う場合、指標値算出部２０５は省略し、再学習要否判定部２０６が外れ値判定部２０４の判定結果を用いて再学習の要否を判定してもよい。例えば、再学習要否判定部２０６は、Ｓ１６で外れ値であると判定された入力データの数をカウントし、カウントした数が所定の閾値以上であった場合に再学習要と判定してもよい。 Also, when performing processing to prompt relearning, the index value calculation unit 205 may be omitted, and the relearning necessity determination unit 206 may determine whether relearning is necessary using the determination result of the outlier determination unit 204. For example, the relearning necessity determination unit 206 may count the number of input data items determined to be outliers in S16, and determine that relearning is necessary if the counted number is equal to or greater than a predetermined threshold value.

　このように、情報処理装置２は、推論モデルの生成に用いた訓練データセットに含まれる訓練データが外れ値であるか否かを判定するための閾値を用いて、推論モデルに入力された複数の入力データのそれぞれについて、当該入力データが外れ値であるか否かを判定する外れ値判定部２０４と、外れ値判定部２０４の判定結果に応じて推論モデルの再学習を促す再学習要否判定部（報知部）２０６と、を備える構成であってもよい。これにより、推論結果の正否の検証が難しい場合にも、推論モデルの更新に関する処理である再学習を促す処理を適切に行うことが可能になる。 In this way, the information processing device 2 may be configured to include an outlier determination unit 204 that determines whether or not each of a plurality of input data input to the inference model is an outlier using a threshold value for determining whether or not the training data included in the training dataset used to generate the inference model is an outlier, and a relearning necessity determination unit (alert unit) 206 that prompts relearning of the inference model depending on the determination result of the outlier determination unit 204. This makes it possible to appropriately perform processing to prompt relearning, which is processing related to updating the inference model, even when it is difficult to verify the correctness of the inference result.

　〔情報処理装置２で再学習を行う構成について〕
　また、情報処理装置２は、推論モデルの再学習を行う学習部を備えていてもよい。この場合、制御システム７から学習装置１を省略することができる。そして、この場合、図７のＳ１８において再学習要と判定されると、Ｓ１９では情報処理装置２が備える学習部により再学習が行われる。 [Configuration for performing re-learning in information processing device 2]
The information processing device 2 may also include a learning unit that performs re-learning of the inference model. In this case, the learning device 1 can be omitted from the control system 7. In this case, when it is determined in S18 of Fig. 7 that re-learning is required, in S19, re-learning is performed by the learning unit included in the information processing device 2.

　この場合も、上記〔再学習を促す構成について〕で説明したように、指標値算出部２０５は省略してもよい。指標値算出部２０５を省略する場合、外れ値判定部２０４の判定結果を用いて再学習のタイミングが決定される。例えば、再学習要否判定部２０６が外れ値判定部２０４の判定結果を用いて再学習の要否を判定してもよく、この場合、再学習要と判定された時点が再学習のタイミングとなる。また、再学習要否判定部２０６を省略し、学習部が再学習を行うか否かを判定してもよく、この場合、学習部が外れ値判定部２０４の判定結果に基づいて再学習のタイミングを決定することになる。 In this case, too, as explained above in [Configuration for encouraging re-learning], the index value calculation unit 205 may be omitted. When the index value calculation unit 205 is omitted, the timing of re-learning is determined using the judgment result of the outlier judgment unit 204. For example, the relearning necessity judgment unit 206 may judge whether or not re-learning is necessary using the judgment result of the outlier judgment unit 204, in which case the time when it is judged that re-learning is necessary becomes the timing of re-learning. Also, the relearning necessity judgment unit 206 may be omitted and the learning unit may judge whether or not to re-learn, in which case the learning unit will determine the timing of re-learning based on the judgment result of the outlier judgment unit 204.

　このように、情報処理装置２は、推論モデルの生成に用いた訓練データセットに含まれる訓練データが外れ値であるか否かを判定するための閾値を用いて、推論モデルに入力された複数の入力データのそれぞれについて、当該入力データが外れ値であるか否かを判定する外れ値判定部２０４と、外れ値判定部２０４の判定結果に基づいて決定されたタイミングで推論モデルの再学習を行う学習部と、を備える構成であってもよい。 In this way, the information processing device 2 may be configured to include an outlier determination unit 204 that determines whether or not each of a plurality of input data input to the inference model is an outlier using a threshold value for determining whether or not the training data included in the training dataset used to generate the inference model is an outlier, and a learning unit that re-learns the inference model at a timing determined based on the determination result of the outlier determination unit 204.

　そして、学習済みの範囲から乖離した入力データの数や割合が増えたとき、入力データに対する推論モデルの適合性は低くなっているといえる。また、このとき、推論モデルの再学習や交換の必要性は高くなっているといえる。よって、外れ値判定部２０４の判定結果に基づいて決定されたタイミングで推論モデルの再学習を行う上記の構成によれば、妥当なタイミングで再学習を行うことができる。 When the number or proportion of input data that deviates from the learned range increases, it can be said that the suitability of the inference model for the input data is decreasing. At this time, it can be said that there is a high need to re-learn or replace the inference model. Therefore, according to the above configuration in which the inference model is re-learned at a timing determined based on the judgment result of the outlier judgment unit 204, re-learning can be performed at an appropriate timing.

　また、上記の構成によれば、推論モデルによる推論結果を用いる必要がない。このため、上記の構成によれば、推論結果の正否の検証が難しい場合にも、推論モデルの更新に関する処理である再学習を適切に行うことが可能になる。 Furthermore, with the above configuration, there is no need to use the inference results from the inference model. Therefore, with the above configuration, even when it is difficult to verify the accuracy of the inference results, it is possible to appropriately perform re-learning, which is a process for updating the inference model.

　〔情報処理装置３が実行する処理の流れ〕
　情報処理装置３が実行する処理（探索方法）の流れを図８に基づいて説明する。図８は、情報処理装置３が実行する処理の一例を示すフローチャートである。なお、図８の処理を実行する契機は特に限定されない。例えば、所定期間おきに実行するようにしてもよいし、制御対象５の稼働状態を示す異常時間率等の情報を基に、稼働状態が悪化しているあるいは悪化傾向にあると判定されたタイミングで実行するようにしてもよい。 [Flow of processing executed by information processing device 3]
The flow of the process (search method) executed by the information processing device 3 will be described with reference to Fig. 8. Fig. 8 is a flowchart showing an example of the process executed by the information processing device 3. The timing for executing the process in Fig. 8 is not particularly limited. For example, the process may be executed at predetermined intervals, or may be executed at a timing when it is determined that the operating state of the control target 5 has deteriorated or is showing a tendency to deteriorate based on information such as an abnormal time rate indicating the operating state of the control target 5.

　Ｓ２１では、データ取得部３０１が、最適化の対象となるパラメータの初期値と、当該初期値に対応する結果データとを取得する。例えば、データ取得部３０１は、入力部３３を介して入力される初期値と結果データを取得してもよい。パラメータの初期値に対応する結果データは、当該初期値を適用した制御対象５の制御が行われた期間における制御対象５の稼働状態を示すものであり、例えば当該期間に計測装置５２により計測された計測値（計測データ）であってもよい。 In S21, the data acquisition unit 301 acquires the initial values of the parameters to be optimized and the result data corresponding to the initial values. For example, the data acquisition unit 301 may acquire the initial values and result data input via the input unit 33. The result data corresponding to the initial values of the parameters indicates the operating state of the control object 5 during the period in which the control of the control object 5 was performed using the initial values, and may be, for example, measurement values (measurement data) measured by the measuring device 52 during that period.

　また、データ取得部３０１は、Ｓ２１において、初期値とその結果データに加えて、パラメータの上限値およびその上限値を適用したときの結果データと、パラメータの下限値およびその下限値を適用したときの結果データとを取得していてもよい。 In addition, in S21, the data acquisition unit 301 may acquire, in addition to the initial value and the resulting data, an upper limit value of the parameter and the resulting data when the upper limit value is applied, and a lower limit value of the parameter and the resulting data when the lower limit value is applied.

　パラメータの初期値の決定方法は特に限定されない。例えば、記述統計的手法等によりパラメータの初期値を決定してもよい。この場合、例えば、異常発生前後のパラメータの値の分布を示す度数分布表などを作成し、それを基に最適と考えられるパラメータの値を特定し、特定した値を初期値としてもよい。初期値が決まれば、パラメータの値を初期値に設定して制御対象５の制御を所定期間行い、当該期間における稼働状態を示す結果データを取得することができる。 The method for determining the initial parameter values is not particularly limited. For example, the initial parameter values may be determined by descriptive statistical methods or the like. In this case, for example, a frequency distribution table showing the distribution of parameter values before and after the occurrence of an abnormality may be created, and based on that, a parameter value considered to be optimal may be identified, and the identified value may be used as the initial value. Once the initial value is determined, the parameter value may be set to the initial value, and control of the control target 5 may be performed for a predetermined period, and result data showing the operating state during that period may be obtained.

　また、パラメータの上限値と下限値の決定方法も特に限定されない。例えば、初期値と同様に記述統計的手法等により決定してもよいし、初期値を基準とし、初期値から所定幅だけ乖離した値を上限値および下限値としてもよい。後者の場合、例えば所定幅を１０％として、初期値に初期値の１０％の値を加えて上限値とし、初期値から初期値の１０％の値を差し引いて下限値としてもよい。上限値および下限値のそれぞれについても、初期値と同様にして結果データを取得することができる。 Furthermore, there is no particular limitation on the method of determining the upper and lower limits of the parameters. For example, they may be determined by descriptive statistical methods, as with the initial values, or the initial value may be used as a reference and values that deviate from the initial value by a specified range may be set as the upper and lower limits. In the latter case, for example, the specified range may be set to 10%, with the upper limit being determined by adding 10% of the initial value to the initial value, and the lower limit being determined by subtracting 10% of the initial value from the initial value. Result data can be obtained for each of the upper and lower limits in the same manner as for the initial values.

　Ｓ２２では、評価値算出部３０２が、Ｓ２１で取得された結果データを用いて、初期値を適用して制御対象５の制御を行った期間における制御対象５の稼働状態を評価した評価値を算出する。評価値としては、例えば上述した異常時間率を適用してもよい。なお、異常時間率を算出する際の「異常」の定義は、推論モデルの学習の際と同じとしてもよいし、より広い定義を適用してもよい。 In S22, the evaluation value calculation unit 302 uses the result data acquired in S21 to calculate an evaluation value that evaluates the operating state of the control target 5 during the period in which the control target 5 was controlled by applying the initial value. As the evaluation value, for example, the abnormal time rate described above may be applied. Note that the definition of "abnormality" when calculating the abnormal time rate may be the same as that when learning the inference model, or a broader definition may be applied.

　Ｓ２３（予測分布を算出するステップ）では、予測分布算出部３０３が、Ｓ２１で取得された初期値と、Ｓ２２で算出された評価値とに基づいて、最適化の対象とするパラメータと制御対象５の稼働状態との関係を示す関数の予測分布を算出する。なお、表示制御部３０６は、算出された予測分布を出力部３４に出力させる等してオペレータに提示してもよい。 In S23 (step of calculating the predictive distribution), the predictive distribution calculation unit 303 calculates the predictive distribution of a function indicating the relationship between the parameters to be optimized and the operating state of the control target 5 based on the initial values acquired in S21 and the evaluation values calculated in S22. The display control unit 306 may present the calculated predictive distribution to the operator by, for example, outputting it to the output unit 34.

　Ｓ２４（候補を探索するステップ）では、探索部３０４が、Ｓ２３で算出された予測分布に基づいてパラメータの最適値の候補を探索する。また、表示制御部３０６は、検出された候補を出力部３４に出力させる等して当該候補をオペレータに提示する。 In S24 (step of searching for candidates), the search unit 304 searches for candidates for the optimal parameter value based on the predictive distribution calculated in S23. In addition, the display control unit 306 presents the detected candidates to the operator by, for example, outputting the candidates to the output unit 34.

　この後、オペレータは、提示された候補を適用することに問題がないか確認し、問題ないと判断した場合には、制御装置４に当該候補を入力して、当該候補を適用した制御対象５の制御を開始させる。そして、オペレータは、当該候補を適用した制御が行われた期間に計測装置５２で計測された計測データ等を制御対象５の稼働状態を示す結果データとして、適用したパラメータの値（提示された候補の値）と共に情報処理装置３に入力する。これらの処理はオペレータを介さず自動で行うようにしてもよい。 Then, the operator checks whether there is any problem with applying the presented candidate, and if it is determined that there is no problem, inputs the candidate into the control device 4 and starts control of the control object 5 applying the candidate. The operator then inputs the measurement data etc. measured by the measuring device 52 during the period in which control applying the candidate was performed, together with the values of the applied parameters (the values of the presented candidates) into the information processing device 3 as result data indicating the operating state of the control object 5. These processes may be performed automatically without the intervention of the operator.

　なお、オペレータが、Ｓ２４の探索の結果として提示された候補を適用することに問題があると判断した場合、Ｓ２５以下の処理はスキップされる。この場合、探索部３０４は、先に検出した候補とは異なる候補を探索してもよい。また、予めパラメータの正常範囲を定めておいてもよい。この場合、オペレータの判断によらず、Ｓ２４で検出された候補が当該正常範囲外である場合に、Ｓ２５以下の処理がスキップされる。 If the operator determines that there is a problem with applying the candidates presented as a result of the search in S24, the processing from S25 onwards is skipped. In this case, the search unit 304 may search for a candidate different from the previously detected candidate. Also, a normal range for the parameters may be determined in advance. In this case, regardless of the operator's judgment, if the candidate detected in S24 is outside the normal range, the processing from S25 onwards is skipped.

　Ｓ２５では、データ取得部３０１が、上述のようにして入力された、適用されたパラメータの値と結果データとを取得する。なお、データ取得部３０１は、適用されたパラメータの値と結果データを制御装置４から取得してもよい。そして、Ｓ２６では、評価値算出部３０２が、Ｓ２５で取得された結果データを用いて評価値を算出する。 In S25, the data acquisition unit 301 acquires the applied parameter values and result data input as described above. The data acquisition unit 301 may acquire the applied parameter values and result data from the control device 4. Then, in S26, the evaluation value calculation unit 302 calculates an evaluation value using the result data acquired in S25.

　Ｓ２７では、最適化制御部３０５が、Ｓ２６で算出された評価値に基づいて最適化を終了するか否かを判定する。例えば、最適化制御部３０５は、評価値が所定の閾値以上である場合に最適化を終了する（Ｓ２７でＹＥＳ）と判定し、評価値が当該閾値未満である場合に最適化を続ける（Ｓ２７でＮＯ）と判定してもよい。Ｓ２７でＹＥＳと判定された場合には図８の処理は終了し、Ｓ２７でＮＯと判定された場合にはＳ２３の処理に戻る。Ｓ２７から遷移したＳ２３では、Ｓ２５で取得されたパラメータの値と結果データとを用いて予測分布が更新される。 In S27, the optimization control unit 305 determines whether or not to end the optimization based on the evaluation value calculated in S26. For example, the optimization control unit 305 may determine to end the optimization (YES in S27) if the evaluation value is equal to or greater than a predetermined threshold, and may determine to continue the optimization (NO in S27) if the evaluation value is less than the threshold. If YES is determined in S27, the processing of FIG. 8 ends, and if NO is determined in S27, the processing returns to S23. In S23, which is transitioned from S27, the predictive distribution is updated using the parameter values and result data acquired in S25.

　〔変形例〕
　上述の各実施形態で説明した各処理の実行主体は任意であり、上述の例に限られない。つまり、相互に通信可能な複数の情報処理装置（プロセッサということもできる）により、学習装置１、情報処理装置２、３、および制御装置４の機能を実現することができる。例えば、図７および図８のフローチャートに記載されている各処理を複数の情報処理装置に分担させることもできる。つまり、上述の各実施形態における制御方法の実行主体は、１つの情報処理装置であってもよいし、複数の情報処理装置であってもよい。 [Modifications]
The execution subject of each process described in each of the above-mentioned embodiments is arbitrary and is not limited to the above-mentioned examples. In other words, the functions of the learning device 1, the

information processing devices

2 and 3, and the control device 4 can be realized by a plurality of information processing devices (which can also be called processors) that can communicate with each other. For example, each process described in the flowcharts of Figures 7 and 8 can be shared among a plurality of information processing devices. In other words, the execution subject of the control method in each of the above-mentioned embodiments may be one information processing device or a plurality of information processing devices.

　また、上述のように、学習装置１の機能を情報処理装置２に持たせてこれらを１つの装置に統合してもよいし、学習装置１、情報処理装置２、および情報処理装置３の機能の一部または全部を制御装置４に持たせてもよい。このように、制御システム７をどのような装置で構成するかについても適宜変更することが可能である。 Furthermore, as described above, the functions of the learning device 1 may be provided in the information processing device 2 and these may be integrated into a single device, or some or all of the functions of the learning device 1, the information processing device 2, and the information processing device 3 may be provided in the control device 4. In this way, it is possible to appropriately change the types of devices that constitute the control system 7.

　〔ソフトウェアによる実現例〕
　学習装置１、情報処理装置２、３、および制御装置４（以下、「装置」と呼ぶ）の機能は、当該装置としてコンピュータを機能させるためのプログラムであって、当該装置の各制御ブロック（特に制御部２０および制御部３０に含まれる各部）としてコンピュータを機能させるためのプログラム（指標値算出プログラム／探索プログラム）により実現することができる。 [Software implementation example]
The functions of the learning device 1,

information processing devices

2, 3, and control device 4 (hereinafter referred to as "devices") can be realized by a program for causing a computer to function as the device, and a program (index value calculation program/search program) for causing a computer to function as each control block of the device (particularly each part included in the control unit 20 and the control unit 30).

　この場合、上記装置は、上記プログラムを実行するためのハードウェアとして、少なくとも１つの制御装置（例えばプロセッサ）と少なくとも１つの記憶装置（例えばメモリ）を有するコンピュータを備えている。この制御装置と記憶装置により上記プログラムを実行することにより、上記各実施形態で説明した各機能が実現される。 In this case, the device includes a computer having at least one control device (e.g., a processor) and at least one storage device (e.g., a memory) as hardware for executing the program. The functions described in each of the above embodiments are realized by executing the program using this control device and storage device.

　上記プログラムは、一時的ではなく、コンピュータ読み取り可能な、１または複数の記録媒体に記録されていてもよい。この記録媒体は、上記装置が備えていてもよいし、備えていなくてもよい。後者の場合、上記プログラムは、有線または無線の任意の伝送媒体を介して上記装置に供給されてもよい。 The program may be recorded on one or more computer-readable recording media, not on a temporary basis. The recording media may or may not be included in the device. In the latter case, the program may be supplied to the device via any wired or wireless transmission medium.

　また、上記各制御ブロックの機能の一部または全部は、論理回路により実現することも可能である。例えば、上記各制御ブロックとして機能する論理回路が形成された集積回路も本発明の範疇に含まれる。この他にも、例えば量子コンピュータにより上記各制御ブロックの機能を実現することも可能である。 Furthermore, some or all of the functions of each of the above control blocks can be realized by a logic circuit. For example, an integrated circuit in which a logic circuit that functions as each of the above control blocks is formed is also included in the scope of the present invention. In addition, it is also possible to realize the functions of each of the above control blocks by, for example, a quantum computer.

　〔まとめ〕
　本発明の態様１に係る情報処理装置は、推論モデルによる推論の結果に応じて所定の制御対象に対する制御を行う制御システムにおける当該推論および当該制御の少なくとも何れかに関するパラメータと、当該パラメータが適用されている期間における前記制御対象の稼働状態との関係を示す関数の予測分布を算出する予測分布算出部と、前記予測分布に基づいて前記パラメータの最適値の候補を探索する探索部と、を備える。〔summary〕
An information processing device according to aspect 1 of the present invention includes a predictive distribution calculation unit that calculates a predictive distribution of a function indicating the relationship between parameters relating to at least one of an inference and the control in a control system that performs control on a specified controlled object in accordance with the result of an inference using an inference model and the operating state of the controlled object during a period in which the parameters are applied, and a search unit that searches for candidates for an optimal value of the parameter based on the predictive distribution.

　本発明の態様２に係る情報処理装置は、前記態様１において、前記パラメータは、前記推論の結果に応じた、前記制御対象に対する制御の内容と制御量とを示すものである。 In the information processing device according to aspect 2 of the present invention, in the above-mentioned aspect 1, the parameters indicate the content and amount of control of the control object according to the result of the inference.

　本発明の態様３に係る情報処理装置は、前記態様２において、前記パラメータには、前記推論モデルによる推論に用いられる推論用パラメータが含まれており、前記探索部は、前記予測分布に基づいて、前記制御対象に対する制御の内容と制御量の最適値の候補と、前記推論用パラメータの最適値の候補の両方を探索する。 In the information processing device according to aspect 3 of the present invention, in the above aspect 2, the parameters include inference parameters used for inference by the inference model, and the search unit searches for both the content of control and the optimal value of the control amount for the control object, and the optimal value of the inference parameters, based on the predictive distribution.

　本発明の態様４に係る情報処理装置は、前記態様１において、前記推論モデルは、前記制御対象の将来の稼働状態を示す値を予測するものであり、前記パラメータは、前記制御対象の将来の稼働状態が正常であるか否かを判定するための閾値である。 In the information processing device according to aspect 4 of the present invention, in the above aspect 1, the inference model predicts a value indicating the future operating state of the control object, and the parameter is a threshold value for determining whether the future operating state of the control object is normal or not.

　本発明の態様５に係る情報処理装置は、前記態様１から４の何れかにおいて、前記推論モデルは、前記制御対象の将来の稼働状態を示す値を予測するものであり、前記予測分布算出部は、前記制御対象の稼働状態を示す情報として、前記制御対象の稼働時間に対する、前記制御対象の稼働状態が異常であった時間の比を用いる。 In the information processing device according to aspect 5 of the present invention, in any one of aspects 1 to 4, the inference model predicts a value indicating the future operating state of the control object, and the predictive distribution calculation unit uses the ratio of the time during which the operating state of the control object was abnormal to the operating time of the control object as information indicating the operating state of the control object.

　本発明の態様６に係る情報処理装置は、前記態様１から５の何れかにおいて、前記予測分布算出部は、前記探索部が検出した前記候補と、当該候補が適用されている期間における前記制御対象の稼働状態とに基づいて前記予測分布を更新し、前記探索部は、更新後の前記予測分布に基づいて前記パラメータの最適値の新たな候補を探索する。 In the information processing device according to aspect 6 of the present invention, in any one of aspects 1 to 5, the predictive distribution calculation unit updates the predictive distribution based on the candidate detected by the search unit and the operating state of the control target during the period in which the candidate is applied, and the search unit searches for a new candidate for the optimal value of the parameter based on the updated predictive distribution.

　本発明の態様７に係る情報処理装置は、前記態様６において、前記探索部による各探索で検出された前記候補の値の遷移を示す遷移情報を表示する表示制御部を備える。 The information processing device according to aspect 7 of the present invention is the same as in aspect 6, but includes a display control unit that displays transition information indicating the transition of the candidate values detected in each search by the search unit.

　本発明の態様８に係る情報処理装置は、前記態様７において、前記表示制御部は、前記候補を適用して前記制御対象の制御を行った後の前記制御対象の稼働状態に応じて前記遷移情報に含まれる各候補の値の表示態様を異ならせる。 In the information processing device according to aspect 8 of the present invention, in the above-mentioned aspect 7, the display control unit changes the display mode of the value of each candidate included in the transition information depending on the operating state of the control object after applying the candidate to control the control object.

　本発明の態様９に係る制御システムは、前記態様１に記載の情報処理装置と、前記情報処理装置が検出する前記パラメータの最適値の候補を適用して前記制御対象に対する制御を行う制御装置と、を含み、前記情報処理装置は、前記候補が適用されている期間における前記稼働状態を示す結果データを取得し、当該結果データに基づいて前記予測分布を更新すると共に、更新後の予測分布に基づいて前記パラメータの最適値の新たな候補を探索する。 A control system according to aspect 9 of the present invention includes the information processing device according to aspect 1, and a control device that applies candidates for optimal values of the parameters detected by the information processing device to control the control target, and the information processing device obtains result data indicating the operating state during the period in which the candidates are applied, updates the predictive distribution based on the result data, and searches for new candidates for optimal values of the parameters based on the updated predictive distribution.

　本発明の態様１０に係る探索方法は、少なくとも１つの情報処理装置が実行する探索方法であって、推論モデルによる推論の結果に応じて所定の制御対象に対する制御を行う制御システムにおける当該推論および当該制御の少なくとも何れかに関するパラメータと、当該パラメータが適用されている期間における前記制御対象の稼働状態との関係を示す関数の予測分布を算出するステップと、前記予測分布に基づいて前記パラメータの最適値の候補を探索するステップと、を含む。 The search method according to aspect 10 of the present invention is a search method executed by at least one information processing device, and includes the steps of: calculating a predictive distribution of a function indicating the relationship between a parameter relating to at least one of the inference and the control in a control system that controls a specified control object according to the result of inference by an inference model, and the operating state of the control object during a period in which the parameter is applied; and searching for a candidate optimal value for the parameter based on the predictive distribution.

　本発明の態様１１に係る探索プログラムは、請求項１に記載の情報処理装置としてコンピュータを機能させるための探索プログラムであって、前記予測分布算出部および前記探索部としてコンピュータを機能させる。 The search program according to aspect 11 of the present invention is a search program for causing a computer to function as the information processing device described in claim 1, and causes the computer to function as the predictive distribution calculation unit and the search unit.

　本発明は上述した各実施形態に限定されるものではなく、請求項に示した範囲で種々の変更が可能であり、異なる実施形態にそれぞれ開示された技術的手段を適宜組み合わせて得られる実施形態についても本発明の技術的範囲に含まれる。 The present invention is not limited to the above-described embodiments, and various modifications are possible within the scope of the claims. The technical scope of the present invention also includes embodiments obtained by appropriately combining the technical means disclosed in the different embodiments.

　３　　　情報処理装置
　３０３　予測分布算出部
　３０４　探索部
　３０６　表示制御部
　４　　　制御装置
　７　　　制御システム 3 Information processing device 303 Prediction distribution calculation unit 304 Search unit 306 Display control unit 4 Control device 7 Control system

Claims

　推論モデルによる推論の結果に応じて所定の制御対象に対する制御を行う制御システムにおける当該推論および当該制御の少なくとも何れかに関するパラメータと、当該パラメータが適用されている期間における前記制御対象の稼働状態との関係を示す関数の予測分布を算出する予測分布算出部と、
　前記予測分布に基づいて前記パラメータの最適値の候補を探索する探索部と、を備える情報処理装置。 a predictive distribution calculation unit that calculates a predictive distribution of a function indicating a relationship between a parameter related to at least one of an inference and the control in a control system that controls a predetermined control object according to a result of an inference by an inference model and an operating state of the control object during a period in which the parameter is applied;
a search unit that searches for candidates for the optimal value of the parameter based on the predictive distribution.
　前記パラメータは、前記推論の結果に応じた、前記制御対象に対する制御の内容と制御量とを示すものである、請求項１に記載の情報処理装置。 The information processing device according to claim 1, wherein the parameters indicate the content and amount of control of the control object according to the result of the inference.
　前記パラメータには、前記推論モデルによる推論に用いられる推論用パラメータが含まれており、
　前記探索部は、前記予測分布に基づいて、前記制御対象に対する制御の内容と制御量の最適値の候補と、前記推論用パラメータの最適値の候補の両方を探索する、請求項２に記載の情報処理装置。 The parameters include inference parameters used for inference by the inference model,
The information processing apparatus according to claim 2 , wherein the search unit searches for both candidates for the optimum value of the control content and the control amount for the control object, and candidates for the optimum value of the inference parameter, based on the predictive distribution.
　前記推論モデルは、前記制御対象の将来の稼働状態を示す値を予測するものであり、
　前記パラメータは、前記制御対象の将来の稼働状態が正常であるか否かを判定するための閾値である、請求項１に記載の情報処理装置。 The inference model predicts a value indicating a future operating state of the control object,
The information processing apparatus according to claim 1 , wherein the parameter is a threshold value for determining whether a future operating state of the controlled object is normal or not.
　前記推論モデルは、前記制御対象の将来の稼働状態を示す値を予測するものであり、
　前記予測分布算出部は、前記制御対象の稼働状態を示す情報として、前記制御対象の稼働時間に対する、前記制御対象の稼働状態が異常であった時間の比を用いる、請求項１から４の何れか１項に記載の情報処理装置。 The inference model predicts a value indicating a future operating state of the control object,
The information processing device according to claim 1 , wherein the predictive distribution calculation unit uses a ratio of a time during which the operating state of the control object was abnormal to an operating time of the control object as the information indicating the operating state of the control object.
　前記予測分布算出部は、前記探索部が検出した前記候補と、当該候補が適用されている期間における前記制御対象の稼働状態とに基づいて前記予測分布を更新し、
　前記探索部は、更新後の前記予測分布に基づいて前記パラメータの最適値の新たな候補を探索する、請求項１から４の何れか１項に記載の情報処理装置。 the predictive distribution calculation unit updates the predictive distribution based on the candidate detected by the search unit and an operating state of the control target during a period in which the candidate is applied; and
The information processing device according to claim 1 , wherein the search unit searches for new candidates for the optimal values of the parameters based on the updated predictive distribution.
　前記探索部による各探索で検出された前記候補の値の遷移を示す遷移情報を表示する表示制御部を備える、請求項６に記載の情報処理装置。 The information processing device according to claim 6, further comprising a display control unit that displays transition information indicating the transition of the candidate values detected in each search by the search unit.
　前記表示制御部は、前記候補を適用して前記制御対象の制御を行った後の前記制御対象の稼働状態に応じて前記遷移情報に含まれる各候補の値の表示態様を異ならせる、請求項７に記載の情報処理装置。 The information processing device according to claim 7, wherein the display control unit changes the display mode of the value of each candidate included in the transition information depending on the operating state of the control object after the candidate is applied to control the control object.
　請求項１に記載の情報処理装置と、
　前記情報処理装置が検出する前記パラメータの最適値の候補を適用して前記制御対象に対する制御を行う制御装置と、を含み、
　前記情報処理装置は、前記候補が適用されている期間における前記稼働状態を示す結果データを取得し、当該結果データに基づいて前記予測分布を更新すると共に、更新後の予測分布に基づいて前記パラメータの最適値の新たな候補を探索する、制御システム。 The information processing device according to claim 1 ;
a control device that applies the candidate optimal values of the parameters detected by the information processing device to control the control target;
The information processing device obtains result data indicating the operating state during a period in which the candidate is applied, updates the predictive distribution based on the result data, and searches for new candidates for the optimal value of the parameter based on the updated predictive distribution.
　少なくとも１つの情報処理装置が実行する探索方法であって、
　推論モデルによる推論の結果に応じて所定の制御対象に対する制御を行う制御システムにおける当該推論および当該制御の少なくとも何れかに関するパラメータと、当該パラメータが適用されている期間における前記制御対象の稼働状態との関係を示す関数の予測分布を算出するステップと、
　前記予測分布に基づいて前記パラメータの最適値の候補を探索するステップと、を含む探索方法。 A search method executed by at least one information processing device, comprising:
A step of calculating a predictive distribution of a function showing a relationship between parameters related to at least one of an inference and the control in a control system that controls a specified controlled object according to the result of an inference by an inference model and an operating state of the controlled object during a period in which the parameters are applied;
searching for candidates for the optimal value of the parameter based on the predictive distribution.
　請求項１に記載の情報処理装置としてコンピュータを機能させるための探索プログラムであって、前記予測分布算出部および前記探索部としてコンピュータを機能させるための探索プログラム。 A search program for causing a computer to function as the information processing device according to claim 1, the search program causing a computer to function as the predictive distribution calculation unit and the search unit.