JP2023079180A

JP2023079180A - Information processing system, information processing method and information processing program

Info

Publication number: JP2023079180A
Application number: JP2022183471A
Authority: JP
Inventors: 隼人久米村; Hayato Kumemura
Original assignee: Datafluct; Datafluct Inc
Current assignee: Datafluct; Datafluct Inc
Priority date: 2021-11-26
Filing date: 2022-11-16
Publication date: 2023-06-07

Abstract

To provide an information processing system, an information processing method and an information processing program that enable selection of an appropriate learner.SOLUTION: An information processing system comprises a control unit. The control unit is configured to perform the following steps. In a data reception step, input of first input data is received. In a learner identification step, a plurality of learners are identified based on the received first input data. In a model display step, model information on learning models generated by the identified learner is displayed in a mode enabling comparison for each learner based on the first input data.SELECTED DRAWING: Figure 5

Description

本発明は、情報処理システム、情報処理方法及び情報処理プログラムに関する。 The present invention relates to an information processing system, an information processing method, and an information processing program.

従来技術として、学習器に関する下記の文献が挙げられる。 As prior art, the following documents regarding learning devices can be cited.

特開２０２１－１７７４２８号公報Japanese Patent Application Laid-Open No. 2021-177428

学習器から生成される学習モデルは、同一の入力データを用いたとしても学習器のアルゴリズム等によって異なることがある。そのため、ユーザは、学習モデルを用いた予測精度の向上のために、入力データに応じて学習器を適切に選択する必要がある。しかし、適切な学習器の選択には、データサイエンスに関する知見が比較的高い水準で求められることがある。 A learning model generated by a learning device may differ depending on the algorithm of the learning device even if the same input data is used. Therefore, a user needs to appropriately select a learner according to input data in order to improve prediction accuracy using a learning model. However, selecting an appropriate learner may require a relatively high level of data science knowledge.

（１）本発明の一態様によれば、情報処理システムが提供される。この情報処理システムは、制御部を備える。制御部は、次の各ステップを実行するように構成される。データ受付ステップでは、第１の入力データの入力を受け付ける。学習器特定ステップでは、受け付けた第１の入力データに応じて複数の学習器を特定する。モデル表示ステップでは、第１の入力データに基づき、特定された学習器が生成する学習モデルに関するモデル情報を、学習モデルごとに比較可能な態様で表示させる。 (1) According to one aspect of the present invention, an information processing system is provided. This information processing system includes a control unit. The controller is configured to perform the following steps. The data receiving step receives input of the first input data. In the learner identifying step, a plurality of learners are identified according to the received first input data. In the model display step, based on the first input data, model information relating to the learning models generated by the identified learners is displayed in a comparable manner for each learning model.

かかる情報処理システムによれば、ユーザに要求されるデータサイエンスに関する知見の水準を、これまでより下げることができる。 According to such an information processing system, it is possible to lower the level of knowledge regarding data science required of users.

情報処理システム１を表す構成図である。1 is a configuration diagram showing an information processing system 1; FIG. 情報処理装置２のハードウェア構成を示すブロック図である。2 is a block diagram showing the hardware configuration of the information processing device 2; FIG. ユーザ端末３のハードウェア構成を示すブロック図である。3 is a block diagram showing the hardware configuration of the user terminal 3; FIG. 制御部２３が備える機能部の一例を示す。An example of a functional unit included in the control unit 23 is shown. 情報処理システム１において実行される情報処理の流れの一例を示すアクティビティ図である。3 is an activity diagram showing an example of the flow of information processing executed in the information processing system 1; FIG. 表示部３４に表示される受付ウィンドウ４の一例である。It is an example of the reception window 4 displayed on the display unit 34 . 表示部３４に表示されるデータウィンドウ５及び変換処理ウィンドウ６の一例である。It is an example of the data window 5 and the conversion processing window 6 displayed on the display unit 34 . 第２の表示モード６ｂの場合に表示部３４に表示される変換処理ウィンドウ６の一例を示す図である。FIG. 11 is a diagram showing an example of the conversion processing window 6 displayed on the display unit 34 in the case of the second display mode 6b; 表示部３４に表示されるモデル情報表示ウィンドウ７の一例を示す図である。3 is a diagram showing an example of a model information display window 7 displayed on a display unit 34; FIG. 表示部３４に表示されたモデル検索ウィンドウ８及びモデル比較ウィンドウ９の一例である。It is an example of the model search window 8 and the model comparison window 9 displayed on the display unit 34 .

以下、図面を用いて本発明の実施形態について説明する。以下に示す実施形態中で示した各種特徴事項は、互いに組み合わせ可能である。 Embodiments of the present invention will be described below with reference to the drawings. Various features shown in the embodiments shown below can be combined with each other.

ところで、本実施形態に登場するソフトウェアを実現するためのプログラムは、コンピュータが読み取り可能な非一時的な記録媒体（Ｎｏｎ－ＴｒａｎｓｉｔｏｒｙＣｏｍｐｕｔｅｒ－ＲｅａｄａｂｌｅＭｅｄｉｕｍ）として提供されてもよいし、外部のサーバからダウンロード可能に提供されてもよいし、外部のコンピュータで当該プログラムを起動させてクライアント端末でその機能を実現（いわゆるクラウドコンピューティング）するように提供されてもよい。 By the way, the program for realizing the software appearing in this embodiment may be provided as a non-transitory computer-readable medium (Non-Transitory Computer-Readable Medium), or may be downloaded from an external server. It may be provided as possible, or may be provided so that the program is activated on an external computer and the function is realized on the client terminal (so-called cloud computing).

また、本実施形態において「部」とは、例えば、広義の回路によって実施されるハードウェア資源と、これらのハードウェア資源によって具体的に実現されうるソフトウェアの情報処理とを合わせたものも含みうる。また、本実施形態においては様々な情報を取り扱うが、これら情報は、例えば電圧・電流を表す信号値の物理的な値、０又は１で構成される２進数のビット集合体としての信号値の高低、又は量子的な重ね合わせ（いわゆる量子ビット）によって表され、広義の回路上で通信・演算が実行されうる。 Further, in the present embodiment, the term “unit” may include, for example, a combination of hardware resources implemented by circuits in a broad sense and software information processing that can be specifically realized by these hardware resources. . In addition, various information is handled in the present embodiment, and these information are, for example, physical values of signal values representing voltage and current, and signal values as binary bit aggregates composed of 0 or 1. It is represented by high and low, or quantum superposition (so-called quantum bit), and communication and operation can be performed on a circuit in a broad sense.

また、広義の回路とは、回路（Ｃｉｒｃｕｉｔ）、回路類（Ｃｉｒｃｕｉｔｒｙ）、プロセッサ（Ｐｒｏｃｅｓｓｏｒ）、及びメモリ（Ｍｅｍｏｒｙ）等を少なくとも適当に組み合わせることによって実現される回路である。すなわち、特定用途向け集積回路（ＡｐｐｌｉｃａｔｉｏｎＳｐｅｃｉｆｉｃＩｎｔｅｇｒａｔｅｄＣｉｒｃｕｉｔ：ＡＳＩＣ）、プログラマブル論理デバイス（例えば、単純プログラマブル論理デバイス（ＳｉｍｐｌｅＰｒｏｇｒａｍｍａｂｌｅＬｏｇｉｃＤｅｖｉｃｅ：ＳＰＬＤ）、複合プログラマブル論理デバイス（ＣｏｍｐｌｅｘＰｒｏｇｒａｍｍａｂｌｅＬｏｇｉｃＤｅｖｉｃｅ：ＣＰＬＤ）、及びフィールドプログラマブルゲートアレイ（ＦｉｅｌｄＰｒｏｇｒａｍｍａｂｌｅＧａｔｅＡｒｒａｙ：ＦＰＧＡ））等を含むものである。 A circuit in a broad sense is a circuit implemented by appropriately combining at least circuits, circuits, processors, memories, and the like. Application Specific Integrated Circuits (ASICs); Programmable Logic Devices (e.g., Simple Programmable Logic Devices (SPLDs); Complex Programmable Logic Devices (CPLDs); and field It includes a programmable gate array (Field Programmable Gate Array: FPGA).

１．ハードウェア構成
本節では、ハードウェア構成について説明する。 1. Hardware configuration This section describes the hardware configuration.

<情報処理システム１>
図１は、情報処理システム１を表す構成図である。情報処理システム１は、情報処理装置２と、ユーザ端末３と、データベースＤＢ１と、を備える。情報処理装置２と、ユーザ端末３と、データベースＤＢ１と、は、電気通信回線を通じて通信可能に構成されている。一実施形態において、情報処理システム１とは、１つ又はそれ以上の装置又は構成要素からなるものである。仮に例えば、情報処理装置２のみからなる場合であれば、情報処理システム１は、情報処理装置２となりうる。以下、これらの構成要素について説明する。 <Information processing system 1>
FIG. 1 is a configuration diagram showing an information processing system 1. As shown in FIG. The information processing system 1 includes an information processing device 2, a user terminal 3, and a database DB1. The information processing device 2, the user terminal 3, and the database DB1 are configured to be able to communicate with each other through an electric communication line. In one embodiment, information handling system 1 is comprised of one or more devices or components. If, for example, the information processing system 1 consists only of the information processing device 2 , the information processing system 1 can be the information processing device 2 . These constituent elements are described below.

<情報処理装置２>
図２は、情報処理装置２のハードウェア構成を示すブロック図である。情報処理装置２は、通信部２１と、記憶部２２と、制御部２３とを備え、これらの構成要素が情報処理装置２の内部において通信バス２０を介して電気的に接続されている。各構成要素についてさらに説明する。 <Information processing device 2>
FIG. 2 is a block diagram showing the hardware configuration of the information processing device 2. As shown in FIG. The information processing device 2 includes a communication section 21 , a storage section 22 and a control section 23 , and these components are electrically connected via a communication bus 20 inside the information processing device 2 . Each component will be further described.

通信部２１は、ＵＳＢ、ＩＥＥＥ１３９４、Ｔｈｕｎｄｅｒｂｏｌｔ（登録商標）、有線ＬＡＮネットワーク通信等といった有線型の通信手段が好ましいものの、無線ＬＡＮネットワーク通信、３Ｇ／ＬＴＥ／５Ｇ等のモバイル通信、ＢＬＵＥＴＯＯＴＨ（登録商標）通信等を必要に応じて含めてもよい。すなわち、これら複数の通信手段の集合として実施することがより好ましい。すなわち、情報処理装置２は、通信部２１及びネットワークを介して、外部から種々の情報を通信してもよい。 The communication unit 21 is preferably a wired communication means such as USB, IEEE1394, Thunderbolt (registered trademark), wired LAN network communication, etc., but wireless LAN network communication, mobile communication such as 3G/LTE/5G, BLUETOOTH (registered trademark), etc. Communication and the like may be included as desired. That is, it is more preferable to implement as a set of these communication means. That is, the information processing device 2 may communicate various information from the outside via the communication unit 21 and the network.

記憶部２２は、前述の記載により定義される様々な情報を記憶する。これは、例えば、制御部２３によって実行される情報処理装置２に係る種々のプログラム等を記憶するソリッドステートドライブ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ：ＳＳＤ）等のストレージデバイスとして、あるいは、プログラムの演算に係る一時的に必要な情報（引数、配列等）を記憶するランダムアクセスメモリ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ：ＲＡＭ）等のメモリとして実施されうる。記憶部２２は、制御部２３によって実行される情報処理装置２に係る種々のプログラムや変数等を記憶している。 The storage unit 22 stores various information defined by the above description. For example, it can be used as a storage device such as a solid state drive (SSD) for storing various programs related to the information processing device 2 executed by the control unit 23, or as a temporary storage device related to program calculation. It can be implemented as a memory such as a Random Access Memory (RAM) that stores information (arguments, arrays, etc.) required for the . The storage unit 22 stores various programs, variables, etc. related to the information processing device 2 executed by the control unit 23 .

制御部２３は、情報処理装置２に関連する全体動作の処理・制御を行う。制御部２３は、例えば不図示の中央処理装置（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ：ＣＰＵ）である。制御部２３は、記憶部２２に記憶された所定のプログラムを読み出すことによって、情報処理装置２に係る種々の機能を実現する。すなわち、記憶部２２に記憶されているソフトウェアによる情報処理が、ハードウェアの一例である制御部２３によって具体的に実現されることで、制御部２３に含まれる各機能部として実行されうる。これらについては、次節においてさらに詳述する。なお、制御部２３は単一であることに限定されず、機能ごとに複数の制御部２３を有するように実施してもよい。またそれらの組合せであってもよい。 The control unit 23 processes and controls overall operations related to the information processing device 2 . The control unit 23 is, for example, a central processing unit (CPU) (not shown). The control unit 23 realizes various functions related to the information processing device 2 by reading a predetermined program stored in the storage unit 22 . That is, information processing by software stored in the storage unit 22 can be specifically realized by the control unit 23 which is an example of hardware, and can be executed as each functional unit included in the control unit 23 . These are further detailed in the next section. Note that the control unit 23 is not limited to a single unit, and may be implemented to have a plurality of control units 23 for each function. A combination thereof may also be used.

<ユーザ端末３>
図３は、ユーザ端末３のハードウェア構成を示すブロック図である。ユーザ端末３は、通信部３１と、記憶部３２と、制御部３３と、表示部３４と、入力部３５とを備え、これらの構成要素がユーザ端末３の内部において通信バス３０を介して電気的に接続されている。通信部３１、記憶部３２及び制御部３３の説明は、情報処理装置２における各部の説明と同様のため省略する。 <User terminal 3>
FIG. 3 is a block diagram showing the hardware configuration of the user terminal 3. As shown in FIG. The user terminal 3 includes a communication section 31 , a storage section 32 , a control section 33 , a display section 34 and an input section 35 . properly connected. Descriptions of the communication unit 31, the storage unit 32, and the control unit 33 are omitted because they are similar to the description of each unit in the information processing apparatus 2. FIG.

表示部３４は、ユーザ端末３筐体に含まれるものであってもよいし、外付けされるものであってもよい。表示部３４は、ユーザが操作可能なグラフィカルユーザインターフェース（ＧｒａｐｈｉｃａｌＵｓｅｒＩｎｔｅｒｆａｃｅ：ＧＵＩ）の画面を表示する。これは例えば、ＣＲＴディスプレイ、液晶ディスプレイ、有機ＥＬディスプレイ及びプラズマディスプレイ等の表示デバイスを、ユーザ端末３の種類に応じて使い分けて実施することが好ましい。 The display unit 34 may be included in the housing of the user terminal 3, or may be externally attached. The display unit 34 displays a screen of a graphical user interface (GUI) that can be operated by the user. For example, it is preferable to use a display device such as a CRT display, a liquid crystal display, an organic EL display, and a plasma display according to the type of the user terminal 3 .

入力部３５は、ユーザ端末３の筐体に含まれるものであってもよいし、外付けされるものであってもよい。例えば、入力部３５は、表示部３４と一体となってタッチパネルとして実施されてもよい。タッチパネルであれば、ユーザは、タップ操作、スワイプ操作等を入力することができる。もちろん、タッチパネルに代えて、スイッチボタン、マウス、ＱＷＥＲＴＹキーボード等を採用してもよい。すなわち、入力部３５がユーザによってなされた操作入力を受け付ける。当該入力が命令信号として、通信バス３０を介して制御部３３に転送され、制御部３３が必要に応じて所定の制御や演算を実行しうる。 The input unit 35 may be included in the housing of the user terminal 3 or may be externally attached. For example, the input unit 35 may be integrated with the display unit 34 and implemented as a touch panel. With a touch panel, the user can input a tap operation, a swipe operation, or the like. Of course, a switch button, a mouse, a QWERTY keyboard, or the like may be employed instead of the touch panel. That is, the input unit 35 receives an operation input made by the user. The input is transferred as a command signal to the control unit 33 via the communication bus 30, and the control unit 33 can execute predetermined control and calculation as necessary.

<データベースＤＢ１>
データベースＤＢ１は、外部データＤ０を記憶する。外部データＤ０は、例えば公衆がアクセス可能なデータであっても、特定のユーザのみがアクセス可能なデータであってもよい。また、外部データＤ０は、本情報処理システム１を使用するユーザのみがアクセス可能なデータであってもよい。データベースＤＢ１は、単一の記憶デバイスにより実現されていても、複数の記憶デバイスにより実現されていてもよい。外部データＤ０が表す内容は、例えば衛生観測結果、気候観測結果、統計資料、カレンダー情報など、任意である。 <Database DB1>
Database DB1 stores external data D0. The external data D0 may be, for example, data accessible to the public or data accessible only to specific users. Also, the external data D0 may be data that can be accessed only by the user using the information processing system 1 . The database DB1 may be realized by a single storage device or may be realized by a plurality of storage devices. The contents represented by the external data D0 are arbitrary, such as satellite observation results, climate observation results, statistical data, and calendar information.

２．機能構成
図４は、制御部２３が備える機能部の一例を示す。図４に示すように、制御部２３は、データ受付部２３１と、学習器特定部２３２と、学習器選択受付部２３３と、分析手法選択受付部２３４と、データ処理部２３５と、モデル表示部２３６と、処理表示部２３７と、処理条件表示部２３８と、を備える。 2. Functional Configuration FIG. 4 shows an example of functional units provided in the control unit 23 . As shown in FIG. 4, the control unit 23 includes a data reception unit 231, a learning device identification unit 232, a learning device selection reception unit 233, an analysis method selection reception unit 234, a data processing unit 235, and a model display unit. 236 , a processing display section 237 and a processing condition display section 238 .

データ受付部２３１は、第１の入力データＤ１の入力を受け付ける。第１の入力データＤ１は、情報処理装置２に入力されるデータである。第１の入力データＤ１は、複数のデータ点を含みうる。第１の入力データＤ１は、データセットということもできる。第１の入力データＤ１は、少なくともユーザが保有する保有データを含む。なお、第１の入力データＤ１は、ユーザ以外が保有するデータ、例えばデータベースＤＢ１に記憶されたデータ、を含んでもよい。第１の入力データＤ１は、少なくとも構造化データを含む。構造化データは、予め定められた構造となるように標準化されたデータである。なお、第１の入力データＤ１は、構造化データ以外のデータ、例えば非構造化データ、又は半構造化データを含んでもよい。非構造化データは、構造化データのように標準化された構造を持たない任意の形式のデータである。半構造化データは、非構造化データと、当該非構造化データを識別可能なタグと、の組み合わせからなる。半構造化データの形式は、例えば、グラフ型、キーバリュー型、ドキュメント型、カラム型などである。なお、入力データ受付部２３１が受け付ける第１の入力データＤ１は１つに限られず、複数であってもよい。 The data reception unit 231 receives input of the first input data D1. The first input data D<b>1 is data input to the information processing device 2 . The first input data D1 may include multiple data points. The first input data D1 can also be called a data set. The first input data D1 includes at least held data held by the user. Note that the first input data D1 may include data held by a person other than the user, such as data stored in the database DB1. The first input data D1 includes at least structured data. Structured data is data standardized to have a predetermined structure. Note that the first input data D1 may include data other than structured data, such as unstructured data or semi-structured data. Unstructured data is any form of data that does not have a standardized structure like structured data. Semi-structured data consists of a combination of unstructured data and tags that can identify the unstructured data. Formats of semi-structured data include, for example, graph type, key-value type, document type, and column type. Note that the number of first input data D1 received by the input data receiving unit 231 is not limited to one, and may be plural.

学習器特定部２３２は、受け付けた第１の入力データＤ１に応じて複数の学習器ＭＬを特定する。
学習器は、入力されるデータを用いて学習モデルＭ１を生成する。学習モデルＭ１は、少なくとも１つの入力ｘ１に基づいて、少なくとも１つの出力ｙ１を生成する。入力ｘ１は、説明変数とも言われる。また、出力ｙ１は、評価関数とも言われる。 The learning device identification unit 232 identifies a plurality of learning devices ML according to the received first input data D1.
The learning device uses input data to generate a learning model M1. Learning model M1 produces at least one output y1 based on at least one input x1. The input x1 is also called an explanatory variable. The output y1 is also called an evaluation function.

学習器選択受付部２３３は、特定された学習器ＭＬに対する選択をユーザより受け付ける。 The learner selection reception unit 233 receives a selection of the identified learner ML from the user.

分析手法選択受付部２３４は、複数の分析手法のうちの学習モデルＭ１の生成に用いられる少なくとも１つの選択を受け付ける。分析手法は、任意であるが、例えば分類分析、回帰分析、時系列分析、レコメンド分析、異常検知、クラスタリング、画像解析、及びテキスト解析のうちの少なくとも１つを含む。分析手法としては、教師あり学習、教師なし学習、強化学習など任意のアルゴリズムのものを採用可能である。 The analysis method selection reception unit 234 receives selection of at least one of the plurality of analysis methods to be used for generating the learning model M1. The analysis method is arbitrary, but includes, for example, at least one of classification analysis, regression analysis, time series analysis, recommendation analysis, anomaly detection, clustering, image analysis, and text analysis. Any algorithm such as supervised learning, unsupervised learning, or reinforcement learning can be used as the analysis method.

データ処理部２３５は、変換処理を実行する。変換処理とは、データ受付部２３１に入力された第１の入力データＤ１を、特定された学習器ＭＬに入力可能な態様である第２の入力データＤ２に変換する処理である。変換処理は、例えば、第１の入力データＤ１の一部の削除、欠損値の補完、外れ値の削除、第１の入力データＤ１の規格化など、任意の処理を含みうる。変換処理は、データ整形ともいわれる。
変換処理は、第１の入力データＤ１に含まれる複数の入力ｘ１の結合処理、分離処理、補正などを含んでもよい。例えば、変換処理は、第１の入力データＤ１が時系列を表す年、月、及び日をそれぞれ異なる入力ｘ１として有する場合に、これらの複数の入力ｘ１を１つの入力に結合する処理を含みうる。また、変換処理は、複数の入力データ受付部２３１にて複数の第１の入力データＤ１を受け付けた場合、これらの第１の入力データＤ１の結合処理を行ってもよい。別例として、変換処理は、入力ｘ１の追加処理、特徴量の追加などを含んでもよい。例えば、データ処理部２３５は、第１の入力データＤ１に応じてデータベースＤＢ１から任意の外部データＤ０を取得し、当該外部データＤ０を第１の入力データＤ１に追加してもよい。データ処理部２３５は、追加された外部データＤ０に基づき、特徴量の追加を行ってもよい。
変換処理は、第１の入力データＤ１の内容に基づいて、制御部２３によって自動で実行可能な処理を含む。なお、変換処理は、ユーザによる指定に基づいて実行可能な処理を含んでもよい。また、本実施形態では、変換処理は、第１の入力データＤ１を所定の変換条件と比較することで特定される自動変換処理を含む。変換条件とは、例えば第１の入力データＤ１の容量、第１の入力データＤ１の統計情報の分散値が閾値未満か否か、第１の入力データＤ１の統計情報に基づく外れ値の有無などである。変換条件は、変換処理が行われる必要性を示唆するものでもある。
また、変換処理は、入力データ受付部２３１に入力された第１の入力データＤ１の形式を、特定された学習器ＭＬのそれぞれに適合する形式に変換する処理を含む。第１の入力データＤ１の形式は、例えば、第１の入力データＤ１の名称、文字コード、改行コード、表記言語、区切り文字などを含む。
例えば、特定された学習器ＭＬに入力可能な入力データＤの文字コードがＵＴＦ－８であるにも関わらず、第１の入力データＤ１の文字コードがＳｈｉｆｔ－ＪＩＳの場合、データ処理部２３５は、当該第１の入力データＤ１を、文字コードがＵＴＦ－８に変更された第２の入力データＤ２に変換する変換処理を実行する。
別例として、変換処理は、特定された複数の学習器ＭＬに入力可能な第２の入力データＤ２のそれぞれに対して名称を付与する処理を含みうる。このとき、変換処理は、制御部２３が第２の入力データＤ２を一意に特定可能な名称を、第２の入力データＤ２に付与することが好ましい。これにより、第２の入力データＤ２の管理が容易となる。なお、名称の付与は、第１の入力データＤ１の名称を変更することによって行われてもよい。
別例として、第１の入力データＤ１がＢＯＭ（ＢｙｔｅＯｒｄｅｒＭａｒｋ）を含む場合、変換処理は、当該ＢＯＭの除去を含んでもよい。これにより、意図しないエラーが抑制しやすくなる。
別例として、変換処理は、第１の入力データＤ１に含まれる入力ｘ１のうち、学習モデルＭ１の生成に用いられないものを除去する処理を含んでもよい。データ処理部２３５は、例えば、入力ｘ１の形式、分布に基づいて、当該入力ｘ１が学習モデルＭ１の生成に用いられるか否かを判定すればよい。これにより、第２の入力データＤ２の容量が軽減されるため、学習モデルＭ１の生成に必要な時間が短縮される。なお、変換処理の特定は、変換処理を行う必要がない場合などには、行う変換処理がないことを特定することを含みうる。
制御部２３は、さらに、特定され、又は選択された分析手法に基づき、変換処理を特定してもよい。例えば、選択された分析手法が時系列分析である場合、制御部２３によって特定される変換処理は、時系列を表す複数の入力ｘ１を１つに結合する変換処理、各データ点の時間間隔が異なる場合、時間間隔を調整するようにデータ点の補完、削除、又は補正を行う変換処理、外部データＤ０としてのカレンダー情報、気象情報、又は人口統計情報を第１の入力データＤ１と結合し、時系列を表す入力ｘ１と関連付ける変換処理などを含む。 The data processing unit 235 executes conversion processing. The conversion process is a process of converting the first input data D1 input to the data reception unit 231 into the second input data D2 that is in a form that can be input to the specified learning device ML. The conversion processing can include arbitrary processing such as deletion of part of the first input data D1, supplementation of missing values, deletion of outliers, normalization of the first input data D1, and the like. The conversion process is also called data shaping.
The conversion process may include a combination process, a separation process, a correction, and the like of a plurality of inputs x1 included in the first input data D1. For example, if the first input data D1 has years, months, and days representing time series as different inputs x1, the conversion process may include combining these multiple inputs x1 into one input. . Further, in the conversion process, when a plurality of first input data D1 are received by a plurality of input data receiving units 231, the combining process of these first input data D1 may be performed. As another example, the conversion process may include additional processing of the input x1, addition of feature amounts, and the like. For example, the data processing unit 235 may acquire arbitrary external data D0 from the database DB1 according to the first input data D1 and add the external data D0 to the first input data D1. The data processing unit 235 may add feature amounts based on the added external data D0.
The conversion processing includes processing that can be automatically executed by the control unit 23 based on the contents of the first input data D1. Note that the conversion processing may include processing that can be executed based on designation by the user. Further, in this embodiment, the conversion processing includes automatic conversion processing specified by comparing the first input data D1 with predetermined conversion conditions. The conversion conditions include, for example, the capacity of the first input data D1, whether or not the variance of the statistical information of the first input data D1 is less than a threshold, and whether or not there is an outlier based on the statistical information of the first input data D1. is. Conversion conditions also suggest the need for conversion processing to be performed.
The conversion process also includes a process of converting the format of the first input data D1 input to the input data reception unit 231 into a format suitable for each of the specified learning devices ML. The format of the first input data D1 includes, for example, the name of the first input data D1, character code, line feed code, writing language, delimiter, and the like.
For example, when the character code of the input data D that can be input to the specified learning device ML is UTF-8, but the character code of the first input data D1 is Shift-JIS, the data processing unit 235 , the conversion processing is executed to convert the first input data D1 into the second input data D2 whose character code is changed to UTF-8.
As another example, the conversion process may include a process of giving a name to each of the second input data D2 that can be input to the identified plurality of learners ML. At this time, it is preferable that the conversion process gives the second input data D2 a name by which the control unit 23 can uniquely identify the second input data D2. This facilitates management of the second input data D2. Note that the name may be assigned by changing the name of the first input data D1.
As another example, when the first input data D1 includes a BOM (Byte Order Mark), the conversion process may include removal of the BOM. This makes it easier to suppress unintended errors.
As another example, the conversion process may include a process of removing inputs x1 included in the first input data D1 that are not used to generate the learning model M1. The data processing unit 235 may determine whether or not the input x1 is used to generate the learning model M1, for example, based on the format and distribution of the input x1. This reduces the volume of the second input data D2, thereby shortening the time required to generate the learning model M1. Note that specifying the conversion process may include specifying that there is no conversion process to be performed, for example, when there is no need to perform the conversion process.
The control unit 23 may further specify conversion processing based on the specified or selected analysis technique. For example, when the selected analysis method is time series analysis, the conversion process specified by the control unit 23 is a conversion process that combines a plurality of inputs x1 representing time series into one, and the time interval between each data point is if different, combining calendar information, weather information, or demographic information as external data D0 with the first input data D1; It includes conversion processing associated with the input x1 representing the time series.

モデル表示部２３６は、第１の入力データＤ１に基づき、モデル情報ＩＦ１を、学習モデルごとに比較可能な態様で表示部３４に表示させる。詳細には、モデル表示部２３６は、特定された学習器ＭＬのうち、選択により選択された学習器ＭＬを用いて生成されるモデル情報ＩＦ１を表示部３４に表示させる。 The model display unit 236 displays the model information IF1 on the display unit 34 based on the first input data D1 in a manner in which each learning model can be compared. Specifically, the model display unit 236 causes the display unit 34 to display the model information IF1 generated using the selected learner ML among the identified learners ML.

モデル情報ＩＦ１は、学習モデルＭ１に関する情報である。例えば、モデル情報ＩＦ１は、学習モデルＭ１の生成に用いられた第１の入力データＤ１の名称、容量、学習モデルＭ１が生成された日時などを含みうる。第１の入力データＤ１の名称とは、例えば、第１の入力データＤ１のファイル名である。本実施形態では、モデル情報ＩＦ１は、少なくとも学習モデルＭ１の予測精度に関する精度情報を含む。分析手法が回帰分析の場合、精度情報は、例えば、決定係数（Ｒ２スコア）、平均二乗誤差（ＭＳＥ：ＭｅａｎｓＳｑｕａｒｅｄＥｒｒｏｒ）、平均絶対誤差（ＭＡＥ：ＭｅａｎｓＡｂｓｏｌｕｔｅＥｒｒｏｒ）、平均二乗偏差（ＲＭＳＥ：ＲｏｏｔＭｅａｎＳｑｕａｒｅｄＥｒｒｏｒ）、二乗平均平方根誤差（ＲＭＳＥ：ＲｏｏｔＭｅａｎＳｑｕａｒｅｄＥｒｒｏｒ）、対数平均二乗誤差（ＲＭＬＳＥ：ＲｏｏｔＭｅａｎＳｑｕａｒｅｄＬｏｇａｒｉｔｈｍｉｃＥｒｒｏｒ）、平均絶対パーセント誤差（ＭＡＰＥ：ＭｅａｎＡｂｓｏｌｕｔｅＰｅｒｃｅｎｔａｇｅＥｒｒｏｒ）などの指標を含む。分析手法が分類分析の場合、精度情報は、正解率（Ａｃｃｕｒａｃｙ）、再現率（Ｒｅｃａｌｌ）、適合率（Ｐｒｅｃｉｓｉｏｎ）、特異度（Ｓｐｅｃｉｆｉｃｉｔｙ）、Ｆ値（Ｆ－ｍｅａｓｕｒｅ）、重み付きＦ値、マシューズ相関係数（ＭＣＣ：ＭａｔｔｈｅｗｓＣｏｒｒｅｌａｔｉｏｎＣｏｅｆｆｉｃｉｅｎｔ）、カッパ係数（Ｋａｐｐａ）、ログ損失（Ｌｏｇｌｏｓｓ）、ＡＵＣ：ＡｒｅａＵｎｄｅｒｔｈｅＣｕｒｖｅ、ＰＲ－ＡＵＣ：ＡｒｅａＵｎｄｅｒｔｈｅＰｒｅｃｉｓｉｏｎ－Ｒｅｃａｌｌｃｕｒｖｅなどの指標を含む。上記精度情報は、二値分類に用いられるものに限られず、２より大きい他クラス分類に用いられるものであってもよい。分析手法が時系列分析の場合、精度情報は、例えば、変動係数（ＣｏｅｆｆｉｃｉｅｎｔｏｆＶａｒｉａｔｉｏｎ）、動的時間伸縮法での平均絶対誤差（ＤｙｎａｍｉｃＴｉｍｅＷａｒｐｉｎｇＭＡＥ）、ＭＡＰＥ、対称平均絶対パーセント誤差（ＳＭＡＰＥ：ＳｙｍｍｅｔｒｉｃＭＡＰＥ）、加重ＳＭＡＰＥ、平均絶対スケール誤差（ＭＡＳＥ：ＭｅａｎＡｂｓｏｌｕｔｅＳｃａｌｅｄＥｒｒｏｒ）、ＭＡＲＲＥ：ＭｅａｎＡｂｓｏｌｕｔｅＲａｎｇｅｄＲｅｌａｔｉｖｅＥｒｒｏｒ、ＯｖｅｒａｌｌＰｅｒｃｅｎｔａｇｅｅｒｒｏｒ、Ｒ２、ｒｈｏ－ｒｉｓｋＲＭＳＬＥなどの指標を含む。精度情報は、各パラメータの統計値、例えば、ＭＡＥの中央値（ＭＡＥｍｅｄｉａｎ）、ＭＡＥの平均値（ＭＡＥｍｅａｎ）などを含んでもよい。ある分析手法の指標が、他の分析手法の指標として用いられてもよい。 The model information IF1 is information about the learning model M1. For example, the model information IF1 can include the name and capacity of the first input data D1 used to generate the learning model M1, the date and time when the learning model M1 was generated, and the like. The name of the first input data D1 is, for example, the file name of the first input data D1. In this embodiment, the model information IF1 includes at least accuracy information regarding the prediction accuracy of the learning model M1. When the analysis method is regression analysis, the accuracy information is, for example, the coefficient of determination (R2 score), the mean squared error (MSE), the mean absolute error (MAE), the mean squared deviation (RMSE: Root Mean Squared Error (RMSE), Root Mean Squared Error (RMSE), Logarithmic Mean Squared Logarithmic Error (RMLSE), Mean Absolute Percentage Error (MAPE) or). When the analysis method is classification analysis, the accuracy information includes accuracy, recall, precision, specificity, F value (F-measure), weighted F value, Matthews Including indicators such as the correlation coefficient (MCC: Matthews Correlation Coefficient), the kappa coefficient (Kappa), the log loss (Logloss), AUC: Area Under the Curve, PR-AUC: Area Under the Precision-Recall curve. The accuracy information is not limited to that used for binary classification, and may be used for other class classification greater than two. When the analysis method is time series analysis, the accuracy information is, for example, Coefficient of Variation, Dynamic Time Warping MAE, MAPE, Symmetric Mean Absolute Percentage Error (SMAPE: Symmetric MAPE), Weighted SMAPE, Mean Absolute Scaled Error (MASE), MARRE: Mean Absolute Ranged Relative Error, Overall Percentage error, R2, rho-risk RMSLE. The accuracy information may include statistical values of each parameter, such as MAE median and MAE mean. An index for one analysis method may be used as an index for another analysis method.

処理表示部２３７は、第１の入力データＤ１と第２の入力データＤ２との差異点を認識可能な態様で、表示部３４に表示させる。例えば、処理表示部２３７は、第１の入力データＤ１と第２の入力データＤ２との差異点５１２を、第１の入力データＤ１と第２の入力データＤ２との共通点５１１と異なる態様で表示させる。例えば、処理表示部２３７は、両者を色彩、形状、模様の少なくとも１つが異なる態様で表示させる。別例として、処理表示部２３７は、矢印等の所定の目印を、第１の入力データＤ１と第２の入力データＤ２の差異点５１２に対応付けて表示させてもよい。 The processing display unit 237 causes the display unit 34 to display the difference between the first input data D1 and the second input data D2 in a recognizable manner. For example, the processing display unit 237 displays the difference 512 between the first input data D1 and the second input data D2 in a manner different from the common point 511 between the first input data D1 and the second input data D2. display. For example, the processing display unit 237 displays both in a mode in which at least one of color, shape, and pattern is different. As another example, the processing display unit 237 may display a predetermined mark such as an arrow in association with the difference 512 between the first input data D1 and the second input data D2.

処理条件表示部２３８は、少なくとも入力された第１の入力データＤ１と、特定された学習器ＭＬと、に基づき、変換処理が行われる条件を認識可能な態様で表示させる。本実施形態において、変換処理が行われる条件とは、上述した変換条件に相当する。 The processing condition display unit 238 displays the conditions for the conversion processing in a recognizable manner based on at least the input first input data D1 and the specified learning device ML. In the present embodiment, the conditions under which conversion processing is performed correspond to the conversion conditions described above.

モデル表示部２３６は、第１の入力データＤ１に基づき、特定された学習器ＭＬが生成する学習モデルＭ１に関するモデル情報ＩＦ１を、学習モデルＭ１ごとに比較可能な態様で表示部３４に表示させる。本実施形態のモデル表示部２３６は、少なくとも、生成される学習モデルＭ１ごとの精度情報を比較可能に表示させる。例えば、モデル表示部２３６は、それぞれの学習モデルＭ１に関するモデル情報ＩＦ１を、表示部３４に一覧可能に表示させる。 Based on the first input data D1, the model display unit 236 causes the display unit 34 to display the model information IF1 regarding the learning model M1 generated by the specified learning device ML in a manner that enables comparison for each learning model M1. The model display unit 236 of the present embodiment displays at least accuracy information for each generated learning model M1 in a comparable manner. For example, the model display unit 236 causes the display unit 34 to display the model information IF1 related to each learning model M1 in a viewable manner.

３．情報処理の詳細
本節では、前述した情報処理システム１において実行される情報処理について説明する。なお、当該情報処理は、アクティビティ図に図示されない、任意の例外処理を含みうる。例外処理は、当該情報処理の中断や、各処理の省略を含む。当該情報処理にて行われる選択又は入力は、ユーザによる操作に基づくものでも、ユーザの操作に依らず自動で行われるものでもよい。 3. Details of Information Processing In this section, information processing executed in the information processing system 1 described above will be described. Note that the information processing may include any exception handling not shown in the activity diagram. Exception processing includes interruption of the information processing and omission of each process. The selection or input performed in the information processing may be based on the user's operation, or may be automatically performed without depending on the user's operation.

図５は、情報処理システム１において実行される情報処理の流れの一例を示すアクティビティ図である。図５に示すように、アクティビティＡ００１にて、データ受付部２３１は、第１の入力データＤ１の入力を受け付ける。 FIG. 5 is an activity diagram showing an example of the flow of information processing executed in the information processing system 1. As shown in FIG. As shown in FIG. 5, in activity A001, the data reception unit 231 receives input of first input data D1.

次にアクティビティＡ００２にて、学習器特定部２３２は、第１の入力データＤ１に応じて複数の学習器ＭＬを特定する。 Next, in activity A002, the learner identifying unit 232 identifies a plurality of learners ML according to the first input data D1.

次にアクティビティＡ００３にて、学習器選択受付部２３３は、アクティビティＡ００２にて特定された学習器ＭＬの選択を受け付ける。 Next, in activity A003, the learner selection accepting unit 233 accepts selection of the learner ML identified in activity A002.

学習器ＭＬの選択を受け付けた後、処理がアクティビティＡ００４に進み、分析手法選択受付部２３４は、分析手法の選択を受け付ける。分析手法は、予め定められたものでも、特定され、又は選択された学習器ＭＬに応じて特定されるものでもよい。 After receiving the selection of the learning device ML, the process proceeds to activity A004, and the analysis method selection receiving unit 234 receives the selection of the analysis method. The analysis method may be predetermined, specified, or specified according to the selected learner ML.

分析手法の選択を受け付けた後、処理がアクティビティＡ００５に進み、制御部２３は、分析手法選択受付部２３４によって選択された学習器ＭＬに応じて、変換処理を特定する。詳細には、制御部２３は、さらに、特定され、又は選択された分析手法に基づき、変換処理を特定する。例えば、アクティビティＡ００４にて選択された分析手法が時系列分析である場合、制御部２３によって特定される変換処理は、時系列を表す複数の入力ｘ１を１つに結合する変換処理、各データ点の時間間隔が異なる場合、時間間隔を調整するようにデータ点の補完、削除、又は補正を行う変換処理、外部データＤ０としてのカレンダー情報、気象情報、又は人口統計情報を第１の入力データＤ１と結合し、時系列を表す入力ｘ１と関連付ける変換処理などを含む。
なお、変換処理の特定は、変換処理を行う必要がない場合などには、行う変換処理がないことを特定することを含みうる。 After receiving the selection of the analysis method, the process proceeds to activity A005, and the control unit 23 specifies the conversion process according to the learning device ML selected by the analysis method selection receiving unit 234. FIG. Specifically, the control unit 23 further specifies conversion processing based on the specified or selected analysis technique. For example, when the analysis method selected in activity A004 is the time series analysis, the conversion process specified by the control unit 23 is the conversion process of combining a plurality of inputs x1 representing time series into one, each data point If the time intervals are different, a conversion process that complements, deletes, or corrects the data points so as to adjust the time intervals, calendar information, weather information, or demographic information as the external data D0 as the first input data D1 , and includes transformation processing for associating with the input x1 representing the time series.
Note that specifying the conversion process may include specifying that there is no conversion process to be performed, for example, when there is no need to perform the conversion process.

制御部２３は、当該変換処理が行われる条件、すなわち変換条件を特定してもよい。例えば、制御部２３は、少なくとも第１の入力データＤ１と、アクティビティＡ００２で特定され、又はアクティビティＡ００５で選択された学習器ＭＬと、に基づいて、変換条件を特定してもよい。例えば、制御部２３は、第１の入力データＤ１を示す統計情報に基づき、変換条件を特定する。第１の入力データＤ１に関する統計情報とは、例えばデータ点の分布、平均値、分散、標準偏差、最大値、最小値、中央値、最頻値、再尤度、共分散、相関係数、Ｒ２値などを含む。変換処理が外れ値の除去の場合、制御部２３は、あるデータ点と平均値との差分の絶対値が標準偏差の２倍以上である場合、当該データ点を外れ値と判断する。この場合、あるデータ点と平均値との差分の絶対値が標準偏差の２倍以上であることが、変換条件に相当する。 The control unit 23 may specify the conditions under which the conversion process is performed, that is, the conversion conditions. For example, the control unit 23 may specify the conversion condition based on at least the first input data D1 and the learner ML specified in activity A002 or selected in activity A005. For example, the control unit 23 identifies conversion conditions based on statistical information indicating the first input data D1. The statistical information about the first input data D1 includes, for example, data point distribution, mean value, variance, standard deviation, maximum value, minimum value, median value, mode value, re-likelihood, covariance, correlation coefficient, Including R2 value and so on. When the conversion process is removal of outliers, the control unit 23 determines that the data point is an outlier if the absolute value of the difference between the data point and the average value is twice the standard deviation or more. In this case, the conversion condition is that the absolute value of the difference between a certain data point and the average value is at least twice the standard deviation.

次に処理がアクティビティＡ００６に進み、制御部２３は、アクティビティＡ００５にて特定された変換処理の選択を受け付ける。 Next, the process proceeds to activity A006, and the control unit 23 accepts selection of the conversion process specified in activity A005.

次に処理がアクティビティＡ００７に進み、処理表示部２３７は、アクティビティＡ００６にて特定された変換処理に関する情報を表示部３４に表示させる。変換処理に関する情報とは、例えば、変換処理の具体的内容、変換処理による第１の入力データＤ１の変化、変換処理によって生成される第２の入力データＤ２などである。例えば、処理表示部２３７は、特定された変換処理に基づき、第１の入力データＤ１と第２の入力データＤ２との差異点を、ユーザが認識可能な態様で表示部３４に表示させる。これにより、ユーザは、第１の入力データＤ１に対して行われる変換処理の内容を直感的に認識しやすくなる。なお、この段階では第２の入力データＤ２は、実際に生成されている必要はなく、例えば第１の入力データＤ１と変換処理とに基づいて生成されることが予想されるものでもよい。 Next, the process proceeds to activity A007, and the process display unit 237 causes the display unit 34 to display information regarding the conversion process specified in activity A006. The information about the conversion process includes, for example, specific contents of the conversion process, changes in the first input data D1 due to the conversion process, second input data D2 generated by the conversion process, and the like. For example, the processing display unit 237 causes the display unit 34 to display the differences between the first input data D1 and the second input data D2 in a user-recognizable manner based on the identified conversion processing. This makes it easier for the user to intuitively recognize the contents of the conversion process performed on the first input data D1. It should be noted that the second input data D2 does not have to be actually generated at this stage, and may be expected to be generated based on the first input data D1 and conversion processing, for example.

アクティビティＡ００７では、さらに処理条件表示部２３８が、変換処理が行われる条件を認識可能な態様で表示部３４に表示させてもよい。これにより、変換処理のブラックボックス化が抑制される。 In activity A007, the processing condition display unit 238 may further display the conditions for conversion processing on the display unit 34 in a recognizable manner. This suppresses conversion processing from becoming a black box.

次に処理がアクティビティＡ００８に進み、制御部２３は、第１の入力データＤ１に対して、アクティビティＡ００６にて選択された変換処理を実行する。これにより、第２の入力データＤ２が生成される。 Next, the process proceeds to activity A008, and the control unit 23 executes the conversion process selected in activity A006 on the first input data D1. Thereby, the second input data D2 is generated.

次に処理がアクティビティＡ００９に進み、制御部２３は、第２の入力データＤ２を、アクティビティＡ００３にて選択された学習器ＭＬのそれぞれに入力する。このとき、制御部２３は、選択された分析手法に基づき、学習器ＭＬでの学習アルゴリズムを指定してもよい。これにより、学習器ＭＬのそれぞれは、第１の入力データＤ１に基づいて学習モデルＭ１を生成する。詳細には、学習器ＭＬは、第２の入力データＤ２を用いて学習モデルＭ１を生成する。なお、学習器ＭＬは、情報処理システム１に含まれる任意の部材に保存されているものでも、情報処理システム１と電気通信回線を介して通信可能な外部装置に保存されているものでもよい。なお、外部装置の図示は省略されている。 Next, the process proceeds to activity A009, and the control unit 23 inputs the second input data D2 to each of the learners ML selected in activity A003. At this time, the control unit 23 may designate a learning algorithm in the learning device ML based on the selected analysis method. Thereby, each learning device ML generates a learning model M1 based on the first input data D1. Specifically, the learning device ML uses the second input data D2 to generate the learning model M1. Note that the learning device ML may be stored in an arbitrary member included in the information processing system 1 or may be stored in an external device that can communicate with the information processing system 1 via an electric communication line. Note that the illustration of the external device is omitted.

次に処理がアクティビティＡ０１０に進み、制御部２３は、学習器ＭＬのそれぞれから生成される学習モデルＭ１を取得する。 Next, the process proceeds to activity A010, and the control unit 23 acquires the learning model M1 generated from each of the learning devices ML.

次に処理がアクティビティＡ０１１に進み、モデル表示部２３６は、モデル情報ＩＦ１を表示部３４に表示させる。 Next, the process proceeds to activity A011, and the model display unit 236 causes the display unit 34 to display the model information IF1.

４．表示部３４に表示される内容の一例
本節では、上記情報処理に基づいて表示部３４に表示される内容の一例について説明する。本実施形態では、ユーザが、第１の入力データＤ１を用いて、商品の売上価格の予測を行う場面を用いて説明する。本実施形態の表示部３４には、受付ウィンドウ４と、データウィンドウ５と、変換処理ウィンドウ６と、モデル情報表示ウィンドウ７と、モデル検索ウィンドウ８と、モデル比較ウィンドウ９と、が表示され得る。 4. Example of Content Displayed on Display Unit 34 In this section, an example of content displayed on the display unit 34 based on the above information processing will be described. In this embodiment, a case where the user predicts the selling price of a product using the first input data D1 will be described. A reception window 4, a data window 5, a conversion processing window 6, a model information display window 7, a model search window 8, and a model comparison window 9 can be displayed on the display unit 34 of the present embodiment.

４－１．受付ウィンドウ４の一例
まず、受付ウィンドウ４の詳細について説明する。図６は、表示部３４に表示される受付ウィンドウ４の一例である。図６に示すように、受付ウィンドウ４は、入力データ受付エリア４１と、学習器選択エリア４２と、分析手法選択エリア４３と、受付操作表示エリア４４と、を含む。 4-1. An example of reception window 4 First, the details of the reception window 4 will be described. FIG. 6 is an example of the reception window 4 displayed on the display unit 34. As shown in FIG. As shown in FIG. 6 , the reception window 4 includes an input data reception area 41 , a learning device selection area 42 , an analysis method selection area 43 and an reception operation display area 44 .

入力データ受付エリア４１には、第１の入力データＤ１の入力を受け付けるユーザインタフェースが表示される。以下、説明の便宜上、ユーザインタフェースを単にＵＩという。入力データ受付エリア４１は、インポートボタン４１１と、データ名表示エリア４１２と、を含む。 The input data reception area 41 displays a user interface for receiving input of the first input data D1. Hereinafter, for convenience of explanation, the user interface is simply referred to as UI. The input data reception area 41 includes an import button 411 and a data name display area 412 .

ユーザは、インポートボタン４１１を操作することにより、第１の入力データＤ１をデータ受付部２３１に入力する。このとき、データ受付部２３１は、ユーザによるインポートボタン４１１の操作に基づき、ユーザによる第１の入力データＤ１の入力を受け付ける。 The user inputs first input data D<b>1 to data reception unit 231 by operating import button 411 . At this time, the data reception unit 231 receives input of the first input data D1 by the user based on the operation of the import button 411 by the user.

データ名表示エリア４１２には、入力された第１の入力データＤ１の名称が表示される。 The data name display area 412 displays the name of the input first input data D1.

学習器選択エリア４２には、学習モデルＭ１の生成に用いられる学習器ＭＬを選択可能なＵＩが表示される。学習器選択エリア４２に表示される学習器ＭＬは、学習器特定部２３２によって、データ受付部２３１が受け付けた第１の入力データＤ１に応じて特定される。例えば、学習器特定部２３２は、第１の入力データＤ１の容量、フォーマット、識別子に応じて、学習器選択エリア４２に表示される学習器ＭＬを特定する。本実施形態の学習器選択エリア４２は、データ受付部２３１が第１の入力データＤ１の入力を受け付けた場合に学習器ＭＬを選択可能なアクティブ状態となる。
学習器選択エリア４２は、予測対象選択エリア４２１と、複数の学習器表示エリア４２２と、学習器選択表示エリア４２３と、第１の受付操作ボタン４２４と、を含む。 The learner selection area 42 displays a UI that allows selection of the learner ML used to generate the learning model M1. The learning device ML displayed in the learning device selection area 42 is specified by the learning device specifying unit 232 according to the first input data D1 accepted by the data accepting unit 231 . For example, the learning device identification unit 232 identifies the learning device ML displayed in the learning device selection area 42 according to the capacity, format, and identifier of the first input data D1. The learning device selection area 42 of the present embodiment enters an active state in which the learning device ML can be selected when the data reception unit 231 receives the input of the first input data D1.
The learning device selection area 42 includes a prediction target selection area 421 , a plurality of learning device display areas 422 , a learning device selection display area 423 and a first reception operation button 424 .

予測対象選択エリア４２１は、学習モデルＭ１の出力ｙ１となるパラメータを指定可能に構成されている。予測対象選択エリア４２１は、例えばプルタブ、リスト、ボタンなど、任意の対応で実現可能である。図６では、出力ｙ１として、売上価格が指定されている。なお、指定される出力ｙ１は１つに限られず、複数であってもよい。 The prediction target selection area 421 is configured to be able to designate a parameter that is the output y1 of the learning model M1. The prediction target selection area 421 can be realized by arbitrary correspondence such as a pull tab, a list, and a button. In FIG. 6, the sales price is specified as the output y1. Note that the designated output y1 is not limited to one, and may be multiple.

学習器表示エリア４２２には、アクティビティＡ００２にて特定された学習器ＭＬを選択可能なＵＩが表示される。例えば、学習器表示エリア４２２には、複数の学習器ＭＬを区別可能な情報が表示される。当該区別可能な情報とは、学習器ＭＬの名称、種類、アルゴリズムなど、任意の情報を含みうる。なお、特定された学習器ＭＬの数が学習器表示エリア４２２の数より小さい場合、学習器表示エリア４２２の一部には、学習器ＭＬの情報がないことが表示されてもよい。 A learning device display area 422 displays a UI for selecting the learning device ML identified in activity A002. For example, in the learner display area 422, information that can distinguish between a plurality of learners ML is displayed. The distinguishable information may include arbitrary information such as the name, type, and algorithm of the learning device ML. If the number of identified learners ML is smaller than the number of learner display areas 422, a part of the learner display area 422 may display that there is no information on the learners ML.

学習器選択表示エリア４２３には、学習器表示エリア４２２のそれぞれに対応する学習器ＭＬが選択されているか否かが表示される。学習器選択表示エリア４２３の具体的態様はユーザが視覚的に把握可能であれば任意である。例えば、学習器選択表示エリア４２３には、チェックボックスでのチェックの有無、色彩の変化、濃淡の変化、枠線の変化などが表示される。 The learning device selection display area 423 displays whether or not the learning device ML corresponding to each of the learning device display areas 422 is selected. A specific aspect of the learning device selection display area 423 is arbitrary as long as the user can visually grasp it. For example, in the learning device selection display area 423, presence/absence of a check in a check box, change in color, change in shade, change in frame line, and the like are displayed.

第１の受付操作ボタン４２４は、ユーザの操作により、選択された学習器ＭＬで後述する分析手法の選択を行うか否かを決定可能なＵＩである。分析手法の選択を行わないための操作が行われた場合、例えば、学習器選択エリア４２に代わり入力データ受付エリア４１がアクティブとなり、再度第１の入力データＤ１の受付が可能となる。 The first reception operation button 424 is a UI that allows the user to decide whether or not to select an analysis method, which will be described later, in the selected learning device ML. When an operation is performed not to select the analysis method, for example, instead of the learning device selection area 42, the input data reception area 41 becomes active, and the first input data D1 can be received again.

一方、第１の受付操作ボタン４２４に対して分析手法の選択を行うための操作が行われた場合、分析手法選択エリア４３がアクティブになる。分析手法選択エリア４３には、ユーザが分析手法を選択可能なＵＩが表示される。分析手法選択エリア４３は、分析手法選択ボタン４３１と、モデル名表示エリア４３２と、第２の受付操作ボタン４３３と、を含む。 On the other hand, when an operation for selecting an analysis method is performed on the first reception operation button 424, the analysis method selection area 43 becomes active. The analysis method selection area 43 displays a UI that allows the user to select an analysis method. The analysis method selection area 43 includes an analysis method selection button 431 , a model name display area 432 and a second reception operation button 433 .

分析手法選択ボタン４３１は、ユーザによる操作に応じて、分析手法の選択を受付可能に構成されている。分析手法選択ボタン４３１は、例えばユーザのクリック操作、タップ操作、フリック操作を受付可能に構成されている。本実施形態では、分析手法選択ボタン４３１は、設定されている分析手法の数に応じて複数存在する。選択されている分析手法選択ボタン４３１の表示態様は、選択されていない分析手法選択ボタン４３１の表示態様と異なっていてもよい。これにより、ユーザは、どの分析手法が選択されているかを把握しやすくなる。 The analysis method selection button 431 is configured to accept selection of an analysis method according to user's operation. The analysis method selection button 431 is configured to accept, for example, a user's click operation, tap operation, and flick operation. In this embodiment, a plurality of analysis method selection buttons 431 exist according to the number of set analysis methods. The display mode of the selected analysis method selection button 431 may be different from the display mode of the analysis method selection button 431 that is not selected. This makes it easier for the user to grasp which analysis method is selected.

モデル名表示エリア４３２は、入力データ受付エリア４１で受け付けられた第１の入力データＤ１と、学習器選択エリア４２にて選択された学習器ＭＬと、分析手法選択エリア４３で選択された分析手法と、に基づき生成される学習モデルＭ１の名称を表示可能に構成されている。なお、モデル名表示エリア４３２は、ユーザが当該学習モデルＭ１の名称を入力可能に構成されていてもよい。 The model name display area 432 displays the first input data D1 received in the input data receiving area 41, the learning device ML selected in the learning device selection area 42, and the analysis method selected in the analysis method selection area 43. and the name of the learning model M1 generated based on . Note that the model name display area 432 may be configured so that the user can enter the name of the learning model M1.

第２の受付操作ボタン４３３は、ユーザが上記学習モデルＭ１の生成を行うか否かを決定可能に構成されている。第２の受付操作ボタン４３３の操作に基づき学習モデルＭ１の生成を行わない決定がされた場合、分析手法選択エリア４３に代えて、入力データ受付エリア４１又は学習器選択エリア４２がアクティブとなる。 The second reception operation button 433 is configured so that the user can decide whether or not to generate the learning model M1. When it is determined not to generate the learning model M1 based on the operation of the second reception operation button 433, instead of the analysis method selection area 43, the input data reception area 41 or the learning device selection area 42 becomes active.

４－２．データウィンドウ５の一例
一方、第２の受付操作ボタン４３３の操作に基づき学習モデルＭ１の生成を行う決定がされた場合、データウィンドウ５及び変換処理ウィンドウ６が表示部３４に表示される。図７は、表示部３４に表示されるデータウィンドウ５及び変換処理ウィンドウ６の一例である。 4-2. Example of Data Window 5 On the other hand, when it is determined to generate the learning model M1 based on the operation of the second reception operation button 433, the data window 5 and the conversion processing window 6 are displayed on the display unit 34. FIG. FIG. 7 shows an example of the data window 5 and the conversion processing window 6 displayed on the display unit 34. As shown in FIG.

データウィンドウ５は、第１の入力データＤ１に関する情報を表示可能に構成されている。第１の入力データＤ１に関する情報とは、例えば、第１の入力データＤ１の名称、第１の入力データＤ１のデータ点の数、容量、第１の入力データＤ１に含まれるデータ点の内容などである。データウィンドウ５は、変数名表示エリア５０と、集計グラフ表示エリア５１と、集計情報表示エリア５２と、個別情報表示エリア５３と、を含む。 The data window 5 is configured to display information about the first input data D1. The information about the first input data D1 includes, for example, the name of the first input data D1, the number of data points of the first input data D1, the capacity, the contents of the data points included in the first input data D1, and the like. is. The data window 5 includes a variable name display area 50 , an aggregation graph display area 51 , an aggregation information display area 52 and an individual information display area 53 .

変数名表示エリア５０では、第１の入力データＤ１に含まれる入力ｘ１を識別可能な情報が表示される。例えば、変数名表示エリア５０では、第１の入力データＤ１から入力ｘ１のそれぞれの名称に相当する情報が表示される。 In the variable name display area 50, information that can identify the input x1 included in the first input data D1 is displayed. For example, in the variable name display area 50, information corresponding to each name of the first input data D1 to the input x1 is displayed.

集計グラフ表示エリア５１では、入力データ視覚情報が表示される。入力データ視覚情報は、第１の入力データＤ１に関する情報が視覚的に表示されたものである。入力データ視覚情報は、例えばヒストグラム、折れ線グラフ、円グラフ、又はバブルチャートなどを用いて表示される。入力データ視覚情報は、これらの組み合わせを用いて表示されてもよい。例えば、入力データ視覚情報は、第１の入力データＤ１の統計情報を含む。詳細には、入力データ視覚情報は、入力ｘ１ごとの第１の入力データＤ１の統計情報を含む。本実施形態では、入力データ視覚情報は、第１の入力データＤ１のデータ点の分布のヒストグラムとして集計情報表示エリア５２に表示されている。また、入力データ視覚情報は、これらの表示態様の組み合わせとして集計情報表示エリア５２に表示されてもよい。例えば、上記ヒストグラム及び上記折れ線グラフは、集計情報表示エリア５２に一覧可能に表示されていてもよい。また、上記ヒストグラム及び上記折れ線グラフは、集計情報表示エリア５２に重畳して表示されていてもよい。
集計グラフ表示エリア５１では、第１の入力データＤ１と第２の入力データＤ２との差異点５１２を認識可能な態様で表示されている。差異点５１２は、第１の入力データＤ１と第２の入力データＤ２との差分に対応するともいえる。第１の入力データＤ１と第２の入力データＤ２との差異点５１２は、第１の入力データＤ１と第２の入力データＤ２との共通点５１１と異なる態様で表示される。例えば、処理表示部２３７は、両者を色彩、形状、模様の少なくとも１つが異なる態様で表示させる。別例として、処理表示部２３７は、矢印等の所定の目印を、第１の入力データＤ１と第２の入力データＤ２の差異点５１２に対応付けて表示させてもよい。 Input data visual information is displayed in the total graph display area 51 . The input data visual information is a visual display of information related to the first input data D1. Input data visual information is displayed using, for example, histograms, line graphs, pie charts, or bubble charts. Input data visual information may be displayed using a combination of these. For example, the input data visual information includes statistical information of the first input data D1. Specifically, the input data visual information includes statistical information of the first input data D1 for each input x1. In this embodiment, the input data visual information is displayed in the summary information display area 52 as a histogram of the distribution of data points of the first input data D1. Also, the input data visual information may be displayed in the total information display area 52 as a combination of these display modes. For example, the histogram and the line graph may be displayed in the total information display area 52 so as to be viewable. Further, the histogram and the line graph may be superimposed and displayed on the total information display area 52 .
In the total graph display area 51, the difference 512 between the first input data D1 and the second input data D2 is displayed in a recognizable manner. It can be said that the difference 512 corresponds to the difference between the first input data D1 and the second input data D2. A point of difference 512 between the first input data D1 and the second input data D2 is displayed in a manner different from a point of commonality 511 between the first input data D1 and the second input data D2. For example, the processing display unit 237 displays both in a mode in which at least one of color, shape, and pattern is different. As another example, the processing display unit 237 may display a predetermined mark such as an arrow in association with the difference 512 between the first input data D1 and the second input data D2.

集計情報表示エリア５２では、第１の入力データＤ１に関する統計情報が表示されている。集計情報表示エリア５２にて表示される統計情報は、例えば最大値、最小値、平均値、標準偏差である。また、当該統計情報は、第１の入力データＤ１の欠損値の数を表示してもよい。統計情報は、数値や文字列として表示されても、ヒストグラムなどの視覚情報として表示されてもよい。 In the total information display area 52, statistical information regarding the first input data D1 is displayed. The statistical information displayed in the aggregated information display area 52 is, for example, maximum value, minimum value, average value, and standard deviation. Also, the statistical information may display the number of missing values in the first input data D1. The statistical information may be displayed as numerical values, character strings, or as visual information such as histograms.

個別情報表示エリア５３では、第１の入力データＤ１に含まれるデータ点の情報が表示される。詳細には、個別情報表示エリア５３では、入力ｘ１ごとのデータ点の情報が表示される。個別情報表示エリア５３での表示態様は任意であるが、例えば、入力ｘ１ごとのデータ点の情報が、テーブル形式で表示される。 In the individual information display area 53, information on data points included in the first input data D1 is displayed. Specifically, in the individual information display area 53, information on data points for each input x1 is displayed. Although the display mode in the individual information display area 53 is arbitrary, for example, data point information for each input x1 is displayed in a table format.

４－３．変換処理ウィンドウ６について
変換処理ウィンドウ６では、少なくとも、第１の入力データＤ１に対して行う変換処理に関する情報が表示される。本実施形態では、変換処理ウィンドウ６は、データウィンドウ５と一覧可能に表示されるが、データウィンドウ５と別々に表示されてもよい。変換処理ウィンドウ６の表示モードは、第１の表示モード６ａと、第２の表示モード６ｂと、を含む。第１の表示モード６ａでは、変換処理ウィンドウ６は、第１の入力データ情報表示エリア６１と、生成条件表示エリア６２と、自動変換処理表示エリア６３と、処理条件表示エリア６４と、第１の処理実行ボタン６５と、手動変換移行ボタン６６と、処理保存ボタン６７と、を含む。 4-3. Concerning Conversion Processing Window 6 In the conversion processing window 6, at least information regarding the conversion processing to be performed on the first input data D1 is displayed. In this embodiment, the conversion processing window 6 is displayed so as to be viewable with the data window 5 , but may be displayed separately from the data window 5 . The display modes of the conversion processing window 6 include a first display mode 6a and a second display mode 6b. In the first display mode 6a, the conversion processing window 6 includes a first input data information display area 61, a generation condition display area 62, an automatic conversion processing display area 63, a processing condition display area 64, and a first A process execution button 65 , a manual conversion transfer button 66 and a process save button 67 are included.

第１の入力データ情報表示エリア６１には、第１の入力データＤ１又は第２の入力データＤ２に関する情報が表示される。本実施形態では、変換処理によって生成される第２の入力データＤ２に関する情報が表示される。第２の入力データＤ２に関する情報とは、例えば、第２の入力データＤ２の容量、第２の入力データＤ２のサイズ、第１の入力データＤ１と第２の入力データＤ２との容量の差分などである。 Information about the first input data D1 or the second input data D2 is displayed in the first input data information display area 61 . In this embodiment, information about the second input data D2 generated by the conversion process is displayed. The information about the second input data D2 is, for example, the capacity of the second input data D2, the size of the second input data D2, the difference in capacity between the first input data D1 and the second input data D2, and the like. is.

生成条件表示エリア６２には、学習モデルＭ１の生成条件が表示される。学習モデルＭ１の生成条件とは、例えば、予測対象選択エリア４２１にて選択された予測対象、学習器表示エリア４２２にて選択された学習器ＭＬ、学習器ＭＬで用いられるアルゴリズムなど任意である。 The generation condition display area 62 displays the generation conditions of the learning model M1. The conditions for generating the learning model M1 are, for example, the prediction target selected in the prediction target selection area 421, the learning device ML selected in the learning device display area 422, the algorithm used in the learning device ML, and the like.

自動変換処理表示エリア６３は、処理表示部２３７で特定される自動変換処理の内容を表示する。変換処理の内容とは、例えば、第１の入力データＤ１の一部の削除、欠損値の補完、外れ値の削除、第１の入力データＤ１の規格化などである。第１の入力データＤ１の一部の削除とは、説明変数として用いられる可能性の低い入力ｘ１を削除することである。このような入力ｘ１としては、例えば各データ点のＩＤ番号などである。なお、データウィンドウ５にて表示される第１の入力データＤ１のうち、変換処理による変更部分は、インジケータＬ１によって示唆される。インジケータＬ１は、例えば色彩、形状、模様の差異に基づいて、変換処理によって変更部分を示唆する。インジケータＬ１は、変更部分に対応する領域を、他の領域と異なる輪郭線で示唆するものでもよい。 The automatic conversion process display area 63 displays the contents of the automatic conversion process specified by the process display section 237 . The content of the conversion processing includes, for example, deletion of part of the first input data D1, complementation of missing values, deletion of outliers, normalization of the first input data D1, and the like. Partial deletion of the first input data D1 means deletion of the input x1 that is unlikely to be used as an explanatory variable. Such an input x1 is, for example, the ID number of each data point. Note that the portion of the first input data D1 displayed in the data window 5 that has been changed by the conversion process is indicated by the indicator L1. The indicator L1 suggests a change part by conversion processing, for example, based on differences in color, shape, and pattern. The indicator L1 may indicate the area corresponding to the changed portion with a different outline from other areas.

処理条件表示エリア６４には、自動変換処理表示エリア６３にて表示される自動変換処理の変換条件が表示される。詳細には、処理条件表示エリア６４には、自動変換処理のそれぞれに対応する変換条件が、当該自動変換処理ごとに表示される。 The processing condition display area 64 displays conversion conditions for the automatic conversion processing displayed in the automatic conversion processing display area 63 . Specifically, in the processing condition display area 64, the conversion conditions corresponding to each automatic conversion process are displayed for each automatic conversion process.

第１の処理実行ボタン６５は、制御部２３に自動変換処理表示エリア６３に表示された自動変換処理を実行させるためのＵＩである。ユーザは、第１の処理実行ボタン６５を操作することによって制御部２３に当該自動変換処理を実行させることができる。 The first process execution button 65 is a UI for causing the control unit 23 to execute the automatic conversion process displayed in the automatic conversion process display area 63 . The user can cause the control unit 23 to execute the automatic conversion process by operating the first process execution button 65 .

手動変換移行ボタン６６は、ユーザの操作に応じて、変換処理ウィンドウ６の表示モードを第１の表示モード６ａから第２の表示モード６ｂに遷移させるものである。第２の表示モード６ｂは、ユーザが手動で変換処理を指定可能な表示モードである。図８は、第２の表示モード６ｂの場合に表示部３４に表示される変換処理ウィンドウ６の一例を示す図である。手動変換移行ボタン６６が操作されることにより、変換処理ウィンドウ６は、手動変換処理指定エリア６６１と、手動変換処理保存エリア６６２と、を含む表示モードに遷移する。このとき、処理条件表示エリア６４及び第１の処理実行ボタン６５が非表示となってもよい。これにより、ユーザが手動での変換処理の指定に重要度の低い表示を減らし、操作の便宜の向上を図ることができる。 The manual conversion transition button 66 is used to transition the display mode of the conversion processing window 6 from the first display mode 6a to the second display mode 6b according to the user's operation. A second display mode 6b is a display mode in which the user can manually specify conversion processing. FIG. 8 is a diagram showing an example of the conversion processing window 6 displayed on the display section 34 in the case of the second display mode 6b. By operating the manual conversion transition button 66 , the conversion processing window 6 transitions to a display mode including a manual conversion processing designation area 661 and a manual conversion processing saving area 662 . At this time, the processing condition display area 64 and the first processing execution button 65 may be hidden. As a result, it is possible to reduce the display of low importance for the user's manual designation of the conversion process, and improve the convenience of operation.

手動変換処理指定エリア６６１は、自動変換処理と異なる変換処理をユーザが指定可能に構成されている。以下、説明の便宜上、手動変換処理指定エリア６６１にて指定された変換処理を、手動変換処理という。例えば、変換処理が外れ値の除去の場合、手動変換処理指定エリア６６１には外れ値の候補が表示される。ユーザは、当該候補のなかから変換処理で除去されるものを指定する。また、変換処理が欠損値の補完の場合、手動変換処理指定エリア６６１には、自動変換処理で補完される欠損値の候補が表示される。ユーザは、当該候補のなかから変換処理で補完されるものを指定する。これらの指定は、例えば、手動変換処理保存エリア６６２に含まれるチェックボックス、スライダー、ボタンなどの視覚情報に対する操作によって実現可能である。 The manual conversion process designation area 661 is configured so that the user can designate a conversion process different from the automatic conversion process. Hereinafter, for convenience of explanation, the conversion process specified in the manual conversion process specification area 661 will be referred to as manual conversion process. For example, if the conversion process is removal of outliers, outlier candidates are displayed in the manual conversion process designation area 661 . The user designates those candidates to be removed in the conversion process. If the conversion process is to complement missing values, the manual conversion process designation area 661 displays missing value candidates to be complemented by the automatic conversion process. The user designates one to be complemented by the conversion process from among the candidates. These designations can be realized, for example, by operating visual information such as check boxes, sliders, and buttons included in the manual conversion processing storage area 662 .

なお、指定された手動変換処理を含む変換処理によって生成される第２の入力データＤ２が変化することがある。この場合、手動変換処理を含む変換処理によって生成される第２の入力データＤ２と、第１の入力データＤ１と、の共通点５１１及び差異点５１２は、集計グラフ表示エリア５１に表示されてもよい。また、集計グラフ表示エリア５１には、手動変換処理前後での第２の入力データＤ２の差異点が、上述した共通点５１１及び差異点５１２と異なる態様で表示されてもよい。これにより、ユーザは、指定した手動変換処理の内容を視覚的に把握可能となる。また、当該表示は、手動変換処理の指定と連動して行われることが好ましい。これにより、手動変換処理の指定が第１の入力データＤ１に与える影響の把握が容易となる。 Note that the second input data D2 generated by the conversion process including the designated manual conversion process may change. In this case, the points of commonality 511 and the points of difference 512 between the second input data D2 generated by the conversion process including the manual conversion process and the first input data D1 may be displayed in the total graph display area 51. good. Further, in the summary graph display area 51, the points of difference of the second input data D2 before and after the manual conversion process may be displayed in a manner different from the points of commonality 511 and points of difference 512 described above. This allows the user to visually grasp the content of the specified manual conversion process. Moreover, it is preferable that the display is performed in conjunction with the specification of the manual conversion process. As a result, it becomes easy to grasp the influence of the designation of the manual conversion process on the first input data D1.

手動変換処理保存エリア６６２は、ユーザによる操作に基づき、手動変換処理を保存するか否かを決定可能なＵＩである。手動変換処理を保存しない決定が行われた場合、当該手動変換処理が破棄される。その後、変換処理ウィンドウ６の表示モードが第２の表示モード６ｂから第１の表示モード６ａに遷移する。一方、手動変換処理を保存する決定が行われた場合、当該手動変換処理が変換処理として更新される。その後、変換処理ウィンドウ６の表示モードが第２の表示モード６ｂから第１の表示モード６ａに遷移する。 The manual conversion process saving area 662 is a UI that allows the user to decide whether or not to save the manual conversion process based on the user's operation. If a decision is made not to save the manual conversion process, the manual conversion process is discarded. After that, the display mode of the conversion processing window 6 transitions from the second display mode 6b to the first display mode 6a. On the other hand, if a decision is made to save the manual conversion process, the manual conversion process is updated as the conversion process. After that, the display mode of the conversion processing window 6 transitions from the second display mode 6b to the first display mode 6a.

処理保存ボタン６７は、ユーザの操作に基づいて変換処理の内容を保存するか否かを決定可能なＵＩである。変換処理の内容を保存しない決定がされた場合、制御部２３は、第１の入力データＤ１に対して変換処理を行わず、情報処理を終了する。このとき、制御部２３は、表示部３４に、再度受付ウィンドウ４を表示させてもよい。 The process save button 67 is a UI that allows the user to decide whether or not to save the contents of the conversion process based on the user's operation. When it is determined not to save the contents of the conversion process, the control unit 23 does not perform the conversion process on the first input data D1 and ends the information processing. At this time, the control unit 23 may cause the display unit 34 to display the reception window 4 again.

一方、変換処理の内容を保存する決定がされた場合、第１の入力データＤ１に対して変換処理が実行される。この場合、手動変換処理の指定が行われていた場合、第１の入力データＤ１に対して手動変換処理が行われる。一方、手動変換処理の指定が行われていない場合、第１の入力データＤ１に対して自動変換処理が行われる。これにより、第２の入力データＤ２が生成される。なお、生成された第２の入力データＤ２は、記憶部２２に保存されてもよい。その後、学習条件に基づいて第２の入力データＤ２を、特定された学習器ＭＬのそれぞれに入力することで、学習器ＭＬのそれぞれから学習モデルＭ１が生成される。その後、学習モデルＭ１についてのモデル情報表示ウィンドウ７が表示部３４に表示される。 On the other hand, when it is decided to save the contents of the conversion process, the conversion process is executed on the first input data D1. In this case, if the manual conversion process has been specified, the manual conversion process is performed on the first input data D1. On the other hand, when manual conversion processing is not specified, automatic conversion processing is performed on the first input data D1. Thereby, the second input data D2 is generated. Note that the generated second input data D<b>2 may be stored in the storage unit 22 . After that, by inputting the second input data D2 to each of the identified learners ML based on the learning conditions, the learning model M1 is generated from each of the learners ML. After that, the model information display window 7 for the learning model M1 is displayed on the display section 34 .

４－４．モデル情報表示ウィンドウ７について
次に、表示部３４に表示されるモデル情報表示ウィンドウ７の一例について説明する。図９は、表示部３４に表示されるモデル情報表示ウィンドウ７の一例を示す図である。モデル情報表示ウィンドウ７には、生成された学習モデルＭ１に関する情報が表示される。本実施形態では、生成された学習モデルＭ１の１つに関する情報が表示される。モデル情報表示ウィンドウ７は、第２の入力データ情報表示エリア７１と、モデル情報表示エリア７２と、シミュレーション実行ボタン７３と、を含む。 4-4. Model Information Display Window 7 Next, an example of the model information display window 7 displayed on the display unit 34 will be described. FIG. 9 is a diagram showing an example of the model information display window 7 displayed on the display unit 34. As shown in FIG. The model information display window 7 displays information about the generated learning model M1. In this embodiment, information about one of the generated learning models M1 is displayed. The model information display window 7 includes a second input data information display area 71 , a model information display area 72 and a simulation execution button 73 .

第２の入力データ情報表示エリア７１には、学習器ＭＬに入力された入力データに関する情報が表示される。本実施形態では、第２の入力データＤ２に関する情報が表示される。例えば、第２の入力データ情報表示エリア７１には、第２の入力データＤ２の容量、サイズなどが表示される。 The second input data information display area 71 displays information about the input data input to the learning device ML. In this embodiment, information about the second input data D2 is displayed. For example, the second input data information display area 71 displays the capacity, size, etc. of the second input data D2.

モデル情報表示エリア７２には、生成された学習モデルＭ１に関するモデル情報ＩＦ１が表示される。本実施形態では、モデル情報表示エリア７２には、少なくとも学習モデルＭ１の予測精度に関する精度情報を含む。モデル情報表示エリア７２は、複数の精度情報表示エリア７２１と、寄与度表示エリア７２２と、寄与度一覧表示ボタン７２５と、を含む。 The model information display area 72 displays model information IF1 regarding the generated learning model M1. In this embodiment, the model information display area 72 includes at least accuracy information about the prediction accuracy of the learning model M1. The model information display area 72 includes a plurality of accuracy information display areas 721 , a contribution display area 722 and a contribution list display button 725 .

精度情報表示エリア７２１には、学習モデルＭ１の精度情報が表示される。本実施形態では、精度情報表示エリア７２１のそれぞれには、異なる精度情報が表示される。具体的には、精度情報表示エリア７２１のそれぞれには、決定係数、平均二乗誤差、平均二乗偏差が、個別に表示されている。精度情報表示エリア７２１には、各精度情報の数値、各精度情報の意味、各精度情報の評価方法、各精度情報の改善方法などが表示されうる。 Accuracy information of the learning model M1 is displayed in the accuracy information display area 721 . In this embodiment, different accuracy information is displayed in each of the accuracy information display areas 721 . Specifically, the coefficient of determination, the mean square error, and the mean square deviation are individually displayed in each of the accuracy information display areas 721 . The accuracy information display area 721 can display a numerical value of each accuracy information, a meaning of each accuracy information, an evaluation method of each accuracy information, an improvement method of each accuracy information, and the like.

寄与度表示エリア７２２には、学習モデルＭ１の説明変数、すなわち、学習モデルＭ１の生成に用いられた入力ｘ１ごとの出力ｙ１への寄与度が表示される。寄与度は、例えば学習モデルＭ１における入力ｘ１ごとの係数に基づいて導出される。寄与度は、増加寄与度と減少寄与度とを含む。増加寄与度は、入力ｘ１の寄与度のうち出力ｙ１の増加に関与する成分である。減少寄与度は、入力ｘ１の寄与度のうち出力ｙ１の減少に関与する成分である。この場合、学習モデルＭ１における入力ｘ１ごとの係数は、増加寄与度に対応する成分と、減少寄与度に対応する成分と、を含みうる。寄与度表示エリア７２２は、増加寄与度が表示される増加寄与度表示エリア７２３と、減少寄与度が表示される減少寄与度表示エリア７２４と、を含む。 The contribution degree display area 722 displays the explanatory variable of the learning model M1, that is, the degree of contribution to the output y1 for each input x1 used to generate the learning model M1. The contribution is derived, for example, based on the coefficient for each input x1 in the learning model M1. The contribution includes an increase contribution and a decrease contribution. The increase contribution is a component of the contribution of the input x1 that contributes to the increase of the output y1. The decrease contribution is a component of the contribution of the input x1 that contributes to the decrease of the output y1. In this case, the coefficient for each input x1 in the learning model M1 may include a component corresponding to increased contribution and a component corresponding to decreased contribution. The contribution display area 722 includes an increased contribution display area 723 in which the increased contribution is displayed and a decreased contribution display area 724 in which the decreased contribution is displayed.

増加寄与度表示エリア７２３及び減少寄与度表示エリア７２４には、増加寄与度と減少寄与度とが区別可能に表示される。また、増加寄与度表示エリア７２３及び減少寄与度表示エリア７２４には、増加寄与度と減少寄与度とが比較可能に表示される。例えば、増加寄与度表示エリア７２３及び減少寄与度表示エリア７２４には、増加寄与度と減少寄与度とが横棒グラフとして比較可能かつ一覧可能に表示される。本実施形態では、寄与度表示エリア７２２には、全部の入力ｘ１のうちの一部の寄与度が表示される。具体的には、寄与度表示エリア７２２には、全部の入力ｘ１のうち、寄与度が高いものから順に所定の序数、例えば５番目、までのものが表示される。これにより、ユーザは、が出力ｙ１に影響を与えやすい入力ｘ１を認識しやすくなる。 In the increased contribution display area 723 and the decreased contribution display area 724, the increased contribution and the decreased contribution are displayed in a distinguishable manner. Further, in the increased contribution display area 723 and the decreased contribution display area 724, the increased contribution and the decreased contribution are displayed in a comparable manner. For example, in the increased contribution display area 723 and the decreased contribution display area 724, the increased contribution and the decreased contribution are displayed as horizontal bar graphs so that they can be compared and viewed. In this embodiment, the contribution degree display area 722 displays the contribution degree of some of all the inputs x1. Specifically, in the contribution degree display area 722, among all the inputs x1, those with a predetermined ordinal number, such as the fifth, are displayed in descending order of contribution degree. This makes it easier for the user to recognize the input x1 that tends to affect the output y1.

寄与度一覧表示ボタン７２５は、全部の入力ｘ１のうちの一部の寄与度のみが表示されている場合に、ユーザによる操作に基づいて入力ｘ１の寄与度の表示数を増加させるＵＩである。寄与度一覧表示ボタン７２５の操作に基づいて、全部の入力ｘ１の寄与度が表示されてもよい。 The contribution degree list display button 725 is a UI for increasing the display number of the contribution degree of the input x1 based on the user's operation when only a part of the contribution degree of all the inputs x1 is displayed. Based on the operation of the contribution list display button 725, the contribution of all the inputs x1 may be displayed.

シミュレーション実行ボタン７３は、ユーザによる操作に基づいて、学習モデルＭ１を用いた予測シミュレーションを実行するためのＵＩである。例えば、予測シミュレーションは、所定の条件を満たす出力ｙ１に対応する入力ｘ１の探索である。所定の条件とは、例えば、出力ｙ１が予め定められた閾値以上となることや、所定の試行回数において出力ｙ１が最大又は最大となること、などである。入力ｘ１が学習モデルＭ１に入力されると、出力ｙ１が得られる。このとき、予め定められた定義域内で入力ｘ１を変化させることで、入力ｘ１の変化に応じて出力ｙ１が変化する。これにより、出力ｙ１が所定の条件を満たす場合における入力ｘ１が導出される。例えば、出力ｙ１が売上価格である場合で、所定の条件が売上価格の最大化の場合、ユーザは、当該予測シミュレーションにより、売上価格が最大となる入力ｘ１を得ることができる。 The simulation execution button 73 is a UI for executing a predictive simulation using the learning model M1 based on user's operation. For example, a predictive simulation is a search for an input x1 corresponding to an output y1 that satisfies a given condition. The predetermined condition is, for example, that the output y1 is equal to or greater than a predetermined threshold, or that the output y1 is the maximum or the maximum in a predetermined number of trials. When the input x1 is input to the learning model M1, the output y1 is obtained. At this time, by changing the input x1 within a predetermined domain, the output y1 changes according to the change in the input x1. Thus, the input x1 is derived when the output y1 satisfies a predetermined condition. For example, if the output y1 is the sales price and the predetermined condition is to maximize the sales price, the user can obtain the input x1 that maximizes the sales price through the predictive simulation.

４－５．モデル検索ウィンドウ８及びモデル比較ウィンドウ９について
制御部２３は、表示部３４にモデル検索ウィンドウ８及びモデル比較ウィンドウ９を表示させてもよい。図１０は、表示部３４に表示されたモデル検索ウィンドウ８及びモデル比較ウィンドウ９の一例である。 4-5. Model Search Window 8 and Model Comparison Window 9 The control unit 23 may cause the display unit 34 to display the model search window 8 and the model comparison window 9 . FIG. 10 shows an example of the model search window 8 and the model comparison window 9 displayed on the display unit 34. As shown in FIG.

モデル検索ウィンドウ８は、過去に生成された学習モデルＭ１を検索可能なＵＩを含む。具体的には、モデル検索ウィンドウ８は、検索条件入力エリア８１と、検索結果表示エリア８２と、検索ウィンドウ終了ボタン８３と、を含む。 The model search window 8 includes a UI that allows searching for previously generated learning models M1. Specifically, the model search window 8 includes a search condition input area 81 , a search result display area 82 , and a search window exit button 83 .

検索条件入力エリア８１は、検索に用いられる検索条件を受付可能に構成されている。検索条件は、例えば、学習モデルＭ１の名称、アルゴリズム、出力ｙ１の名称などのキーワード、学習モデルＭ１の学習条件、学習モデルＭ１が生成された時期など、任意である。また、検索条件入力エリア８１は、ユーザによる操作に基づいて、受け付けられた検索条件をもとに学習モデルＭ１の検索を実行可能に構成されている。 The search condition input area 81 is configured to accept search conditions used for searching. The search condition is arbitrary, for example, the name of the learning model M1, the algorithm, keywords such as the name of the output y1, the learning condition of the learning model M1, the time when the learning model M1 was generated, and the like. In addition, the search condition input area 81 is configured to be able to search for the learning model M1 based on the received search condition based on the user's operation.

検索結果表示エリア８２には、検索条件入力エリア８１が受け付けた検索条件に基づく検索結果が表示される。検索結果表示エリア８２には、検索条件に適合する過去の学習モデルＭ１が一覧可能に表示される。検索結果表示エリア８２には、当該過去の学習モデルＭ１のモデル情報ＩＦ１の少なくとも一部がユーザに視認可能に表示されていてもよい。これにより、検索結果の一覧性が向上する。検索結果表示エリア８２に表示される過去の学習モデルＭ１は、ユーザにより指定可能に構成されている。ユーザによる学習モデルＭ１の指定は、チェックボックス等のインジケータにより視認可能に表示される。以下、説明の便宜上、検索結果表示エリア８２にて指定された学習モデルＭ１を、指定学習モデルＭ２という。 The search result display area 82 displays search results based on the search conditions received by the search condition input area 81 . In the search result display area 82, past learning models M1 that match the search conditions are displayed in a listable manner. At least part of the model information IF1 of the past learning model M1 may be displayed in the search result display area 82 so as to be visible to the user. This improves the listability of search results. The past learning model M1 displayed in the search result display area 82 is configured to be designated by the user. Designation of the learning model M1 by the user is visibly displayed by an indicator such as a check box. Hereinafter, for convenience of explanation, the learning model M1 specified in the search result display area 82 will be referred to as a specified learning model M2.

検索ウィンドウ終了ボタン８３は、ユーザの操作に基づき過去の学習モデルＭ１の検索を終了するＵＩである。 The search window end button 83 is a UI for ending the search of the past learning model M1 based on the user's operation.

モデル比較ウィンドウ９では、指定学習モデルＭ２のモデル情報ＩＦ１を比較可能に表示される。モデル比較ウィンドウ９は、比較モデル表示エリア９１と、パラメータ選択エリア９２と、比較結果表示エリア９３と、シミュレーション実行ボタン９４と、を含む。 In the model comparison window 9, the model information IF1 of the designated learning model M2 is displayed so as to be comparable. The model comparison window 9 includes a comparison model display area 91 , a parameter selection area 92 , a comparison result display area 93 and a simulation execution button 94 .

比較モデル表示エリア９１では、検索結果表示エリア８２にて指定された学習モデルＭ１のモデル情報ＩＦ１の少なくとも一部が表示される。比較モデル表示エリア９１では、ユーザが、比較モデル表示エリア９１に表示される指定学習モデルＭ２のうちの１つを指定可能に構成されている。 In the comparison model display area 91, at least part of the model information IF1 of the learning model M1 specified in the search result display area 82 is displayed. The comparison model display area 91 is configured so that the user can designate one of the designated learning models M2 displayed in the comparison model display area 91 .

パラメータ選択エリア９２では、ユーザが指定学習モデルＭ２の生成及び評価に用いられるパラメータを選択可能に構成されている。本実施形態では、２つのパラメータを選択可能に構成されている。指定学習モデルＭ２の生成及び評価に用いられるパラメータは、モデル情報ＩＦ１に含まれる各種精度情報や、指定学習モデルＭ２の学習条件などが含まれる。以下、説明の便宜上、パラメータ選択エリア９２にて選択されるパラメータを、選択パラメータという。 The parameter selection area 92 is configured so that the user can select parameters used for generating and evaluating the designated learning model M2. In this embodiment, two parameters are selectable. Parameters used for generating and evaluating the designated learning model M2 include various accuracy information included in the model information IF1, learning conditions for the designated learning model M2, and the like. Hereinafter, for convenience of explanation, the parameter selected in the parameter selection area 92 will be referred to as the selected parameter.

比較結果表示エリア９３では、指定学習モデルＭ２のそれぞれの選択パラメータを一覧可能な視覚情報が表示される。視覚情報とは、例えば、散布図、ヒストグラム、相関図、三次元プロット図など、任意である。これにより、指定学習モデルＭ２の精度比較を容易に行うことができる。 In the comparison result display area 93, visual information is displayed that allows a list of selected parameters for each of the designated learning models M2. Visual information is arbitrary, for example, a scatter diagram, a histogram, a correlation diagram, a three-dimensional plot diagram, or the like. This makes it possible to easily compare the accuracy of the designated learning model M2.

シミュレーション実行ボタン９４は、ユーザによる操作に基づき、指定学習モデルＭ２を用いた予測シミュレーションを実行するＵＩである。予測シミュレーションに用いられる指定学習モデルＭ２は、例えば比較モデル表示エリア９１にて指定される指定学習モデルＭ２である。 The simulation execution button 94 is a UI for executing a predictive simulation using the designated learning model M2 based on the user's operation. The specified learning model M2 used in the predictive simulation is the specified learning model M2 specified in the comparative model display area 91, for example.

本実施形態では、モデル比較ウィンドウ９は、モデル検索ウィンドウ８と一覧可能に表示されている。これにより、検索結果と指定学習モデルＭ２との比較が容易となる。 In this embodiment, the model comparison window 9 is displayed so as to be viewable with the model search window 8 . This facilitates comparison between the search result and the designated learning model M2.

５．その他
前述の実施形態に係る情報処理システム１に関して、以下のような態様を採用してもよい。 5. Others Regarding the information processing system 1 according to the above-described embodiment, the following aspects may be adopted.

第１の入力データＤ１及び第２の入力データＤ２は、それぞれ外部データＤ０としてデータベースＤＢ１に記憶されてもよい。これらの外部データＤ０は、所定の条件のもと、他のユーザに提供可能であってもよい。 The first input data D1 and the second input data D2 may each be stored in the database DB1 as the external data D0. These external data D0 may be provided to other users under predetermined conditions.

制御部２３は、データウィンドウ５、変換処理ウィンドウ６、モデル情報表示ウィンドウ７の少なくとも１つに、第１の入力データＤ１に対して行われた変換処理の履歴、いわゆる変換処理のバージョン、を表示させてもよい。これにより、変換処理と精度情報との関係性の類推が容易となる。また、制御部２３は、変換処理のバージョンの管理を行ってもよい。 The control unit 23 displays the history of conversion processing performed on the first input data D1, the so-called conversion processing version, in at least one of the data window 5, the conversion processing window 6, and the model information display window 7. You may let This makes it easy to analogize the relationship between the conversion process and the accuracy information. Also, the control unit 23 may manage versions of conversion processing.

制御部２３は、例えば、第１の入力データＤ１が所定の品質条件を満たさない場合、表示部３４に警告を表示させてもよい。品質条件とは、例えば、第１の入力データＤ１のデータ点の数、容量、外れ値の割合などである。品質条件を満たさない場合とは、例えば、第１の入力データＤ１のデータ点の数が所定の値未満である場合、第１の入力データＤ１の外れ値が所定の基準数より多い場合などである。当該警告は、受付ウィンドウ４、データウィンドウ５、変換処理ウィンドウ６、モデル情報表示ウィンドウ７、モデル検索ウィンドウ８、及びモデル比較ウィンドウ９のうちの少なくとも１つでも、それ以外のウィンドウでもよい。なお、当該警告は、表示部３４に表示されるものに限られず、音、振動、光など任意の態様で実現可能である。 For example, when the first input data D1 does not satisfy a predetermined quality condition, the control unit 23 may cause the display unit 34 to display a warning. The quality conditions are, for example, the number of data points of the first input data D1, the capacity, the ratio of outliers, and the like. When the quality condition is not satisfied, for example, when the number of data points in the first input data D1 is less than a predetermined value, or when the number of outliers in the first input data D1 is greater than a predetermined reference number. be. The warning may be at least one of the reception window 4, data window 5, conversion processing window 6, model information display window 7, model search window 8, and model comparison window 9, or other windows. Note that the warning is not limited to being displayed on the display unit 34, and can be realized in any form such as sound, vibration, or light.

情報処理装置２は、オンプレミス形態であってもよく、クラウド形態であってもよい。クラウド形態の情報処理装置２としては、例えば、ＳａａＳ（ＳｏｆｔｗａｒｅａｓａＳｅｒｖｉｃｅ）、クラウドコンピューティングという形態で、上述の機能や処理を提供してもよい。 The information processing device 2 may be in an on-premise form or in a cloud form. The cloud-type information processing apparatus 2 may provide the above functions and processes in the form of, for example, SaaS (Software as a Service) or cloud computing.

以上の実施形態では、情報処理装置２が種々の記憶・制御を行ったが、情報処理装置２に代えて、複数の外部装置が用いられてもよい。すなわち、ブロックチェーン技術等を用いて、第１の入力データＤ１、第２の入力データＤ２、学習モデルＭ１を分散して複数の外部装置に記憶させてもよい。 In the above embodiment, the information processing device 2 performs various types of storage and control, but instead of the information processing device 2, a plurality of external devices may be used. That is, the first input data D1, the second input data D2, and the learning model M1 may be distributed and stored in a plurality of external devices using blockchain technology or the like.

次に記載の各態様で提供されてもよい。 It may be provided in each aspect described below.

（２）前記情報処理システムにおいて、前記モデル情報は、少なくとも前記学習モデルの予測精度に関する精度情報を含み、前記モデル表示ステップでは、生成される前記学習モデルごとの前記精度情報を比較可能に表示させる、もの。 (2) In the information processing system, the model information includes at least accuracy information about the prediction accuracy of the learning model, and the model display step displays the accuracy information for each of the generated learning models in a comparable manner. ,thing.

このような構成によれば、ユーザは第１の入力データに適した学習モデルを、精度情報に基づき判断することが可能となる。したがって、ユーザに要求されるデータサイエンスに関する知見の水準を、さらに下げることができる。 With such a configuration, the user can determine a learning model suitable for the first input data based on the accuracy information. Therefore, the level of knowledge about data science required of users can be further lowered.

（３）前記情報処理システムにおいて、さらに、学習器選択受付ステップでは、特定された前記学習器に対する選択をユーザより受け付け、前記モデル表示ステップでは、特定された前記学習器のうち、前記選択により選択された前記学習器を用いて生成される前記モデル情報を表示させる、もの。 (3) In the information processing system, the learner selection accepting step further accepts a selection of the identified learner from the user, and the model display step selects one of the identified learners by the selection. displaying the model information generated using the learned learner.

このような構成によれば、ユーザは、ユーザ自身の利用態様に合わせて、学習モデルを生成させる学習器を選択することができるため、利便性の向上を図ることができる。 According to such a configuration, the user can select a learning device for generating a learning model in accordance with the user's usage mode, thereby improving convenience.

（４）前記情報処理システムにおいて、さらに、データ処理ステップでは、入力された前記第１の入力データを、特定された前記学習器に入力可能な態様である第２の入力データに変換する変換処理を実行し、前記モデル情報は、前記学習器によって前記第２の入力データを用いて生成される前記学習モデルに関する、もの。 (4) In the information processing system, further, in the data processing step, a conversion process of converting the input first input data into second input data in a form that can be input to the specified learning device. wherein the model information relates to the learning model generated by the learner using the second input data.

このような構成によれば、ユーザが第１の入力データを、特定された学習器のそれぞれに入力可能な第２の入力データに変換する労力を軽減することができる。 According to such a configuration, it is possible to reduce the user's effort to convert the first input data into the second input data that can be input to each of the identified learners.

（５）前記情報処理システムにおいて、さらに、処理表示ステップでは、前記第１の入力データと前記第２の入力データとの差異点を認識可能な態様で表示させる、もの。 (5) In the information processing system, further, in the process display step, a difference between the first input data and the second input data is displayed in a recognizable manner.

このような構成によれば、ユーザは、第１の入力データと第２の入力データとの差異点に基づき、変換処理を把握することができる。したがって、ユーザにとって学習モデルがブラックボックス化する可能性を低減することができる。 According to such a configuration, the user can understand the conversion process based on the difference between the first input data and the second input data. Therefore, it is possible to reduce the possibility that the learning model becomes a black box for the user.

（６）前記情報処理システムにおいて、さらに、処理条件表示ステップでは、少なくとも入力された前記第１の入力データと、特定された前記学習器と、に基づき、前記変換処理が行われる条件を認識可能な態様で表示させる、もの。 (6) In the information processing system, further, in the processing condition display step, the conditions for performing the conversion processing can be recognized based on at least the input first input data and the specified learning device. A thing that is displayed in a suitable manner.

このような構成によれば、ユーザは、変換処理が行われる根拠を条件として認識することができるため、学習モデルがブラックボックス化する可能性をさらに低減することができる。 According to such a configuration, the user can recognize the grounds for the conversion process as a condition, thereby further reducing the possibility of the learning model becoming a black box.

（７）前記情報処理システムにおいて、さらに、分析手法選択受付ステップでは、複数の分析手法のうちの前記学習モデルの生成に用いられる少なくとも１つの選択を受け付ける、もの。 (7) In the information processing system, further, the analysis method selection receiving step receives selection of at least one of the plurality of analysis methods to be used for generating the learning model.

このような構成によれば、ユーザが学習モデルの利用態様に応じて分析手法を選択することができるため、さらなる利便性の向上を図ることができる。 According to such a configuration, the user can select the analysis method according to the usage mode of the learning model, so it is possible to further improve the convenience.

（８）前記情報処理システムにおいて、前記分析手法は、分類分析、回帰分析、及び時系列分析のうちの少なくとも１つを含む、もの。 (8) In the information processing system, the analysis method includes at least one of classification analysis, regression analysis, and time series analysis.

このような構成によれば、分析手法のなかでも特に汎用性の高い、分類分析、回帰分析、及び時系列分析のうちの少なくとも１つを用いることが可能となるため、さらなる利便性の向上を図ることができる。 According to such a configuration, it is possible to use at least one of classification analysis, regression analysis, and time series analysis, which are particularly versatile among analysis methods, further improving convenience. can be planned.

（９）前記情報処理システムにおいて、前記第１の入力データは、少なくともユーザが保有する保有データを含む、もの。 (9) In the information processing system, the first input data includes at least held data held by a user.

このような構成によれば、保有データに含まれるユーザ固有の条件が学習モデルに反映可能となるため、さらなる予測精度の向上を図ることができる。 According to such a configuration, the user-specific conditions included in the held data can be reflected in the learning model, so that the prediction accuracy can be further improved.

（１０）前記情報処理システムにおいて、前記第１の入力データは、少なくとも構造化データを含む、もの。 (10) In the information processing system, the first input data includes at least structured data.

このような構成によれば、第１の入力データの構造に基づき、学習モデルの予測精度のさらなる向上を図ることができる。 According to such a configuration, it is possible to further improve the prediction accuracy of the learning model based on the structure of the first input data.

（１１）情報処理方法であって、前記情報処理システムの各ステップを含む、もの。 (11) An information processing method, comprising steps of the information processing system.

（１２）情報処理プログラムであって、コンピュータに、前記情報処理システムの各ステップを実行させる、もの。
もちろん、この限りではない。 (12) An information processing program that causes a computer to execute each step of the information processing system.
Of course, it is not limited to this.

さらに、以下のような観点にも留意されたい。 Furthermore, the following points should also be noted.

深層学習（ＤｅｅｐＬｅａｒｎｉｎｇ、ＤＬ）をはじめとする機械学習（ＭａｃｈｉｎｅＬｅａｒｎｉｎｇ、ＭＬ）の技術を様々な局面で利用しようとする動きが加速し、一種のブームとも言える状況が生まれている。しかしこのような盛り上がりに反し、ＭＬ導入のプロジェクトの８５％が失敗し、ＭＬやＡＩ（ＡｒｔｉｆｉｃｉａｌＩｎｔｅｌｌｉｇｅｎｃｅ、人工知能）技術を活用できている企業は１０％、情報系企業ですら１７％にとどまると言われる。 The movement to use deep learning (DL) and other machine learning (machine learning, ML) technologies in various situations is accelerating, creating a situation that can be called a kind of boom. However, contrary to this excitement, 85% of ML introduction projects have failed, 10% of companies are able to utilize ML and AI (Artificial Intelligence) technology, and only 17% of information companies. It is said.

これには様々な原因がある。第１にＭＬやＡＩがいかなる問題に対して有効かの理解が簡単ではないこと、第２にＭＬを使うためにはどういうデータを用意すればよいのか、どのようにデータの加工と前処理をすればよいのかが経験と勘に依存すること、第３にデータを大量に準備することが容易ではないこと、第４にＭＬやＡＩのモデルをどう構築したらよいのかの理解が簡単ではなく、しかも経験と勘に依存すること、第５にＭＬの一手法であるＤＬからなぜ欲する出力を得られるのかの理解が困難なこと、第６に以上のように理解が進まない結果として満足できる性能を得ることができないことなどが挙げられる。 There are various causes for this. First, it is not easy to understand what problems ML and AI are effective against. Second, what kind of data should be prepared to use ML, and how to process and preprocess data. What to do depends on experience and intuition. Third, it is not easy to prepare a large amount of data. Fourth, it is not easy to understand how to build an ML or AI model. Moreover, it depends on experience and intuition. Fifth, it is difficult to understand why the desired output can be obtained from DL, which is a method of ML. and the fact that it is not possible to obtain

上述のとおり、ＭＬを成功裏に活用するためには様々な障害が存在する反面、インターネット上には多くのＭＬサービスやＡＩサービスが存在し、どれを使えばよいのか分からないというカオス的状況にもある。 As mentioned above, there are various obstacles to the successful utilization of ML, but on the other hand, there are many ML services and AI services on the Internet, and it is a chaotic situation where you do not know which one to use. There is also

その上、上記のＭＬサービス、ＡＩサービスを使いこなすためにはたくさんのパラメータを入力しなければならず、パラメータの意味の理解も難しく、ＭＬやＡＩの専門家でなければ使いこなせないという現実も存在する。いわば、ＭＬサービスやＡＩサービスは専門家以外にも使える民主化されたサービスとはなっていなかった。 In addition, in order to use the above ML and AI services, many parameters must be entered, and it is difficult to understand the meaning of the parameters, and there is a reality that only ML and AI experts can use them. . In other words, ML services and AI services were not democratized services that could be used by non-specialists.

前述の状況を鑑み、専門的な知識を有していなくても使いこなすことができ、入力データを準備さえすれば３ステップでＭＬサービスを使うことができ、得られた結果に対する解析を提供し、さらには予測も行うことのできる技術を提供することにより、誰でもＭＬサービスを利用できる環境を創出することが本発明の目的である。これによりＭＬサービスが民主化される。 In view of the above-mentioned situation, it is possible to use ML services without specialized knowledge, and if you prepare input data, you can use ML services in 3 steps, and provide analysis of the obtained results, Furthermore, it is an object of the present invention to create an environment in which anyone can use ML services by providing a technique that can also perform prediction. This democratizes ML services.

上記課題を解決するための技術的思想は、インターネット上に存在する多くのＭＬ（以下ＡｕｔｏＭＬと呼ぶ）サービスへ接続するためのラッピング・インターフェースシステムを提供することである。これにより、データの収集、前処理、アップロードなどのデータ準備（ステップ１）、モデル構築と複数のＭＬの並行的実行（ステップ２）、各ＭＬの性能比較と実業務への導入（ステップ３）の３ステップでＭＬの導入が可能となる。 A technical idea for solving the above problems is to provide a wrapping interface system for connecting to many ML (hereinafter referred to as AutoML) services existing on the Internet. As a result, data preparation such as data collection, preprocessing, and uploading (step 1), model construction and parallel execution of multiple MLs (step 2), performance comparison of each ML and introduction to actual work (step 3) It is possible to introduce ML in the following three steps.

そのためにまずアカウント設定やパラメータ入力手順等を一元化し、次に各ＡｕｔｏＭＬへとフォーマット変換を施す。これにより１０～１５ステップが必要であったアカウント作成を３ステップで行うことが可能となる。 For this purpose, the account setting, parameter input procedure, etc. are first unified, and then format conversion is applied to each AutoML. This makes it possible to create an account in 3 steps, which used to take 10 to 15 steps.

次に、社内外のデータを収集する。このために必要な社内外データへのアクセスポイントに対して自動的に、あるいはユーザーの介入と補助を得ながら接続が行なわれ、データが収集される。 Next, collect internal and external data. The necessary access points to internal and external data for this are connected automatically or with user intervention and assistance, and the data is collected.

続いて、入力データの加工を行う。以下に限られないが、これにはデータのクレンジングとして日付データなどの形式の一元的形式への変換、欠損の多いデータ項目の処理などを行い、原データから統計的処理を含む前処理を適用して目的に適したデータに変換すること、クエリを使用してデータ抽出やデータ結合などを行うことなどが含まれる。 Next, the input data is processed. This includes, but is not limited to, data cleansing such as conversion to a unified format such as date data, processing data items with many missing points, and applying preprocessing including statistical processing from the original data. and transforming it into data suitable for the purpose, and using queries to perform data extraction, data merging, etc.

このとき、必要に応じて加工後のデータを表示して確認と修正を行ってもよい。 At this time, if necessary, the processed data may be displayed for confirmation and correction.

次に、ＭＬモデルの準備を行う。インターネット上に存在する各種ＭＬサービスの利用に限られず、ＧＵＩ（ＧｒａｐｈｉｃａｌＵｓｅｒＩｎｔｅｒｆａｃｅ）ベースによってプログラミングを行わずに独自モデルを構築する方法、インターネット上に存在する各種ＭＬサービスの修正を行って独自モデルを構築する方法、および既に構築されているがインターネット上には公開されていないＭＬモデルを本発明システムへインポートを行う方法等によって行われる。 Next, prepare the ML model. It is not limited to the use of various ML services that exist on the Internet, but also a method of constructing an original model without programming based on a GUI (Graphical User Interface), and modifying various ML services that exist on the Internet to create an original model. This is done by a method of constructing and a method of importing an ML model that has already been constructed but has not been published on the Internet into the system of the present invention.

さらには、プログラミングすることなく、ＭＬや統計分析を可能にする機能も提供される。加えて、どのようなテンプレートでモデル構築を行えば精度の高いモデルが構築できるかについて、入力データからリコメンドする機能も提供される。 It also provides functionality that enables ML and statistical analysis without programming. In addition, a function is also provided to recommend from the input data what kind of template should be used to build a model with high accuracy.

ＭＬに入力されるデータを学習データと予測データに分割し、学習データによって学習したＭＬに予測データを入力してもよい。予測データはＭＬ性能比較等のためにこれ以降使われる。 The data input to the ML may be divided into learning data and prediction data, and the prediction data may be input to the ML that has been learned using the learning data. The prediction data is used hereafter for ML performance comparisons, etc.

入力データとＭＬモデルの準備が終了したら、学習データを用いて学習が開始される。
このとき複数のＭＬを並行的に実行させてもよい。 Once the input data and the ML model have been prepared, training is started using the learning data.
At this time, multiple MLs may be executed in parallel.

学習が終わったら、予測データが自動的あるいはユーザーの指示によって各ＭＬサービスに投入され、結果を得る。このとき複数のＡｕｔｏＭＬが並行的に実行されていれば、即座に性能比較ができる。 After training, predictive data is injected into each ML service automatically or by user's instruction to obtain results. If multiple AutoMLs are executed in parallel at this time, the performance can be compared immediately.

各ＡｕｔｏＭＬサービスの結果を表示する。これにはグラフィカルな可視化を含み、各ＭＬサービスの予測精度レベル（決定係数）、項目の寄与度の比較などが表示される。 View the results for each AutoML service. This includes graphical visualizations showing the level of prediction accuracy (coefficient of determination) for each ML service, comparison of item contributions, etc.

上記のデータ収集～結果の表示と比較までを繰り返し、実業務に投入が可能だとユーザーによって判断されたら、運用が開始される。 The above data collection, display and comparison of results are repeated, and when the user decides that it can be put into actual work, the operation is started.

運用において、用意されたＡＰＩ（ＡｐｐｌｉｃａｔｉｏｎＰｒｏｇｒａｍＩｎｔｅｒｆａｃｅ）によってアプリケーションプログラムから本システムへ問い合わせを行うことによって結果がアプリケーションプログラムによって活用すること、あるいは本システムから直接結果を表示することも本発明の範囲である。 In operation, it is within the scope of the present invention that the results are utilized by the application program by inquiring from the application program to this system using a prepared API (Application Program Interface), or that the results are displayed directly from this system. .

運用の自動化のためにＫｕｂｅｆｌｏｗを含むＭＬプラットフォームを利用しても良い。 ML platforms including Kubeflow may be used for automation of operations.

上記一連の操作をパイプライン化し、操作の単純化を図っても良い。パイプライン化することにより自由度は下がるが全体の見通しが良くなり、専門家でなくても扱うことが可能となる。もし自由度を上げる必要があるときには、詳細画面を開く等によって専門的な項目設定を行っても良い。 The above series of operations may be pipelined to simplify the operations. Pipelining reduces the degree of freedom, but improves the overall outlook and allows non-experts to handle it. If it is necessary to increase the degree of freedom, specialized item settings may be performed by opening a detailed screen or the like.

さらにはパイプラインの流れに沿って各種変更や操作履歴の記録と閲覧が可能となり、ＭＬサービスやモデル選択の根拠を示すことが可能となる。 Furthermore, it is possible to record and view various changes and operation histories along the flow of the pipeline, and it is possible to show the grounds for selecting ML services and models.

採用すべきＭＬモデルとそれに必要なデータ前処理方法が決定された後に、実業務に導入される。例えば、ＭＬの出力として売上予測が欲しい場合、直接本発明システムによって表示される画面を見ても良いし、必要に応じて本発明システムが提供するＡＰＩを介してアプリケーションプログラムから情報提供のリクエストが出され、それによってアプリケーションプログラムの画面に表示されても良い。 After the ML model to be adopted and the data preprocessing method required for it are decided, it is introduced into the actual business. For example, if you want a sales forecast as an ML output, you can directly see the screen displayed by the system of the present invention, or if necessary, request information provision from the application program via the API provided by the system of the present invention. may be issued and thereby displayed on the screen of the application program.

そこで、上記課題を解決するために、第１の態様に係る情報処理方法は、アカウント設定、パラメータ入力手順設定のうち少なくともいずれかを含む初期設定作業を一元化する第１のステップと、インターネット上に存在し得る自動機械学習サービスへ接続するためのフォーマット変換を施す第２のステップと、社内外のデータが収集される第３のステップと、前記収集されたデータを加工する第４のステップと、使用されるべき自動機械学習サービスの準備を行う第５のステップと、前記第４のステップにおいて加工されたデータを学習データと予測データとに分割し、前記学習データを前記第５のステップにおいて準備が行われた自動機械学習サービスによって学習させる第６のステップとを具備することを特徴とする。 Therefore, in order to solve the above problems, an information processing method according to a first aspect includes a first step of unifying initial setting work including at least one of account setting and parameter input procedure setting; a second step of applying a format conversion to connect to possible automated machine learning services; a third step in which internal and external data is collected; a fourth step of processing said collected data; a fifth step of preparing an automated machine learning service to be used; splitting the data processed in said fourth step into learning data and prediction data; preparing said learning data in said fifth step; and a sixth step of learning by an automatic machine learning service performed by.

また、上記課題を解決するために、第２の態様に係る情報処理装置は、アカウント設定、パラメータ入力手順設定のうち少なくともいずれかを含む初期設定作業を一元化することのできる一元化部と、インターネット上に存在し得る自動機械学習サービスへ接続するためのフォーマット変換を施すフォーマット変換部と、社内外のデータが収集されるデータ収集部と、前記収集されたデータを加工するデータ加工部と、使用されるべき自動機械学習サービスの準備を行う準備部と、前記データ加工部によって加工されたデータを学習データと予測データとに分割し、前記学習データを前記準備部によって準備が行われた自動機械学習サービスによって学習させる学習部とを具備することを特徴とする。 In order to solve the above problems, an information processing apparatus according to a second aspect includes a centralizing unit capable of centralizing initial setting work including at least one of account setting and parameter input procedure setting; A format conversion unit that performs format conversion for connecting to an automatic machine learning service that can exist in the a preparation unit that prepares for an automatic machine learning service to be performed, the data processed by the data processing unit is divided into learning data and prediction data, and the learning data is prepared by the preparation unit for automatic machine learning and a learning unit for learning by the service.

上記２つの態様によれば、深層学習をはじめとする機械学習技術の専門家でなくても、学習データを準備さえすれば学習モデルの選択および／または構築ができ、複数の学習モデルの結果から性能の比較ができ、複数の学習モデルから最適なものを選択でき、それを実業務に投入し、投入後の運用をサポートすることができることとなる。 According to the above two aspects, even if you are not an expert in machine learning technology such as deep learning, you can select and / or build a learning model as long as you prepare learning data, and from the results of a plurality of learning models Performance can be compared, the optimal model can be selected from multiple learning models, it can be applied to actual business operations, and operation after application can be supported.

第３の態様として、第２の態様において、前記収集され準備された入力データをインターネット上に存在する多くの自動機械学習サービスに応じた変換を行うようにしてもよい。この態様によれば、機械学習サービス毎に異なる入力データを準備するプロセスを省くことが可能となる。なお、この第３の態様は第１の態様に対して重畳的に用いることもできる。 As a third aspect, in the second aspect, the collected and prepared input data may be converted according to many automatic machine learning services existing on the Internet. According to this aspect, it is possible to omit the process of preparing different input data for each machine learning service. It should be noted that this third mode can also be used in a superimposed manner with respect to the first mode.

第４の態様として、第２の態様において、入力データの単純な形式変換、欠損データもしくは重複・不要データの処理を含むデータクレンジング、原データからの特徴量の抽出、統計的処理を含む前処理を適用して目的に適したデータへの変換、クエリを使用したデータ抽出もしくはデータ結合を含むデータ変換、のうちの少なくともいずれかを実行するようにしてもよい。この態様によれば、入力データの単純な形式変換、欠損データもしくは重複・不要データの処理を含むデータクレンジング、原データからの特徴量の抽出、統計的処理を含む前処理を適用して目的に適したデータへの変換、クエリを使用したデータ抽出もしくはデータ結合を含むデータ変換、を簡単な指示を与えることにより実行することが可能となる。なお、この第４の態様は第１の態様に対して重畳的に用いることもできる。 As a fourth aspect, in the second aspect, simple format conversion of input data, data cleansing including processing of missing data or redundant/unnecessary data, extraction of feature values from original data, preprocessing including statistical processing may be applied to perform data transformations including data extraction and/or data merging using queries and/or transformations into data suitable for the purpose. According to this aspect, simple format conversion of input data, data cleansing including processing of missing data or redundant/unnecessary data, extraction of feature values from original data, preprocessing including statistical processing are applied to achieve the purpose Data conversion, including conversion to suitable data, data extraction using queries, or data merging, can be performed by giving simple instructions. It should be noted that this fourth aspect can also be used in a superimposed manner with respect to the first aspect.

第５の態様として、第２の態様において、インターネット上に存在する複数の機械学習サービスあるいは機械学習モデルを一覧できる一覧部と、前記一覧部にて一覧される複数の機械学習サービスあるいは機械学習モデルのうちのいずれかへの接続が選択される選択部と、前記選択部によって選択されたことにより一括して複数の機械学習サービスあるいは機械学習モデルへのデータ投入、平行的実行、結果の取得及び比較、のうちの少なくともいずれかを実行する実行部とをさらに備えるようにしてもよい。なお、この第５の態様は第１の態様に対して重畳的に用いることもできる。 As a fifth aspect, in the second aspect, a list part that can list a plurality of machine learning services or machine learning models existing on the Internet, and a plurality of machine learning services or machine learning models listed in the list part A selection unit that selects connection to one of the above, and data input to a plurality of machine learning services or machine learning models collectively by being selected by the selection unit, parallel execution, acquisition of results, and and an execution unit that executes at least one of the comparison. It should be noted that this fifth mode can also be used in a superimposed manner with respect to the first mode.

第６の態様として、第２の態様において、前記学習部及び／もしくは前記準備部は、グラフィカル・ユーザー・インターフェース手段によって行われるようにしてもよい。この態様によれば、上記の機械学習サービスを選択することに加え、グラフィカル・ユーザー・インターフェースをベースにした手法でユーザー独自の機械学習モデルを構築することができ、および／または、公開情報として存在する機械学習モデルをインポートすることができる。なお、この第６の態様は第１の態様に対して重畳的に用いることもできる。 As a sixth aspect, in the second aspect, the learning section and/or the preparation section may be performed by graphical user interface means. According to this aspect, in addition to selecting the machine learning service described above, users can build their own machine learning models using a graphical user interface-based technique, and/or exist as public information. You can import a machine learning model that It should be noted that this sixth aspect can also be used in a superimposed manner with respect to the first aspect.

第７の態様として、第２の態様において、前記学習部において前記加工されたデータが前記学習データと前記予測データとに分割されるにおいては、機械学習への入力データを学習用データと性能比較および／または予測データに分割されるようにしてもよい。この態様によれば、前記機械学習への入力データを学習用データと性能比較および／または予測データに分割して利用することができる。なお、この第７の態様は第１の態様に対して重畳的に用いることもできる。 As a seventh aspect, in the second aspect, when the processed data is divided into the learning data and the prediction data in the learning unit, the input data to machine learning is compared in performance with the learning data. and/or may be divided into prediction data. According to this aspect, input data to the machine learning can be divided into learning data and performance comparison and/or prediction data for use. It should be noted that this seventh mode can also be used in a superimposed manner with respect to the first mode.

第８の態様として、第２の態様において、同一入力データによって実行した複数の機械学習サービスあるいは機械学習モデルの性能を比較するための指標を提供する指標提供部をさらに備えるようにしてもよい。なお、この第８の態様は第１の態様に対して重畳的に用いることもできる。 As an eighth aspect, the second aspect may further include an index providing unit that provides an index for comparing the performance of a plurality of machine learning services or machine learning models executed with the same input data. It should be noted that this eighth mode can also be used in a superimposed manner with respect to the first mode.

第９の態様として、第８の態様において、前記指標として、決定係数、平均絶対誤差、平均二乗偏差、項目寄与度、モデルの予測と実際の比較、残差ヒストグラム、のうちの少なくともいずれかを含む機械学習サービスと機械学習モデルの性能の比較を行うための指標が提示されるようにしてもよい。なお、この第９の態様は第１の態様に対して第８の態様が重畳された態様に対して重畳的に用いることもできる。 As a ninth aspect, in the eighth aspect, at least one of the coefficient of determination, mean absolute error, mean square deviation, item contribution, model prediction and actual comparison, residual histogram, as the indicator A metric may be presented to compare the performance of the machine learning service and the machine learning model. It should be noted that the ninth mode can also be used in a superimposed manner with respect to the mode in which the eighth mode is superimposed on the first mode.

第１０の態様として、第２の態様において、複数の機械学習サービスと前記複数のうちのそれぞれの機械学習モデルの結果とから選択する選択部をさらに備えるようにしてもよい。この態様によれば、複数の機械学習サービスと前記複数のうちのそれぞれの機械学習モデルの結果とから最適なものが選択されて実業務への投入を行うことができる。なお、この第１０の態様は第１の態様に対して重畳的に用いることもできる。 As a tenth aspect, the second aspect may further include a selection unit that selects from a plurality of machine learning services and the result of each machine learning model out of the plurality. According to this aspect, the optimum one can be selected from a plurality of machine learning services and the results of the respective machine learning models out of the plurality and applied to actual business. It should be noted that this tenth aspect can also be used in a superimposed manner with respect to the first aspect.

第１１の態様として、第２の態様において、前記機械学習サービスと前記機械学習モデルの精度の維持及び／もしくは管理とを行うことのできる維持管理部をさらに備えるようにしてもよい。この態様によれば、実投入後に前記機械学習サービスと前記機械学習モデルの精度の維持及び／もしくは管理とを行うことのできる機能が提供される。
なお、この第１１の態様は第１の態様に対して重畳的に用いることもできる。 As an eleventh aspect, in the second aspect, a maintenance and management unit capable of maintaining and/or managing the accuracy of the machine learning service and the machine learning model may be further provided. According to this aspect, a function is provided that can maintain and/or manage the accuracy of the machine learning service and the machine learning model after actual launch.
It should be noted that the eleventh aspect can also be used in a superimposed manner with respect to the first aspect.

第１２の態様として、第２の態様において、前記データの収集・準備、複数の機械学習サービスと機械学習モデルの平行的実行、機械学習サービスと機械学習モデルの性能比較、実業務への投入の操作をパイプライン化するパイプライン部をさらに備えるようにしてもよい。この態様によれば、上記データの収集・準備、複数の機械学習サービスと機械学習モデルの平行的実行、機械学習サービスと機械学習モデルの性能比較、実業務への投入の操作がパイプライン化され、全体の見通しをよくすることができる。なお、この第１２の態様は第１の態様に対して重畳的に用いることもできる。 As a twelfth aspect, in the second aspect, collection and preparation of the data, parallel execution of multiple machine learning services and machine learning models, performance comparison of machine learning services and machine learning models, input to actual business A pipeline section that pipelines the operations may be further provided. According to this aspect, the operation of collecting and preparing the above data, parallel execution of a plurality of machine learning services and machine learning models, performance comparison of machine learning services and machine learning models, and input to actual work is pipelined. , can improve the overall outlook. The twelfth mode can also be used in a superimposed manner with respect to the first mode.

第１３の態様として、第１２の態様において、前記パイプライン化された処理の様々な中間段階で、必要に応じてユーザーが介入することを許容するユーザー介入部をさらに具備するようにしてもよい。この態様によれば、パイプライン化された処理の様々な中間段階で、必要に応じてユーザーが介入して詳細な設定や操作が行われてもよいこととなる。なお、この第１３の態様は第１の態様に対して第１２の態様が重畳された態様に対して重畳的に用いることもできる。 As a thirteenth aspect, the twelfth aspect may further include a user intervention unit that allows a user to intervene as necessary at various intermediate stages of the pipelined processing. . According to this aspect, the user may intervene as necessary to perform detailed settings and operations at various intermediate stages of the pipelined processing. The thirteenth mode can also be used in a superimposed manner with respect to the mode in which the twelfth mode is superimposed on the first mode.

第１４の態様として、第２の態様において、前記機械学習サービスあるいは前記機械学習モデルの処理結果を得るために、アプリケーションプログラムからアプリケーションプログラムインターフェースを介して前記データの要求が行われるデータ要求部をさらに具備するようにしてもよい。この態様によれば、実業務へ投入された機械学習サービスあるいは機械学習モデルの処理結果を得るために、アプリケーションプログラムからアプリケーションプログラムインターフェースを介して本発明システムに対してデータの要求が行われ、それぞれのアプリケーションプログラムで表示を含む処理がされてもよいこととなる。なお、この第１４の態様は第１の態様に対して重畳的に用いることもできる。 As a fourteenth aspect, in the second aspect, further comprising a data requesting unit for requesting the data from an application program via an application program interface in order to obtain the processing result of the machine learning service or the machine learning model It may be provided. According to this aspect, in order to obtain the processing results of a machine learning service or a machine learning model that has been put into actual business, an application program requests data from the system of the present invention via the application program interface. processing including display may be performed by the application program. It should be noted that the fourteenth aspect can also be used in a superimposed manner with respect to the first aspect.

第１５の態様として、第２の態様において、前記一元化部、前記フォーマット変換部、前記データ収集部、前記データ加工部、前記準備部、前記学習部、のうちの少なくともいずれかにおける画面がデータの収集と準備に係る画面、機械学習モデルの選択・構築・実行に係る画面、各学習モデルの性能比較に係る画面、機械学習モデルの選択を決定して実業務への導入する画面、の少なくともいずれかを含む画面遷移を有するようにしてもよい。この態様によれば、データの準備とアップロードから複数のＭＬ間の性能の比較と実業務への導入までの各ステップにおいて、複数の画面を用いてユーザーと情報がやり取りされるが、これらの画面がデータの収集と準備（前処理、アップロード等）に関わる画面、機械学習モデルの選択・構築・実行に関わる画面、（モデル構築、ＭＬの実行）、各学習モデルの性能比較に関する画面、そして機械学習モデルの選択を決定して実業務への導入する画面を含む画面遷移を有するから、画面遷移定義において学習プロセスが設計できることとなる。なお、この第１５の態様は第１の態様に対して重畳的に用いることもできる。 As a fifteenth aspect, in the second aspect, the screen in at least one of the unification unit, the format conversion unit, the data collection unit, the data processing unit, the preparation unit, and the learning unit is the data At least one of the screens related to collection and preparation, the screen related to selection/construction/execution of machine learning models, the screen related to performance comparison of each learning model, and the screen to decide the selection of machine learning models and introduce them to actual work. You may have a screen transition including. According to this aspect, information is exchanged with the user using a plurality of screens in each step from preparation and upload of data to comparison of performance between a plurality of MLs and introduction to actual work. is a screen related to data collection and preparation (preprocessing, upload, etc.), a screen related to selection, construction, and execution of machine learning models (model construction, ML execution), a screen related to performance comparison of each learning model, and a machine Since it has screen transitions including screens for determining the selection of learning models and introducing them to actual work, the learning process can be designed in the screen transition definition. The fifteenth aspect can also be used in a superimposed manner with respect to the first aspect.

また、上記課題を解決するために、第１６の態様に係るプログラムは、コンピュータを、専門家の介在無しに、収集された学習データをそれぞれの機械学習サービスや機械学習モデルに合致するように変換するデータ加工・変換部と、欠損データや重複・不要データの処理を含むデータクレンジング部と、原データからの特徴量の抽出を行う特徴量抽出部と、統計的処理を含む前処理を適用して目的に適したデータへの変換やクエリを使用したデータ抽出やデータ結合を含むデータ変換を行うデータ結合・分割部と、データの正規化・標準化を行う正規化・標準化部と、複数の機械学習サービスや機械学習モデルを選択するサービス・モデル選択部と、機械学習モデルを構築するためのノーコード開発部と、平行的に複数の機械学習サービスや機械学習モデルを実行するシミュレーション部と、結果を表示し比較するモデル評価部と、最適な機械学習サービスあるいは機械学習モデルを選択するモデル選択部と、選択モデルを実業務へ投入し運用する投入・運用部と、上記一連の各部の機能をサポートするサポート部と、として機能させることを特徴とする。 Further, in order to solve the above problems, a program according to a sixteenth aspect causes a computer to convert collected learning data to match each machine learning service or machine learning model without intervention of an expert. data processing/conversion unit, data cleansing unit that processes missing data, duplicated/unnecessary data, feature value extraction unit that extracts feature values from the original data, and preprocessing including statistical processing. A data combining/splitting unit that converts data into data suitable for the purpose using queries, data extraction and data conversion using queries, a normalization/standardization unit that normalizes and standardizes data, and multiple machines A service model selection section that selects learning services and machine learning models, a no-code development section for building machine learning models, a simulation section that executes multiple machine learning services and machine learning models in parallel, and results A model evaluation section that displays and compares, a model selection section that selects the optimal machine learning service or machine learning model, and an input/operation section that puts the selected model into actual business and operates it. It is characterized by functioning as a support part that supports.

上記態様によれば、専門家の介在無しに、収集された学習データをそれぞれの機械学習サービスや機械学習モデルに合致するように変換するデータ加工・変換部、欠損データや重複・不要データの処理を含むデータクレンジング部、原データからの特徴量の抽出を行う特徴量抽出部、統計的処理を含む前処理を適用して目的に適したデータへの変換やクエリを使用したデータ抽出やデータ結合を含むデータ変換を行うデータ結合・分割部、データの正規化・標準化を行う正規化・標準化部、複数の機械学習サービスや機械学習モデルを選択するサービス・モデル選択部、プログラミングをすることなしに独自の機械学習モデルを構築するノーコード開発部、平行的に複数の機械学習サービスや機械学習モデルを実行するシミュレーション部、結果を表示し比較するモデル評価部、最適な機械学習サービスあるいは機械学習モデルを選択するモデル選択部、選択モデルを実業務へ投入し運用する投入・運用部、およびこれら一連をサポートするサポート部、として機能することが可能となる。 According to the above aspect, the data processing/converting unit converts the collected learning data so as to match each machine learning service or machine learning model without the intervention of an expert, and the processing of missing data, duplicated/unnecessary data, etc. a data cleansing unit including a data cleansing unit, a feature extraction unit that extracts feature values from the original data, a conversion to data suitable for the purpose by applying preprocessing including statistical processing, and data extraction and data combination using queries Data joining/splitting part that performs data transformation including normalization/standardization part that normalizes/standardizes data Service/model selection part that selects multiple machine learning services and machine learning models, without programming A no-code development department that builds its own machine learning model, a simulation department that runs multiple machine learning services and machine learning models in parallel, a model evaluation department that displays and compares results, and an optimal machine learning service or machine learning model. a model selection unit that selects a model, an input/operation unit that inputs the selected model into actual business operations and operates it, and a support unit that supports a series of these operations.

第１７の態様として、第１６の態様に係るプログラムが記憶された記録媒体として実現することもできる。 A seventeenth aspect can also be implemented as a recording medium storing the program according to the sixteenth aspect.

最後に、本開示に係る種々の実施形態を説明したが、これらは、例として提示したものであり、発明の範囲を限定することは意図していない。当該新規な実施形態は、その他の様々な形態で実施されることが可能であり、発明の要旨を逸脱しない範囲で、種々の省略、置き換え、変更を行うことができる。当該実施形態やその変形は、発明の範囲や要旨に含まれるとともに、特許請求の範囲に記載された発明とその均等の範囲に含まれるものである。 Finally, while various embodiments of the present disclosure have been described, they have been presented by way of example and are not intended to limit the scope of the invention. The novel embodiment can be embodied in various other forms, and various omissions, replacements, and modifications can be made without departing from the scope of the invention. The embodiment and its modifications are included in the scope and gist of the invention, and are included in the scope of the invention described in the claims and equivalents thereof.

１：情報処理システム
２：情報処理装置
３：ユーザ端末
４：受付ウィンドウ
５：データウィンドウ
６：変換処理ウィンドウ
６ａ：第１の表示モード
６ｂ：第２の表示モード
７：モデル情報表示ウィンドウ
８：モデル検索ウィンドウ
９：モデル比較ウィンドウ
２０：通信バス
２１：通信部
２２：記憶部
２３：制御部
３０：通信バス
３１：通信部
３２：記憶部
３３：制御部
３４：表示部
３５：入力部
４１：入力データ受付エリア
４２：学習器選択エリア
４３：分析手法選択エリア
４４：受付操作表示エリア
５０：変数名表示エリア
５１：集計グラフ表示エリア
５２：集計情報表示エリア
５３：個別情報表示エリア
６１：第１の入力データ情報表示エリア
６２：生成条件表示エリア
６３：自動変換処理表示エリア
６４：処理条件表示エリア
６５：第１の処理実行ボタン
６６：手動変換移行ボタン
６７：処理保存ボタン
７１：第２の入力データ情報表示エリア
７２：モデル情報表示エリア
７３：シミュレーション実行ボタン
８１：検索条件入力エリア
８２：検索結果表示エリア
８３：検索ウィンドウ終了ボタン
９１：比較モデル表示エリア
９２：パラメータ選択エリア
９３：比較結果表示エリア
９４：シミュレーション実行ボタン
２３１：データ受付部
２３２：学習器特定部
２３３：学習器選択受付部
２３４：分析手法選択受付部
２３５：データ処理部
２３６：モデル表示部
２３７：処理表示部
２３８：処理条件表示部
４１１：インポートボタン
４１２：データ名表示エリア
４２１：予測対象選択エリア
４２２：学習器表示エリア
４２３：学習器選択表示エリア
４２４：第１の受付操作ボタン
４３１：分析手法選択ボタン
４３２：モデル名表示エリア
４３３：第２の受付操作ボタン
５１１：共通点
５１２：差異点
６６１：手動変換処理指定エリア
６６２：手動変換処理保存エリア
７２１：精度情報表示エリア
７２２：寄与度表示エリア
７２３：増加寄与度表示エリア
７２４：減少寄与度表示エリア
７２５：寄与度一覧表示ボタン
Ａ００１：アクティビティ
Ａ００２：アクティビティ
Ａ００３：アクティビティ
Ａ００４：アクティビティ
Ａ００５：アクティビティ
Ａ００６：アクティビティ
Ａ００７：アクティビティ
Ａ００８：アクティビティ
Ａ００９：アクティビティ
Ａ０１０：アクティビティ
Ａ０１１：アクティビティ
Ｄ０：外部データ
Ｄ１：第１の入力データ
Ｄ２：第２の入力データ
ＤＢ１：データベース
ＩＦ１：モデル情報
Ｌ１：インジケータ
Ｍ１：学習モデル
Ｍ２：指定学習モデル
ＭＬ：学習器
ｘ１：入力
ｙ１：出力 1: information processing system 2: information processing device 3: user terminal 4: reception window 5: data window 6: conversion processing window 6a: first display mode 6b: second display mode 7: model information display window 8: model Search window 9: Model comparison window 20: Communication bus 21: Communication unit 22: Storage unit 23: Control unit 30: Communication bus 31: Communication unit 32: Storage unit 33: Control unit 34: Display unit 35: Input unit 41: Input Data reception area 42 : Learning device selection area 43 : Analysis method selection area 44 : Reception operation display area 50 : Variable name display area 51 : Total graph display area 52 : Total information display area 53 : Individual information display area 61 : First Input data information display area 62 : Generation condition display area 63 : Automatic conversion processing display area 64 : Processing condition display area 65 : First processing execution button 66 : Manual conversion transfer button 67 : Processing save button 71 : Second input data Information display area 72 : Model information display area 73 : Simulation execution button 81 : Search condition input area 82 : Search result display area 83 : Search window end button 91 : Comparison model display area 92 : Parameter selection area 93 : Comparison result display area 94 : Simulation execution button 231 : Data reception unit 232 : Learning device identification unit 233 : Learning device selection reception unit 234 : Analysis method selection reception unit 235 : Data processing unit 236 : Model display unit 237 : Processing display unit 238 : Processing condition display unit 411 : Import button 412 : Data name display area 421 : Prediction target selection area 422 : Learning device display area 423 : Learning device selection display area 424 : First reception operation button 431 : Analysis method selection button 432 : Model name display area 433 : Second reception operation button 511 : Common point 512 : Difference point 661 : Manual conversion processing designation area 662 : Manual conversion processing storage area 721 : Accuracy information display area 722 : Contribution degree display area 723 : Increase contribution degree display area 724 : Reduction contribution display area 725: Contribution list display button A001: Activity A002: Activity A003: Activity A004: Activity A005: Activity A006: Activity A007: Activity A008: Activity A009: Activity A010: Activity A011: Activity D0: External data D1 : First input data D2 : Second input data DB1 : Database IF1 : Model information L1 : Indicator M1 : Learning model M2 : Designated learning model ML : Learning device x1 : Input y1 : Output

Claims

情報処理システムであって、
制御部を備え、
前記制御部は、次の各ステップを実行するように構成され、
データ受付ステップでは、第１の入力データの入力を受け付け、
学習器特定ステップでは、受け付けた前記第１の入力データに応じて複数の学習器を特定し、
モデル表示ステップでは、前記第１の入力データに基づき、特定された前記学習器が生成する学習モデルに関するモデル情報を、前記学習モデルごとに比較可能な態様で表示させる、もの。 An information processing system,
Equipped with a control unit,
The control unit is configured to perform the following steps,
The data receiving step receives input of the first input data,
In the learner identification step, a plurality of learners are identified according to the received first input data,
In the model display step, based on the first input data, model information relating to the learning models generated by the specified learning device is displayed in a comparable manner for each learning model.

請求項１に記載の情報処理システムにおいて、
前記モデル情報は、少なくとも前記学習モデルの予測精度に関する精度情報を含み、
前記モデル表示ステップでは、生成される前記学習モデルごとの前記精度情報を比較可能に表示させる、もの。 In the information processing system according to claim 1,
The model information includes at least accuracy information about the prediction accuracy of the learning model,
In the model display step, the accuracy information for each generated learning model is displayed in a comparable manner.

請求項１に記載の情報処理システムにおいて、
さらに、学習器選択受付ステップでは、特定された前記学習器に対する選択をユーザより受け付け、
前記モデル表示ステップでは、特定された前記学習器のうち、前記選択により選択された前記学習器を用いて生成される前記モデル情報を表示させる、もの。 In the information processing system according to claim 1,
Furthermore, in the learning device selection accepting step, a selection of the specified learning device is accepted from the user,
In the model display step, the model information generated using the learning device selected by the selection among the identified learning devices is displayed.

請求項１に記載の情報処理システムにおいて、
さらに、データ処理ステップでは、入力された前記第１の入力データを、特定された前記学習器に入力可能な態様である第２の入力データに変換する変換処理を実行し、
前記モデル情報は、前記学習器によって前記第２の入力データを用いて生成される前記学習モデルに関する、もの。 In the information processing system according to claim 1,
Furthermore, in the data processing step, a conversion process is performed to convert the input first input data into second input data in a form that can be input to the specified learning device,
The model information relates to the learning model generated by the learning device using the second input data.

請求項４に記載の情報処理システムにおいて、
さらに、処理表示ステップでは、前記第１の入力データと前記第２の入力データとの差異点を認識可能な態様で表示させる、もの。 In the information processing system according to claim 4,
Furthermore, in the processing display step, a difference between the first input data and the second input data is displayed in a recognizable manner.

請求項５に記載の情報処理システムにおいて、
さらに、処理条件表示ステップでは、少なくとも入力された前記第１の入力データと、特定された前記学習器と、に基づき、前記変換処理が行われる条件を認識可能な態様で表示させる、もの。 In the information processing system according to claim 5,
Furthermore, in the processing condition display step, the conditions for performing the conversion processing are displayed in a recognizable manner based on at least the input first input data and the specified learning device.

請求項１に記載の情報処理システムにおいて、
さらに、分析手法選択受付ステップでは、複数の分析手法のうちの前記学習モデルの生成に用いられる少なくとも１つの選択を受け付ける、もの。 In the information processing system according to claim 1,
Furthermore, in the analysis method selection receiving step, selection of at least one of the plurality of analysis methods to be used for generating the learning model is received.

請求項７に記載の情報処理システムにおいて、
前記分析手法は、分類分析、回帰分析、及び時系列分析のうちの少なくとも１つを含む、もの。 In the information processing system according to claim 7,
The analysis method includes at least one of classification analysis, regression analysis, and time series analysis.

請求項１に記載の情報処理システムにおいて、
前記第１の入力データは、少なくともユーザが保有する保有データを含む、もの。 In the information processing system according to claim 1,
The first input data includes at least held data held by a user.

請求項１に記載の情報処理システムにおいて、
前記第１の入力データは、少なくとも構造化データを含む、もの。 In the information processing system according to claim 1,
The first input data includes at least structured data.

情報処理方法であって、
請求項１～請求項１０の何れか１つに記載の情報処理システムの各ステップを含む、もの。 An information processing method,
An object comprising each step of the information processing system according to any one of claims 1 to 10.

情報処理プログラムであって、
コンピュータに、請求項１～請求項１０の何れか１つに記載の情報処理システムの各ステップを実行させる、もの。 An information processing program,
A computer that executes each step of the information processing system according to any one of claims 1 to 10.