JP7278716B2

JP7278716B2 - Adjustment device, adjustment method and adjustment program

Info

Publication number: JP7278716B2
Application number: JP2018096575A
Authority: JP
Inventors: 直行角田; 晃平菅原
Original assignee: Yahoo Japan Corp
Current assignee: Yahoo Japan Corp
Priority date: 2018-05-18
Filing date: 2018-05-18
Publication date: 2023-05-22
Anticipated expiration: 2038-05-18
Also published as: JP2019200736A

Description

本発明は、調整装置、調整方法および調整プログラムに関する。 The present invention relates to an adjusting device, an adjusting method and an adjusting program.

従来、複数の機能ブロックを有し、製造後に利用者が機能ブロックの構成を設定できるプログラマブルロジックデバイス（ＰＬＤ）の技術が知られている。このようなＰＬＤが有する構成の設定を容易にするため、高レベル言語からＰＬＤが有する機能ブロックの構成をコンパイルする技術が知られている。 2. Description of the Related Art Conventionally, a programmable logic device (PLD) technology is known that has a plurality of functional blocks and allows a user to set the configuration of the functional blocks after manufacturing. In order to facilitate the setting of the configuration of such PLD, a technology is known that compiles the configuration of functional blocks of PLD from a high-level language.

特開２０１３－１６５４９０号公報JP 2013-165490 A

“Assignment Decision Diagram for High-Level Synthesis”，Viraphol Chaiyakul, Daniel D. Gajski ＜インターネット＞http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.33.9420&rep=rep1&type=pdf（平成３０年５月１日検索）“Assignment Decision Diagram for High-Level Synthesis,” Viraphol Chaiyakul, Daniel D. Gajski <Internet> http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.33.9420&rep=rep1&type=pdf Searched on May 1, 2018) “A deeper look into the LLVM code generator, Part 1, Eli Bendersky”＜インターネット＞https://eli.thegreenplace.net/2013/02/25/a-deeper-look-into-the-llvm-code-generator-part-1（平成３０年５月１日検索）“A deeper look into the LLVM code generator, Part 1, Eli Bendersky” <Internet> https://eli.thegreenplace.net/2013/02/25/a-deeper-look-into-the-llvm-code-generator -part-1 (searched on May 1, 2018) “QuickDough: A Rapid FPGA Accelerator Generation Framework using Soft Coarse-Grained Reconfigurable Array Overlay” Hayden Kwok-Hay So, ＜インターネット＞https://www.eee.hku.hk/~hso/olaf2013/so_olaf.pdf（平成３０年５月１日検索）“QuickDough: A Rapid FPGA Accelerator Generation Framework using Soft Coarse-Grained Reconfigurable Array Overlay” Hayden Kwok-Hay So, <Internet> https://www.eee.hku.hk/~hso/olaf2013/so_olaf.pdf (Heisei 30 (searched on May 1, 2015) “パーシステントホモロジーと機械学習” 平岡裕章＜インターネット＞http://ibisml.org/archive/ibis2016/Hiraoka_IBIS2016.pdf（平成３０年５月１日検索）“Persistent Homology and Machine Learning” Hiroaki Hiraoka <Internet> http://ibisml.org/archive/ibis2016/Hiraoka_IBIS2016.pdf (searched May 1, 2018) “グラフ分解と固有値問題”，川本達郎, 日本神経回路学会誌Ｖｏｌ．２１，Ｎｏ．４（２０１４），１６２－１６９＜インターネット＞https://www.jstage.jst.go.jp/article/jnns/21/4/21_162/_pdf（平成３０年５月１日検索）“Graph Decomposition and Eigenvalue Problem”, Tatsuro Kawamoto, Journal of Japanese Neural Network Society Vol. 21, No. 4 (2014), 162-169 <Internet> https://www.jstage.jst.go.jp/article/jnns/21/4/21_162/_pdf (searched May 1, 2018)

しかしながら、上述した技術では、ＰＬＤにおける処理の効率を改善する余地がある。 However, the techniques described above leave room for improving the efficiency of processing in PLDs.

例えば、上述した技術は、ＰＬＤに所定の処理を実行させるための構成をコンパイルしているに過ぎず、ＰＬＤが有する機能を効率的に利用する構成を実現しているとは言えない場合がある。 For example, the above-described technology merely compiles a configuration for causing the PLD to execute a predetermined process, and may not realize a configuration that efficiently uses the functions of the PLD. .

本願は、上記に鑑みてなされたものであって、ＰＬＤにおける処理の効率を改善することを目的とする。 The present application has been made in view of the above, and aims to improve the efficiency of processing in PLDs.

本願に係る調整装置は、処理に用いる論理回路の組み合わせを変更可能な演算装置の各論理回路が発揮する機能の内容を示す複数の第１グラフと、前記演算装置に所定の処理を実行させる機械語を生成するコンパイラが、前記機械語を生成するための前記所定の処理の内容から生成する第２グラフとを取得する取得部と、前記取得部により取得された前記第１グラフの構造と前記第２グラフの構造、または、グラフ構造のパターンが類似するように、前記コンパイラ若しくは前記演算装置が発揮する機能を調整する調整部とを有することを特徴とする。 The adjustment device according to the present application includes a plurality of first graphs showing the contents of functions exhibited by each logic circuit of an arithmetic device capable of changing the combination of logic circuits used for processing, and a machine that causes the arithmetic device to execute a predetermined process. A compiler for generating a word acquires a second graph generated from the content of the predetermined processing for generating the machine language; an acquisition unit for acquiring a structure of the first graph acquired by the acquisition unit; It is characterized by comprising an adjustment unit that adjusts the function exhibited by the compiler or the arithmetic unit so that the structure of the second graph or the pattern of the graph structure is similar.

実施形態の一態様によれば、ＰＬＤにおける処理の効率を改善することができる。 According to one aspect of an embodiment, efficiency of processing in a PLD can be improved.

図１は、実施形態に係る調整装置が実行する処理の一例を示す図である。FIG. 1 is a diagram illustrating an example of processing executed by an adjustment device according to an embodiment; 図２は、実施形態に係る調整装置の構成例を示す図である。FIG. 2 is a diagram illustrating a configuration example of the adjustment device according to the embodiment; 図３は、実施形態に係るＤＡＧデータベースに登録される情報の一例を示す図である。FIG. 3 is a diagram illustrating an example of information registered in a DAG database according to the embodiment; 図４は、実施形態に係る機能分解グラフデータベースに登録される情報の一例を示す図である。FIG. 4 is a diagram illustrating an example of information registered in a functional decomposition graph database according to the embodiment; 図５は、実施形態に係る調整処理の流れの一例を説明するフローチャートである。FIG. 5 is a flowchart illustrating an example of the flow of adjustment processing according to the embodiment. 図６は、ハードウェア構成の一例を示す図である。FIG. 6 is a diagram illustrating an example of a hardware configuration;

以下に、本願に係る調整装置、調整方法および調整プログラムを実施するための形態（以下、「実施形態」と記載する。）について図面を参照しつつ詳細に説明する。なお、この実施形態により本願に係る調整装置、調整方法および調整プログラムが限定されるものではない。また、以下の各実施形態において同一の部位には同一の符号を付し、重複する説明は省略される。 EMBODIMENT OF THE INVENTION Below, the form (it describes as "embodiment" hereafter.) for implementing the adjusting device, the adjusting method, and the adjusting program concerning this application is demonstrated in detail, referring drawings. Note that the adjusting device, adjusting method, and adjusting program according to the present application are not limited to this embodiment. Also, in each of the following embodiments, the same parts are denoted by the same reference numerals, and overlapping descriptions are omitted.

［実施形態］
〔１－１．演算装置の一例〕
まず、図１を用いて、調整装置が実行する調整処理の一例について説明する。なお、以下の説明では、調整処理により、所定の処理を示すコードを、所定の処理を演算装置１００に実行させるための機械語へとコンパイルするコンパイラの調整を行う処理の一例について説明する。 [Embodiment]
[1-1. Example of computing device]
First, an example of adjustment processing executed by the adjustment device will be described with reference to FIG. In the following description, an example of a process of adjusting a compiler that compiles a code indicating a predetermined process into a machine language for causing the arithmetic device 100 to execute the predetermined process will be described.

図１は、実施形態に係る調整装置が実行する処理の一例を示す図である。図１では、調整装置１０は、以下に説明する調整処理を実行する情報処理装置であり、例えば、サーバ装置やクラウドシステム等により実現される。 FIG. 1 is a diagram illustrating an example of processing executed by an adjustment device according to an embodiment; In FIG. 1, the adjustment device 10 is an information processing device that executes adjustment processing described below, and is realized by, for example, a server device, a cloud system, or the like.

演算装置１００は、以下に説明する調整処理を実行する演算装置である。ここで、演算装置１００は、製造後に利用者が内部の論理回路を定義あるいは変更することができる集積回路であり、所謂ＰＬＤ（Programmable Logic Device）である。より具体的には、演算装置１００は、ＦＰＧＡ（Field-Programmable Gate Array）により実現される。また、演算装置１００は、調整装置１０および主記憶装置２００と接続されている。 The arithmetic device 100 is an arithmetic device that executes adjustment processing described below. Here, the arithmetic device 100 is an integrated circuit in which a user can define or change an internal logic circuit after manufacturing, and is a so-called PLD (Programmable Logic Device). More specifically, arithmetic device 100 is implemented by an FPGA (Field-Programmable Gate Array). Arithmetic device 100 is also connected to adjustment device 10 and main storage device 200 .

主記憶装置２００は、各種データを記憶する記憶装置であり、例えば、ＲＡＭ（Random Access Memory)、フラッシュメモリ（Flash Memory）等の半導体メモリ素子等の記憶装置によって実現される。 The main storage device 200 is a storage device that stores various data, and is realized by a storage device such as a semiconductor memory device such as a RAM (Random Access Memory) and a flash memory.

例えば、演算装置１００は、プロセッサ１１０、入出力装置１２０、メモリコントローラ１３０、ＦＰＧＡ１５０およびＦＰＧＡ１５０を有する。プロセッサ１１０は、演算装置１００が有するプロセッサであり、例えば、ＡＲＭアーキテクチャやＰＯＷＥＲアーキテクチャを採用したプロセッサ若しくはマイクロプロセッサである。そして、プロセッサ１１０は、ＦＰＧＡと連携することで、各種の演算処理を実行する。 For example, computing device 100 has processor 110 , input/output device 120 , memory controller 130 , FPGA 150 and FPGA 150 . The processor 110 is a processor included in the computing device 100, and is, for example, a processor or microprocessor that employs the ARM architecture or the POWER architecture. And the processor 110 performs various arithmetic processing by cooperating with FPGA.

例えば、プロセッサ１１０は、プロセッサコア１１１とキャッシュメモリ１１２とを有する。プロセッサコア１１１は、論理演算や四則演算を実現する所謂コアであり、ＡＬＵ（Arithmetic Logic Unit）から構成される算術論理演算装置により実現される。キャッシュメモリ１１２は、プロセッサ１１０が有する補助記憶装置である。より具体的には、キャッシュメモリ１１２は、主記憶装置２００よりもプロセッサコア１１１が高速にアクセスすることができる記憶装置であり、所謂キャッシュメモリである。 For example, processor 110 has processor core 111 and cache memory 112 . The processor core 111 is a so-called core that implements logic operations and four arithmetic operations, and is implemented by an arithmetic logic unit configured from an ALU (Arithmetic Logic Unit). A cache memory 112 is an auxiliary storage device that the processor 110 has. More specifically, the cache memory 112 is a storage device that can be accessed by the processor core 111 at a higher speed than the main storage device 200, and is a so-called cache memory.

入出力装置１２０は、演算装置１００と調整装置１０といった任意の外部装置との間の通信を中継する装置であり、所謂Ｉ／Ｏ（Input Output）装置である。例えば、入出力装置１２０は、ＵＳＢ（Universal Serial Bus）、イーサーネット、ＳＤ（Secure Digital）、ＵＡＲＴ（Universal Asynchronous Receiver/Transmitter）、ＳＰＩ（Serial Peripheral Interface）、Ｉ２Ｃ、ＧＰＩＯ（General-purpose input/output）等、各種の通信規格に沿って外部装置との間の通信を制御する各種の入出力装置により実現される。 The input/output device 120 is a device that relays communication between the arithmetic device 100 and any external device such as the adjustment device 10, and is a so-called I/O (Input Output) device. For example, the input/output device 120 includes a USB (Universal Serial Bus), Ethernet, SD (Secure Digital), UART (Universal Asynchronous Receiver/Transmitter), SPI (Serial Peripheral Interface), I2C, GPIO (General-purpose input/output ), etc., are implemented by various input/output devices that control communication with external devices according to various communication standards.

メモリコントローラ１３０は、演算装置１００による主記憶装置２００へのメモリアクセスを制御する。例えば、メモリコントローラ１３０は、ページング方式により主記憶装置２００に格納されたデータの読み出しや書込みを行う。より具体的な例を挙げると、メモリコントローラ１３０は、プロセッサ１１０やＦＰＧＡ１５０が処理を実行する際に用いる仮想ページ番号と、主記憶装置２００が有する記憶領域を示す物理ページ番号とを対応付けたページテーブルを有し、ページテーブルを参照しながら、プロセッサ１１０やＦＰＧＡ１５０が処理に用いる情報を主記憶装置２００から読み出したり、主記憶装置２００に対する情報の書込みを実現する。 The memory controller 130 controls memory access to the main storage device 200 by the arithmetic device 100 . For example, the memory controller 130 reads and writes data stored in the main memory device 200 by paging. To give a more specific example, the memory controller 130 creates a page in which a virtual page number used when the processor 110 or FPGA 150 executes processing and a physical page number indicating a storage area of the main memory device 200 are associated with each other. It has a table, and reads information used for processing by the processor 110 and FPGA 150 from the main storage device 200 and writes information to the main storage device 200 while referring to the page table.

例えば、図１に示す構成を有する場合、プロセッサコア１１１は、キャッシュメモリ１１２にアクセスし、演算対象となるデータの取得を試行する。そして、プロセッサコア１１１は、データがキャッシュメモリ１１２に格納されていない場合（すなわち、キャッシュミス）は、メモリコントローラ１３０に対して、演算対象となるデータが格納されている記憶領域のアドレスを通知する。より具体的には、プロセッサコア１１１は、演算処理に用いる仮想ページ番号をメモリコントローラ１３０に通知する。 For example, in the configuration shown in FIG. 1, the processor core 111 accesses the cache memory 112 and tries to acquire data to be operated. Then, if the data is not stored in the cache memory 112 (that is, a cache miss), the processor core 111 notifies the memory controller 130 of the address of the storage area where the data to be operated is stored. . More specifically, the processor core 111 notifies the memory controller 130 of the virtual page number used for arithmetic processing.

このような場合、メモリコントローラ１３０は、ページテーブルを参照し、アクセス対象となる物理ページ番号を特定する。そして、メモリコントローラ１３０は、主記憶装置２００が有する記憶領域のうち、特定した物理ページ番号が示す記憶領域に格納されたデータを読出す。その後、メモリコントローラ１３０は、読み出したデータと仮想ページ番号とを対応付けてキャッシュメモリ１１２に登録する。 In such a case, the memory controller 130 refers to the page table and identifies the physical page number to be accessed. Memory controller 130 then reads the data stored in the storage area indicated by the identified physical page number among the storage areas of main memory device 200 . After that, the memory controller 130 associates the read data with the virtual page number and registers them in the cache memory 112 .

また、メモリコントローラ１３０は、キャッシュメモリ１１２に登録されたデータを主記憶装置２００に書き戻す場合（ライトバック）は、ページテーブルを参照し、書き戻しの対象となる情報と対応付けられた仮想ページ番号と対応する物理ページ番号を特定する。そして、メモリコントローラ１３０は、主記憶装置２００の記憶領域のうち、特定した物理ページ番号が示す記憶領域に書き戻しの対象となる情報を格納する。 In addition, when writing back data registered in the cache memory 112 to the main storage device 200 (write back), the memory controller 130 refers to the page table and stores the virtual page associated with the information to be written back. Identify the number and the corresponding physical page number. Then, the memory controller 130 stores the information to be written back in the storage area indicated by the specified physical page number among the storage areas of the main memory device 200 .

ＦＰＧＡ１５０は、調整装置１０から提供される機械語に従って、回路構成を変更可能な演算装置である。例えば、ＦＰＧＡ１５０は、デジタル回路要素とアナログ回路要素とを含む。例えば、ＦＰＧＡ１５０には、所定の処理を実行可能な論理コンポーネントである複数の論理ブロックを有し、論理ブロック間が再構成可能な配線により相互接続されている。そして、ＦＰＧＡ１５０は、調整装置１０から提供されるハードウェア記述言語（ＨＤＬ：Hardware Description language）により、論理コンポーネント間の接続を変更することで、各種の処理をハードウェアにより実現することができる。すなわち、ＦＰＧＡ１５０は、処理に用いる論理回路の組み合わせを変更可能な演算装置として動作する。 The FPGA 150 is an arithmetic device whose circuit configuration can be changed according to the machine language provided by the adjusting device 10 . For example, FPGA 150 includes digital circuitry and analog circuitry. For example, the FPGA 150 has a plurality of logic blocks, which are logic components capable of executing predetermined processing, and the logic blocks are interconnected by reconfigurable wiring. The FPGA 150 can implement various types of processing by hardware by changing connections between logical components using a hardware description language (HDL) provided by the adjustment device 10 . That is, the FPGA 150 operates as an arithmetic device capable of changing the combination of logic circuits used for processing.

〔１－２．調整処理の一例〕
ここで、ＦＰＧＡ１５０が有する各論理コンポーネントは、入力された情報に対して様々な処理を実行することができる。このような論理コンポーネントが実行可能な処理は、例えば、入力に対する出力の真理値表から生成される論理式を構成する回路により表すことができる。 [1-2. Example of adjustment processing]
Here, each logic component of the FPGA 150 can execute various processes on input information. The processing that such logic components can perform can be represented, for example, by a circuit that constructs a logic equation generated from a truth table of outputs with respect to inputs.

一方、このようなＦＰＧＡ１５０に所定の処理を実行させるため、処理を記述したコードからＦＰＧＡ１５０に処理を実行させるためのＨＤＬを生成するコンパイラが知られている。このようなコンパイラは、例えば、仮想機械をターゲットとした中間コード（ビットコード）を生成し、その仮想機械向けコードを特定のマシンの機械語に変換するＬＬＶＭ（低レベル仮想機械）を用いて構成される（例えば、非特許文献３参照）。このようなＬＬＶＭにより構成されるコンパイラは、処理を記述したコードから処理のスケジューリングを行う。例えば、コンパイラは、処理を記述したコードから処理の流れを示すＤＡＧ（Directed Acyclic Graph）を生成し、生成したＤＡＧを用いてスケジューリングを行う。 On the other hand, in order to cause the FPGA 150 to execute a predetermined process, a compiler is known that generates HDL for causing the FPGA 150 to execute the process from a code describing the process. Such a compiler, for example, is configured using LLVM (low-level virtual machine) that generates intermediate code (bitcode) targeting a virtual machine and converts the code for the virtual machine into machine language for a specific machine. (See, for example, Non-Patent Document 3). A compiler configured with such LLVM schedules processing from a code describing the processing. For example, the compiler generates a DAG (Directed Acyclic Graph) indicating the flow of processing from code describing processing, and performs scheduling using the generated DAG.

ここで、ＤＡＧが有する構造と、ＦＰＧＡが有する各論理コンポーネントが発揮可能な機能の構造とが類似するように、コンパイラにコードをコンパイルさせた場合は、ＦＰＧＡ１５０の構成を処理に対して最適化することができるとも考えられる。より具体的には、所定の処理を実行する際にＦＰＧＡが構成する論理コンポーネントの機能の構造と、ＤＡＧが有する構造とが類似する場合は、ＨＤＬの最適化を実現できるとも考えられる。そこで、調整装置１０は、以下の調整処理を実行する。 Here, if the compiler compiles the code so that the structure of the DAG and the structure of the functions that can be exhibited by each logic component of the FPGA are similar, the configuration of the FPGA 150 is optimized for processing. It is also conceivable that More specifically, if the structure of the functions of the logic components that the FPGA configures when executing a given process is similar to the structure that the DAG has, it is conceivable that the HDL can be optimized. Therefore, the adjustment device 10 executes the following adjustment processing.

まず、調整装置１０は、処理に用いる論理回路の組み合わせを変更可能な演算装置、すなわち、ＦＰＧＡ１５０の各論理回路が発揮する機能の内容を示す複数の第１グラフを取得する。例えば、調整装置１０は、所定の処理を実行する際に、ＦＰＧＡ１５０上に構成されうる論理モジュールの構造を示す第１グラフを取得する。例えば、ＦＰＧＡ１５０は、接続を変更可能な複数の論理回路を有する。そこで、調整装置１０は、ＦＰＧＡ１５０が所定の処理を実行する際に構成する１つまたは複数の機能を示す複数の第１グラフを生成する。この第１グラフは、グラフ構造だけではなく、顔認識処理、音声認識処理、言語処理、行列演算のような数学演算処理等のようなパッケージ化されたＦＰＧＡ回路を表すグラフ構造のパターンでもよい。 First, the adjustment device 10 acquires a plurality of first graphs showing the content of the function exhibited by each logic circuit of the arithmetic device, that is, the FPGA 150, which can change the combination of logic circuits used for processing. For example, the coordinator 10 acquires a first graph showing the structure of logic modules that can be configured on the FPGA 150 when executing predetermined processing. For example, the FPGA 150 has multiple logic circuits that can change connections. Therefore, the adjustment device 10 generates a plurality of first graphs showing one or more functions configured when the FPGA 150 executes predetermined processing. This first graph may be not only a graph structure, but also a graph structure pattern representing a packaged FPGA circuit such as face recognition processing, voice recognition processing, language processing, mathematical operations such as matrix operations, and the like.

例えば、調整装置１０は、所定の処理を実行するために第１処理～第５処理が必要となる場合、各処理のそれぞれをＦＰＧＡ１５０が実現するために、ＦＰＧＡ１５０が１つまたは複数の論理回路を用いて実現する処理の論理を処理ごとに特定する。そして、調整装置１０は、特定した各論理を示すグラフを第１グラフとして生成する。 For example, when the adjustment device 10 requires the first to fifth processes to execute a predetermined process, the FPGA 150 implements one or more logic circuits so that the FPGA 150 implements each process. The logic of the processing to be implemented by using is specified for each processing. Then, the adjustment device 10 generates a graph indicating each identified logic as a first graph.

また、調整装置１０は、ＦＰＧＡ１５０に所定の処理を実行させる機械語を生成するコンパイラが、機械語を生成するための所定の処理の内容から生成する第２グラフを取得する。すなわち、調整装置１０は、コンパイラが生成するＤＡＧを第２グラフとして取得する。 Further, the adjustment device 10 acquires a second graph generated by a compiler that generates a machine language for causing the FPGA 150 to execute a predetermined process from the content of the predetermined process for generating the machine language. That is, the adjustment device 10 acquires the DAG generated by the compiler as the second graph.

そして、調整装置１０は、第１グラフの構造と第２グラフの構造とが類似するように、コンパイラ、若しくは、ＦＰＧＡ１５０が発揮する機能を調整する。例えば、調整装置１０は、コードからコンパイラが生成するＤＡＧの構造が、ＦＰＧＡ１５０の機能を示す第１グラフの構造に近づくように、コンパイラを再構成する。このようなコンパイラの再構成は、例えば、コンパイラを構成するプログラムモジュールのうち、ＤＡＧを生成するためのプログラムモジュールを修正することで実現される。すなわち、調整装置１０は、ＦＰＧＡ１５０用のコンパイラコンパイラとしての処理を実行する際に、第１グラフの構造と第２グラフの構造とが類似するように、コンパイラを構成する。 Then, the adjustment device 10 adjusts the functions exhibited by the compiler or the FPGA 150 so that the structure of the first graph and the structure of the second graph are similar. For example, the adjustment device 10 reconfigures the compiler so that the structure of the DAG generated by the compiler from the code approaches the structure of the first graph representing the function of the FPGA 150 . Such reconfiguration of the compiler is realized, for example, by correcting a program module for generating a DAG among the program modules that make up the compiler. That is, the adjustment device 10 configures the compiler so that the structure of the first graph and the structure of the second graph are similar when executing processing as a compiler for the FPGA 150 .

例えば、コンパイラがＬＬＶＭにより実現される場合、コンパイラは、ＩＲ（Intermediate Representation）ビルダを用いてコードから中間表現を生成し、オプティマイザを用いて中間表現からＨＤＬを生成する。ここで、オプティマイザは、中間表現からＨＤＬを生成する際に、ＤＡＧを生成し、生成したＤＡＧを用いてＨＤＬを生成する（例えば、非特許文献２参照）。そこで、調整装置１０は、中間表現から生成されるＤＡＧの構造が第１グラフに近づくように、ＩＲビルダやオプティマイザの論理を修正すればよい。 For example, when the compiler is implemented by LLVM, the compiler uses an IR (Intermediate Representation) builder to generate an intermediate representation from the code, and an optimizer to generate HDL from the intermediate representation. Here, the optimizer generates a DAG when generating HDL from the intermediate representation, and generates HDL using the generated DAG (see Non-Patent Document 2, for example). Therefore, the adjustment device 10 should modify the logic of the IR builder and optimizer so that the structure of the DAG generated from the intermediate representation approaches the first graph.

このような処理の結果、調整装置１０は、ＦＰＧＡ１５０が有する機能の構造に類似する構造を有するＤＡＧを用いて、コードをＨＤＬにコンパイルするコンパイラを生成することができる。このようなコンパイラにより生成されたＨＤＬは、ＦＰＧＡ１５０が有する論理コンポーネントの効率的な構成を実現することができる。この結果、調整装置１０は、ＦＰＧＡ１５０等、ＰＬＤにおける処理の効率を改善することができる。 As a result of such processing, the coordinator 10 can generate a compiler that compiles the code into HDL using a DAG having a structure similar to that of the functions of the FPGA 150 . HDL generated by such a compiler can implement efficient configuration of the logic components of the FPGA 150 . As a result, the adjustment device 10 can improve the efficiency of processing in PLDs such as the FPGA 150 .

〔１－３．グラフの比較について〕
ここで、調整装置１０は、第１グラフの構造と第２グラフの構造とが類似するように、コンパイラを調整するのであれば、任意の手法により、第１グラフの構造と第２グラフの構造とを比較してよい。例えば、調整装置１０は、グラフが有するトポロジを比較する各種の手法を用いて、第１グラフの構造と第２グラフの構造とを比較し、比較結果に基づいて、第１グラフの構造と第２グラフの構造、または、グラフ構造のパターンが類似するように、コンパイラを調整すればよい。 [1-3. About graph comparison]
Here, as long as the compiler is adjusted so that the structure of the first graph and the structure of the second graph are similar, the adjustment device 10 uses any method to adjust the structure of the first graph and the structure of the second graph. can be compared with For example, the adjustment device 10 compares the structure of the first graph with the structure of the second graph using various techniques for comparing the topologies of the graphs, and based on the comparison result, compares the structure of the first graph with the structure of the second graph. The compiler should be adjusted so that the structures of the two graphs or the patterns of the graph structures are similar.

例えば、複数のノードの集合で示される情報が有する構造を示す手法として、パーシステントホモロジーの技術が知られている。このようなパーシステントホモロジーの技術においては、所定のｎ次元空間上に配置された各ノードの大きさを変化させ、生じた穴の数、穴の大きさ、消滅した穴の数を特定する。そして、このような技術においては、特定した情報から穴の発生パラメータと穴の消滅パラメータとを特定し、特定した発生パラメータと消滅パラメータとから生成されるパーシステント図を、ノードの集合の構造が有する特徴として用いる（例えば、特許文献４参照）。 For example, a technique of persistent homology is known as a technique for indicating the structure of information indicated by a set of multiple nodes. In such a technique of persistent homology, the size of each node arranged on a predetermined n-dimensional space is changed to specify the number of holes created, the size of the holes, and the number of holes eliminated. In such a technique, a hole generation parameter and a hole disappearance parameter are specified from the specified information, and a persistent diagram generated from the specified hole occurrence parameter and hole disappearance parameter is generated based on the structure of the set of nodes. It is used as a feature to have (for example, see Patent Document 4).

ここで、第１グラフから生成したパーシステント図と、第２グラフから生成したパーシステント図とが類似する場合（例えば、パーシステント図に現れる線形関数の傾きが類似する場合）は、第１グラフの構造と第２グラフの構造とが類似すると推定される。そこで、調整装置１０は、第１グラフから生成したパーシステント図と、第２グラフから生成したパーシステント図とを生成し、生成したパーシステント図の比較結果に基づいて、第１グラフと第２グラフとが類似するか否かを判定してもよい。また、調整装置１０は、第１グラフから生成したパーシステント図と、第２グラフから生成したパーシステント図とが類似するように、コンパイラの調整を行ってもよい。 Here, when the persistent diagram generated from the first graph and the persistent diagram generated from the second graph are similar (for example, when the slopes of the linear functions appearing in the persistent diagram are similar), the first graph is assumed to be similar to the structure of the second graph. Therefore, the adjustment device 10 generates a persistence diagram generated from the first graph and a persistence diagram generated from the second graph, and compares the generated persistence diagrams with the first graph and the second graph. It may be determined whether or not the graphs are similar. Further, the adjustment device 10 may adjust the compiler so that the persistent diagram generated from the first graph and the persistent diagram generated from the second graph are similar.

また、上述した処理に係わらず、調整装置１０は、他の手法を用いて、第１グラフの構造と第２グラフの構造とが類似するかを判定して良い。例えば、調整装置１０は、第１グラフの位相と第２グラフの位相とを比較し、各位相が類似するようにコンパイラの調整を行ってもよい。 Moreover, regardless of the above-described processing, the adjustment device 10 may use another technique to determine whether the structure of the first graph and the structure of the second graph are similar. For example, the adjustment device 10 may compare the phase of the first graph and the phase of the second graph and adjust the compiler so that the phases are similar.

また、調整装置１０は、第２グラフを複数の分解グラフに分解し、分解した分解グラフと第１グラフの構造とが類似するように、コンパイラの調整を行ってもよい。例えば、調整装置１０は、ＦＰＧＡ１５０が発揮可能な機能を示す第１グラフであって、それぞれ異なる機能を示す複数の第１グラフを取得する。また、調整装置１０は、グラフラプラシアンに基づいて、第２グラフを複数の分解グラフに分解する。そして、調整装置１０は、複数の分解グラフの構造と、複数の第１グラフの構造とが類似するように、コンパイラを調整してもよい。 Further, the adjustment device 10 may decompose the second graph into a plurality of decomposed graphs, and adjust the compiler so that the decomposed decomposed graph and the structure of the first graph are similar. For example, the adjusting device 10 acquires a plurality of first graphs showing functions that the FPGA 150 can exhibit, each showing a different function. Also, the adjustment device 10 decomposes the second graph into a plurality of decomposed graphs based on the graph Laplacian. Then, the adjustment device 10 may adjust the compiler so that the structures of the multiple decomposition graphs and the structures of the multiple first graphs are similar.

例えば、適切にグラフの分解を行うため、分解グラフ間のエッジの数がなるべく小さくなり、かつ、各分解グラフの大きさ（ノードの数）がなるべく等しくなるように、グラフを複数の分解グラフに分解するための様々な手法が提案されている。このような分解グラフに分解するための手法として、グラフ全体を示す隣接行列を生成し、生成した隣接行列に基づいたグラフラプラシアンを設定し、グラフラプラシアンに基づいて、グラフを分解するエッジを決定するスペクトラルクラスタリングの手法が知られている（例えば、非特許文献５参照）。 For example, in order to properly decompose a graph, divide the graph into multiple decomposed graphs so that the number of edges between decomposed graphs is as small as possible and the size (number of nodes) of each decomposed graph is as equal as possible. Various techniques have been proposed for decomposition. As a method for decomposing into such a decomposed graph, an adjacency matrix indicating the entire graph is generated, a graph Laplacian is set based on the generated adjacency matrix, and an edge for decomposing the graph is determined based on the graph Laplacian. A method of spectral clustering is known (see, for example, Non-Patent Document 5).

そこで、調整装置１０は、第２グラフのグラフラプラシアンに基づき、第２グラフを複数の分解グラフに分解する。また、調整装置１０は、ＦＰＧＡ１５０が発揮可能な各機能を示す複数の第１グラフを取得する。そして、調整装置１０は、複数の分解グラフと、複数の第１グラフとを比較し、各グラフの構造が類似するように、コンパイラの調整を行う。例えば、調整装置１０は、いずれかの第１グラフとの類似度が所定の範囲内に収まる分解グラフの数が最大化するように、コンパイラの調整を行ってもよい。また、調整装置１０は、各分解グラフと、各第１グラフとの類似度の和が最大化するように、コンパイラの調整を行ってもよい。すなわち、調整装置１０は、分解グラフの構造と第１グラフの構造との類似度を指標として、コンパイラの調整を実行する。 Therefore, the adjustment device 10 decomposes the second graph into a plurality of decomposed graphs based on the graph Laplacian of the second graph. Further, the adjustment device 10 acquires a plurality of first graphs showing each function that the FPGA 150 can exhibit. Then, the adjustment device 10 compares the multiple decomposed graphs with the multiple first graphs, and adjusts the compiler so that the structures of the graphs are similar. For example, the adjustment device 10 may adjust the compiler so as to maximize the number of decomposed graphs whose similarity to any first graph falls within a predetermined range. Further, the adjustment device 10 may adjust the compiler so that the sum of similarities between each decomposition graph and each first graph is maximized. That is, the adjustment device 10 adjusts the compiler using the degree of similarity between the structure of the decomposition graph and the structure of the first graph as an index.

なお、調整装置１０は、グラフ構造そのものの類似性のみならず、グラフ構造のパターンが類似するように、コンパイラやＦＰＧＡ１５０の調整を行ってもよい。例えば、調整装置１０は、各種のパターン解析技術を用いて、グラフ構造が有するパターンを特定し、特定したパターンが類似するように、コンパイラやＦＰＧＡ１５０の調整を行ってもよい。 Note that the adjustment device 10 may adjust the compiler and the FPGA 150 so that not only the similarity of the graph structures themselves but also the patterns of the graph structures are similar. For example, the adjustment device 10 may use various pattern analysis techniques to identify patterns that the graph structures have, and adjust the compiler and FPGA 150 so that the identified patterns are similar.

〔１－４．調整処理の一例について〕
次に、図１を用いて、調整装置１０が実行する調整処理の一例を説明する。まず、調整装置１０は、ＦＰＧＡ１５０が発揮可能な機能を示すグラフ（すなわち、第１グラフ）を機能分解グラフとして取得する。例えば、調整装置１０は、所定の処理を実行するためにＦＰＧＡ１５０が論理回路を用いて構成する各論理の内容を示す複数のグラフを機能分解グラフとして取得する。 [1-4. About an example of adjustment processing]
Next, an example of adjustment processing executed by the adjustment device 10 will be described with reference to FIG. First, the adjustment device 10 acquires a graph (that is, the first graph) indicating the functions that the FPGA 150 can exhibit as a functional decomposition graph. For example, the adjustment device 10 acquires, as functional decomposition graphs, a plurality of graphs showing the content of each logic configured by the FPGA 150 using logic circuits in order to execute predetermined processing.

また、調整装置１０は、所定の処理を示すソースコードからコンパイラが生成したＤＡＧを取得し、ＤＡＧの構造と機能分解グラフの構造とを比較する（ステップＳ２）。例えば、調整装置１０は、所定の処理を示すソースコードをＬＬＶＭに入力し、ＬＬＶＭが生成したＤＡＧを取得する。また、調整装置１０は、グラフラプラシアンに基づいて、ＤＡＧを複数の分解グラフＤＡＧ１～ＤＡＧ４に分解する。そして、調整装置１０は、分解グラフＤＡＧ１～ＤＡＧ４と、機能分解グラフのそれぞれとを比較し、類似するか否かを判定する。 Further, the adjustment device 10 acquires a DAG generated by the compiler from the source code indicating the predetermined processing, and compares the structure of the DAG with the structure of the functional decomposition graph (step S2). For example, the coordinator 10 inputs a source code indicating predetermined processing to LLVM, and obtains a DAG generated by LLVM. Further, the adjustment device 10 decomposes the DAG into a plurality of decomposed graphs DAG1 to DAG4 based on the graph Laplacian. Then, the adjustment device 10 compares the decomposition graphs DAG1 to DAG4 with each of the functional decomposition graphs, and determines whether or not they are similar.

そして、調整装置１０は、ＤＡＧと機能分解グラフとが相互に類似するように、コンパイラの調整を行う。例えば、調整装置１０は、機能分解グラフと類似するＤＡＧを生成するように、ＬＬＶＭが有するＩＲビルダやオプティマイザの調整を行う。そして、調整装置１０は、調整したコンパイラがソースコードから生成したＨＤＬを用いて、ＦＰＧＡ１５０の設定を行う（ステップＳ４）。 Then, the adjusting device 10 adjusts the compiler so that the DAG and the functional decomposition graph are similar to each other. For example, the adjustment device 10 adjusts the IR builder and optimizer of LLVM so as to generate a DAG similar to the functional decomposition graph. Then, the adjusting device 10 sets the FPGA 150 using the HDL generated from the source code by the adjusted compiler (step S4).

〔１－５．ＦＰＧＡ側の調整について〕
ここで、上述した説明では、調整装置１０は、コンパイラ側の調整を行うことで、ＦＰＧＡ１５０による処理の最適化を図った。しかしながら、実施形態は、これに限定されるものではない。例えば、調整装置１０は、所定の処理を実行する際にＦＰＧＡ１５０が発揮する機能を調整することで、処理の最適化を図ってもよい。 [1-5. Adjustment on the FPGA side]
Here, in the above description, the adjustment device 10 optimizes the processing by the FPGA 150 by adjusting the compiler. However, embodiments are not so limited. For example, the adjusting device 10 may optimize the processing by adjusting the functions exhibited by the FPGA 150 when executing the predetermined processing.

例えば、調整装置１０は、所定の処理を実現するためにＦＰＧＡ１５０に発揮させる機能の組み合わせを複数特定する。また、調整装置１０は、特定した組み合わせごとに、機能分解グラフとの構造とＤＡＧの構造とを比較する。そして、調整装置１０は、特定した組み合わせのうち、機能分解グラフの構造がＤＡＧの構造に最も近い機能分解グラフを特定する。その後、調整装置１０は、特定した機能分解グラフの組が示す機能を発揮するように、ＦＰＧＡ１５０の設定を行ってもよい。また、調整装置１０は、ＤＡＧの構造が、特定した組に含まれる機能分解グラフの構造と類似するように、コンパイラを調整してもよい。 For example, the adjusting device 10 identifies a plurality of combinations of functions to be exhibited by the FPGA 150 in order to implement predetermined processing. Further, the adjustment device 10 compares the structure of the functional decomposition graph and the structure of the DAG for each specified combination. Then, the adjustment device 10 identifies a functional decomposition graph whose structure is closest to the structure of the DAG from among the identified combinations. After that, the adjustment device 10 may set the FPGA 150 so as to exhibit the function indicated by the identified set of functional decomposition graphs. Also, the adjustment device 10 may adjust the compiler so that the structure of the DAG is similar to the structure of the functional decomposition graph included in the specified set.

すなわち、調整装置１０は、ＬＬＶＭの中間言語に合わせた最適化が行われるように、ＦＰＧＡ１５０の回路設計を調整し、ＦＰＧＡ１５０の回路設計に合わせた中間言語の最適化が行われるように、ＬＬＶＭの調整を行う。より具体的には、調整装置１０は、ＬＬＶＭがスケジューリングにより生成するＩｎｓｔｒｕｃｔｉｏｎ（中間表現）と、ＦＰＧＡ１５０が発揮する機能を示すＰｒｅＩｍｐｌｅｍｅｎｔｅｄＢｉｔｓｔｒｅａｍとが相互に類似するように、ＬＬＶＭやＦＰＧＡ１５０の回路設計を調整する。このような処理の結果、ＦＰＧＡ１５０の回路設計をＬＬＶＭの中間言語、すなわち、抽象的なバックエンドに近づけることができる。この結果、調整装置１０は、ＬＬＶＭおよびＦＰＧＡ１５０のボトルネックを相互に解消することができる。 That is, the adjustment device 10 adjusts the circuit design of the FPGA 150 so that optimization is performed according to the intermediate language of the LLVM, and optimizes the intermediate language according to the circuit design of the FPGA 150. make adjustments. More specifically, the adjustment device 10 designs the circuits of the LLVM and the FPGA 150 so that the Instruction (intermediate representation) generated by the scheduling of the LLVM and the Pre-Implemented Bitstream indicating the function exhibited by the FPGA 150 are similar to each other. adjust. As a result of such processing, the circuit design of FPGA 150 can be brought closer to the intermediate language of LLVM, that is, the abstract backend. As a result, the coordinator 10 can eliminate the bottlenecks of the LLVM and the FPGA 150 mutually.

〔２．生成装置の構成〕
以下、上記した調整処理を実現する調整装置１０が有する機能構成の一例について説明する。図２は、実施形態に係る調整装置の構成例を示す図である。図２に示すように、調整装置１０は、通信部２０、記憶部３０、および制御部４０を有する。 [2. Configuration of generation device]
An example of the functional configuration of the adjustment device 10 that implements the adjustment process described above will be described below. FIG. 2 is a diagram illustrating a configuration example of the adjustment device according to the embodiment; As shown in FIG. 2 , the adjustment device 10 has a communication section 20 , a storage section 30 and a control section 40 .

通信部２０は、例えば、ＮＩＣ（Network Interface Card）等によって実現される。そして、通信部２０は、所定のネットワークを介して演算装置１００と接続され、演算装置１００との間で各種通信を行う。 The communication unit 20 is realized by, for example, a NIC (Network Interface Card) or the like. The communication unit 20 is connected to the arithmetic device 100 via a predetermined network, and performs various communications with the arithmetic device 100 .

記憶部３０は、例えば、ＲＡＭ（Random Access Memory)、フラッシュメモリ（Flash Memory）等の半導体メモリ素子、または、ハードディスク、光ディスク等の記憶装置によって実現される。また、記憶部３０は、ＤＡＧデータベース３１、機能分解グラフデータベース３２、およびＬＬＶＭ３３を記憶する。 The storage unit 30 is implemented by, for example, a semiconductor memory device such as a RAM (Random Access Memory) or a flash memory, or a storage device such as a hard disk or an optical disk. The storage unit 30 also stores a DAG database 31 , a functional decomposition graph database 32 and an LLVM 33 .

ＤＡＧデータベース３１には、ＬＬＶＭ３３が生成するＤＡＧの情報が登録される。例えば、図３は、実施形態に係るＤＡＧデータベースに登録される情報の一例を示す図である。図３に示すように、ＤＡＧデータベース３１には、「ノードＩＤ」、「機能」、「エッジ情報」、「条件情報」および「接続先」といった情報が対応付けて登録されている。 DAG information generated by the LLVM 33 is registered in the DAG database 31 . For example, FIG. 3 is a diagram showing an example of information registered in the DAG database according to the embodiment. As shown in FIG. 3, in the DAG database 31, information such as "node ID", "function", "edge information", "condition information", and "connection destination" are associated and registered.

ここで、「ノードＩＤ」とは、ＤＡＧに含まれるノードを示す識別子である。また、「機能」とは、「ノードＩＤ」が示すノードと対応する機能（すなわち、処理の内容）を示す情報である。また、「エッジ情報」とは、ノード間を接続するエッジを示す識別子である。また、「条件情報」とは、対応付けられた「エッジ情報」が示すエッジにより接続された他のノードが示す機能を発揮させるための条件を示す情報である。また、「接続先」とは、対応付けられた「エッジ情報」が示すエッジにより接続された他のノードのノードＩＤである。 Here, "node ID" is an identifier that indicates a node included in the DAG. A "function" is information indicating a function (that is, the content of processing) corresponding to a node indicated by the "node ID". "Edge information" is an identifier that indicates an edge that connects nodes. "Condition information" is information indicating a condition for exhibiting a function indicated by another node connected by an edge indicated by associated "edge information". The "connection destination" is the node ID of another node connected by the edge indicated by the associated "edge information".

例えば、図３に示す例では、ＤＡＧデータベース３１には、ノードＩＤ「Ｎ１」、機能「機能＃１」、エッジ情報「Ｅ１２」、条件情報「Ｃ１」および接続先「Ｎ２」が対応付けて登録されている。このような情報は、ノードＩＤ「Ｎ１」が示す処理の内容が「機能＃１」であり、処理結果が条件情報「Ｃ１」を満たした場合に、エッジ情報「Ｅ１２」が示すエッジにより接続されたノードＩＤ「Ｎ２」が示すノードの処理が実行される旨を示す。なお、図３に示す例では、「機能＃１」や「Ｃ１」といった概念的な値を記載したが、実際には、実行する処理の内容を示す関数や文字列、条件を示す関数等が登録されることとなる。 For example, in the example shown in FIG. 3, the node ID "N1", the function "function #1", the edge information "E12", the condition information "C1", and the connection destination "N2" are registered in the DAG database 31 in association with each other. It is Such information is connected by the edge indicated by the edge information "E12" when the content of the process indicated by the node ID "N1" is "function #1" and the processing result satisfies the condition information "C1". This indicates that the process of the node indicated by the node ID "N2" will be executed. Note that in the example shown in FIG. 3, conceptual values such as "function #1" and "C1" are described, but in reality, there are functions, character strings, and functions that indicate the conditions of the processing to be executed. to be registered.

図２に戻り、説明を続ける。機能分解グラフデータベース３２には、機能分解グラフが登録される。例えば、図４は、実施形態に係る機能分解グラフデータベースに登録される情報の一例を示す図である。図４に示すように、機能分解グラフデータベース３２には、「ノードＩＤ」、「機能」、「エッジ情報」、「条件情報」および「接続先」といった情報が登録される。すなわち、機能分解グラフデータベース３２には、ＤＡＧデータベース３１と同様に、ＦＰＧＡ１５０が構成可能な論理モジュールにおける処理の内容を示すグラフの情報が登録される。 Returning to FIG. 2, the description is continued. Functional decomposition graphs are registered in the functional decomposition graph database 32 . For example, FIG. 4 is a diagram showing an example of information registered in the functional decomposition graph database according to the embodiment. As shown in FIG. 4, the functional decomposition graph database 32 registers information such as "node ID", "function", "edge information", "condition information", and "connection destination". That is, in the functional decomposition graph database 32, similarly to the DAG database 31, graph information indicating the contents of processing in logic modules that can be configured by the FPGA 150 is registered.

図２に戻り、説明を続ける。ＬＬＶＭ３３は、所定のソースコードが示す処理を演算装置１００のＦＰＧＡ１５０に実行させるためのＨＤＬを生成するコンパイラである。 Returning to FIG. 2, the description is continued. The LLVM 33 is a compiler that generates HDL for causing the FPGA 150 of the arithmetic device 100 to execute processing indicated by a given source code.

制御部４０は、コントローラ（controller）であり、例えば、ＣＰＵ（Central Processing Unit）、ＭＰＵ（Micro Processing Unit）等のプロセッサによって、調整装置１０内部の記憶装置に記憶されている各種プログラムがＲＡＭ等を作業領域として実行されることにより実現される。また、制御部４０は、コントローラ（controller）であり、例えば、ＡＳＩＣ（Application Specific Integrated Circuit）やＦＰＧＡ（Field Programmable Gate Array）等の集積回路により実現されてもよい。 The control unit 40 is a controller, and various programs stored in a storage device inside the adjustment device 10 are executed by a processor such as a CPU (Central Processing Unit) or an MPU (Micro Processing Unit) in a RAM or the like. It is realized by being executed as a work area. Also, the control unit 40 is a controller, and may be implemented by an integrated circuit such as an ASIC (Application Specific Integrated Circuit) or an FPGA (Field Programmable Gate Array).

図２に示すように、制御部４０は、取得部４１、分解部４２、比較部４３、調整部４４、および設定部４５を有する。 As shown in FIG. 2 , the control unit 40 has an acquisition unit 41 , a decomposition unit 42 , a comparison unit 43 , an adjustment unit 44 and a setting unit 45 .

取得部４１は、処理に用いる論理回路の組み合わせを変更可能な演算装置の各論理回路が発揮する機能の内容を示す複数の第１グラフと、演算装置に所定の処理を実行させる機械語を生成するコンパイラが、機械語を生成するための前記所定の処理の内容から生成する第２グラフとを取得する。例えば、取得部４１は、利用者が利用する端末装置（図示は省略）から、所定の処理を示すソースコードを受付ける。このような場合、取得部４１は、ＦＰＧＡ１５０が所定の処理を実行する際に構成する論理コンポーネントの組み合わせを特定する。そして、取得部４１は、生成した論理コンポーネントにおける処理の内容を示すグラフを機能分解グラフとして生成する。すなわち、取得部４１は、複数の機能分解グラフを生成する。そして、取得部４１は、生成した機能分解グラフを機能分解グラフデータベース３２に登録する。 Acquisition unit 41 generates a plurality of first graphs showing details of functions exhibited by each logic circuit of an arithmetic device capable of changing the combination of logic circuits used for processing, and machine language for causing the arithmetic device to execute predetermined processing. and a second graph generated from the content of the predetermined processing for generating the machine language by a compiler. For example, the acquisition unit 41 receives a source code indicating predetermined processing from a terminal device (not shown) used by the user. In such a case, the acquisition unit 41 identifies a combination of logic components that are configured when the FPGA 150 executes predetermined processing. Then, the acquisition unit 41 generates a graph indicating the content of processing in the generated logic component as a functional decomposition graph. That is, the acquisition unit 41 generates a plurality of functional decomposition graphs. The acquisition unit 41 then registers the generated functional decomposition graph in the functional decomposition graph database 32 .

また、取得部４１は、ＬＬＶＭ３３を読出し、受付けたソースコードをＬＬＶＭ３３に入力する。このような場合、ＬＬＶＭ３３は、処理を実行するためのＨＤＬを生成するが、その際に実行するスケジューリングにおいて中間表現を生成する。そこで、取得部４１は、ＬＬＶＭ３３がソースコードから生成した中間表現を取得し、取得した中間表現を示すグラフ、すなわち、ＤＡＧを取得する。そして、取得部４１は、取得したＤＡＧをＤＡＧデータベース３１に登録する。 The acquisition unit 41 also reads the LLVM 33 and inputs the received source code to the LLVM 33 . In such a case, the LLVM 33 generates an HDL for executing processing, but generates an intermediate representation in scheduling for execution at that time. Therefore, the acquisition unit 41 acquires the intermediate representation generated from the source code by the LLVM 33, and acquires a graph indicating the acquired intermediate representation, that is, the DAG. The acquisition unit 41 then registers the acquired DAG in the DAG database 31 .

分解部４２は、グラフラプラシアンに基づいて、第２グラフを複数の分解グラフに分解する。例えば、分解部４２は、ＤＡＧデータベース３１を参照し、ＤＡＧを取得する。このような場合、分解部４２は、ＤＡＧのグラフラプラシアンを生成し、生成したグラフラプラシアンを用いて、ＤＡＧを複数の分解グラフに分解する。例えば、分解部４２は、非特許文献５に開示された手法を用いて、分解グラフを生成する。 The decomposing unit 42 decomposes the second graph into a plurality of decomposed graphs based on the graph Laplacian. For example, the decomposition unit 42 refers to the DAG database 31 and acquires DAGs. In such a case, the decomposing unit 42 generates a graph Laplacian of the DAG, and uses the generated graph Laplacian to decompose the DAG into a plurality of decomposition graphs. For example, the decomposition unit 42 uses the technique disclosed in Non-Patent Document 5 to generate a decomposition graph.

比較部４３は、ＤＡＧと機能分解グラフとの構造を比較する。例えば、比較部４３は、分解部４２がＤＡＧを分解することで生成した複数の分解グラフを取得する。このような場合、比較部４３は、機能分解グラフデータベース３２を参照し、各機能分解グラフを取得する。そして、比較部４３は、各分解グラフの構造と、各機能分解グラフの構造とを比較し、類似度を算出する。 The comparison unit 43 compares the structures of the DAG and the functional decomposition graph. For example, the comparing unit 43 acquires a plurality of decomposed graphs generated by decomposing the DAG by the decomposing unit 42 . In such a case, the comparison unit 43 refers to the functional decomposition graph database 32 and acquires each functional decomposition graph. Then, the comparison unit 43 compares the structure of each decomposition graph with the structure of each functional decomposition graph, and calculates the degree of similarity.

例えば、比較部４３は、非特許文献４に開示された技術を用いて、各グラフのパーシステント図を生成し、生成したパーシステント図の類似度を算出する。例えば、比較部４３は、パーシステント図に現れる線形関数の傾きの値の類似度を算出する。 For example, the comparison unit 43 uses the technology disclosed in Non-Patent Document 4 to generate a persistent diagram of each graph and calculate the similarity of the generated persistent diagrams. For example, the comparison unit 43 calculates the similarity of the slope values of the linear functions appearing in the persistent diagram.

調整部４４は、第１グラフの構造と第２グラフの構造、または、グラフ構造のパターンが類似するように、コンパイラ若しくは演算装置が発揮する機能を調整する。例えば、調整部４４は、比較部４３による各分解グラフの構造と、各機能分解グラフの構造とを比較結果を取得する。より具体的には、調整部４４は、各分解グラフの構造と、各機能分解グラフの構造との類似度を取得する。 The adjustment unit 44 adjusts the functions performed by the compiler or the arithmetic device so that the structure of the first graph and the structure of the second graph or the patterns of the graph structures are similar. For example, the adjustment unit 44 acquires the result of comparing the structure of each decomposition graph and the structure of each functional decomposition graph by the comparison unit 43 . More specifically, the adjusting unit 44 acquires the degree of similarity between the structure of each decomposition graph and the structure of each functional decomposition graph.

例えば、調整部４４は、各分解グラフの構造と各機能分解グラフの構造との類似度の総和が高くなるように、ＬＬＶＭ３３が有するＩＲビルダやオプティマイザを修正する。なお、調整部４４は、ＩＲビルダやオプティマイザの修正内容と、修正した結果新たにＬＬＶＭ３３が生成した中間表現に基づくＤＡＧの分解グラフの構造と、各機能分解グラフの構造との類似度との間の関係に基づいて、ＩＲビルダやオプティマイザの修正方針を決定してもよい。すなわち、調整部４４は、各分解グラフの構造と各機能分解グラフの構造との類似度の総和が高くなるように、ＬＬＶＭ３３から新たなＬＬＶＭを生成する。 For example, the adjustment unit 44 modifies the IR builder and optimizer of the LLVM 33 so that the sum of similarities between the structure of each decomposition graph and the structure of each functional decomposition graph increases. Note that the adjustment unit 44 determines the degree of similarity between the content of corrections made by the IR builder or optimizer, the structure of the DAG decomposition graph based on the intermediate representation newly generated by the LLVM 33 as a result of the correction, and the structure of each functional decomposition graph. A correction policy for the IR builder or optimizer may be determined based on the relationship of . That is, the adjustment unit 44 generates a new LLVM from the LLVM 33 so that the sum of similarities between the structure of each decomposition graph and the structure of each functional decomposition graph is high.

なお、調整部４４は、各分解グラフの構造と各機能分解グラフの構造との類似度の総和が高くなるように、ＦＰＧＡ１５０が構成する論理モジュールの構造を修正してもよい。例えば、調整部４４は、ＦＰＧＡ１５０に構成させる論理モジュールを、各分解グラフの構造と類似する構造を有する論理モジュールに限定させてもよい。その後、調整部４４は、新たなＬＬＶＭを記憶部３０に登録する。 Note that the adjustment unit 44 may modify the structure of the logic modules configured by the FPGA 150 so that the sum of similarities between the structure of each decomposition graph and the structure of each function decomposition graph increases. For example, the adjustment unit 44 may limit the logic modules configured in the FPGA 150 to logic modules having a structure similar to the structure of each decomposition graph. The adjustment unit 44 then registers the new LLVM in the storage unit 30 .

設定部４５は、調整部４４により調整されたＬＬＶＭを用いて、ＦＰＧＡ１５０の設定を行う。例えば、設定部４５は、ＬＬＶＭを用いて、ソースコードをＨＤＬに変換し、変換後のＨＤＬを演算装置１００へと提供することで、ＦＰＧＡ１５０に所定の処理を実行させるための論理コンポーネントを設定する。 The setting unit 45 uses the LLVM adjusted by the adjustment unit 44 to set the FPGA 150 . For example, the setting unit 45 converts the source code into HDL using LLVM, and provides the converted HDL to the arithmetic device 100 to set logic components for causing the FPGA 150 to execute predetermined processing. do.

〔３．生成装置が実行する処理の流れの一例〕
次に、図５を用いて、調整装置１０が実行する提供処理の流れの一例について説明する。図５は、実施形態に係る調整処理の流れの一例を説明するフローチャートである。まず、調整装置１０は、ＦＰＧＡの機能分解グラフを取得する（ステップＳ１０１）。また、調整装置１０は、コンパイラであるＬＬＶＭの中間表現に基づくＤＡＧが有する構造と機能分解グラフが有する構造とを比較する（ステップＳ１０２）。 [3. Example of flow of processing executed by generation device]
Next, an example of the flow of provision processing executed by the adjustment device 10 will be described with reference to FIG. 5 . FIG. 5 is a flowchart illustrating an example of the flow of adjustment processing according to the embodiment. First, the adjustment device 10 acquires a functional decomposition graph of FPGA (step S101). Further, the adjustment device 10 compares the structure of the DAG based on the intermediate representation of LLVM, which is the compiler, with the structure of the functional decomposition graph (step S102).

そして、調整装置１０は、各グラフの構造が類似するようにコンパイラを構成する（ステップＳ１０３）。その後、調整装置１０は、調整したコンパイラを用いて、ソースコードを機械語に変換し（ステップＳ１０４）、変換後の機械語を用いてＦＰＧＡの設定を行い（ステップＳ１０５）、処理を終了する。 Then, the adjustment device 10 configures the compiler so that the structures of the graphs are similar (step S103). After that, the adjustment device 10 converts the source code into machine language using the adjusted compiler (step S104), configures the FPGA using the converted machine language (step S105), and ends the process.

〔４．変形例〕
上記では、調整装置１０による提供処理の一例について説明した。しかしながら、実施形態は、これに限定されるものではない。以下、調整装置１０が実行する調整処理のバリエーションについて説明する。 [4. Modification]
An example of the providing process by the adjustment device 10 has been described above. However, embodiments are not so limited. Variations of the adjustment process executed by the adjustment device 10 will be described below.

〔４－１．装置構成〕
上述した例では、調整装置１０は、調整装置１０内で調整処理を実行した。しかしながら、実施形態は、これに限定されるものではない。例えば、調整装置１０は、演算装置１００の設定を行うフロントエンドサーバと、調整処理を実行するバックエンドサーバとにより実現されてもよい。このような場合、例えば、フロントエンドサーバは、図２に示す設定部４５を有し、バックエンドサーバは、図２に示す取得部４１、分解部４２、比較部４３、および調整部４４を有する。また、調整装置１０は、ＤＡＧデータベース３１や機能分解グラフデータベース３２を外部のストレージサーバに記憶させてもよい。 [4-1. Device configuration〕
In the example described above, the adjustment device 10 performed adjustment processing within the adjustment device 10 . However, embodiments are not so limited. For example, the adjustment device 10 may be implemented by a front-end server that configures the arithmetic device 100 and a back-end server that executes adjustment processing. In such a case, for example, the front-end server has a setting unit 45 shown in FIG. 2, and the back-end server has an acquisition unit 41, a decomposition unit 42, a comparison unit 43, and an adjustment unit 44 shown in FIG. . Further, the adjustment device 10 may store the DAG database 31 and the functional decomposition graph database 32 in an external storage server.

〔４－２．その他〕
また、上記実施形態において説明した各処理のうち、自動的に行われるものとして説明した処理の全部または一部を手動的に行うこともでき、あるいは、手動的に行われるものとして説明した処理の全部または一部を公知の方法で自動的に行うこともできる。この他、上記文章中や図面中で示した処理手順、具体的名称、各種のデータやパラメータを含む情報については、特記する場合を除いて任意に変更することができる。例えば、各図に示した各種情報は、図示した情報に限られない。 [4-2. others〕
Further, among the processes described in the above embodiments, all or part of the processes described as being automatically performed can be manually performed, or the processes described as being performed manually can be performed manually. All or part of this can also be done automatically by known methods. In addition, information including processing procedures, specific names, and various data and parameters shown in the above text and drawings can be arbitrarily changed unless otherwise specified. For example, the various information shown in each drawing is not limited to the illustrated information.

また、図示した各装置の各構成要素は機能概念的なものであり、必ずしも物理的に図示の如く構成されていることを要しない。すなわち、各装置の分散・統合の具体的形態は図示のものに限られず、その全部または一部を、各種の負荷や使用状況などに応じて、任意の単位で機能的または物理的に分散・統合して構成することができる。 Also, each component of each device illustrated is functionally conceptual, and does not necessarily need to be physically configured as illustrated. In other words, the specific form of distribution and integration of each device is not limited to the one shown in the figure, and all or part of them can be functionally or physically distributed and integrated in arbitrary units according to various loads and usage conditions. Can be integrated and configured.

また、上記してきた各実施形態は、処理内容を矛盾させない範囲で適宜組み合わせることが可能である。 Moreover, each of the embodiments described above can be appropriately combined within a range that does not contradict the processing contents.

〔４－３．プログラム〕
また、上述してきた実施形態に係る調整装置１０は、例えば図６に示すような構成のコンピュータ１０００によって実現される。図６は、ハードウェア構成の一例を示す図である。コンピュータ１０００は、出力装置１０１０、入力装置１０２０と接続され、演算装置１０３０、一次記憶装置１０４０、二次記憶装置１０５０、出力ＩＦ（Interface）１０６０、入力ＩＦ１０７０、ネットワークＩＦ１０８０がバス１０９０により接続された形態を有する。 [4-3. program〕
Moreover, the adjusting device 10 according to the above-described embodiments is implemented by a computer 1000 configured as shown in FIG. 6, for example. FIG. 6 is a diagram illustrating an example of a hardware configuration; A computer 1000 is connected to an output device 1010 and an input device 1020, and an arithmetic device 1030, a primary storage device 1040, a secondary storage device 1050, an output IF (Interface) 1060, an input IF 1070, and a network IF 1080 are connected via a bus 1090. have

演算装置１０３０は、一次記憶装置１０４０や二次記憶装置１０５０に格納されたプログラムや入力装置１０２０から読み出したプログラム等に基づいて動作し、各種の処理を実行する。一次記憶装置１０４０は、ＲＡＭ等、演算装置１０３０が各種の演算に用いるデータを一次的に記憶するメモリ装置である。また、二次記憶装置１０５０は、演算装置１０３０が各種の演算に用いるデータや、各種のデータベースが登録される記憶装置であり、ＲＯＭ(Read Only Memory)、ＨＤＤ、フラッシュメモリ等により実現される。 Arithmetic device 1030 operates based on programs stored in primary storage device 1040 and secondary storage device 1050, programs read from input device 1020, and the like, and executes various types of processing. The primary storage device 1040 is a memory device such as a RAM that temporarily stores data used by the arithmetic device 1030 for various calculations. The secondary storage device 1050 is a storage device in which data used for various calculations by the arithmetic device 1030 and various databases are registered, and is implemented by a ROM (Read Only Memory), HDD, flash memory, or the like.

出力ＩＦ１０６０は、モニタやプリンタといった各種の情報を出力する出力装置１０１０に対し、出力対象となる情報を送信するためのインタフェースであり、例えば、ＵＳＢ（Universal Serial Bus）やＤＶＩ（Digital Visual Interface）、ＨＤＭＩ（登録商標）（High Definition Multimedia Interface）といった規格のコネクタにより実現される。また、入力ＩＦ１０７０は、マウス、キーボード、およびスキャナ等といった各種の入力装置１０２０から情報を受信するためのインタフェースであり、例えば、ＵＳＢ等により実現される。 The output IF 1060 is an interface for transmitting information to be output to the output device 1010 that outputs various types of information such as a monitor and a printer. It is realized by a connector conforming to a standard such as HDMI (registered trademark) (High Definition Multimedia Interface). Also, the input IF 1070 is an interface for receiving information from various input devices 1020 such as a mouse, keyboard, scanner, etc., and is realized by, for example, USB.

なお、入力装置１０２０は、例えば、ＣＤ（Compact Disc）、ＤＶＤ（Digital Versatile Disc）、ＰＤ（Phase change rewritable Disk）等の光学記録媒体、ＭＯ（Magneto-Optical disk）等の光磁気記録媒体、テープ媒体、磁気記録媒体、または半導体メモリ等から情報を読み出す装置であってもよい。また、入力装置１０２０は、ＵＳＢメモリ等の外付け記憶媒体であってもよい。 Note that the input device 1020 includes, for example, optical recording media such as CDs (Compact Discs), DVDs (Digital Versatile Discs), PDs (Phase change rewritable discs), magneto-optical recording media such as MOs (Magneto-Optical discs), and tapes. It may be a device that reads information from a medium, a magnetic recording medium, a semiconductor memory, or the like. Also, the input device 1020 may be an external storage medium such as a USB memory.

ネットワークＩＦ１０８０は、ネットワークＮを介して他の機器からデータを受信して演算装置１０３０へ送り、また、ネットワークＮを介して演算装置１０３０が生成したデータを他の機器へ送信する。 Network IF 1080 receives data from other devices via network N and sends the data to arithmetic device 1030, and also transmits data generated by arithmetic device 1030 via network N to other devices.

演算装置１０３０は、出力ＩＦ１０６０や入力ＩＦ１０７０を介して、出力装置１０１０や入力装置１０２０の制御を行う。例えば、演算装置１０３０は、入力装置１０２０や二次記憶装置１０５０からプログラムを一次記憶装置１０４０上にロードし、ロードしたプログラムを実行する。 The arithmetic device 1030 controls the output device 1010 and the input device 1020 via the output IF 1060 and the input IF 1070 . For example, arithmetic device 1030 loads a program from input device 1020 or secondary storage device 1050 onto primary storage device 1040 and executes the loaded program.

例えば、コンピュータ１０００が調整装置１０として機能する場合、コンピュータ１０００の演算装置１０３０は、一次記憶装置１０４０上にロードされたプログラムまたはデータを実行することにより、制御部４０の機能を実現する。コンピュータ１０００のＣＰＵ１１００は、これらのプログラムまたはデータを記録媒体１８００から読み取って実行するが、他の例として、他の装置からネットワークＮを介してこれらのプログラムを取得してもよい。 For example, when computer 1000 functions as adjustment device 10 , arithmetic device 1030 of computer 1000 implements the functions of control unit 40 by executing programs or data loaded on primary storage device 1040 . CPU 1100 of computer 1000 reads these programs or data from recording medium 1800 and executes them, but as another example, these programs may be acquired via network N from another device.

〔５．効果〕
上述したように、調整装置１０は、処理に用いる論理回路の組み合わせを変更可能な演算装置の各論理回路が発揮する機能の内容を示す複数の第１グラフと、演算装置に所定の処理を実行させる機械語を生成するコンパイラが、機械語を生成するための所定の処理の内容から生成する第２グラフとを取得する。そして、調整装置１０は、取得された第１グラフの構造と第２グラフの構造、または、グラフ構造のパターンが類似するように、コンパイラ若しくは演算装置が発揮する機能を調整する。このように、調整装置１０は、ＰＬＤの論理設計と、ＰＬＤの設定を行うコンパイラが中間表現を生成する際に用いる論理設計とを近づけることで、ＰＬＤの設定を最適化するので、ＰＬＤにおける処理の効率を改善できる。 [5. effect〕
As described above, the adjustment device 10 has a plurality of first graphs showing the content of the function exhibited by each logic circuit of the arithmetic device capable of changing the combination of the logic circuits used for processing, and the arithmetic device executes a predetermined process. A second graph generated from the content of a predetermined process for generating the machine language is obtained by a compiler that generates a machine language for generating the machine language. Then, the adjustment device 10 adjusts the function exhibited by the compiler or the arithmetic device so that the structure of the acquired first graph and the structure of the second graph or the patterns of the graph structures are similar. In this way, the adjustment device 10 optimizes the PLD setting by bringing the logic design of the PLD closer to the logic design used by the compiler that sets the PLD to generate the intermediate representation. can improve the efficiency of

また、調整装置１０は、グラフラプラシアンに基づいて、第２グラフを複数の分解グラフに分解し、複数の第１グラフの構造と複数の分解グラフの構造、または、グラフ構造のパターンが類似するように、コンパイラ若しくは演算装置が発揮する機能を調整する。また、調整装置１０は、複数の第１グラフを取得し、複数の分解グラフの構造と複数の第１グラフの構造、または、グラフ構造のパターンが類似するように、コンパイラ若しくは演算装置が発揮する機能を調整する。また、調整装置１０は、スケジューリングのためにコンパイラが中間表現から生成する第２グラフを取得する。また、調整装置１０は、低レベル仮想機械を用いて構成されたコンパイラが生成する第２グラフを取得する。このため、調整装置１０は、ＰＬＤの設定を最適化するので、ＰＬＤにおける処理の効率を改善できる。 Further, the adjustment device 10 decomposes the second graph into a plurality of decomposed graphs based on the graph Laplacian so that the structures of the plurality of first graphs and the structures of the decomposed graphs or the patterns of the graph structures are similar. In addition, it adjusts the functions that the compiler or the arithmetic unit exhibits. In addition, the adjustment device 10 acquires a plurality of first graphs, and the compiler or the arithmetic device performs such that the structures of the plurality of decomposed graphs and the structures of the plurality of first graphs, or the patterns of the graph structures are similar. Adjust functions. The coordinator 10 also obtains a second graph that the compiler generates from the intermediate representation for scheduling. The coordinator 10 also obtains a second graph generated by a compiler configured using a low-level virtual machine. Therefore, the adjustment device 10 optimizes the settings of the PLD, thereby improving the efficiency of processing in the PLD.

以上、本願の実施形態のいくつかを図面に基づいて詳細に説明したが、これらは例示であり、発明の開示の欄に記載の態様を始めとして、当業者の知識に基づいて種々の変形、改良を施した他の形態で本発明を実施することが可能である。 As described above, some of the embodiments of the present application have been described in detail based on the drawings. It is possible to carry out the invention in other forms with modifications.

また、上記してきた「部（section、module、unit）」は、「手段」や「回路」などに読み替えることができる。例えば、生成部は、生成手段や生成回路に読み替えることができる。 Also, the "section, module, unit" described above can be read as "means" or "circuit". For example, the generating unit can be read as generating means or a generating circuit.

１０調整装置
２０通信部
３０記憶部
３１ＤＡＧデータベース
３２機能分解グラフデータベース
３３ＬＬＶＭ
４０制御部
４１取得部
４２分解部
４３比較部
４４調整部
４５設定部
１００演算装置
１５０ＦＰＧＡ REFERENCE SIGNS LIST 10 adjustment device 20 communication unit 30 storage unit 31 DAG database 32 functional decomposition graph database 33 LLVM
40 control unit 41 acquisition unit 42 decomposition unit 43 comparison unit 44 adjustment unit 45 setting unit 100 arithmetic unit 150 FPGA

Claims

論理回路の組み合わせを変更可能な演算装置の各論理回路が発揮する機能の内容を示す複数の第１グラフと、前記演算装置に所定の処理を実行させる機械語をソースコードから生成するＬＬＶＭを用いたコンパイラが前記ソースコードから生成する第２グラフとを取得する取得部と、
前記第２グラフを複数の分解グラフに分解する分解部と、
前記取得部により取得された複数の前記第１グラフの構造と前記分解部により分解された複数の前記分解グラフの構造とが類似するように、前記ＬＬＶＭを調整する調整部と
を有することを特徴とする調整装置。 A plurality of first graphs showing contents of functions exhibited by each logic circuit of an arithmetic unit capable of changing the combination of logic circuits, and LLVM for generating machine language for causing the arithmetic unit to execute predetermined processing from source code. an acquisition unit that acquires a second graph generated from the source code by the compiler used;
a decomposition unit that decomposes the second graph into a plurality of decomposition graphs;
an adjusting unit that adjusts the LLVM so that structures of the plurality of first graphs obtained by the obtaining unit and structures of the plurality of decomposed graphs decomposed by the decomposing unit are similar to each other; adjustment device.

前記分解部は、グラフラプラシアンに基づいて、前記第２グラフを複数の分解グラフに分解する
ことを特徴とする請求項１に記載の調整装置。 The decomposing unit decomposes the second graph into a plurality of decomposed graphs based on graph Laplacian.
The adjusting device according to claim 1, characterized in that:

前記取得部は、スケジューリングのために前記コンパイラが中間表現から生成する第２グラフを取得する
ことを特徴とする請求項１又は２に記載の調整装置。 3. The coordinator according to claim 1 , wherein the acquisition unit acquires the second graph generated from the intermediate representation by the compiler for scheduling.

調整装置が実行する調整方法であって、
論理回路の組み合わせを変更可能な演算装置の各論理回路が発揮する機能の内容を示す複数の第１グラフと、前記演算装置に所定の処理を実行させる機械語をソースコードから生成するＬＬＶＭを用いたコンパイラが前記ソースコードから生成する第２グラフとを取得する取得工程と、
前記第２グラフを複数の分解グラフに分解する分解工程と、
前記取得工程により取得された複数の前記第１グラフの構造と前記分解工程により分解された複数の前記分解グラフの構造とが類似するように、前記ＬＬＶＭを調整する調整工程と
を含むことを特徴とする調整方法。 An adjustment method performed by an adjustment device,
A plurality of first graphs showing contents of functions exhibited by each logic circuit of an arithmetic unit capable of changing the combination of logic circuits, and LLVM for generating machine language for causing the arithmetic unit to execute predetermined processing from source code. a obtaining step of obtaining a second graph generated from the source code by the compiler used;
a decomposition step of decomposing the second graph into a plurality of decomposition graphs;
an adjusting step of adjusting the LLVM so that structures of the plurality of first graphs obtained by the obtaining step and structures of the plurality of decomposed graphs decomposed by the decomposing step are similar to each other. adjustment method.

論理回路の組み合わせを変更可能な演算装置の各論理回路が発揮する機能の内容を示す複数の第１グラフと、前記演算装置に所定の処理を実行させる機械語をソースコードから生成するＬＬＶＭを用いたコンパイラが前記ソースコードから生成する第２グラフとを取得する取得手順と、
前記第２グラフを複数の分解グラフに分解する分解手順と、前記取得手順により取得された複数の前記第１グラフの構造と前記分解手順により分解された複数の前記分解グラフの構造とが類似するように、前記ＬＬＶＭを調整する調整手順と
をコンピュータに実行させるための調整プログラム。 A plurality of first graphs showing contents of functions exhibited by each logic circuit of an arithmetic unit capable of changing the combination of logic circuits, and LLVM for generating machine language for causing the arithmetic unit to execute predetermined processing from source code. an acquisition procedure for acquiring a second graph generated from the source code by the compiler used;
A decomposition procedure for decomposing the second graph into a plurality of decomposition graphs, and structures of the plurality of first graphs obtained by the obtaining procedure and structures of the plurality of decomposition graphs obtained by the decomposition procedure are similar. and an adjustment program for causing a computer to execute: