JP2000137692A

JP2000137692A - Inter-distributed node load distribution system

Info

Publication number: JP2000137692A
Application number: JP10311316A
Authority: JP
Inventors: Akifumi Murata; 明文村田
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1998-10-30
Filing date: 1998-10-30
Publication date: 2000-05-16

Abstract

PROBLEM TO BE SOLVED: To execute the distribution of loads when loads are centralized or distribution is requested without generating any waste in a memory resource. SOLUTION: This inter-node load distribution system is provided with a load table 1 for storing the present load value of each node N1-Nn and a performance value table 3 for storing the processing performance value of each node N1-Nn. At the time of detecting the excess load of its own node by referring to the load table 1, a movement instructing part 4 selects a node being the destination of movement which is capable of increasing loads by referring to the performance value table 3, and generates a load movement instruction. A load distributing part 5 moves the load of its own node to the node being the destination of movement for each prescribed unit based on the load movement instruction generated by the movement instructing part 4.

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、分散ノードコンピ
ューティング環境における分散ノード間負荷分散方式に
関する。The present invention relates to a load distribution method between distributed nodes in a distributed node computing environment.

【０００２】[0002]

【従来の技術】一般に、複数のノードが分散配置された
分散ノード間では、複数のプロセス（サービス）を実行
しつつ各ノードの負荷分散を行なう際の分散ノード間負
荷分散方式が知られている。2. Description of the Related Art In general, among distributed nodes in which a plurality of nodes are arranged in a distributed manner, a load distribution method among distributed nodes is known in which a load of each node is distributed while executing a plurality of processes (services). .

【０００３】この種の分散ノード間負荷分散方式として
は、例えば、ＲＰＣ（リモートプロシージャコール）に
よるＲＰＣサーバの分散方式がある。このＲＰＣサーバ
の分散方式は、予め負荷分散用に全サーバにＲＰＣサー
バを立上げておき、全サーバに負荷を分散する方式であ
る。また、分散ノード間負荷分散方式には、予めプロセ
ス立上げ時に各サーバに負荷を分散する方式もある。As this type of load distribution method between distributed nodes, for example, there is a distribution method of an RPC server by RPC (remote procedure call). This RPC server distribution method is a method in which RPC servers are set up in all servers in advance for load distribution, and the load is distributed to all servers. Further, among the load distribution methods among the distributed nodes, there is a method in which the load is distributed to each server in advance when the process is started.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら以上のよ
うな分散ノード間負荷分散方式では、例えばＲＰＣサー
バの分散方式の場合、予め負荷分散用に全サーバにＲＰ
Ｃサーバを立上げる必要があるので、メモリリソースに
無駄を生じさせる問題があり、また、負荷集中時又は分
散要求時に負荷を分散し得ない問題がある。However, in the above-described load distribution method between distributed nodes, for example, in the case of the distribution method of the RPC server, the RP is distributed to all servers in advance for load distribution.
Since it is necessary to start the C server, there is a problem that the memory resource is wasted, and there is a problem that the load cannot be distributed at the time of load concentration or distribution request.

【０００５】一方、プロセス立上げ時の分散方式の場
合、負荷の集中したサーバに新たな負荷をかけないもの
の、この負荷の集中したサーバの負荷を分散し得ない問
題がある。[0005] On the other hand, in the case of the distribution method at the time of starting the process, although a new load is not applied to the server on which the load is concentrated, there is a problem that the load on the server with the concentrated load cannot be distributed.

【０００６】本発明は上記実情を考慮してなされたもの
で、メモリリソースに無駄を生じさせず、負荷集中時や
分散要求時に負荷分散を実行し得る分散ノード間負荷分
散方式を提供することを目的とする。The present invention has been made in view of the above circumstances, and provides a load distribution method between distributed nodes that can execute load distribution at the time of load concentration or distribution request without wasting memory resources. Aim.

【０００７】[0007]

【課題を解決するための手段】請求項１に対応する発明
は、分散配置された複数のノードが互いに接続され、前
記各ノードの有する負荷を各ノード間で分散させるため
の分散ノード間負荷分散方式であって、前記各ノードの
現在の負荷値が記憶される負荷記憶手段と、前記各ノー
ドの処理性能値が記憶される性能値記憶手段と、前記負
荷記憶手段を参照して自ノードの過負荷を検出したと
き、前記性能値記憶手段を参照して負荷を増加可能な移
動先ノードを選択し、負荷移動指示を生成する移動指示
手段と、前記移動指示手段により生成された負荷移動指
示に基づいて、自ノードの負荷を所定単位毎に前記移動
先ノードに移動させる負荷分散手段とを備えた分散ノー
ド間負荷分散方式である。According to a first aspect of the present invention, there is provided an inter-node load distribution for connecting a plurality of nodes arranged in a distributed manner to each other and distributing a load of each node among the nodes. A load storage unit in which a current load value of each node is stored; a performance value storage unit in which a processing performance value of each node is stored; and When an overload is detected, a destination node capable of increasing the load is selected with reference to the performance value storage unit, and a migration instruction unit that generates a load migration instruction; and a load migration instruction generated by the migration instruction unit. And a load distributing means for distributing the load of the own node to the destination node in predetermined units on the basis of.

【０００８】また、請求項２に対応する発明は、請求項
１に対応する分散ノード間負荷分散方式において、前記
移動指示手段としては、予め複数の度合のいずれかに負
荷が分類され、前記各度合毎に、移動対象の負荷が先頭
にあり、移動された負荷が末尾に接続されるキューを備
えた分散ノード間負荷分散方式である。According to a second aspect of the present invention, in the load sharing method between distributed nodes according to the first aspect, the movement instructing means is configured to classify loads in advance into any of a plurality of degrees, This is a distributed node-to-node load distribution method having a queue in which the load to be moved is at the top and the moved load is connected at the end for each degree.

【０００９】さらに、請求項３に対応する発明は、請求
項１に対応する分散ノード間負荷分散方式において、前
記負荷テーブルとしては、前記各負荷値と、前記各負荷
値における時系列的な平均値と、前記各負荷値の平均２
乗誤差とが記憶されており、前記移動指示手段として
は、前記自ノードの過負荷を検出したとき、前記負荷テ
ーブル内の平均値及び平均２乗誤差に基づいて、現在の
負荷値と前記平均値とが所定値以上離れており、且つ前
記平均２乗誤差の小さい負荷を移動対象に選択しない分
散ノード間負荷分散方式である。Further, according to a third aspect of the present invention, in the load distribution method between distributed nodes according to the first aspect, the load table includes the load values and a time-series average of the load values. Value and the average of each load value 2
And the movement instructing means detects a current load value and the average value based on an average value and an average square error in the load table when detecting an overload of the own node. This is a load sharing method between distributed nodes in which a load whose value is more than a predetermined value and whose mean square error is small is not selected as a movement target.

【００１０】また、請求項４に対応する発明は、請求項
１に対応する分散ノード間負荷分散方式において、前記
移動指示手段としては、移動先対象のノードが先頭にあ
り、移動先にされたノードが末尾に接続されるキューを
備えた分散ノード間負荷分散方式である。According to a fourth aspect of the present invention, in the load balancing method between distributed nodes according to the first aspect, the movement instruction means is such that a node to be moved is at the head and the movement destination is set as the movement destination. This is a load sharing method between distributed nodes including a queue to which a node is connected at the end.

【００１１】さらに、請求項５に対応する発明は、請求
項１に対応する分散ノード間負荷分散方式において、前
記負荷テーブルとしては、前記負荷値がプロセス毎に記
憶され、且つ１個以上のプロセスからなる集合が前記所
定単位として登録されている分散ノード間負荷分散方式
である。According to a fifth aspect of the present invention, in the load distribution method between distributed nodes according to the first aspect, the load value is stored for each process as the load table, and one or more process values are stored. Is a load distribution method between distributed nodes, in which a set of distributed nodes is registered as the predetermined unit.

【００１２】また、請求項６に対応する発明は、請求項
１に対応する分散ノード間負荷分散方式において、前記
性能値テーブルとしては、ＭＩＰＳ（１００万命令／
秒）に基づいた前記処理性能値が記憶される分散ノード
間負荷分散方式である。According to a sixth aspect of the present invention, in the load distribution method between distributed nodes according to the first aspect, the performance value table includes MIPS (1,000,000 instructions /
(Second) based on the processing performance value.

【００１３】さらに、請求項７に対応する発明は、請求
項１に対応する分散ノード間負荷分散方式において、前
記移動指示手段としては、自ノードにおける移動対象の
負荷の負荷値、前記移動先ノードの処理性能値及び前記
自ノードの処理性能値に基づいて、前記移動対象の負荷
を移動した場合に前記移動先ノードで増加する負荷値を
算出し、前記増加する負荷値が前記移動先ノードでのＣ
ＰＵのアイドル量よりも小のとき、前記負荷移動指示を
生成する分散ノード間負荷分散方式である。According to a seventh aspect of the present invention, in the load distribution method between distributed nodes according to the first aspect, the movement instructing means includes: a load value of a load to be moved in the own node; Based on the processing performance value of the own node and the processing performance value of the own node, when the load of the movement target is moved, a load value that increases at the destination node is calculated, and the increasing load value is calculated at the destination node. C
This is a load sharing method between distributed nodes that generates the load transfer instruction when the amount is smaller than the idle amount of the PU.

【００１４】また、請求項８に対応する発明は、請求項
１に対応する分散ノード間負荷分散方式において、前記
移動指示手段としては、予め高負荷、中負荷又は小負荷
のいずれかの度合に負荷が分類され、前記各度合毎に、
移動対象の負荷が先頭にあり、移動された負荷が末尾に
接続されるキューを備えた分散ノード間負荷分散方式で
ある。（作用）従って、請求項１に対応する発明は以上のよう
な手段を講じたことにより、各ノードの現在の負荷値が
記憶される負荷記憶手段と、各ノードの処理性能値が記
憶される性能値記憶手段とを有し、移動指示手段が、負
荷記憶手段を参照して自ノードの過負荷を検出したと
き、性能値記憶手段を参照して負荷を増加可能な移動先
ノードを選択し、負荷移動指示を生成し、負荷分散手段
が、移動指示手段により生成された負荷移動指示に基づ
いて、自ノードの負荷を所定単位毎に移動先ノードに移
動させるので、従来とは異なり、メモリリソースに無駄
を生じさせず、負荷集中時や分散要求時に負荷分散を実
行させることができる。According to an eighth aspect of the present invention, in the load sharing method between distributed nodes according to the first aspect, the movement instructing means includes a high load, a medium load and a small load in advance. The load is classified, and for each of the above-mentioned degrees,
This is a load sharing method between distributed nodes including a queue in which the load to be moved is at the top and the moved load is connected to the end. (Operation) Therefore, in the invention corresponding to claim 1, by taking the above means, the load storage means for storing the current load value of each node and the processing performance value of each node are stored. When the movement instruction means detects an overload of the own node by referring to the load storage means, the movement instruction means selects a destination node capable of increasing the load by referring to the performance value storage means. Since the load distribution instruction is generated, and the load distribution unit moves the load of the own node to the destination node for each predetermined unit based on the load migration instruction generated by the movement instruction unit. It is possible to execute load distribution at the time of load concentration or distribution request without causing waste of resources.

【００１５】また、請求項２に対応する発明は、移動指
示手段としては、予め複数の度合のいずれかに負荷が分
類され、各度合毎に、移動対象の負荷が先頭にあり、移
動された負荷が末尾に接続されるキューを備えたので、
請求項１に対応する作用に加え、特定の負荷のみが順番
に移動するたらい回し動作を阻止することができる。According to a second aspect of the present invention, as the movement instructing means, the loads are classified in advance into any one of a plurality of degrees, and for each degree, the load to be moved is at the top and moved. With a queue where the load is connected at the end,
In addition to the operation corresponding to the first aspect, it is possible to prevent the swirling operation in which only a specific load moves sequentially.

【００１６】さらに、請求項３に対応する発明は、負荷
テーブルとしては、各負荷値と、各負荷値における時系
列的な平均値と、各負荷値の平均２乗誤差とが記憶され
ており、移動指示手段としては、自ノードの過負荷を検
出したとき、負荷テーブル内の平均値及び平均２乗誤差
に基づいて、現在の負荷値と平均値とが所定値以上離れ
ており、且つ平均２乗誤差の小さい負荷を移動対象に選
択しない。Further, in the invention corresponding to claim 3, the load table stores each load value, a time-series average value of each load value, and an average square error of each load value. When the overload of the own node is detected, based on the average value and the mean square error in the load table, the current load value and the average value are separated from each other by a predetermined value or more. A load having a small square error is not selected as a moving object.

【００１７】これにより、請求項１に対応する作用に加
え、通常は低負荷値で一時的に高負荷値となる負荷の移
動を阻止できるので、移動によるオーバヘッドを抑制す
ることができる。In this way, in addition to the action corresponding to the first aspect, the movement of the load, which normally has a low load value and temporarily becomes a high load value, can be prevented, so that the overhead due to the movement can be suppressed.

【００１８】また、請求項４に対応する発明は、移動指
示手段としては、移動先対象のノードが先頭にあり、移
動先にされたノードが末尾に接続されるキューを備えた
ので、請求項１に対応する作用に加え、負荷の移動先を
選択する際に、他の全ノードのうち、一部のノードを検
索すればよいので、サービス移動のオーバヘッドを抑制
することができる。According to a fourth aspect of the present invention, the movement instructing means includes a queue in which the destination node is located at the top and the destination node is connected at the end. In addition to the action corresponding to 1, when selecting a destination of the load, it is only necessary to search some of the other nodes, so that the overhead of service movement can be suppressed.

【００１９】さらに、請求項５に対応する発明は、負荷
テーブルとしては、負荷値がプロセス毎に記憶され、且
つ１個以上のプロセスからなる集合が所定単位として登
録されているので、請求項１に対応する作用を容易且つ
確実に奏することができる。In the invention corresponding to claim 5, the load value is stored for each process in the load table, and a set of one or more processes is registered as a predetermined unit. Can be easily and reliably performed.

【００２０】また、請求項６に対応する発明は、性能値
テーブルとしては、ＭＩＰＳ（１００万命令／秒）に基
づいた処理性能値が記憶されるので、請求項１に対応す
る作用を容易且つ確実に奏することができる。In the invention according to claim 6, since the processing value based on MIPS (million instructions / second) is stored as the performance value table, the operation corresponding to claim 1 can be performed easily and easily. Can be played reliably.

【００２１】さらに、請求項７に対応する発明は、移動
指示手段としては、自ノードにおける移動対象の負荷の
負荷値、移動先ノードの処理性能値及び自ノードの処理
性能値に基づいて、移動対象の負荷を移動した場合に移
動先ノードで増加する負荷値を算出し、増加する負荷値
が移動先ノードでのＣＰＵのアイドル量よりも小のと
き、負荷移動指示を生成するので、請求項１に対応する
作用を容易且つ確実に奏することができる。Further, according to a seventh aspect of the present invention, as the movement instructing means, the movement instructing means based on the load value of the load to be moved in the own node, the processing performance value of the destination node and the processing performance value of the own node. When a load value to be increased is calculated at the destination node when the target load is moved, a load movement instruction is generated when the increased load value is smaller than the idle amount of the CPU at the destination node. The operation corresponding to 1 can be easily and reliably performed.

【００２２】また、請求項８に対応する発明は、移動指
示手段としては、予め高負荷、中負荷又は小負荷のいず
れかの度合に負荷が分類され、各度合毎に、移動対象の
負荷が先頭にあり、移動された負荷が末尾に接続される
キューを備えたので、請求項１に対応する作用に加え、
特定の負荷のみが順番に移動するたらい回し動作を阻止
することができる。In the invention corresponding to claim 8, the movement instructing means may classify loads in advance into any of high load, medium load and small load, and determine the load to be moved for each degree. In addition to the function corresponding to claim 1, a queue is provided at the head of which the moved load is connected to the tail.
It is possible to prevent the turning operation in which only a specific load moves sequentially.

【００２３】[0023]

【発明の実施の形態】以下、本発明の各実施形態につい
て図面を参照しながら説明する。（第１の実施形態）図１は本発明の第１の実施形態に係
る分散ノード間負荷分散方式の適用された計算機システ
ムの構成を示す模式図である。この計算機システムは、
複数のノード（計算機本体）Ｎ１〜Ｎｎが互いに接続さ
れている。ここで、各ノードＮ１〜Ｎｎは、実行中のプ
ロセス（プログラム）が異なるものの、互いに同一構成
のため、ノードＮ１を例に挙げて説明する。Embodiments of the present invention will be described below with reference to the drawings. (First Embodiment) FIG. 1 is a schematic diagram showing a configuration of a computer system to which a distributed node load balancing method according to a first embodiment of the present invention is applied. This computer system is
A plurality of nodes (computer bodies) N1 to Nn are connected to each other. Here, the nodes N1 to Nn have different processes (programs) being executed, but have the same configuration, so that the node N1 will be described as an example.

【００２４】ノードＮ１は、実行の有無によらずに保持
する複数のプロセスＰ１〜Ｐｍの他、負荷テーブル１、
負荷管理部２、性能値テーブル３、移動指示部４、負荷
分散部５、プロセスファイル６及びバックアップファイ
ル７を備えている。The node N1 includes a plurality of processes P1 to Pm which are held irrespective of whether or not they are executed, a load table 1,
A load management unit 2, a performance value table 3, a movement instruction unit 4, a load distribution unit 5, a process file 6, and a backup file 7 are provided.

【００２５】負荷テーブル１は、図２に示すように、負
荷管理部２によって、各ノードＮ１〜Ｎｎにおけるプロ
セスＰ１〜Ｐｍ毎の現在のＣＰＵ負荷値、メモリ負荷値
及びディスク負荷値が読出／書込可能に記憶されるテー
ブルであり、１以上の任意のプロセスＰをまとめた集合
が、１つのサービスＳ（負荷分散の単位）として取扱わ
れる。As shown in FIG. 2, the load management unit 2 reads / writes the current CPU load value, memory load value, and disk load value for each of the processes P1 to Pm in each of the nodes N1 to Nn. A set of one or more arbitrary processes P is handled as one service S (unit of load distribution).

【００２６】なお、負荷テーブル１は、以上の値に加
え、サービスＳ１〜Ｓｊ毎の負荷値（各サービス内の各
プロセス負荷の合計値）が記憶されてもよく、ノードＮ
１〜Ｎｎ毎の負荷値（ノード内の各サービス負荷の合計
値）が記憶されてもよい。The load table 1 may store, in addition to the above values, load values for each of the services S1 to Sj (total value of each process load in each service).
A load value (a total value of each service load in the node) for each of 1 to Nn may be stored.

【００２７】また、ノードＮ１全体のメモリ負荷値は、
ノードＮ１のスワップ(swap)の使用量（スワップの残り
容量あるいは一定時間のスワップin/out、ページin/out
の量）で規定可能である。ノードＮ１全体のＣＰＵ負荷
値は、ＣＰＵのアイドル量あるいはＣＰＵ割当て待ちプ
ロセス数で規定可能である。The memory load value of the entire node N1 is:
Swap usage of node N1 (remaining capacity of swap or swap in / out for a certain time, page in / out
Amount). The CPU load value of the entire node N1 can be defined by the amount of idle CPU or the number of processes waiting for CPU allocation.

【００２８】負荷管理部２は、定期的に各ノードＮ１〜
ＮｎにおけるプロセスＰ１〜Ｐｍ毎の現在のＣＰＵ負荷
値、メモリ負荷値及びディスク負荷値を収集し、これら
ＣＰＵ負荷値、メモリ負荷値及びディスク負荷値を負荷
テーブル１に書込む機能をもっている。The load management unit 2 periodically checks each of the nodes N1 to N1.
It has a function of collecting the current CPU load value, memory load value, and disk load value for each of the processes P1 to Pm in Nn, and writing these CPU load value, memory load value, and disk load value in the load table 1.

【００２９】性能値テーブル３は、ノードＮ１からＮｎ
毎に予めＣＰＵ性能値及びメモリ性能値が読出可能に登
録されたテーブルであり、ノードＮ１〜Ｎｎ内の共有領
域（共有メモリでもファイルでも可）に設けられてい
る。ここで、ＣＰＵ性能値としては、１／（ＭＩＰＳ＊
ＭＰＵ）が使用可能となっている。なお、ＭＩＰＳ（１
００万命令／秒）は、１つのＣＰＵの性能値であり、Ｍ
ＰＵは、ＣＰＵの個数である。一方、メモリ性能値とし
ては、ノード内の各メモリ容量の合計値が使用可能とな
っている。The performance value table 3 includes nodes N1 to Nn
This is a table in which a CPU performance value and a memory performance value are registered in advance so as to be readable for each of them, and are provided in a shared area (can be a shared memory or a file) in the nodes N1 to Nn. Here, the CPU performance value is 1 / (MIPS *
MPU) can be used. Note that MIPS (1
(Million instructions / sec) is the performance value of one CPU, and M
PU is the number of CPUs. On the other hand, as the memory performance value, the total value of each memory capacity in the node can be used.

【００３０】移動指示部４は、定期的に負荷テーブル１
を参照して自ノードＮ１の負荷状況を調査し、負荷が所
定値を越えた旨（過負荷）を検出すると、自ノードＮ１
の各プロセスＰをサービスＳ単位で低負荷のノードＮｉ
（ｉは１〜ｎまでの任意の自然数（但し、自ノードの番
号を除く））に移動させる旨の指示を負荷分散部５に与
える機能をもっている。The movement instructing unit 4 periodically loads the load table 1
, The load status of the own node N1 is checked, and when it is detected that the load exceeds a predetermined value (overload), the own node N1 is checked.
Of each process P in the service S unit with a low load node Ni
(I is an arbitrary natural number from 1 to n (excluding the number of the own node)), and has a function of giving an instruction to the load distribution unit 5 to move the load.

【００３１】同様に、移動指示部４は、ノードＮ自体又
はプロセスＰの障害発生時に、障害発生により実行不可
能となったプロセスＰを含むサービスＳを低負荷のノー
ドＮｉに移動させる旨の指示を負荷分散部５に与える機
能をもっている。Similarly, when the failure of the node N itself or the process P occurs, the movement instructing unit 4 issues an instruction to move the service S including the process P which has become unexecutable due to the failure to the node Ni with a low load. Is provided to the load distribution unit 5.

【００３２】負荷分散部５は、移動指示部４から受けた
負荷移動指示に基づいて、自ノードＮ１の移動対象の負
荷（プロセス）をサービスＳ単位で他のノードＮｉに移
動させる移動機能をもっている。The load distribution unit 5 has a movement function of moving the load (process) to be moved by the own node N1 to another node Ni in service S units based on the load movement instruction received from the movement instruction unit 4. .

【００３３】ここで、移動機能は、自ノードＮ１に移動
対象のプロセスＰｋ（ｋは１〜ｍまでの任意の自然数）
があるとき、自ノードＮ１における移動対象のプロセス
Ｐｋを停止させ（障害により既に停止していれば不
要）、移動対象のプロセスＰｋを立上げるための再開実
行指示を移動先の他ノードＮｉに与え、他ノードＮｉに
おける移動対象のプロセスＰｋを再開実行させることに
より、結果としてプロセスＰｋを自ノードＮ１から他ノ
ードＮｉに移動させるものである。Here, the transfer function is a process Pk (k is an arbitrary natural number from 1 to m) to be transferred to its own node N1.
When there is, the process Pk to be moved in the own node N1 is stopped (it is unnecessary if the process Pk has already been stopped due to a failure), and a restart execution instruction for starting up the process Pk to be moved is given to the other node Ni of the destination. By restarting the process Pk to be moved in the other node Ni, the process Pk is moved from the own node N1 to the other node Ni as a result.

【００３４】この移動機能として、負荷分散部５は、移
動指示部４から受ける指示により、実行途中のプロセス
ＰをサービスＳ単位で他ノードＮｉへ移送して低負荷の
他ノードＮｉで継続実行する技術（特願平９−２３２９
３０号）を用いている。As the transfer function, the load distribution unit 5 transfers the process P being executed to another node Ni in units of service S in accordance with an instruction received from the transfer instruction unit 4 and continuously executes the process P on another node Ni with a low load. Technology (Japanese Patent Application No. 9-2329)
No. 30).

【００３５】係る技術を用いる負荷分散部５は、プロセ
ス実行により更新されるプロセスファイル６の更新内容
の記録（以下、ログという）を採取して他の全ノードＮ
２〜Ｎｎに分散するためのジャケットルーチン８と、ジ
ャケットルーチン８から受けたログを未確定キュー９ａ
として保持すると共に、チェックポイント毎に未確定キ
ュー９ａを確定キュー９ｂとして該確定キュー９ｂ内の
各ログに基づいてプロセスのバックアップファイル７を
更新可能なデーモン９とを備えている。The load distribution unit 5 using this technique collects a record (hereinafter referred to as a log) of the update contents of the process file 6 updated by the execution of the process, and collects all other nodes N
The jacket routine 8 for distributing the data into 2 to Nn, and the log received from the jacket routine 8
And a daemon 9 that can update the backup file 7 of the process based on each log in the confirmed queue 9b as a confirmed queue 9b for each checkpoint.

【００３６】ここで、ログは、プロセス状態を示すもの
であり、例えば、データ等のレジスタ情報及びファイル
の更新等のシステムコール発行結果が使用可能である。
チェックポイントとしては、一定時間が経過した時点、
あるいはＯＳのシステムコール発行コールをフェッチし
た時点が使用可能である。Here, the log indicates a process state, and for example, register information such as data and a result of issuing a system call such as updating of a file can be used.
The checkpoints are as follows:
Alternatively, the time when the system call issuance call of the OS is fetched can be used.

【００３７】次に、以上のように構成された計算機シス
テムの動作を説明する。ノードＮ１における負荷分散部
５のジャケットルーチン８は、図３に示すように、実行
中のプロセスＰ１から各ログを採取し、これら各ログを
プロセスファイルに更新記憶させると共に少なくともチ
ェックポイントＣＰまでに他の各ノードＮ２〜Ｎｎに送
信する。各ノードＮ２〜Ｎｎでは、デーモン９がこのロ
グを受けて未確定キューとして保持し、チェックポイン
トＣＰ毎にログ内のシステムコール発行結果を反映させ
て処理を実行する。例えばシステムコール発行結果がバ
ックアップファイル７の更新を示すとき、デーモン９に
よりバックアップファイル７を更新する。Next, the operation of the computer system configured as described above will be described. As shown in FIG. 3, the jacket routine 8 of the load distribution unit 5 in the node N1 collects each log from the running process P1, updates and stores each log in a process file, and at least stores the other logs until the checkpoint CP. To each of the nodes N2 to Nn. In each of the nodes N2 to Nn, the daemon 9 receives this log, holds it as an undetermined queue, and executes processing by reflecting the system call issuance result in the log for each checkpoint CP. For example, when the system call issuance result indicates that the backup file 7 is updated, the backup file 7 is updated by the daemon 9.

【００３８】一方、ノードＮ１では、移動指示部４が、
定期的に負荷テーブル１の情報を監視し、一定条件（自
ノードＮ１の負荷が所定値を越えた時点）を満たすと、
負荷分散を開始する。On the other hand, in the node N1, the movement instructing section 4
The information in the load table 1 is periodically monitored, and when a certain condition (when the load of the own node N1 exceeds a predetermined value) is satisfied,
Start load balancing.

【００３９】すなわち、移動指示部４は、図４に示すよ
うに、一定時間スリープ(sleep) し（ＳＴ１）、しかる
後、負荷テーブル１を参照して自ノードＮ１の負荷状況
を調査する（ＳＴ２）。That is, as shown in FIG. 4, the movement instructing unit 4 sleeps for a certain period of time (ST1), and then checks the load condition of the own node N1 with reference to the load table 1 (ST2). ).

【００４０】この調査において、自ノードＮ１について
過負荷か否かを判定し（ＳＴ３）、過負荷でないときに
はステップＳＴ１へ戻る。In this investigation, it is determined whether or not the own node N1 is overloaded (ST3). If not, the process returns to step ST1.

【００４１】なお、ステップＳＴ３の判定は、メモリ負
荷の場合、前述したノードＮ１全体のメモリ負荷値が所
定値を越えたときに過負荷とし、ＣＰＵ負荷の場合、前
述したノードＮ１全体のＣＰＵ負荷値が所定値を越えた
ときに過負荷とする。It should be noted that the determination in step ST3 is that when the memory load is over, the above-mentioned memory load value of the entire node N1 exceeds a predetermined value, and the CPU load is overloaded. When the value exceeds a predetermined value, it is overloaded.

【００４２】自ノードＮ１を過負荷と判定したとき、自
ノードＮ１から高負荷のサービスＳを他ノードＮｉへの
転送対象として選択する（ＳＴ４）。When it is determined that the own node N1 is overloaded, the high load service S from the own node N1 is selected as a transfer target to another node Ni (ST4).

【００４３】続いて、最低の負荷の例えばノードＮ２を
選択し（ＳＴ５）、負荷テーブル１及び性能値テーブル
３を参照しつつ、サービスＳを移動可能か否かを判定す
る（ＳＴ６）。Subsequently, the node N2 having the lowest load, for example, is selected (ST5), and it is determined whether or not the service S can be moved while referring to the load table 1 and the performance value table 3 (ST6).

【００４４】ステップＳＴ６の判定は、メモリ負荷と、
ＣＰＵ負荷との２通りが実行される。メモリ負荷の判定
は、移動先ノードＮ２のメモリ性能値をＭtotal とし、
移動先ノードＮ２で現在使用中のメモリ負荷値をＭusin
g とし、移動するサービスＳのメモリ負荷値をＭservと
した場合、次の（１）式を満たすときに移動可能とされ
る。Ｍtotal − Ｍusing ＞Ｍserv …（１）なお、メモリ性能値はノードＮ２内の各メモリ容量の合
計値である。The determination in step ST6 is based on the memory load,
CPU load is executed in two ways. The memory load is determined by setting the memory performance value of the destination node N2 to Mtotal,
Musin is the memory load value currently used in the destination node N2.
g, and the memory load value of the service S to be moved is Mserv, the movement is possible when the following expression (1) is satisfied. Mtotal-Musing> Mserv (1) The memory performance value is the total value of each memory capacity in the node N2.

【００４５】次に、ＣＰＵ負荷の判定は、移動先ノード
Ｎ２の現在のＣＰＵアイドル（遊休）量を先ＣＰＵidol
とし、移動前のノードＮ１でのサービスＳのＣＰＵ負荷
値を前ＣＰＵloadとし、移動先ノードＮ２のＣＰＵ性能
値を先ＣＰＵperfとし、移動前のノードＮ１でのＣＰＵ
性能値を前ＣＰＵperfとした場合、次の（２）式を満た
すときに移動可能とされる。Next, the CPU load is determined by comparing the current CPU idle (idle) amount of the destination node N2 with the destination CPUidol.
The CPU load value of the service S in the node N1 before the movement is set as the previous CPUload, the CPU performance value of the destination node N2 is set as the first CPU perf, and the CPU in the node N1 before the movement is changed.
When the performance value is the previous CPU perf, the movement is possible when the following expression (2) is satisfied.

【００４６】[0046]

【数１】なお、ＣＰＵアイドル量は、新たに使用可能なＣＰＵ負
荷値を意味している。すなわち、（２）式は、右辺の移
動前のＣＰＵ負荷から換算される移動先のＣＰＵ負荷よ
りも、左辺の移動先のＣＰＵアイドル量が大である関係
を意味している。(Equation 1) The CPU idle amount means a newly available CPU load value. That is, equation (2) means a relationship in which the CPU idle amount of the destination on the left side is larger than the CPU load of the destination on the right side calculated from the CPU load before the movement on the right side.

【００４７】また、サービス移動後の移動先ノードＮｉ
の評価において、ＣＰＵアイドル量を後ＣＰＵidolと
し、メモリ負荷値を後Ｍuse とした場合、各ノードＮ２
〜Ｎｎのうち、次の（３）式の値が最高のノードＮ２
が、最低の負荷のノードＮ２として判定される。The destination node Ni after the service is moved
In the evaluation of the above, when the CPU idle amount is post CPUidol and the memory load value is post Muse, each node N2
To Nn, the node N2 having the highest value of the following equation (3)
Is determined as the node N2 having the lowest load.

【００４８】[0048]

【数２】すなわち、（３）式は、サービス移動後において、ＣＰ
Ｕ負荷の余裕分と、メモリ負荷の余裕分とを合計した値
を示している。(Equation 2) That is, equation (3) indicates that after the service transfer, the CP
The figure shows the sum of the U load allowance and the memory load allowance.

【００４９】ステップＳＴ６においては、（１）式，
（２）式を共に満たした場合、すなわち、メモリ負荷及
びＣＰＵ負荷を共に移動可能と判定したときのみ、サー
ビスＳを移動可能と判定し、負荷分散部５にサービス移
動の指示を出し（ＳＴ７）、ステップＳＴ１へ戻る。In step ST6, equation (1)
Only when both the expressions (2) are satisfied, that is, when it is determined that both the memory load and the CPU load can be moved, it is determined that the service S can be moved, and a service transfer instruction is issued to the load distribution unit 5 (ST7). The process returns to step ST1.

【００５０】なお、ステップ６において、高負荷のサー
ビスＳを移動できないとき、中程度の負荷のサービスＳ
を選択し（ＳＴ８）、前述同様にサービスＳを移動可能
か否かを判定する（ＳＴ９）。In step 6, if the high-load service S cannot be moved, the medium-load service S
Is selected (ST8), and it is determined whether the service S can be moved as described above (ST9).

【００５１】また、ステップＳＴ９において移動可能な
ときにはステップＳＴ７に行くが、中程度の負荷のサー
ビスＳが移動不可のとき、低負荷のサービスＳを選択し
（ＳＴ１０）、前述同様にサービスＳを移動可能か否か
を判定する（ＳＴ１１）。If it is possible to move in step ST9, the process goes to step ST7. If the service S having a medium load cannot be moved, the service S having a low load is selected (ST10), and the service S is moved as described above. It is determined whether or not it is possible (ST11).

【００５２】ステップＳＴ１１においても、移動可能な
ときにはステップＳＴ７に行くが、低負荷のサービスＳ
が移動不可のとき、ステップＳＴ１へ戻る。（具体例１）次に、以上のような各ステップＳＴ１〜Ｓ
Ｔ１１において、１つのプロセスＰ１のみを有する１つ
のサービスＳ１の移動に際し、ＣＰＵ負荷のみを検討す
る場合について説明する。Also in step ST11, when it is possible to move, the process goes to step ST7.
Returns to step ST1 when cannot be moved. (Specific Example 1) Next, each of the above steps ST1 to S1
At T11, a case will be described in which only the CPU load is considered when one service S1 having only one process P1 is moved.

【００５３】具体的には、図５に示す負荷状況におい
て、ノードＮ１のＣＰＵ負荷値が９０％を越えた際に、
サービスＳ１を移動させる場合の移動指示部４の動作を
述べる。Specifically, in the load condition shown in FIG. 5, when the CPU load value of the node N1 exceeds 90%,
The operation of the movement instruction unit 4 when moving the service S1 will be described.

【００５４】ステップＳＴ３において、ノードＮ１のＣ
ＰＵ負荷は、サービスＳ１〜Ｓ３を足して９０％である
ため、ノードＮ１が過負荷と判定される。また、ステッ
プＳＴ４において、ノードＮ１で最も高負荷のサービス
Ｓ１が転送対象として選択される。In step ST3, C of node N1
Since the PU load is 90% by adding the services S1 to S3, it is determined that the node N1 is overloaded. In step ST4, the service S1 with the highest load on the node N1 is selected as a transfer target.

【００５５】次いで、ステップＳＴ５において、最低の
負荷のノードＮ２が選択される。例えば、サービスＳ１
を他ノードＮ２又はＮ３へ移動した場合を仮定し、ノー
ドＮ２，Ｎ３にてサービスＳ１を実行する場合のＣＰＵ
負荷値を試算する。その試算結果は、ノードＮ２が２５
％（＝５０％＊（１／２００）／（１／１００））であ
り、ノードＮ３が１６％（＝５０％＊（１／３００）／
（１／１００））である。Next, in step ST5, the node N2 having the lowest load is selected. For example, service S1
Is moved to another node N2 or N3, and the CPU when the service S1 is executed in the nodes N2 and N3.
Calculate the load value. The calculation result shows that node N2 has 25
% (= 50% * (1/200) / (1/100)), and the node N3 has 16% (= 50% * (1/300) /
(1/100)).

【００５６】ここで、サービスＳ１を移動すると、最終
的なＣＰＵ負荷値は、ノードＮ２では３５％（＝１０％
＋２５％）となり、ノードＮ３では４６％（＝３０％＋
１６％）となる。従って、最終的なＣＰＵ負荷値の小さ
いノードＮ２は、ステップＳＴ５により最低の負荷のノ
ードＮ２として選択され、ステップＳＴ６の（２）式に
より（先ＣＰＵidol９０％＞２５％）移動可能と判定さ
れ、ステップＳＴ７によりサービスＳ１が移動される。（具体例２）また、具体例と同一のＣＰＵ性能値におい
て、他のサービスが移動される場合について説明する。
図６に示す負荷状況において、サービスＳ１の移動を仮
定した場合、各ノードＮ２，Ｎ３の負荷は、ノードＮ２
が１１０％（＝８５％＋２５％）となり、ノードＮ３が
９１％（＝７５％＋１６％）となる。この場合、ノード
Ｎ２，Ｎ３の負荷が高いので、サービスＳ１を移動でき
ない。Here, when the service S1 is moved, the final CPU load value becomes 35% (= 10%) in the node N2.
+ 25%), and 46% (= 30% +) at the node N3.
16%). Therefore, the node N2 having the final small CPU load value is selected as the node N2 having the lowest load in step ST5, and it is determined that the node N2 can be moved (90%> 25%) in step ST6 according to the equation (2) in step ST6. The service S1 is moved by ST7. (Specific Example 2) A case where another service is moved at the same CPU performance value as that of the specific example will be described.
In the load situation shown in FIG. 6, when it is assumed that the service S1 moves, the load on each of the nodes N2 and N3 is
Becomes 110% (= 85% + 25%), and the node N3 becomes 91% (= 75% + 16%). In this case, since the load on the nodes N2 and N3 is high, the service S1 cannot be moved.

【００５７】よって、中程度の負荷であるサービスＳ２
の移動を検討する。サービスＳ２を移動した場合の計算
は、ノードＮ２が１００％（＝８５％＋１５％）とな
り、ノードＮ３が８５％（＝７５％＋１０％）となるの
で、サービスＳ２をノードＣへ移動させる。Therefore, the service S2 having a medium load
Consider moving. In the calculation when the service S2 is moved, the node N2 becomes 100% (= 85% + 15%) and the node N3 becomes 85% (= 75% + 10%), so the service S2 is moved to the node C.

【００５８】このように負荷が高い順から、サービスＳ
１，…の移動を計算し、移動可能なノードＮ３にサービ
スを移動させる。但し、全てのサービスＳ１〜Ｓ３が移
動不可能（他の全ノードＮ２，Ｎ３が高負荷状態）のと
き、サービスＳ１〜Ｓ３の移動をあきらめる。As described above, the service S
The movement of 1,... Is calculated, and the service is moved to the movable node N3. However, when all the services S1 to S3 cannot be moved (all the other nodes N2 and N3 are in a high load state), the services S1 to S3 are given up.

【００５９】このように、ノードＮ１では、ノードＮ１
自体又はプロセスＰｋにて障害発生あるいは高負荷の発
生により、プロセスＰｋの実行が困難になると、（高負
荷の発生時には予め当該プロセスＰｋを停止させた
後、）低負荷の例えばノードＮ２にプロセス移動を指示
して負荷分散を実行する。ノードＮ２では、プロセスの
再開を実行する。As described above, in the node N1, the node N1
If it becomes difficult to execute the process Pk due to the occurrence of a fault or a high load in itself or the process Pk, the process is moved to a low-load node, for example, the node N2 (after the process Pk is stopped in advance when a high load occurs). And execute load distribution. In the node N2, the process is restarted.

【００６０】ノードＮ２は、再開の実行時に、プロセス
Ｐｋのmain（プログラムとしてのスタート）をフェッチ
し、チェックポイントＣＰのログからスタック積上げ／
レジスタ情報設定等を実行し、ノードＮ１で中止された
プロセスＰｋを最新のチェックポイントＣＰ時点から再
開して実行する。The node N2 fetches the main (start as a program) of the process Pk at the time of execution of the restart, and stacks / stacks the log from the checkpoint CP.
By executing register information setting and the like, the process Pk suspended at the node N1 is restarted from the latest checkpoint CP and executed.

【００６１】上述したように本実施形態によれば、各ノ
ードＮ１〜Ｎｎの現在の負荷値が記憶される負荷テーブ
ル１と、各ノードＮ１〜Ｎｎの処理性能値が記憶される
性能値テーブル３とを有し、移動指示部４が、負荷テー
ブル１を参照して自ノードの過負荷を検出したとき、性
能値テーブル３を参照して負荷を増加可能な移動先ノー
ドを選択し、負荷移動指示を生成し、負荷分散部５が、
移動指示部４により生成された負荷移動指示に基づい
て、自ノードの負荷を所定単位毎に移動先ノードに移動
させるので、従来とは異なり、メモリリソースに無駄を
生じさせず、負荷集中時や分散要求時に負荷分散を実行
させることができる。As described above, according to this embodiment, the load table 1 storing the current load values of the nodes N1 to Nn and the performance value table 3 storing the processing performance values of the nodes N1 to Nn When the movement instructing unit 4 detects an overload of the own node with reference to the load table 1, the movement instructing unit 4 selects a destination node capable of increasing the load with reference to the performance value table 3, and An instruction is generated, and the load distribution unit 5
Based on the load movement instruction generated by the movement instruction unit 4, the load of the own node is moved to the destination node for each predetermined unit. Load distribution can be executed when a distribution request is made.

【００６２】また、プロセスの実行中の移動により、サ
ービスの継続性を保ちつつ、プログラミングによる負荷
分散の意識をせずに、分散ノード間での負荷分散システ
ムを構築することができる。（第２の実施形態）次に、本発明の第２の実施形態に係
る分散ノード間負荷分散方式の適用された計算機システ
ムについて説明する。Further, by moving the process during execution, it is possible to construct a load distribution system between distributed nodes without consciousness of load distribution by programming while maintaining service continuity. (Second Embodiment) Next, a description will be given of a computer system to which a load sharing method between distributed nodes according to a second embodiment of the present invention is applied.

【００６３】本実施形態は、第１の実施形態中、各ノー
ドＮ１〜Ｎｎの平均の負荷よりも高負荷のサービスＳが
ある場合、この高負荷のサービスＳのみが各ノードＮ１
〜Ｎｎを順番に移動する（たらい回しされる）場合があ
ることを考慮し、このたらい回し動作の阻止を図るもの
である。In the present embodiment, when there is a service S having a higher load than the average load of each of the nodes N1 to Nn in the first embodiment, only the service S having a higher load is applied to each node N1.
ＮNn are sequentially moved (turned around) in order to prevent this turning operation.

【００６４】具体的には、移動指示部４は、前述した機
能に加え、図７に示すように、例えば各サービスＳ１〜
Ｓ９が負荷の程度に応じて配列される高、中、低の３段
階のキューＱ１〜Ｑ３を有し、各段階の負荷のサービス
を選択（ＳＴ４，８，１０）する際に、各段階のキュー
の先頭にあるサービスＳ７（Ｓ８又はＳ９）を移動対象
として選択する機能と、サービスＳ７（Ｓ８又はＳ９）
が移動されたときにはこのサービスＳ７（Ｓ８又はＳ
９）を該当する段階のキューＱ１（Ｑ２又はＱ３）の末
尾に接続する機能とをもっている。More specifically, in addition to the above-described functions, the movement instructing unit 4 includes, for example, each of the services S1 to S5 as shown in FIG.
S9 has three stages of queues Q1 to Q3 of high, medium and low arranged according to the degree of load, and when selecting a service of each stage of load (ST4, 8, 10), A function of selecting the service S7 (S8 or S9) at the head of the queue as a movement target, and a function of selecting the service S7 (S8 or S9)
Is moved to the service S7 (S8 or S8).
9) is connected to the end of the queue Q1 (Q2 or Q3) at the corresponding stage.

【００６５】なお、図７中のサービスＳの添字及び各Ｑ
１〜Ｑ３内のサービスＳの個数は、単なる一例であり、
適宜変更可能なことは言うまでもない。次に、以上のよ
うに構成された計算機システムの動作を説明する。な
お、この説明は、第１の実施形態と比較して述べる。It should be noted that the suffix of the service S in FIG.
The number of services S in 1 to Q3 is merely an example,
Needless to say, it can be changed as appropriate. Next, the operation of the computer system configured as described above will be described. This description will be made in comparison with the first embodiment.

【００６６】前述した第１の実施形態の場合、図５と同
一のＣＰＵ性能値のノードＮ１において、図８に示すＣ
ＰＵ負荷状況であるとする。この場合、ノードＮ１は、
サービスＳ１をノードＮ２に移動させる。ここで、サー
ビスＳ１のＣＰＵ負荷値が３５％〜４５％の範囲内で上
下すると、ノードＮ２は、サービスＳ１を他のノードＮ
３に移動させる可能性がある。また、ノードＮ３はサー
ビスＳ１をさらに他のノードＮ４に移動させる。以下同
様に、サービス１のみが各ノードＮ５〜Ｎｎを順番に移
動する可能性がある。In the case of the first embodiment described above, the node N1 having the same CPU performance value as that of FIG.
It is assumed that the state is a PU load state. In this case, the node N1
The service S1 is moved to the node N2. Here, when the CPU load value of the service S1 rises and falls within the range of 35% to 45%, the node N2 switches the service S1 to another node N
There is a possibility to move to 3. The node N3 moves the service S1 to another node N4. Similarly, there is a possibility that only the service 1 sequentially moves through the nodes N5 to Nn.

【００６７】一方、本実施形態では、サービス選択用の
キューを設けた構成により、移動対象のサービスＳ１〜
Ｓ９が図９に示すようにキューＱ１〜Ｑ３に接続され
る。On the other hand, in this embodiment, the services S1 to S1 to be moved are configured by providing a queue for service selection.
S9 is connected to queues Q1 to Q3 as shown in FIG.

【００６８】ここで、ノードＮ１がキューＱ１の先頭の
サービスＳ１をノードＮ２に移動させると、ノードＮ２
では、図１０に示すように、このサービスＳ１がキュー
Ｑ１の最後に接続される。Here, when the node N1 moves the service S1 at the head of the queue Q1 to the node N2, the node N2
Then, as shown in FIG. 10, the service S1 is connected to the end of the queue Q1.

【００６９】これにより、ノードＮ２がサービスＳを移
動させる場合、高負荷のキューＱ１の先頭であるサービ
スＳ６が移動対象となる。従って、第１の実施形態とは
異なり、サービスＳ１のみが順番に移動するたらい回し
動作を阻止することができる。Thus, when the node N2 moves the service S, the service S6, which is the head of the high-load queue Q1, is to be moved. Therefore, unlike the first embodiment, it is possible to prevent the swirling operation in which only the service S1 moves sequentially.

【００７０】上述したように本実施形態によれば、第１
の実施形態の効果に加え、あるサービス（例えば全ノー
ド中で一番負荷の高いサービス）のみがたらい回しにさ
れる動作を阻止することができる。（第３の実施形態）次に、本発明の第３の実施形態に係
る分散ノード間負荷分散方式の適用された計算機システ
ムについて説明する。As described above, according to the present embodiment, the first
In addition to the effects of the embodiment, it is possible to prevent an operation in which only a certain service (for example, a service with the highest load among all nodes) is circulated. (Third Embodiment) Next, a description will be given of a computer system to which a load sharing method between distributed nodes according to a third embodiment of the present invention is applied.

【００７１】本実施形態は、第１の実施形態中、通常は
低負荷で一時的に高負荷になるが直ぐに低負荷に復帰す
るサービスＳがある場合、このサービスＳを移動させる
場合があることを考慮し、この一時的に高負荷となるサ
ービスＳの移動の阻止を図るものである。This embodiment is different from the first embodiment in that when there is a service S that normally temporarily becomes high at low load but temporarily returns to low load, this service S may be moved. In consideration of the above, it is intended to prevent the movement of the service S which temporarily becomes a high load.

【００７２】具体的には、負荷テーブル１は、前述した
現在の負荷状況に加え、過去の負荷状況の平均値及び平
均２乗誤差が記憶されるものである。More specifically, the load table 1 stores, in addition to the above-described current load status, an average value and a mean square error of the past load status.

【００７３】負荷管理部２は、前述した機能に加え、過
去の負荷状況の平均値及び平均２乗誤差を負荷テーブル
１に書込む機能をもっている。The load management unit 2 has a function of writing the average value and the mean square error of the past load status into the load table 1 in addition to the above-mentioned functions.

【００７４】移動指示部４は、前述した機能に加え、ス
テップＳＴ４でノードが過負荷か否かを判定する際に、
現在の負荷値と負荷値の平均値とが著しく離れており、
且つ平均２乗誤差が小さいノードＮを選択しない機能を
有している。In addition to the above-described functions, the movement instructing unit 4 determines whether or not the node is overloaded in step ST4.
The current load value is significantly different from the average load value,
In addition, it has a function of not selecting a node N having a small mean square error.

【００７５】次に、以上のように構成された計算機シス
テムの動作を説明する。なお、この説明は、第１の実施
形態と比較して述べる。Next, the operation of the computer system configured as described above will be described. This description will be made in comparison with the first embodiment.

【００７６】いま、図１１に示すように、通常は低負荷
で一瞬だけ高負荷になるサービスＳ１があるとする。こ
のサービスＳ１は、一瞬だけ負荷が上昇したが、通常は
低負荷であるので、サービスＳ１を移動せずに時間の経
過を待つ方がよい。Now, as shown in FIG. 11, it is assumed that there is a service S1 in which the load is normally low and the load is high for a moment. Although the load of the service S1 increases for a moment, the load is usually low. Therefore, it is better to wait for the passage of time without moving the service S1.

【００７７】しかし、第１実施形態では、高負荷となる
時間Ａにおいて、移動対象のサービスＳ１を選択する場
合、このサービスＳ１を移動対象とする可能性がある。However, in the first embodiment, when the service S1 to be moved is selected at the time A when the load is high, there is a possibility that the service S1 is set as the movement target.

【００７８】一方、本実施形態では、負荷の平均値と平
均２乗誤差とを管理する構成により、移動対象のサービ
スＳｋを選択する際に、現在の負荷と負荷の平均値とが
著しく離れており、且つ平均２乗誤差が小さいサービス
Ｓ１を移動対象に選択しない。On the other hand, in the present embodiment, when the service Sk to be moved is selected, the current load and the average value of the load are significantly different from each other when the average value of the load and the average square error are managed. The service S1 which has a small mean square error is not selected as a movement target.

【００７９】これにより、通常は低負荷で一時的に高負
荷となるサービスＳ１の移動を阻止できるので、移動に
よるオーバヘッドを抑制することができる。Thus, it is possible to prevent the movement of the service S1, which normally has a low load and temporarily becomes a high load, so that the overhead due to the movement can be suppressed.

【００８０】上述したように本実施形態によれば、第１
の実施形態の効果に加え、通常は低負荷値で一時的に高
負荷値となる負荷の移動を阻止できるので、移動による
オーバヘッドを抑制することができる。（第４の実施形態）次に、本発明の第４の実施形態に係
る分散ノード間負荷分散方式の適用された計算機システ
ムについて説明する。As described above, according to the present embodiment, the first
In addition to the effects of the first embodiment, it is possible to prevent the movement of the load, which normally has a low load value and temporarily becomes a high load value, so that the overhead due to the movement can be suppressed. (Fourth Embodiment) Next, a description will be given of a computer system to which a load sharing method between distributed nodes according to a fourth embodiment of the present invention is applied.

【００８１】本実施形態は、第１の実施形態中、多数の
ノードＮ１〜Ｎｎを有する計算機システムの場合、最低
の負荷をもつノードＮｉを選択する際に、ノード数ｎに
比例してノードＮ１〜Ｎｎの負荷を算出する処理のオー
バヘッドを増大させることを考慮し、このオーバヘッド
の抑制を図るものである。In this embodiment, in the case of the computer system having a large number of nodes N1 to Nn in the first embodiment, when selecting the node Ni having the lowest load, the node N1 is proportional to the node number n. This overhead is to be suppressed in consideration of increasing the overhead of the process of calculating the loads N to Nn.

【００８２】具体的には、移動指示部４は、前述した機
能に加え、移動対象の例えばノードＮ１〜Ｎ４を順番に
配列したキューＱｍを有し、前述したステップＳＴ５に
よるノード選択の際に、最低の負荷のノードＮ２を選択
するのではなく、キューＱｍの先頭にあるノードＮｓを
移動先として選択する機能と、キューＱｍの先頭のノー
ドＮにサービスＳを移動可能なとき、サービスＳをその
先頭のノードＮに移動させる機能と、サービスＳを移動
させたノードＮをキューＱｍの末尾に接続する機能とを
有している。More specifically, the movement instructing section 4 has a queue Qm in which, for example, nodes N1 to N4 to be moved are arranged in order in addition to the above-described functions. A function of selecting the node Ns at the head of the queue Qm as a destination, instead of selecting the node N2 with the lowest load, and when the service S can be moved to the node N at the head of the queue Qm, the service S is It has a function to move to the head node N and a function to connect the node N to which the service S has moved to the end of the queue Qm.

【００８３】次に、以上のように構成された計算機シス
テムの動作を説明する。Next, the operation of the computer system configured as described above will be described.

【００８４】本実施形態では、ノード選択用のキューＱ
を設けた構成により、移動先の候補としてノードＮ１〜
Ｎｘが待ち行列に接続される。In this embodiment, the queue Q for node selection
, The nodes N1 to N1
Nx is connected to the queue.

【００８５】例えば、ノードＮ１→ノードＮ２→ノード
Ｎ３→ノードＮ４というキューＱｍがあり、ノードＮ２
から高負荷のサービスＳ１をノードＮ１に移動するとす
る。For example, there is a queue Qm of node N 1 → node N 2 → node N 3 → node N 4.
From the server S1 to the node N1.

【００８６】ここで、移動指示部４は、ノード選択の際
に、キューＱｍの先頭にあるノードＮ１を移動先として
選択し、そのノードＮ１の負荷を算出してそのノードＮ
１にサービスＳを移動可能なとき、サービスＳをそのノ
ードＮ１に移動させる。Here, at the time of selecting a node, the movement instructing section 4 selects the node N1 at the head of the queue Qm as a movement destination, calculates the load on the node N1, and calculates the load on the node N1.
When the service S can be moved to the node N1, the service S is moved to the node N1.

【００８７】また、移動指示部４は、サービスＳの移動
により負荷の増えたノードＮ１をキューＱｍの末尾に接
続する一方、サービスＳを移動して負荷の減ったノード
Ｎ２をキューＱｍの先頭に接続する。サービス移動後の
キューＱｍの状態は、ノードＮ１→ノードＮ３→ノード
Ｎ４→ノードＮ２のようになる。The movement instructing unit 4 connects the node N1 whose load has increased due to the movement of the service S to the tail of the queue Qm, and moves the node N2 whose load has decreased by moving the service S to the head of the queue Qm. Connecting. The state of the queue Qm after the service transfer is as follows: node N1 → node N3 → node N4 → node N2.

【００８８】このように、負荷を移動したノードがキュ
ーの先頭へ接続され、負荷の移動されたノードはキュー
の最後に接続されることにより、低負荷のノードがキュ
ーＱｍの先頭へ配置され、高負荷のノードがキューＱｍ
の後半に配置される。As described above, the node having moved the load is connected to the head of the queue, and the node having moved the load is connected to the end of the queue. High load node is queue Qm
Placed in the second half.

【００８９】従って、次回、負荷移動先を算出する場合
もキューＱｍの先頭からのヒット率が高くなり、全ノー
ドＮ１〜Ｎｎの負荷を算出する場合に比べ、オーバヘッ
ドを抑制することができる。Therefore, the next time the load destination is calculated, the hit rate from the head of the queue Qm becomes higher, and the overhead can be suppressed as compared with the case where the loads of all the nodes N1 to Nn are calculated.

【００９０】上述したように本実施形態によれば、第１
の実施形態の効果に加え、ノードＮ２がサービスＳの移
動先を選択する際に、他の全ノードＮ１，Ｎ３〜Ｎｎの
うち、一部のノードを検索すればよいので、サービス移
動のオーバヘッドを抑制することができる。As described above, according to the present embodiment, the first
In addition to the effects of the embodiment, when the node N2 selects the destination of the service S, it is only necessary to search some of the other nodes N1, N3 to Nn. Can be suppressed.

【００９１】なお、上記各第２〜第４の実施形態は、第
１の実施形態に個別に適用した場合を説明したが、これ
に限らず、適宜組合せて同時に適用する構成としても、
本発明を同様に実施して同様の効果を得ることができ
る。Although the above-described second to fourth embodiments have been described with respect to the case where they are individually applied to the first embodiment, the present invention is not limited to this.
The present invention can be implemented in a similar manner to obtain similar effects.

【００９２】また、上記実施形態に記載した手法は、コ
ンピュータに実行させることのできるプログラムとし
て、磁気ディスク（フロッピーディスク、ハードディス
クなど）、光ディスク（ＣＤ−ＲＯＭ、ＤＶＤなど）、
光磁気ディスク（ＭＯ）、半導体メモリなどの記憶媒体
に格納して頒布することもできる。The method described in the above embodiment can be executed by a computer as a program such as a magnetic disk (floppy disk, hard disk, etc.), an optical disk (CD-ROM, DVD, etc.),
It can also be stored in a storage medium such as a magneto-optical disk (MO) or a semiconductor memory and distributed.

【００９３】その他、本発明はその要旨を逸脱しない範
囲で種々変形して実施できる。In addition, the present invention can be variously modified and implemented without departing from the gist thereof.

【００９４】[0094]

【発明の効果】以上説明したように本発明によれば、メ
モリリソースに無駄を生じさせず、負荷集中時や分散要
求時に負荷分散を実行できる分散ノード間負荷分散方式
を提供できる。As described above, according to the present invention, it is possible to provide a load distribution method between distributed nodes which can execute load distribution at the time of load concentration or distribution request without wasting memory resources.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明の第１の実施形態に係る分散ノード間負
荷分散方式の適用された計算機システムの構成を示す模
式図FIG. 1 is a schematic diagram illustrating a configuration of a computer system to which a load distribution method between distributed nodes according to a first embodiment of the present invention is applied;

【図２】同実施形態における負荷テーブルの構成を示す
模式図FIG. 2 is a schematic diagram showing a configuration of a load table according to the embodiment;

【図３】同実施形態における動作を説明するための模式
図FIG. 3 is a schematic diagram for explaining the operation in the embodiment.

【図４】同実施形態における移動指示部の動作を説明す
るためのフローチャートFIG. 4 is a flowchart for explaining the operation of a movement instruction unit according to the embodiment;

【図５】同実施形態における動作を説明するための負荷
状況を示す模式図FIG. 5 is a schematic diagram showing a load state for explaining an operation in the embodiment.

【図６】同実施形態における動作を説明するための負荷
状況を示す模式図FIG. 6 is a schematic diagram showing a load state for explaining an operation in the embodiment.

【図７】本発明の第２の実施形態に係る分散ノード間負
荷分散方式に用いられるキューの内容を示す模式図FIG. 7 is a schematic diagram showing the contents of a queue used in a distributed node load distribution method according to the second embodiment of the present invention;

【図８】同実施形態における動作を説明するための負荷
状況を示す模式図FIG. 8 is a schematic diagram showing a load state for explaining an operation in the embodiment.

【図９】同実施形態における動作を説明するためのキュ
ーの内容を示す模式図FIG. 9 is a schematic diagram showing the contents of a queue for explaining the operation in the embodiment;

【図１０】同実施形態における動作を説明するためのキ
ューの内容を示す模式図FIG. 10 is a schematic diagram showing the contents of a queue for explaining the operation in the embodiment;

【図１１】本発明の第３の実施形態に係る分散ノード間
負荷分散方式を説明するためのサービスの負荷値を示す
模式図FIG. 11 is a schematic diagram showing load values of services for explaining a load distribution method between distributed nodes according to the third embodiment of the present invention.

【符号の説明】[Explanation of symbols]

１…負荷テーブル２…負荷管理部３…性能値テーブル４…移動指示部５…負荷分散部６…プロセスファイル７…バックアップファイル８…ジャケットルーチン９…デーモン９ａ…未確定キュー９ｂ…確定キューＮ１〜Ｎｎ…ノードＰ１〜Ｐｍ…プロセス DESCRIPTION OF SYMBOLS 1 ... Load table 2 ... Load management part 3 ... Performance value table 4 ... Movement instruction part 5 ... Load distribution part 6 ... Process file 7 ... Backup file 8 ... Jacket routine 9 ... Daemon 9a ... Undetermined queue 9b ... Confirmed queue N1 Nn: nodes P1 to Pm: process

Claims

【特許請求の範囲】[Claims]

【請求項１】分散配置された複数のノードが互いに接
続され、前記各ノードの有する負荷を各ノード間で分散
させるための分散ノード間負荷分散方式であって、前記各ノードの現在の負荷値が記憶される負荷記憶手段
と、前記各ノードの処理性能値が記憶される性能値記憶手段
と、前記負荷記憶手段を参照して自ノードの過負荷を検出し
たとき、前記性能値記憶手段を参照して負荷を増加可能
な移動先ノードを選択し、負荷移動指示を生成する移動
指示手段と、前記移動指示手段により生成された負荷移動指示に基づ
いて、自ノードの負荷を所定単位毎に前記移動先ノード
に移動させる負荷分散手段とを備えたことを特徴とする分散ノード間負荷分散方式。1. A distributed node-to-node load distribution method for interconnecting a plurality of nodes arranged in a distributed manner and distributing a load of each node among the nodes, wherein a current load value of each node is provided. A load value storing means for storing the processing performance value of each node; and a performance value storing means for detecting an overload of the own node by referring to the load storing means. A transfer instruction unit for generating a load transfer instruction by selecting a destination node capable of increasing the load by referring to the destination node; And a load balancing means for moving the load to the destination node.

【請求項２】請求項１に記載の分散ノード間負荷分散
方式において、前記移動指示手段は、予め複数の度合のいずれかに負荷
が分類され、前記各度合毎に、移動対象の負荷が先頭に
あり、移動された負荷が末尾に接続されるキューを備え
たことを特徴とする分散ノード間負荷分散方式。2. The load distribution method between distributed nodes according to claim 1, wherein the movement instructing means classifies the load into one of a plurality of degrees in advance, and for each of the degrees, a load to be moved is a top load. Characterized in that the distributed load is provided with a queue connected to the end of the load.

【請求項３】請求項１に記載の分散ノード間負荷分散
方式において、前記負荷テーブルは、前記各負荷値と、前記各負荷値に
おける時系列的な平均値と、前記各負荷値の平均２乗誤
差とが記憶されており、前記移動指示手段は、前記自ノードの過負荷を検出した
とき、前記負荷テーブル内の平均値及び平均２乗誤差に
基づいて、現在の負荷値と前記平均値とが所定値以上離
れており、且つ前記平均２乗誤差の小さい負荷を移動対
象に選択しないことを特徴とする分散ノード間負荷分散
方式。3. The load distribution method between distributed nodes according to claim 1, wherein the load table is configured to store the load values, a time-series average value of the load values, and an average of the load values. And the movement instructing means detects a current load value and the average value based on an average value and an average square error in the load table when the overload of the own node is detected. And a load having a small mean square error is not selected as a movement target.

【請求項４】請求項１に記載の分散ノード間負荷分散
方式において、前記移動指示手段は、移動先対象のノードが先頭にあ
り、移動先にされたノードが末尾に接続されるキューを
備えたことを特徴とする分散ノード間負荷分散方式。4. The load balancing method between distributed nodes according to claim 1, wherein the movement instructing means includes a queue in which a destination target node is at the head and a destination node is connected at the end. A load distribution method between distributed nodes.

【請求項５】請求項１に記載の分散ノード間負荷分散
方式において、前記負荷テーブルは、前記負荷値がプロセス毎に記憶さ
れ、且つ１個以上のプロセスからなる集合が前記所定単
位として登録されていることを特徴とする分散ノード間
負荷分散方式。5. The load distribution method between distributed nodes according to claim 1, wherein in the load table, the load value is stored for each process, and a set of one or more processes is registered as the predetermined unit. A load sharing method between distributed nodes.

【請求項６】請求項１に記載の分散ノード間負荷分散
方式において、前記性能値テーブルは、ＭＩＰＳ（１００万命令／秒）
に基づいた前記処理性能値が記憶されることを特徴とす
る分散ノード間負荷分散方式。6. The distributed load between distributed nodes according to claim 1, wherein the performance value table is MIPS (1 million instructions / second).
A load distribution method between distributed nodes, characterized in that the processing performance value based on the storage performance is stored.

【請求項７】請求項１に記載の分散ノード間負荷分散
方式において、前記移動指示手段は、自ノードにおける移動対象の負荷
の負荷値、前記移動先ノードの処理性能値及び前記自ノ
ードの処理性能値に基づいて、前記移動対象の負荷を移
動した場合に前記移動先ノードで増加する負荷値を算出
し、前記増加する負荷値が前記移動先ノードでのＣＰＵ
のアイドル量よりも小のとき、前記負荷移動指示を生成
することを特徴とする分散ノード間負荷分散方式。7. The load distribution method between distributed nodes according to claim 1, wherein the movement instructing means includes a load value of a load to be moved in the own node, a processing performance value of the destination node, and a processing of the own node. Calculating, based on the performance value, a load value that increases at the destination node when the load of the migration target is moved, and the increased load value is determined by the CPU at the destination node;
Wherein the load transfer instruction is generated when the load amount is smaller than the idle amount of the load.

【請求項８】請求項１に記載の分散ノード間負荷分散
方式において、前記移動指示手段は、予め高負荷、中負荷又は小負荷の
いずれかの度合に負荷が分類され、前記各度合毎に、移
動対象の負荷が先頭にあり、移動された負荷が末尾に接
続されるキューを備えたことを特徴とする分散ノード間
負荷分散方式。8. The load distribution method between distributed nodes according to claim 1, wherein the movement instructing means classifies the load in advance into one of a high load, a medium load, and a small load, and A load to be moved is at the top, and a queue to which the moved load is connected at the end is provided.