JP2001195377A

JP2001195377A - Isolation judgment system, management method therefor and recording medium

Info

Publication number: JP2001195377A
Application number: JP2000008159A
Authority: JP
Inventors: Hidechika Aono; 英親青野
Original assignee: NEC Software Kyushu Ltd
Current assignee: NEC Solution Innovators Ltd
Priority date: 2000-01-17
Filing date: 2000-01-17
Publication date: 2001-07-19

Abstract

PROBLEM TO BE SOLVED: To recognize fault generation in respective slave nodes themselves and to reduce burdens for a management job as a master node. SOLUTION: In this management method of the isolation judgment system for which a management computer (master node) and plural computers (slave nodes) to be managed are connected by a network, it is constituted of plural management groups for which the several pieces of the slave nodes are used as one set, the slave nodes inside the management group exchange node information with each other and the respective slave nodes hold the node information inside the management group. The master node manages the node information of all the slave nodes by exchanging the node information with one piece of the slave node of a highest priority order inside each management group.

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、管理コンピュータ
（マスターノード）と、複数台の被管理コンピュータ
（スレーブノード）がネットワークで接続されている孤
立判定システムの管理方法と孤立判定システム、及び該
管理方法を動作プログラムとし、該動作プログラムをコ
ンピュータが読み取り可能に格納した記録媒体に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a management method and an isolation determination system for an isolation determination system in which a management computer (master node) and a plurality of managed computers (slave nodes) are connected via a network, and the management. The present invention relates to a recording medium in which a method is an operation program and the operation program is stored in a computer-readable manner.

【０００２】[0002]

【従来の技術】従来のノード情報の交換方式の一例とし
て、マスターノードが全てのノードに対し、ノード情報
の交換を行う方式がある。2. Description of the Related Art As an example of a conventional node information exchange method, there is a method in which a master node exchanges node information with all nodes.

【０００３】このような方式のシステムはつぎのように
動作する。The system of such a system operates as follows.

【０００４】すなわち、マスターノードが全てのノード
に対してノード情報の交換を行って、マスターノードが
全てのノード情報を管理する方式である。That is, the master node exchanges node information with all nodes, and the master node manages all node information.

【０００５】この一例として、特開平２−１４３７５２
号公報には、主装置と複数のグループに分かれた従装置
との間で、データの伝送をするデータ通信方式が記載さ
れている。この主装置は、各従装置が属する１つ以上の
グループのそれぞれのグループコードを管理する管理手
段と、この管理手段を検索して取り出したデータを送る
グループコードをこの従装置に送る通信データに付加し
て送信のデータを作成する第１の処理手段と、この第１
の処理手段で作成したデータを前記複数のグループに送
信する第１の通信手段とを有し、前記複数の従装置は、
それぞれ、従装置の属する１つ以上のグループのグルー
プを予め設定する設定手段と、前記第１の通信手段から
のデータを受信する第２の通信手段と、受信したデータ
からグループコードを抽出し、このグループコードと設
定手段に設定したグループコードとを比較して解析する
第２の処理手段とを有し、第２の処理手段での解析の結
果、２つのグループコードが一致した場合、従装置は受
信したデータを有効とし、不一致の場合、受信したデー
タを破棄することが記載されている。[0005] As an example of this, Japanese Patent Application Laid-Open No. 2-137552 is disclosed.
This publication describes a data communication system for transmitting data between a main device and slave devices divided into a plurality of groups. The master device manages each group code of one or more groups to which each slave device belongs, and a group code for sending data retrieved by searching the management device to communication data to be sent to the slave device. First processing means for adding and creating transmission data;
And first communication means for transmitting the data created by the processing means to the plurality of groups, wherein the plurality of slave devices are:
Setting means for presetting one or more groups to which the slave device belongs, second communication means for receiving data from the first communication means, and extracting a group code from the received data; Second processing means for comparing and analyzing the group code and the group code set in the setting means, and when the two group codes match as a result of the analysis by the second processing means, Describes that the received data is valid, and if they do not match, the received data is discarded.

【０００６】また、特開平４−４２６３２号公報には、
システム管理方式として、主管理装置と、該主管理装置
に管理される従管理装置と被管理装置とを備え、従管理
装置が被管理装置に対するヘルスチェックのためのパト
ロールを行うシステム管理方式において、主管理装置
は、従管理装置のパトロール対象となる被管理装置の構
成情報及びパトロール監視情報を、動的に変化させる手
段を備え、一つの従管理装置に障害が発生した場合、こ
の障害発生の従管理装置の管理する被管理装置の群を代
行できる従管理装置の管理する被管理装置の群に加えて
管理し、生涯発生の従管理装置が回復した場合、元の状
態に戻すことが記載されている。これらは、一定間隔で
ヘルスチェックのパトロールを行うことで、傷害を発見
し、障害発生期間に、被管理装置を他の従管理装置にパ
トロールフレームの情報をやりとりすることによって管
理させることが記載されている。[0006] Japanese Patent Application Laid-Open No. 4-42632 discloses that
As a system management method, in a system management method including a main management device, a sub-management device and a managed device managed by the main management device, and the sub-management device performs a patrol for a health check on the managed device, The main management device is provided with means for dynamically changing the configuration information and patrol monitoring information of the managed device to be patrolled by the sub-management device. Describes that the group of managed devices managed by the slave management device is managed in addition to the group of managed devices managed by the slave management device that can act on behalf of the slave management device. Have been. These documents describe that patrols of health checks are performed at regular intervals to detect injuries, and during the failure period, managed devices are managed by exchanging patrol frame information with other slave management devices. ing.

【０００７】また、特開平７−１２１４８８号公報に
は、マスタプロセッサと、このマスタプロセッサとバス
を通して接続されたスレーブプロセッサとから構成され
るマルチプロセッサシステムの定期診断方式において、
複数のスレーブプロセッサを複数のグループに群分けし
た各ユニット内の複数のスレーブプロセッサから１つを
ヘルスチェックマスタとして選出し、マスタプロセッサ
は当該ヘルスチェックマスタだけを定期診断し、その診
断結果により障害発生時に再開処理を行うことが記載さ
れている。このヘルスチェックマスタは同一ユニット内
のスレーブプロセッサの定期診断を行い、その診断結果
をマスタプロセッサへ送信し、２段階の管理方式が記載
されている。Japanese Patent Application Laid-Open No. 7-112488 discloses a periodic diagnosis method for a multiprocessor system comprising a master processor and a slave processor connected to the master processor through a bus.
One of a plurality of slave processors in each unit in which a plurality of slave processors are grouped into a plurality of groups is selected as a health check master, and the master processor periodically diagnoses only the health check master, and a failure occurs based on the diagnosis result. It is described that a resumption process is sometimes performed. This health check master performs periodic diagnosis of slave processors in the same unit, transmits the diagnosis result to the master processor, and describes a two-stage management method.

【０００８】[0008]

【発明が解決しようとする課題】しかし、上記特開平４
−４２６３２号公報には、管理グループ毎に従管理装置
が存在し、従管理装置だけが管理グループ内の被管理装
置を管理しているが、管理グループ内に優先順位の判断
がなく、いずれにしても全従管理装置と共に被管理装置
を主管理装置が走査して管理せざるを得ず、主管理装置
の管理のための負担が大きいという問題点を有する。However, the above-mentioned Japanese Patent Application Laid-Open No.
In JP-A-42632, there is a subordinate management device for each management group, and only the subordinate management device manages the managed devices in the management group. However, there is no determination of the priority order in the management group. However, there is a problem that the main management device has to scan and manage the managed device together with all the slave management devices, and the burden of managing the main management device is large.

【０００９】また、特開平７−１２１４８８号公報に
は、ヘルスチェックマスタのみに管理グループ内の情報
が集まることとしている。ヘルスチェックマスタが変わ
る場合、マスタからの要求が来て、初めて自分の管理範
囲を知り、情報の収集を開始するので、孤立的に自己管
理ができず、他の管理グループから孤立したことを判断
できず、自己自体の傷害を発見することが困難である。Japanese Patent Laid-Open Publication No. 7-112488 discloses that information in a management group is collected only in a health check master. When the health check master changes, the master receives a request from the master and knows the scope of management for the first time and starts collecting information. Unable to find their own injuries.

【００１０】このように、これらの従来技術には、次の
ような問題点があった。As described above, these conventional techniques have the following problems.

【００１１】第１の問題点は、マスターノードあるいは
主装置、マスタプロセッサが、全てのスレーブノード或
いは従管理装置に管理される被管理装置、スレーブプロ
セッサに対して、ノード情報、ヘルスチェックのパトロ
ールの交換を行うため、マスターノードに負荷がかかる
ということである。The first problem is that the master node or the main unit or the master processor sends the node information and the health check patrol to the managed devices and the slave processors managed by all the slave nodes or the slave management units. This means that the load is placed on the master node to perform the replacement.

【００１２】第２の問題点は、ヘルスチェック処理に失
敗した場合、その原因がマスターノード側にあるのか、
他ノード側にあるのかを判別することができないため、
ノード情報の交換の信頼性が低いということである。The second problem is that if the health check process fails, is the cause on the master node side?
Because it is not possible to determine whether it is on another node side,
This means that the reliability of exchanging node information is low.

【００１３】そこで、本発明は、各スレーブノード自体
で障害発生を知ることができ、またマスターノードとし
ての管理業務のための負担を軽減することを課題とす
る。Accordingly, an object of the present invention is to make it possible for each slave node itself to know the occurrence of a fault, and to reduce the burden for management work as a master node.

【００１４】[0014]

【課題を解決するための手段】本発明は、上記課題を解
決するために、管理コンピュータ（マスターノード）
と、複数台の被管理コンピュータ（スレーブノード）が
ネットワークで接続されて各ノードが孤立しているか否
かを判定する孤立判定システムの管理方法において、前
記スレーブノード数台を１組とした複数の管理グループ
で構成し、前記管理グループ内の前記スレーブノード同
士がお互いにノード情報の交換を行い、それぞれのスレ
ーブノードが前記管理グループ内のノード情報を保持し
て、前記マスターノードが各管理グループ内の１台の優
先順位の高いスレーブノードに対してノード情報の交換
を行うことで、全スレーブノードのノード情報を管理す
ることを特徴とする。According to the present invention, there is provided a management computer (master node) for solving the above problems.
A plurality of managed computers (slave nodes) are connected via a network to determine whether or not each node is isolated. It comprises a management group, the slave nodes in the management group exchange node information with each other, each slave node holds node information in the management group, and the master node By exchanging node information with one of the high priority slave nodes, the node information of all the slave nodes is managed.

【００１５】また、本発明は、管理コンピュータ（マス
ターノード）と、複数台の被管理コンピュータ（スレー
ブノード）がネットワークで接続されている孤立判定シ
ステムにおいて、前記マスターノードは、前記スレーブ
ノードの数台を１組とした複数の管理グループ毎に設け
た管理グループ優先順位リストと、前記ネットワークに
ノード情報を送受信する送受信部とを有し、前記管理グ
ループ内の前記スレーブノードは前記マスターノードと
前記管理グループ内の他のスレーブノードとノード情報
を前記ネットワークを介して送受信する送受信部と、前
記管理グループ内の前記管理グループ優先順位リストと
を備え、前記マスターノードは前記各管理グループの優
先順位に従って各管理グループの一つに前記ノード情報
の取得要求を実行すると共に、前記管理グループ内のス
レーブノードからノード取得要求に応対してその取得要
求したスレーブノードの属する管理グループ優先順位リ
ストのノード情報を送信し、前記スレーブノードは少な
くとも前記管理グループ内の他のスレーブノードと相互
にノード情報の取得要求を実行して前記管理グループ優
先順位リストを更新することを特徴とする。Further, according to the present invention, in an isolation determination system in which a management computer (master node) and a plurality of managed computers (slave nodes) are connected via a network, the master node includes several slave nodes. A management group priority list provided for each of a plurality of management groups, and a transmission / reception unit for transmitting / receiving node information to / from the network, wherein the slave nodes in the management group include the master node and the management node. A transmission / reception unit that transmits / receives node information to / from other slave nodes in the group via the network, and the management group priority list in the management group, wherein the master node has a priority order of each of the management groups. Executes the node information acquisition request to one of the management groups In response to a node acquisition request from a slave node in the management group, the node information of the management group priority list to which the slave node that has requested the acquisition belongs is transmitted, and the slave node is configured to transmit at least another node in the management group. The management node priority list is updated by mutually executing a node information acquisition request with a slave node.

【００１６】また、本発明は、複数の管理コンピュータ
（マスターノード）と、前記マスターノードがそれぞれ
管理する複数台の被管理コンピュータ（スレーブノー
ド）がネットワークで接続されている孤立判定システム
において、前記マスターノードは、他のマスターノード
と相互にノード情報の取得要求を実行して当該ノード情
報を格納するマスタ管理グループ優先順位リストを保有
し、前記マスターノードは、前記スレーブノードの数台
を１組とした複数の管理グループ毎に設けた管理グルー
プ優先順位リストと、前記ネットワークにノード情報を
送受信する送受信部とを有し、前記管理グループ内の前
記スレーブノードは前記マスターノードと前記管理グル
ープ内の他のスレーブノードとノード情報を前記ネット
ワークを介して送受信する送受信部と、前記管理グルー
プ内の前記管理グループ優先順位リストとを備え、前記
マスターノードは前記各管理グループの優先順位に従っ
て各管理グループの一つに前記ノード情報の取得要求を
実行すると共に、前記管理グループ内のスレーブノード
からノード取得要求に応対してその取得要求したスレー
ブノードの属する管理グループ優先順位リストのノード
情報を送信し、前記スレーブノードは少なくとも前記管
理グループ内の他のスレーブノードと相互にノード情報
の取得要求を実行して前記管理グループ優先順位リスト
を更新することを特徴とする。The present invention also provides an isolation determination system in which a plurality of management computers (master nodes) and a plurality of managed computers (slave nodes) managed by the master nodes are connected via a network. The node holds a master management group priority list for storing node information by executing a request for acquiring node information mutually with other master nodes, and the master node sets several slave nodes as one set. A management group priority list provided for each of the plurality of management groups, and a transmission / reception unit for transmitting / receiving node information to / from the network, wherein the slave nodes in the management group are other than the master node and the slave nodes in the management group. Send and receive node information with other slave nodes via the network The transmitting and receiving unit, comprising the management group priority list in the management group, the master node executes the node information acquisition request to one of each management group according to the priority of each management group, In response to a node acquisition request from a slave node in the management group, transmits node information of a management group priority list to which the slave node that has requested the acquisition belongs, and the slave node communicates with at least another slave node in the management group. The management group priority order list is updated by mutually executing a node information acquisition request.

【００１７】また、本発明は、管理コンピュータ（マス
ターノード）と、複数台の被管理コンピュータ（スレー
ブノード）がネットワークで接続されて各ノードが孤立
しているか否かを判定する孤立判定システムの管理方法
を動作プログラムとし、該動作プログラムをコンピュー
タが読み取り可能に格納した記録媒体において、前記孤
立判定システムの管理方法は、前記スレーブノード数台
を１組とした複数の管理グループで構成し、前記管理グ
ループ内の前記スレーブノード同士がお互いにノード情
報の交換を行い、それぞれのスレーブノードが前記管理
グループ内のノード情報を保持して、前記マスターノー
ドが各管理グループ内の１台の優先順位の高いスレーブ
ノードに対してノード情報の交換を行うことで、前記マ
スターノードが管理する全スレーブノードのノード情報
を管理することを前記動作プログラムとして、該動作プ
ログラムをコンピュータが読み取り可能に格納したこと
を特徴とする。According to the present invention, a management computer (master node) and a plurality of managed computers (slave nodes) are connected via a network to manage an isolation determination system for determining whether each node is isolated. In a recording medium in which the method is an operation program and the operation program is stored in a computer-readable manner, the management method of the isolation determination system comprises a plurality of management groups each including a set of the slave nodes. The slave nodes in the group exchange node information with each other, each slave node holds the node information in the management group, and the master node has one of the higher priority in each management group. By exchanging node information with slave nodes, the master node can manage the nodes. To as the operation program to manage the node information of all the slave nodes, characterized in that the said operating program computer has stored readably.

【００１８】また、本発明は、管理コンピュータ（マス
ターノード）と、複数台の被管理コンピュータ（スレー
ブノード）がネットワークで接続されているシステムに
おいて、スレーブノード数台を１組とした複数の管理グ
ループで構成し、管理グループ内のスレーブノード同士
がお互いにノード情報の交換を行い、それぞれのスレー
ブノードがグループ内のノード情報を保持して、マスタ
ーノードが各グループ内の１台のスレーブノードに対し
てノード情報の交換を行うことで、全スレーブノードの
ノード情報を管理できる方式、および管理グループ内の
スレーブノードが、自ら孤立状態を判断し、システムを
停止する機能を提供するものである。Further, according to the present invention, in a system in which a management computer (master node) and a plurality of managed computers (slave nodes) are connected via a network, a plurality of management groups each including several slave nodes are set. The slave nodes in the management group exchange node information with each other, each slave node holds the node information in the group, and the master node sends one node to each slave node in each group. In this method, the node information of all slave nodes can be managed by exchanging the node information, and a function is provided in which the slave nodes in the management group determine an isolated state by themselves and stop the system.

【００１９】従って、各スレーブノードに障害が発生し
た場合には孤立的に傷害を回復すれば、他のスレーブノ
ードに影響を与えることはなく、マスターノードに障害
が発生した場合には、傷害検出用情報の来ない各スレー
ブノードは孤立的に動作実行するので、マスターノード
の傷害が回復されるまで、他のスレーブノードは孤立的
に処理実行でき、傷害発生ノードによる影響を極小にす
ることができる。Therefore, if a fault occurs in each slave node, if the fault is recovered in isolation, the other slave nodes are not affected, and if a fault occurs in the master node, fault detection is performed. Since each slave node that does not receive the operation information operates in isolation, the other slave nodes can execute processing in isolation until the failure of the master node is recovered, minimizing the effect of the node where the failure occurred. it can.

【００２０】また、本発明は、図１を参照しつつ説明す
れば、管理される各スレーブノード１１〜１９は、３台
で１組の管理グループ１１１〜１１３を構成している。
各スレーブノード１１〜１９は、通信機器１２０を経由
して、管理グループ内のスレーブノード同士で、ノード
情報の交換を行い、管理グループ内のノード情報を自ノ
ードの主記憶装置２１および補助記憶装置に保持する。Further, according to the present invention, referring to FIG. 1, each of the managed slave nodes 11 to 19 constitutes one set of management groups 111 to 113.
Each of the slave nodes 11 to 19 exchanges node information between the slave nodes in the management group via the communication device 120, and stores the node information in the management group in the main storage device 21 and the auxiliary storage device of the own node. To hold.

【００２１】マスターノード１は、各スレーブノードの
状態を管理するサーバとして、通信機器１２０を経由し
て、各管理グループ１１１〜１１３内の優先度が高いス
レーブノード１１とノード情報の交換を定期的に行う。
マスターノード１は、各管理グループ１１１〜１１３内
のノード情報を自ノードの主記憶装置１０１又は補助記
憶装置に保持することによって、管理している全スレー
ブノード１１〜１９のノード情報を管理することができ
る。また、各スレーブノード１１〜１９は、何らかの障
害で、グループ内の他スレーブノードとノード情報の交
換が行えない場合、システム内で孤立していると判断
し、マスターノード１に対してノード情報の交換を行う
ことにより、自ノードの障害か、他ノードの障害かをノ
ード自身が判断することができる。同様に、マスターノ
ード１は、自身が当該管理システムの他のスレーブノー
ドから孤立しているか否かを、通信機器１２０を介する
ノード情報を検出する際の応答によって判断できる。The master node 1 periodically exchanges node information with the high-priority slave nodes 11 in the management groups 111 to 113 via the communication device 120 as a server for managing the state of each slave node. To do.
The master node 1 manages the node information of all the managed slave nodes 11 to 19 by holding the node information in each of the management groups 111 to 113 in the main storage device 101 or the auxiliary storage device of the own node. Can be. If the slave nodes 11 to 19 cannot exchange node information with other slave nodes in the group due to some kind of failure, the slave nodes 11 to 19 determine that they are isolated in the system and notify the master node 1 of the node information. By performing the exchange, the node itself can determine whether it is a failure of its own node or a failure of another node. Similarly, the master node 1 can determine whether or not the master node 1 is isolated from other slave nodes of the management system based on a response when detecting node information via the communication device 120.

【００２２】このようにして、スレーブノード自身及び
マスタスレーブ自身が、自ノードの障害か、他ノードの
障害かを判断することができると共に、マスターノード
が全スレーブノードに対してノード情報の交換を行わな
いため、マスターノードが全スレーブノードに対してノ
ード情報の交換を行う従来の方式に比べ、ノード情報の
交換の信頼性をあげるとともに、マスターノードの負荷
を軽減させることを可能にする。In this way, the slave node itself and the master slave itself can determine whether the node itself has failed or another node has failed, and the master node exchanges node information with all slave nodes. Since this is not performed, the reliability of the exchange of the node information can be improved and the load on the master node can be reduced as compared with the conventional method in which the master node exchanges the node information with all the slave nodes.

【００２３】[0023]

【発明の実施の形態】本発明の実施形態について、図面
を参照しつつ詳細に説明する。Embodiments of the present invention will be described in detail with reference to the drawings.

【００２４】［第１の実施形態］（１）構成の説明図１を参照すると、本発明の第１の実施形態は、複数の
コンピュータで構築されたトランザクションシステムに
おいて、マスターノードは管理用、スレーブノードはト
ランザクションを実行するコンピュータと想定して、グ
ルーピングによるノード情報の交換方式は、マスターノ
ード１、スレーブノード１１〜１９、通信機器１２０か
ら構成されている。First Embodiment (1) Description of Configuration Referring to FIG. 1, in a first embodiment of the present invention, in a transaction system constructed by a plurality of computers, a master node is used for management and a slave is used. Assuming that the nodes are computers that execute transactions, the node information exchange method by grouping includes a master node 1, slave nodes 11 to 19, and a communication device 120.

【００２５】被管理コンピュータ１００として、スレー
ブノード１１〜１９がそれぞれ被管理コンピュータとし
て、それぞれ所謂パソコンと同様な動作で、経理業務処
理や、ワードプロセッサや、ＣＡＤ，設計業務等の分散
コンピュータ処理や、マスターノードや他のスレーブノ
ード等と通信接続して、データの採受を行うことができ
る。As the managed computer 100, the slave nodes 11 to 19 each operate as a managed computer in the same manner as a so-called personal computer, and perform accounting work processing, word processing, distributed computer processing such as CAD and design work, and master processing. It can communicate with a node, another slave node, or the like to receive data.

【００２６】また、マスターノード１は、各管理グルー
プの優先順位リストに基づいて、被管理コンピュータを
管理する業務と共に、不図示のデータベース等を備え、
例えばスレーブノードからの要求により種々のデータを
送信したり、スレーブノードの有するデータを受信して
データベースに格納したりなどの業務を実行することが
できる。The master node 1 is provided with a database and the like (not shown) together with a task for managing the managed computer based on the priority list of each management group.
For example, tasks such as transmitting various data in response to a request from the slave node, receiving data of the slave node and storing the data in a database can be performed.

【００２７】ここで、スレーブノード１１〜１３を、相
互にノード情報の取得要求を行いノード情報を共有する
管理グループ１１１、スレーブノード１４〜１６を相互
にノード情報の取得要求を行いノード情報を共有する管
理グループ１１２、スレーブノード１７〜１９を相互に
ノード情報の取得要求を行いノード情報を共有する管理
グループ１１３として形成する。それぞれのスレーブノ
ード１１〜１９は、自ノードが属する管理グループの優
先順位リストを持つ。Here, the slave nodes 11 to 13 mutually request the node information and share the node information. The management group 111 shares the node information. The slave nodes 14 to 16 mutually request the node information and share the node information. The management group 112 and the slave nodes 17 to 19 form a management group 113 that requests acquisition of node information mutually and shares the node information. Each of the slave nodes 11 to 19 has a priority list of the management group to which the own node belongs.

【００２８】マスターノード１は、管理グループ１１
１，１１２，１１３の各グループ内の優先順位リストを
持つ。各管理グループ優先順位リスト１０１〜１０３
は、各管理グループ内での問い合わせる順序の優先度の
高い順にソートされたノード名のリストであり、各スレ
ーブノード名と、通信可能か否かで傷害の有無を判断で
きる各ノードについてのＯＫ，ＮＧの状態と、その状態
の更新時刻とを保存する領域１２１〜１２３を持つ。The master node 1 has a management group 11
1, 112, and 113 in each group. Each management group priority list 101-103
Is a list of node names sorted in descending order of priority in the order of inquiries in each management group. Each slave node name and OK, OK, There are areas 121 to 123 for storing the status of NG and the update time of the status.

【００２９】マスターノード１は、管理コンピュータ１
００として、システム内の各グループの優先順位の高い
ノードにノード情報を問い合わせるノード問合せ機能
と、グループのノード情報を取得できない場合に、次の
優先度を持つノードへ問い合わせを切り替える切替機能
と、他ノードからのノード情報の問い合わせに対して返
答する返答機能を有している。The master node 1 is a management computer 1
00, a node inquiring function for inquiring node information of a higher priority node of each group in the system, a switching function of switching an inquiry to a node having the next priority when node information of the group cannot be acquired, It has a reply function for replying to an inquiry about node information from a node.

【００３０】管理グループ１〜３内の各スレーブノード
は、各グループ内の他のスレーブノードへノード情報を
問い合わせる情報問合せ機能と、他のスレーブノードか
らの問い合わせに対して自ノード情報と、保有している
他スレーブノードのノード情報を合わせて返答する返答
機能と、受け取ったノード情報を保存する場合に保有し
ているノード情報と比較して新しいノード情報であれば
更新する更新機能と、自ノードが属する管理グループの
他のスレーブノードへの問い合わせに返答がない場合に
マスターノード１に対して問い合わせを行うマスター問
合せ機能と、自ノードが知っているマスターノード、ス
レーブノードと通信が行えない場合に、孤立したと判断
して自システムを停止する停止機能を有している。Each slave node in each of the management groups 1 to 3 has an information inquiry function for inquiring other slave nodes in each group for node information, and own node information in response to an inquiry from another slave node. A reply function that replies together with the node information of the other slave nodes that are present, an update function that updates the received node information if it is newer node information compared to the stored node information, and an own node. A master inquiry function for making an inquiry to the master node 1 when there is no response to an inquiry to another slave node to which the management group to which the master node belongs, and a case where communication with the master node and the slave node known to the own node cannot be performed. Has a stop function of stopping the system by determining that it is isolated.

【００３１】（２）動作の説明次に、図１及び図２、図３を参照して、本実施形態の全
体の動作について詳細に説明する。まず図２において、
管理コンピュータ（マスターノード１）のノード情報収
集動作を説明する。(2) Description of Operation Next, the overall operation of the present embodiment will be described in detail with reference to FIG. 1, FIG. 2, and FIG. First, in FIG.
The node information collecting operation of the management computer (master node 1) will be described.

【００３２】マスターノード１は、予め定めた所定管理
時間毎に、情報収集対象となっている各管理グループの
優先順位の高いスレーブノードのノード名を取得する
（Ｓ１）。各管理グループ１１１〜１１３毎に取得した
ノード名に対応するスレーブノードに対して、通信機器
１２０を介してノード情報取得要求を行う（Ｓ２）。要
求に対する応答があった場合は、ノード情報（状態、時
刻）を記憶装置に保存する（Ｓ３，４）。マスターノー
ド１は時間をカウントし、所定時間応答が無い場合、次
の優先順にあたるノード名を取得する（Ｓ１）。所定時
間内に取得できた場合は、そのスレーブノードに対して
ノード情報取得要求を行う（Ｓ２）。管理グループのノ
ード情報が取得できるか、管理グループの優先順位のリ
スト内のスレーブノード全てに対してノード情報取得要
求の動作を行うまで繰り返す。この処理をシステムの全
ての管理グループに対して行う（Ｓ５）。全ての管理グ
ループに対してノード情報を収集しない場合には、ステ
ップＳ２に移行し、全ての管理グループに対してノード
情報を収集した場合には終了する。The master node 1 acquires a node name of a slave node having a higher priority in each management group to be collected for each predetermined management time (S1). A node information acquisition request is made to the slave node corresponding to the node name acquired for each of the management groups 111 to 113 via the communication device 120 (S2). If there is a response to the request, the node information (state, time) is stored in the storage device (S3, 4). The master node 1 counts the time, and if there is no response for a predetermined time, acquires a node name in the next priority order (S1). If the slave node has been acquired within a predetermined time, a node information acquisition request is made to the slave node (S2). This processing is repeated until node information of the management group can be obtained or a node information acquisition request operation is performed for all slave nodes in the priority list of the management group. This process is performed for all management groups of the system (S5). If node information has not been collected for all management groups, the process proceeds to step S2, and if node information has been collected for all management groups, the process ends.

【００３３】次に、図３を参照しつつ、マスターノード
１がノード情報要求を受けた場合の動作を説明する。Next, an operation when the master node 1 receives a node information request will be described with reference to FIG.

【００３４】マスターノード１は、ノード情報要求を受
けたら、それまでに収集した各管理グループのノード情
報から、ノード情報要求により指定されたスレーブノー
ドについてのノード情報を各管理グループ優先度リスト
を検索して読み込む（Ｓ６）。読み込んだノード情報を
要求元のスレーブノードに送信する（Ｓ７）。この場
合、要求元のスレーブノードに対して、他のスレーブノ
ードのノード情報を送信して、その他のスレーブノード
に傷害が発生しているのかどうかを要求元のスレーブノ
ードに知らせることができる。また、セキュリティを考
慮して、同一管理グループ内の他のスレーブノードのノ
ード情報を制限的に送信することもできる。この同一管
理グループ内のスレーブノードについてのノード情報と
いう制限によって、マスターノード１の動作を制限する
ことにもなって、マスターノードの管理業務の削減を図
ることができる。Upon receiving the node information request, the master node 1 searches each management group priority list for the node information of the slave node specified by the node information request from the node information of each management group collected so far. And read (S6). The read node information is transmitted to the requesting slave node (S7). In this case, the node information of the other slave node can be transmitted to the requesting slave node to notify the requesting slave node whether or not the other slave node has failed. Also, in consideration of security, it is possible to restrictively transmit node information of other slave nodes in the same management group. The operation of the master node 1 is restricted by the restriction of the node information on the slave nodes in the same management group, so that the management work of the master node can be reduced.

【００３５】次に、図４において、管理グループ１１
１，１１２，１１３内における各スレーブノード１１〜
１９のノード情報収集動作を説明する。Next, referring to FIG.
1, 112, 113 each slave node 11-
The nineteenth node information collection operation will be described.

【００３６】スレーブノード１１〜１９は、自己の有す
る管理グループ優先順位リストから自管理グループ内の
他のノード名を取得する（Ｓ８）。例えば、管理グルー
プ１１１においては、スレーブノード１１〜１３のリス
トを不図示のディスプレイに表示し、知りたいスレーブ
ノードのノード名をクリックすることでノード名を取得
する。取得したノード名に対応するスレーブノードに対
してノード情報取得要求を送る（Ｓ９）。該当スレーブ
ノードからの応答を所定時間内に受信したか否かを判断
し（Ｓ１０）、ノード情報が取得できない場合（Ｓ１
１）、該当ノード情報の状態を異常として管理グループ
優先順位リストの状態の項目に、例えば「ＮＧ」とその
ときの時刻を保存し、次のノード情報取得に移る（Ｓ
８）。ノード情報を取得できた場合、該当ノード情報の
状態を正常の「ＯＫ」として保存し、同時に送られてく
る該当ノード以外のノード情報と、保有している対応す
るノード情報を比較し、比較結果から記憶装置内の情報
より更新時刻が新しいノード情報を更新し、次のノード
情報取得に移る（Ｓ１２）。Each of the slave nodes 11 to 19 acquires another node name in its own management group from its own management group priority list (S8). For example, in the management group 111, a list of the slave nodes 11 to 13 is displayed on a display (not shown), and the node name is acquired by clicking the node name of the slave node that the user wants to know. A node information acquisition request is sent to the slave node corresponding to the acquired node name (S9). It is determined whether a response from the slave node has been received within a predetermined time (S10), and if node information cannot be obtained (S1).
1) Assuming that the status of the corresponding node information is abnormal, for example, “NG” and the time at that time are stored in the status item of the management group priority list, and the process proceeds to the next node information acquisition (S).
8). If the node information can be obtained, the state of the corresponding node information is stored as normal “OK”, and the node information other than the corresponding node transmitted at the same time is compared with the corresponding corresponding node information, and the comparison result is obtained. , The node information whose update time is newer than the information in the storage device is updated, and the process proceeds to the next node information acquisition (S12).

【００３７】管理グループ内のノード情報が全て収集で
きていたら、自グループ内の管理グループ優先順位リス
トから選択するスレーブノードはないので、スレーブノ
ードを取得することなく、ノード情報がすべて異常か否
かを判断し（Ｓ１３）、異常でない場合にはノード情報
収集動作を終了する（ＳＴＯＰ）。管理グループ内のノ
ード情報がすべて異常になっていた場合、マスターノー
ド１へ通信機器１２０を介して問い合わせる（Ｓ１３，
１４）。ここで、異常か否かの判断は、ノード情報の取
得情報を要求したのに所定時間応答がない場合や、自己
の取得要求が宛先のスレーブノードに送信されなかった
場合、通信機器に傷害が発生している場合、いずれの場
合も応答がないので異常と判断する。If all the node information in the management group has been collected, there is no slave node to be selected from the management group priority list in the own group. Is determined (S13), and if not abnormal, the node information collecting operation is terminated (STOP). If all the node information in the management group is abnormal, an inquiry is made to the master node 1 via the communication device 120 (S13,
14). Here, the determination as to whether or not there is an abnormality is made when there is no response for a predetermined time after requesting the acquisition information of the node information, or when the own acquisition request is not transmitted to the destination slave node, the communication device is damaged. If it has occurred, there is no response in any case, so it is determined to be abnormal.

【００３８】ステップＳ１３で、すべて異常であると判
断した場合には、マスターノード１の応答からノード情
報を収集できた場合は処理を終了する（Ｓ１５）。マス
ターノード１から所定時間内に応答がない場合、自ノー
ドがネットワークから孤立していると判断し、自スレー
ブノードのシステムを停止させる（Ｓ１６）。If it is determined in step S13 that all the information is abnormal, the process ends if node information can be collected from the response from the master node 1 (S15). When there is no response from the master node 1 within a predetermined time, it is determined that the own node is isolated from the network, and the system of the own slave node is stopped (S16).

【００３９】次に図５において、スレーブノードが、ノ
ード情報取得要求を受けた場合の動作を説明する。この
動作をヘルスチェックと称してもよい。このヘルスチェ
ックは、定期的にチェックするのを原則として、１日１
回でもよいし、１０分に１回でもよく、また、任意的
に、管理グループの業務処理に必要なグループ内の他の
スレーブノードのノード情報を知る必要のある頻度数に
応じて、ヘルスチェックを行う。Next, an operation when the slave node receives a node information acquisition request will be described with reference to FIG. This operation may be called a health check. This health check should be checked regularly, in principle, once a day.
Health check may be performed once every 10 minutes, or optionally, according to the frequency of needing to know node information of other slave nodes in the group necessary for business processing of the management group. I do.

【００４０】スレーブノード１〜９は、他のスレーブノ
ード或いはマスターノード１からノード情報取得要求を
受けた場合、自ノード情報を読み込み（Ｓ１７）、それ
までに収集して保有している管理グループ内の他のスレ
ーブノードのノード情報を自スレーブノードに保管して
いる管理グループ優先順位リストから読み込み（Ｓ１
８）、要求元に自ノード情報と、保有している自己の管
理グループ内の他ノード情報の両方をノード情報として
送信する（Ｓ１９）。When receiving a node information acquisition request from another slave node or the master node 1, the slave nodes 1 to 9 read their own node information (S17), and collects and retains the own node information up to that point in the management group. Is read from the management group priority list stored in the own slave node (S1).
8) Transmit both the own node information and the other node information in the own management group held as request information to the request source (S19).

【００４１】［第２の実施形態］次に、本発明の第２の
実施形態について、図面を参照して詳細に説明する。図
６を参照すると、マスターノード２０２、２０３、２０
４に管理されたシステムが相互にネットワーク２０１に
接続されている。マスターノード２０２、２０３、２０
４それぞれに上述の第１の実施形態の管理グループと同
様なスレーブノードと同様な動作を行わせることによ
り、システム間で相互に監視することが可能となる。[Second Embodiment] Next, a second embodiment of the present invention will be described in detail with reference to the drawings. Referring to FIG. 6, the master nodes 202, 203, 20
4 are mutually connected to the network 201. Master nodes 202, 203, 20
By making each of the four perform the same operation as the slave node similar to the management group of the above-described first embodiment, it is possible to mutually monitor the systems.

【００４２】図６において、マスターノード２０２は管
理グループ２１１と、管理グループ２１２と、管理グル
ープ２１３とをマスターとして通信機器の伝送路を介し
て管理する。また、マスターノード２０３は管理グルー
プ２１４と、管理グループ２１５とをマスターとして管
理する。マスターノード２０４は管理グループ２１６
と、管理グループ２１７とをマスターとして通信機器の
伝送路を介して管理する。マスターノードは、それぞ
れ、スレーブノードの管理業務を専用に行う管理サーバ
であってもよいし、スレーブノードの管理業務をマルチ
タスクの一部の業務として実行してもよい。In FIG. 6, a master node 202 manages a management group 211, a management group 212, and a management group 213 as masters via transmission paths of communication devices. The master node 203 manages the management group 214 and the management group 215 as masters. The master node 204 is a management group 216
And the management group 217 as a master through the transmission path of the communication device. Each of the master nodes may be a management server that exclusively performs the management task of the slave node, or may execute the management task of the slave node as a part of the multitask.

【００４３】各マスターノード２０２、２０３、２０４
は、このマスターノードグループを１つのグループシス
テムとして、相互のマスター管理グループ優先順位リス
トを保有し、優先順位をランダムに付け、或いはシステ
ムの大きさに応じた優先順位を付け、或いは複数のスレ
ーブノードを管理グループとして管理グループの数の多
さで優先順位を付け、或いは管理グループではなく管理
している全スレーブノードの数の多い順に優先順位を付
けて、各優先順位に従ったグループシステム優先順位リ
ストを相互に保管し、所定時間毎にヘルスチェックを他
のマスターノード間で実行する。Each master node 202, 203, 204
Uses this master node group as one group system, holds a mutual master management group priority list, assigns priorities randomly, or assigns priorities according to the size of the system, or a plurality of slave nodes. Is assigned as a management group, and priorities are assigned according to the number of management groups, or priorities are assigned in descending order of the number of all managed slave nodes instead of the management groups, and group system priorities according to the respective priorities The lists are mutually stored, and a health check is performed between other master nodes at predetermined time intervals.

【００４４】［第３の実施形態］さらに、図７を参照す
ると、本第３の実施形態はグルーピングによるノード情
報の交換および孤立判定システムの動作プログラムを記
憶した記憶媒体１３０を備える。この記憶媒体１３０は
マスターノード１のドライバ１３１に装着して、その動
作プログラムを書き込まれ、或いは読み出される。ま
た、この記憶媒体１３０は、ハードディスクやフロッピ
ーディスク等の磁気ディスク、フラッシュメモリを有す
るＩＣカードやスティックメモリ等の半導体メモリ、そ
の他のＣＤ−Ｒ，ＣＤ−ＲＷ、ＤＶＤ−ＲＡＭ等の記憶
媒体であってよい。[Third Embodiment] Still referring to FIG. 7, the third embodiment includes a storage medium 130 storing an operation program of a system for exchanging node information by grouping and an isolation determination. The storage medium 130 is mounted on the driver 131 of the master node 1 and the operation program is written or read. The storage medium 130 is a magnetic disk such as a hard disk or a floppy disk, a semiconductor memory such as an IC card or a stick memory having a flash memory, and other storage media such as a CD-R, a CD-RW, and a DVD-RAM. May be.

【００４５】この記憶媒体からのインストールは、マス
ターノードで行い、各スレーブノードにインストールす
る旨指示する。また、個別にスレーブノードが管理シス
テム中で孤立したか否かを判断するために、通信機器１
２０を介して、又は自己の記録媒体のドライブに挿入し
てパッケージとした記憶媒体から自己の記憶手段にイン
ストールしても良い。The installation from the storage medium is performed by the master node, and an instruction is given to each slave node to perform the installation. Further, in order to individually determine whether or not the slave node is isolated in the management system, the communication device 1
The storage medium may be installed in its own storage means through the storage medium 20 or inserted into a drive of its own recording medium and packaged.

【００４６】グルーピングによるノード情報の交換およ
び孤立判定システムの動作プログラムは記憶媒体１３０
からマスターノード１と、各コンピュータ（中央処理装
置；プロセッサ；データ処理装置）のスレーブノード１
１〜１９に読み込まれ、各スレーブノードのグルーピン
グによるノード情報の交換と、孤立判定を制御する。各
スレーブノードはこの読み込まれたグルーピングによる
ノード情報の交換、および孤立判定システムの動作プロ
グラムによって、第１の実施形態で示した処理と同一の
処理を行う。The operation program of the system for exchanging node information and isolating by grouping is stored in the storage medium 130.
To a master node 1 and a slave node 1 of each computer (central processing unit; processor; data processing unit)
1 to 19, control of exchange of node information by grouping of slave nodes and isolation determination. Each slave node performs the same processing as the processing shown in the first embodiment by exchanging the node information by the read grouping and by the operation program of the isolation determination system.

【００４７】この動作プログラムは、孤立判定システム
全体の動作プログラムと、マスターノード１の動作プロ
グラムと、各スレーブノード１１〜１９の動作プログラ
ムをそれぞれ個別に設けてもよく、好ましくは、マスタ
ーノード１の管理の下で管理され、更新する場合等には
マスターノード１からの指示に従って、各スレーブノー
ド毎に更新することが、システム管理上統一がとれる意
味で好ましい。In this operation program, an operation program for the entire isolation determination system, an operation program for the master node 1, and an operation program for each of the slave nodes 11 to 19 may be individually provided. In the case where the data is managed and updated under management, it is preferable to update the data for each slave node in accordance with an instruction from the master node 1 in terms of system management unification.

【００４８】また、動作プログラムは、第１の実施形態
で説明したように、各管理グループ毎に優先順位を設定
して、マスターノードが管理グループの中のいずれか１
つにノード情報を応答を要求し、各管理グループ内のス
レーブノードは同一管理グループ内の他のスレーブノー
ドにノード情報を要求して応答を待つ等という動作を実
行するコンピュータ用プログラムとしている。As described in the first embodiment, the operation program sets a priority order for each management group, and sets the master node to one of the management groups.
First, a slave computer in each management group requests node information from a response, and the slave node in each management group requests node information from another slave node in the same management group and waits for a response.

【００４９】なお、上記各実施形態において、マスター
ノードの有する管理グループ優先順位リスト１０１と、
スレーブノード１１〜１３の管理グループ優先順位リス
トとは同一内容を有しているので、いずれかのノードに
傷害が発生した場合には、その障害発生のノードが回復
した場合には、マスターノードの有する同一管理グルー
プの、或いは当該管理グループの他のいずれかのスレー
ブノードの管理グループ優先順位リストをコピーすれ
ば、同一管理グループ優先順位リストを得られるので、
当該孤立判定システムによって、回復処理が極めて容易
になる。In each of the above embodiments, the management group priority order list 101 of the master node,
Since the management group priority list of the slave nodes 11 to 13 has the same contents, if any node fails, if the failed node recovers, the master node By copying the management group priority list of the same management group or the management group priority list of any other slave node of the management group, the same management group priority list can be obtained.
The isolation determination system makes the recovery process extremely easy.

【００５０】[0050]

【発明の効果】本発明によれば、グループ内のスレーブ
ノード同士がヘルスチェックを行い、グループ内のスレ
ーブノード同士がヘルスチェックができない場合は、マ
スターノードに対してヘルスチェックを行う。２系統で
ヘルスチェックを行うために信頼性が増し、マスターノ
ードは各グループの１台としかヘルスチェックを行わな
いため、全てのノードに対してヘルスチェックを行う方
式に比べ、負荷を軽減できるので、ヘルスチェックの信
頼性を向上させ、マスターノードの負荷を軽減できる。According to the present invention, the slave nodes in the group perform a health check, and if the slave nodes in the group cannot perform a health check, the health check is performed on the master node. The reliability is increased because the health check is performed by two systems, and the master node performs a health check with only one of each group. Therefore, the load can be reduced compared to the method of performing a health check on all nodes. Thus, the reliability of the health check can be improved and the load on the master node can be reduced.

【００５１】また、グループ内の他ノードからのヘルス
チェック応答がない場合、応答のないスレーブノード以
外からの情報に、応答のないスレーブノードのノード情
報があり、その情報の時刻を参照することにより、マス
ターノードに問い合わせる前に判定が可能であり、また
最終的にマスターノードに問い合わせることによって、
自ノードで障害が発生しているか、グループ内の他ノー
ドで障害が発生しているかの原因の切り分けができるの
で、自ノードで障害が発生しているか、グループ内の他
ノードで障害が発生しているかの切り分けができる。If there is no health check response from another node in the group, the information from the non-responding slave nodes includes the node information of the non-responding slave node, and the time of the information is referred to. Can be determined before querying the master node, and finally by querying the master node,
It is possible to determine the cause of a failure in its own node or a failure in another node in the group.Therefore, if a failure has occurred in its own node or another node in the group has failed. Can be separated.

【００５２】また、マスターノードがノード状態を問い
合わせるノードに障害が発生した場合、そのノードが属
するグループ内のほかのノードもグループ内のノード情
報を保有している。そのため、マスターノードは、その
管理グループの他ノードに対してノード情報要求を行う
ことによって、全ノードの管理を支障なく、行うことが
できるので、マスターノードが情報を問い合わせるノー
ドに障害が発生しても、全ノードの状態を管理できるこ
とになる。When a failure occurs in a node for which the master node inquires about a node state, other nodes in the group to which the node belongs also have node information in the group. Therefore, the master node can perform management of all nodes without any problem by making a node information request to other nodes in the management group. Can also manage the state of all nodes.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明による孤立判定システムの構成ブロック
図である。FIG. 1 is a configuration block diagram of an isolation determination system according to the present invention.

【図２】本発明による孤立判定システムのマスターノー
ドの動作フローチャートである。FIG. 2 is an operation flowchart of a master node of the isolation determination system according to the present invention.

【図３】本発明による孤立判定システムのマスターノー
ドの動作フローチャートである。FIG. 3 is an operation flowchart of a master node of the isolation determination system according to the present invention.

【図４】本発明による孤立判定システムのスレーブノー
ドの動作フローチャートである。FIG. 4 is an operation flowchart of a slave node of the isolation determination system according to the present invention.

【図５】本発明による孤立判定システムのスレーブノー
ドの動作フローチャートである。FIG. 5 is an operation flowchart of a slave node of the isolation determination system according to the present invention.

【図６】本発明による孤立判定システムのネットワーク
の構成ブロック図である。FIG. 6 is a configuration block diagram of a network of an isolation determination system according to the present invention.

【図７】本発明による孤立判定システムの構成ブロック
図である。FIG. 7 is a configuration block diagram of an isolation determination system according to the present invention.

【符号の説明】[Explanation of symbols]

１マスターノード１１〜１９スレーブノード２１管理グループ優先順位リスト１００管理コンピュータ１０１〜１０３管理グループ優先順位リスト１１１〜１１３管理グループ１２０通信機器１２１〜１２３管理グループ優先順位リストの内容１３０記憶媒体 DESCRIPTION OF SYMBOLS 1 Master node 11-19 Slave node 21 Management group priority list 100 Management computer 101-103 Management group priority list 111-113 Management group 120 Communication equipment 121-123 Contents of management group priority list 130 Storage medium

Claims

【特許請求の範囲】[Claims]

【請求項１】管理コンピュータ（マスターノード）
と、複数台の被管理コンピュータ（スレーブノード）が
ネットワークで接続されて前記各ノードが孤立状態を判
定する孤立判定システムの管理方法において、前記スレーブノード数台を１組とした複数の管理グルー
プで構成し、前記管理グループ内の前記スレーブノード
同士がお互いにノード情報の交換を行い、それぞれのス
レーブノードが前記管理グループ内のノード情報を保持
して、前記マスターノードが各管理グループ内の１台の
優先順位の高いスレーブノードに対してノード情報の交
換を行うことにより、前記マスターノードが管理する全
スレーブノードのノード情報を管理することを特徴とす
る孤立判定システムの管理方法。1. Management computer (master node)
A plurality of managed computers (slave nodes) are connected via a network, and each of the nodes determines an isolated state. And the slave nodes in the management group exchange node information with each other, each slave node holds node information in the management group, and the master node is a node in each management group. The node information of all the slave nodes managed by the master node is exchanged by exchanging node information with the slave node having a higher priority order.

【請求項２】請求項１に記載の孤立判定システムの管
理方法において、前記各スレーブノードは、自ら同一管
理グループ内の他のスレーブノードから孤立状態である
ことを判断し、前記孤立状態の自スレーブノードの回復
のため前記自スレーブノードの動作機能を停止する機能
を有することを特徴とする孤立判定システムの管理方
法。2. The management method according to claim 1, wherein each of the slave nodes determines that the slave node is isolated from other slave nodes in the same management group, and determines whether the slave node is in the isolated state. A management method for an isolation determination system, comprising a function of stopping an operation function of the slave node for recovery of the slave node.

【請求項３】請求項１に記載の孤立判定システムの管
理方法において、前記マスターノードは前記各管理グル
ープ毎に管理グループ優先順位リストを有し、所定時間
毎に前記管理グループ優先順位リストの最優先順位のス
レーブノードにヘルスチェックを行うことを特徴とする
孤立判定システムの管理方法。3. The management method for an isolation determination system according to claim 1, wherein the master node has a management group priority list for each of the management groups, and the master node has a management group priority list at predetermined time intervals. A management method for an isolation determination system, wherein a health check is performed on a slave node having a priority.

【請求項４】請求項１に記載の孤立判定システムの管
理方法において、前記スレーブノードは、自己の属する
前記管理グループ内の管理グループ優先順位リストを有
し、所定時間毎に当該管理グループ優先順位リストから
他のスレーブノードにノード情報取得要求を送出し、応
答がない場合にはその他のスレーブノードにそのノード
情報取得要求を送出し、応答に応じて自己の管理グルー
プ優先順位リストを更新することを特徴とする孤立判定
システムの管理方法。4. The management method according to claim 1, wherein the slave node has a management group priority list in the management group to which the slave node belongs, and the slave node has a management group priority list at predetermined time intervals. Send a node information acquisition request to other slave nodes from the list, and if there is no response, send the node information acquisition request to other slave nodes, and update their own management group priority list according to the response. A management method for an isolation determination system characterized by the following.

【請求項５】管理コンピュータ（マスターノード）
と、複数台の被管理コンピュータ（スレーブノード）が
ネットワークで接続されている孤立判定システムにおい
て、前記マスターノードは、前記スレーブノードの数台を１
組とした複数の管理グループ毎に設けた管理グループ優
先順位リストと、前記ネットワークにノード情報を送受
信する送受信部とを有し、前記管理グループ内の前記スレーブノードは前記マスタ
ーノードと前記管理グループ内の他のスレーブノードと
ノード情報を前記ネットワークを介して送受信する送受
信部と、前記管理グループ内の前記管理グループ優先順
位リストとを備え、前記マスターノードは前記各管理グループの優先順位に
従って各管理グループの一つに前記ノード情報の取得要
求を実行すると共に、前記管理グループ内のスレーブノ
ードからノード取得要求に応対してその取得要求したス
レーブノードの属する管理グループ優先順位リストのノ
ード情報を送信し、前記スレーブノードは少なくとも前
記管理グループ内の他のスレーブノードと相互にノード
情報の取得要求を実行して前記管理グループ優先順位リ
ストを更新することを特徴とする孤立判定システム。5. A management computer (master node)
And an isolation determination system in which a plurality of managed computers (slave nodes) are connected via a network.
A management group priority list provided for each of the plurality of management groups, and a transmission / reception unit for transmitting / receiving node information to / from the network, wherein the slave nodes in the management group are in the master node and in the management group. A transmission / reception unit that transmits / receives node information to / from another slave node via the network, and the management group priority list in the management group, wherein the master node manages each management group according to the priority of each management group. Executing the acquisition request of the node information to one of the above, in response to the node acquisition request from the slave nodes in the management group, transmitting the node information of the management group priority list to which the slave node that has requested the acquisition belongs, The slave node has at least other slave nodes in the management group. Isolation determination system and updates the management group priority list by performing the acquisition request of the slave node and mutually node information.

【請求項６】請求項５に記載の孤立判定システムにお
いて、前記スレーブノードは、前記管理グループ内の他
のスレーブノードに前記ノード情報の取得要求を実行し
て応答がない場合には前記管理グループ優先順位リスト
のその他のスレーブノードにノード状態にＮＧフラグを
立て、更に前記マスターノードに前記ノード情報の取得
要求を実行して応答がない場合には自己機が孤立状態と
なったことを判定することを特徴とする孤立判定システ
ム。6. The isolation determination system according to claim 5, wherein the slave node executes a request for acquiring the node information to another slave node in the management group and does not respond to the management group. An NG flag is set to the node status for the other slave nodes in the priority list, and if the master node issues a request to acquire the node information and there is no response, it is determined that the own device has been isolated. An isolation determination system characterized in that:

【請求項７】複数の管理コンピュータ（マスターノー
ド）と、前記マスターノードがそれぞれ管理する複数台
の被管理コンピュータ（スレーブノード）がネットワー
クで接続されている孤立判定システムにおいて、前記マスターノードは、他のマスターノードと相互にノ
ード情報の取得要求を実行して当該ノード情報を格納す
るマスター管理グループ優先順位リストを保有し、前記マスターノードは、前記スレーブノードの数台を１
組とした複数の管理グループ毎に設けた管理グループ優
先順位リストと、前記ネットワークにノード情報を送受
信する送受信部とを有し、前記管理グループ内の前記スレーブノードは前記マスタ
ーノードと前記管理グループ内の他のスレーブノードと
ノード情報を前記ネットワークを介して送受信する送受
信部と、前記管理グループ内の前記管理グループ優先順
位リストとを備え、前記マスターノードは前記各管理グループの優先順位に
従って各管理グループの一つに前記ノード情報の取得要
求を実行すると共に、前記管理グループ内のスレーブノ
ードからノード取得要求に応対してその取得要求したス
レーブノードの属する管理グループ優先順位リストのノ
ード情報を送信し、前記スレーブノードは少なくとも前
記管理グループ内の他のスレーブノードと相互にノード
情報の取得要求を実行して前記管理グループ優先順位リ
ストを更新することを特徴とする孤立判定システム。7. An isolation determination system in which a plurality of management computers (master nodes) and a plurality of managed computers (slave nodes) respectively managed by the master node are connected via a network. The master node has a master management group priority list for storing node information by executing a node information acquisition request with the master node.
A management group priority list provided for each of the plurality of management groups, and a transmission / reception unit for transmitting / receiving node information to / from the network, wherein the slave nodes in the management group are in the master node and in the management group. A transmission / reception unit that transmits / receives node information to / from another slave node via the network, and the management group priority list in the management group, wherein the master node manages each management group according to the priority of each management group. Executing the acquisition request of the node information to one of the above, in response to the node acquisition request from the slave nodes in the management group, transmitting the node information of the management group priority list to which the slave node that has requested the acquisition belongs, The slave node has at least other slave nodes in the management group. Isolation determination system and updates the management group priority list by performing the acquisition request of the slave node and mutually node information.

【請求項８】管理コンピュータ（マスターノード）
と、複数台の被管理コンピュータ（スレーブノード）が
ネットワークで接続されて各ノードが孤立状態を判定す
る孤立判定システムの管理方法を動作プログラムとし、
該動作プログラムをコンピュータが読み取り可能に格納
した記録媒体において、前記孤立判定システムの管理方法は、前記スレーブノー
ド数台を１組とした複数の管理グループで構成し、前記
管理グループ内の前記スレーブノード同士がお互いにノ
ード情報の交換を行い、それぞれのスレーブノードが前
記管理グループ内のノード情報を保持して、前記マスタ
ーノードが各管理グループ内の１台の優先順位の高いス
レーブノードに対してノード情報の交換を行うことで、
前記マスターノードが管理している全スレーブノードの
ノード情報を管理することを前記動作プログラムとし
て、該動作プログラムをコンピュータが読み取り可能に
格納したことを特徴とする記録媒体。8. A management computer (master node)
A management method of an isolation determination system in which a plurality of managed computers (slave nodes) are connected via a network and each node determines an isolated state;
In a recording medium storing the operation program in a computer-readable manner, the management method of the isolation determination system comprises a plurality of management groups each including a set of the slave nodes, and the slave nodes in the management group. The nodes exchange node information with each other, each slave node holds node information in the management group, and the master node sends a node to one high-priority slave node in each management group. By exchanging information,
A recording medium, wherein the operation program is configured to manage node information of all slave nodes managed by the master node as the operation program, and the operation program is stored in a computer-readable manner.