JP2015118440A

JP2015118440A - Monitoring device, and method for updating failure recovery procedure

Info

Publication number: JP2015118440A
Application number: JP2013260066A
Authority: JP
Inventors: 大歩笠井; Daiho Kasai; 原田　英二; Eiji Harada; 英二原田; 秀樹齊藤; Hideki Saito
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2013-12-17
Filing date: 2013-12-17
Publication date: 2015-06-25

Abstract

PROBLEM TO BE SOLVED: To provide a recovery procedure corresponding to an actual failure recovery operation.SOLUTION: A monitoring device for monitoring a state of an apparatus is configured to: hold workflow definitions which are procedures for recovering from failures having occurred in the apparatus; monitor a failure having occurred in the apparatus; hold information on the failure acquired by the monitoring and a recovery procedure having been executed for recovering from the failure, as history information; and update the workflow definitions on the basis of the held history information.

Description

本発明は、機器を監視する監視装置に関し、特に、障害復旧手順の履歴に基づいて障害復旧のワークフロー定義を作成する方法に関する。 The present invention relates to a monitoring apparatus that monitors devices, and more particularly, to a method for creating a workflow definition for failure recovery based on a history of failure recovery procedures.

機器とその対向装置からなる通信システムなどを構成する機器は、発生した障害の内容を監視装置に通知する。監視装置は機器からの通知に基づいて障害の内容を表示する。保守者は、監視装置が表示した障害内容に基づいて障害を復旧するが、障害復旧の作業は複雑であることが多い。保守者の作業負担を緩和する技術として、過去の障害復旧の履歴情報を用いる技術が特許文献１に開示されている。 A device configuring a communication system including the device and the opposite device notifies the monitoring device of the content of the failure that has occurred. The monitoring device displays the content of the failure based on the notification from the device. The maintenance person recovers the failure based on the failure content displayed by the monitoring device, but the failure recovery work is often complicated. As a technique for reducing the work burden on the maintenance person, Patent Document 1 discloses a technique that uses past failure recovery history information.

すなわち、特許文献１には、過去に発生したネットワーク障害の履歴、およびその復旧方法を蓄積・管理しておき、ネットワーク障害が発生した時には、その蓄積した情報を基に操作者が行うべき復旧方法について指示を出すようにしてネットワークにおける障害を処理する方法が開示されている。 That is, Patent Document 1 stores and manages a history of network failures that have occurred in the past and recovery methods thereof, and when a network failure occurs, a recovery method that an operator should perform based on the stored information. A method for handling a failure in a network is disclosed.

特開平８−４４６４１号公報JP-A-8-44641

従来の方法では、監視対象である機器に発生した障害について、過去に実施した障害復旧の履歴情報から復旧手順を選択して復旧を行うが、実際の復旧においては、復旧手順の途中の状況や条件によって、その後の手順が異なることが多い。このため、復旧手順途中の条件によって分岐を考慮した手順を過去の履歴情報から作成して、実際の障害復旧作業に適した復旧手順を提供することが求められている。 In the conventional method, for the failure that occurred in the monitored device, recovery is performed by selecting the recovery procedure from the history information of fault recovery performed in the past, but in actual recovery, the situation in the middle of the recovery procedure and Subsequent procedures often differ depending on conditions. For this reason, it is required to create a procedure considering branching according to conditions during the recovery procedure from past history information and provide a recovery procedure suitable for actual failure recovery work.

本願において開示される発明の代表的な一例を示せば以下の通りである。すなわち、機器の状態を監視する監視装置であって、前記機器で発生した障害を復旧するための手順である障害復旧のワークフロー定義を保持し、前記機器で発生した障害情報とワークフロー定義の手順で実行した復旧手順を履歴情報として保持し、前記保持した履歴情報に基づいてワークフロー定義を更新する。 A typical example of the invention disclosed in the present application is as follows. That is, a monitoring device that monitors the state of a device, holds a workflow definition for failure recovery that is a procedure for recovering a failure that has occurred in the device, and uses the failure information and workflow definition procedure that has occurred in the device. The executed recovery procedure is stored as history information, and the workflow definition is updated based on the stored history information.

本発明の代表的な実施形態によれば、実際の障害復旧作業に適した復旧手順を提供することができる。前述した以外の課題、構成及び効果は、以下の実施例の説明により明らかにされる。 According to the representative embodiment of the present invention, it is possible to provide a recovery procedure suitable for actual failure recovery work. Problems, configurations, and effects other than those described above will become apparent from the description of the following embodiments.

本発明の実施例の通信システムの構成図である。It is a block diagram of the communication system of the Example of this invention. 本実施例の監視装置の構成を示すブロック図である。It is a block diagram which shows the structure of the monitoring apparatus of a present Example. 本実施例の復旧基本手順リストテーブルの構成を説明する図である。It is a figure explaining the structure of the recovery basic procedure list table of a present Example. 本実施例の復旧基本手順リストテーブルの復旧手順の項番付与則を説明する図である。It is a figure explaining the numbering rule of a restoration procedure of a restoration basic procedure list table of a present Example. 本実施例のコマンドリストテーブル（その１）の構成を説明する図である。It is a figure explaining the structure of the command list table (the 1) of a present Example. 本実施例のコマンドリストテーブル（その２）の構成を説明する図である。It is a figure explaining the structure of the command list table (the 2) of a present Example. 本実施例のコマンドリストテーブル（その３）の構成を説明する図である。It is a figure explaining the structure of the command list table (the 3) of a present Example. 本実施例のコマンドリストテーブル（その４）の構成を説明する図である。It is a figure explaining the structure of the command list table (the 4) of a present Example. 本実施例の構成情報リストテーブルの構成を説明する図である。It is a figure explaining the structure of the structure information list table of a present Example. 本実施例のワークフロー定義テーブルに保持されるワークフロー定義を説明する図である。It is a figure explaining the workflow definition hold | maintained at the workflow definition table of a present Example. 本実施例の復旧基本手順リストテーブルからワークフロー定義を作成する処理のフローチャートである。It is a flowchart of the process which produces a workflow definition from the recovery basic procedure list table of a present Example. 本実施例の障害復旧履歴テーブルの構成を説明する図である。It is a figure explaining the structure of the failure recovery log | history table of a present Example. 本実施例のコネクションエラーの履歴（その１）を説明する図である。It is a figure explaining the log | history (the 1) of the connection error of a present Example. 本実施例のコネクションエラーの履歴（その１）に基づいて更新したワークフロー定義を説明する図である。It is a figure explaining the workflow definition updated based on the connection error log | history (the 1) of a present Example. 本実施例のワークフロー定義の更新処理フローを説明する図である。It is a figure explaining the update process flow of the workflow definition of a present Example. 本実施例のコネクションエラーの履歴（その２）を説明する図である。It is a figure explaining the log | history (the 2) of the connection error of a present Example. 本実施例のコネクションエラーの履歴（その２）に基づいて更新したワークフロー定義を説明する図である。It is a figure explaining the workflow definition updated based on the log | history (the 2) of the connection error of a present Example. 本実施例のコネクションエラーの履歴（その３）を説明する図である。It is a figure explaining the log | history (the 3) of the connection error of a present Example. 本実施例のコネクションエラーの履歴（その３）に基づいて更新したワークフロー定義を説明する図である。It is a figure explaining the workflow definition updated based on the connection error log | history (the 3) of a present Example. 本実施例のコネクションエラーの履歴（その３）に基づいて更新したワークフロー定義を説明する図である。It is a figure explaining the workflow definition updated based on the connection error log | history (the 3) of a present Example. 本実施例のコネクションエラーの履歴（その４）を説明する図である。It is a figure explaining the log | history (the 4) of the connection error of a present Example. 本実施例のコネクションエラーの履歴（その４）に基づいて更新したワークフロー定義を説明する図である。It is a figure explaining the workflow definition updated based on the log | history (the 4) of the connection error of a present Example.

以下、図面を参照して、本発明の実施例について説明する。 Embodiments of the present invention will be described below with reference to the drawings.

図１は、本発明の実施例の通信システムの構成図である。 FIG. 1 is a configuration diagram of a communication system according to an embodiment of the present invention.

本実施例の通信システムは、監視対象となる複数の機器２及び機器２を監視する監視装置１を有し、監視装置１及び機器２は通信網４を介して接続される。また、監視対象となる機器２には対向装置３が接続されており、機器２と対向装置３とが通信している。 The communication system of the present embodiment includes a plurality of devices 2 to be monitored and a monitoring device 1 that monitors the devices 2, and the monitoring device 1 and the devices 2 are connected via a communication network 4. Further, the opposite device 3 is connected to the device 2 to be monitored, and the device 2 and the opposite device 3 communicate with each other.

機器２は、自身の監視機能によって動作状態を監視しており、障害検出時に監視装置１にアラームを通知する。監視装置１は、機器２から受けた通知によって、機器２に障害が発生したことを保守者に通知する。保守者は、障害を復旧するコマンドを機器２に実行することによって、機器２の障害を復旧する。なお、監視装置１は、保守者を支援する支援装置としても機能する。この場合、監視装置１は、機器２を監視しなくても、機器２の障害の情報を収集すればよい。 The device 2 monitors the operating state by its own monitoring function, and notifies the monitoring device 1 of an alarm when a failure is detected. Based on the notification received from the device 2, the monitoring device 1 notifies the maintenance person that a failure has occurred in the device 2. The maintenance person recovers the failure of the device 2 by executing a command for recovering the failure on the device 2. The monitoring device 1 also functions as a support device that supports a maintenance person. In this case, the monitoring device 1 may collect information on the failure of the device 2 without monitoring the device 2.

図２は、本実施例の監視装置の構成を示すブロック図である。 FIG. 2 is a block diagram illustrating the configuration of the monitoring apparatus according to the present embodiment.

監視装置１は、プログラムを実行するプロセッサ３０、プログラムを格納するメモリ３１、記憶装置３２、外部入出力インタフェース１０及び通信インタフェース１９を有する計算機である。 The monitoring device 1 is a computer having a processor 30 that executes a program, a memory 31 that stores a program, a storage device 32, an external input / output interface 10, and a communication interface 19.

プロセッサ（ＣＰＵ）３０は、メモリ３１に格納されたプログラムを実行する。メモリ３１は、ＤＲＡＭ（Dynamic Random Access Memory）のような高速かつ揮発性の記憶素子によって構成され、プロセッサ３０によって実行されるプログラム及びプログラムの実行時に使用されるデータを一時的に格納する。具体的には、プロセッサ３０が、メモリ３１に格納されたプログラムを実行することによって、障害復旧の基本手順格納部１１、ワークフロー定義作成部１２、障害受信部１３、復旧手順表示部１４、復旧手順実行部１５、障害復旧履歴保持部１６、ワークフロー定義更新部１７及びコマンド実行部１８の機能を実現する。 The processor (CPU) 30 executes a program stored in the memory 31. The memory 31 is configured by a high-speed and volatile storage element such as a DRAM (Dynamic Random Access Memory), and temporarily stores a program executed by the processor 30 and data used when the program is executed. More specifically, the processor 30 executes a program stored in the memory 31, so that the failure recovery basic procedure storage unit 11, workflow definition creation unit 12, failure reception unit 13, recovery procedure display unit 14, recovery procedure The functions of the execution unit 15, failure recovery history holding unit 16, workflow definition update unit 17, and command execution unit 18 are realized.

記憶装置３２は、例えば、磁気記憶装置（ＨＤＤ）、フラッシュメモリ（ＳＳＤ）等の大容量かつ不揮発性の記憶装置であり、プロセッサ３０が実行するプログラム及びプログラムの実行時に使用されるデータを格納する。具体的には、記憶装置３２は、復旧基本手順リストテーブル２０、コマンドリストテーブル２１、構成情報リストテーブル２２、ワークフロー定義テーブル２３及び障害復旧履歴テーブル２４を格納する。 The storage device 32 is a large-capacity non-volatile storage device such as a magnetic storage device (HDD) or a flash memory (SSD), and stores a program executed by the processor 30 and data used when the program is executed. . Specifically, the storage device 32 stores a recovery basic procedure list table 20, a command list table 21, a configuration information list table 22, a workflow definition table 23, and a failure recovery history table 24.

外部入出力インタフェース１０は、キーボード及びマウス等の、ユーザから入力を受ける入力装置、及び液晶ディスプレイ装置及びプリンタ等、プログラムの実行結果をユーザが視認可能な態様で表示する出力装置が接続される。具体的には、監視装置１は、外部入出力インタフェース１０を介して、保守者へのアラーム通知を出力したり、保守者からの障害復旧コマンドの入力を受け付ける。 The external input / output interface 10 is connected to an input device that receives input from the user such as a keyboard and a mouse, and an output device that displays the execution result of the program in a manner that the user can visually recognize, such as a liquid crystal display device and a printer. Specifically, the monitoring device 1 outputs an alarm notification to the maintenance person or accepts an input of a failure recovery command from the maintenance person via the external input / output interface 10.

通信インタフェース１９は、所定のプロトコルに従って、他の装置との通信を制御するネットワークインタフェース装置である。 The communication interface 19 is a network interface device that controls communication with other devices according to a predetermined protocol.

プロセッサ３０によって実行されるプログラムは、リムーバブルメディア（ＣＤ−ＲＯＭ、フラッシュメモリなど）又はネットワークを介して監視装置１に提供され、非一時的記憶媒体である不揮発性の記憶装置３２に格納される。このため、監視装置１は、リムーバブルメディアからデータを読み込むインタフェースを有するとよい。 The program executed by the processor 30 is provided to the monitoring device 1 via a removable medium (CD-ROM, flash memory, etc.) or a network, and is stored in a nonvolatile storage device 32 that is a non-temporary storage medium. For this reason, the monitoring device 1 may have an interface for reading data from a removable medium.

本実施例の監視装置１は、物理的に一つの計算機上で、又は、論理的又は物理的に構成された複数の計算機上で構成される計算機システムであり、同一の計算機上で別個のスレッドで動作してもよく、複数の物理的計算機資源上に構築された仮想計算機上で動作してもよい。 The monitoring apparatus 1 of this embodiment is a computer system configured on a single computer or a plurality of computers configured logically or physically, and is a separate thread on the same computer. It may operate on a virtual machine constructed on a plurality of physical computer resources.

障害復旧の基本手順格納部１１は、復旧基本手順リストテーブル２０を管理する。復旧基本手順リストテーブル２０は、図３に示すように、監視対象の機器２で発生しうる全ての障害について、障害名、障害名に対する復旧手順を格納する。復旧基本手順リストテーブル２０は、監視装置１の初期構築時に監視装置１に入力される。 The basic procedure storage unit 11 for fault recovery manages the recovery basic procedure list table 20. As shown in FIG. 3, the recovery basic procedure list table 20 stores a failure name and a recovery procedure for the failure name for all failures that may occur in the monitored device 2. The basic recovery procedure list table 20 is input to the monitoring device 1 when the monitoring device 1 is initially constructed.

復旧手順には、監視装置１で実行するコマンドだけでなく、監視装置１以外で実施する手順（関連部署等に電話の連絡等）も記載される。復旧手順には、実行する順番に、（１）、（２）、…の項番を手順に付する。復旧基本手順リストテーブル２０に保持される障害は、監視装置１の初期構築前、すなわち運用前の設計段階で想定される基本的な障害なので、その復旧手順の項番は（１）、（２）、…の記載を基本とするが、分岐を含む複雑な手順も記載可能としている。 In the recovery procedure, not only a command executed by the monitoring device 1 but also a procedure executed by a device other than the monitoring device 1 (such as calling a related department or the like) is described. In the restoration procedure, item numbers (1), (2),... Are added to the procedure in the order of execution. The failure held in the recovery basic procedure list table 20 is a basic failure assumed in the design stage before the initial construction of the monitoring apparatus 1, that is, before the operation, and therefore the item numbers of the recovery procedures are (1) and (2 ), ... are basically described, but it is also possible to describe complicated procedures including branching.

復旧基本手順リストテーブル２０に分岐を含む手順を記載する場合、規則に従って付与された復旧手順の項番の例を図４に示す。図４に示す様に、項番は（１）、（２）、…という数字の順序で付与し、分岐後の数字は分岐前の数字に１を加算した数字に「−１」、「−２」等のサフィックスを付加する。分岐条件がＹＥＳの場合に「−１」が付与され、分岐条件がＮＯの場合に「−２」が付与される。図４に示す例では（３−１）（３−２）が、この規則に該当する。 When a procedure including a branch is described in the recovery basic procedure list table 20, an example of the item number of the recovery procedure assigned according to the rule is shown in FIG. As shown in FIG. 4, the item numbers are given in the order of numbers (1), (2),..., And the numbers after branching are obtained by adding “-1”, “− A suffix such as “2” is added. When the branch condition is YES, “−1” is given, and when the branch condition is NO, “−2” is given. In the example shown in FIG. 4, (3-1) and (3-2) correspond to this rule.

さらに、この後、ＹＥＳ側の手順が続く場合に、（３−１）の次の項番は（４−１）になり、ＮＯ側の手順が続く場合に、（３−２）の次の項番は（４−２）になる。また、分岐した後の手順の後にまた分岐があるような場合は、分岐前の先頭の数字に１を加算した数字に「−１」又は「−２」を付加する。図４に示す例では（５−２−１）、（５−２−２）が、この規則に該当する。 Furthermore, after this, when the procedure on the YES side continues, the next item number of (3-1) becomes (4-1), and when the procedure on the NO side continues, the next item of (3-2) The item number is (4-2). If there is another branch after the procedure after branching, “−1” or “−2” is added to the number obtained by adding 1 to the first number before branching. In the example shown in FIG. 4, (5-2-1) and (5-2-2) correspond to this rule.

また、分岐の直後に分岐がある場合は、分岐の直後の手順に付与される予定の項番の先頭の数字に１を加算した数字に「−１」又は「−２」を付加する。図４に示す例では（１２−１−２−１）（１２−１−２−２）が、この規則に該当する。 Further, when there is a branch immediately after the branch, “−1” or “−2” is added to the number obtained by adding 1 to the first number of the item number to be assigned to the procedure immediately after the branch. In the example shown in FIG. 4, (12-1-2-1) (12-1-2-2) corresponds to this rule.

また、分岐自体の項番はフローの上から順序で付与する。また、分岐のＹＥＳに進む方向の順序で番号を付与し、その後下層の分岐でＮＯになった分岐に番号を付与する。図４に示す例では、分岐１、分岐２、分岐３、分岐４、分岐５が、この規則に該当する。 Also, the item numbers of the branches themselves are given in order from the top of the flow. Further, numbers are assigned in the order in which the branch advances to YES, and then numbers are assigned to the branches that become NO in the lower branch. In the example shown in FIG. 4, branch 1, branch 2, branch 3, branch 4, and branch 5 correspond to this rule.

復旧基本手順リストテーブル２０の復旧手順で分岐を表記する際は、この項番付与規則に従って、分岐後の手順の前に［分岐１Ｙ］［分岐１Ｎ］を記載し、［分岐１Ｙ］直後に分岐条件を記載する。この例では、「１」は分岐自体の項番、「Ｙ」は分岐条件がＹＥＳの場合を意味し、「Ｎ」は分岐条件がＮＯの場合を意味する。図３に示す例では、障害名がソフトウェア障害の復旧手順の［分岐１Ｙ］、［分岐１Ｎ］が、この規則に該当する。 When a branch is described in the recovery procedure of the recovery basic procedure list table 20, [Branch 1Y] [Branch 1N] is described before the procedure after the branch according to this item numbering rule, and a branch is performed immediately after [Branch 1Y]. Describe the conditions. In this example, “1” means the item number of the branch itself, “Y” means the case where the branch condition is YES, and “N” means the case where the branch condition is NO. In the example shown in FIG. 3, [branch 1Y] and [branch 1N] whose failure names are software failure recovery procedures correspond to this rule.

復旧基本手順リストテーブル２０には、入力値、実行結果が明確なったコマンドのみを記載する。 In the basic recovery procedure list table 20, only commands whose input values and execution results are clarified are described.

障害復旧の基本手順格納部１１は、全てのコマンドの内容を記載したコマンドリストテーブル２１を保持する。コマンドリストテーブル２１は、図５Ａ、図５Ｂ、図５Ｃ、図５Ｄに示すように、コマンド名、内容、種類（実行／確認）、コマンド、引数、書式例、実行結果、コマンドを自動実行した場合の影響度合い（影響なし／影響あり）の情報を保持し、人手によって入力される。 The basic procedure storage unit 11 for failure recovery holds a command list table 21 that describes the contents of all commands. As shown in FIGS. 5A, 5B, 5C, and 5D, the command list table 21 automatically executes command names, contents, types (execution / confirmation), commands, arguments, format examples, execution results, and commands. Information on the degree of influence (no influence / with influence) is held and manually input.

また、障害復旧の基本手順格納部１１は、監視対象となる機器２の構成情報リストテーブル２２を保持する。構成情報リストテーブル２２は、図６に示すように、機器Ｎｏ、ＩＰアドレス、機器種別、対向装置の機器ＮｏとＩＰアドレスの情報を保持し、人手によって入力される。 Further, the basic procedure storage unit 11 for failure recovery holds a configuration information list table 22 of the device 2 to be monitored. As shown in FIG. 6, the configuration information list table 22 holds information on the device number, IP address, device type, device number and IP address of the opposite device, and is manually input.

ワークフロー定義作成部１２は、復旧基本手順リストテーブル２０に保持した復旧手順及びコマンドリストテーブル２１からワークフロー定義を作成し、ワークフロー定義テーブル２３に格納する。ワークフロー定義テーブル２３は、障害復旧の基本手順格納部１１が復旧基本手順リストテーブル２０及びコマンドリストテーブル２１の作成が完了した時点で作成する。ワークフロー定義は、機器２で障害が発生した際に監視装置１が障害を復旧するための定義として使用される。 The workflow definition creation unit 12 creates a workflow definition from the recovery procedure and command list table 21 held in the recovery basic procedure list table 20 and stores the workflow definition in the workflow definition table 23. The workflow definition table 23 is created when the failure recovery basic procedure storage unit 11 completes the creation of the recovery basic procedure list table 20 and the command list table 21. The workflow definition is used as a definition for the monitoring device 1 to recover a failure when a failure occurs in the device 2.

図７は、ワークフロー定義テーブル２３に保持されるワークフロー定義のうち、障害名がコネクションエラーのワークフロー定義の例を説明する図である。 FIG. 7 is a diagram for explaining an example of a workflow definition whose failure name is a connection error among the workflow definitions held in the workflow definition table 23.

図７に示すワークフロー定義は、図３の復旧基本手順リストテーブル２０のコネクションエラーの復旧手順を元に作成されたものである。図７に示すワークフロー定義は、ワークフロー定義の開始（Ｓ１００）の後に、図３の復旧基本手順リストテーブル２０の復旧手順の（１）〜（４）の内容をフローチャート形式に変形したものである（Ｓ１０１〜Ｓ１０４）。 The workflow definition shown in FIG. 7 is created based on the connection error recovery procedure in the recovery basic procedure list table 20 of FIG. The workflow definition shown in FIG. 7 is obtained by transforming the contents of the recovery procedures (1) to (4) in the recovery basic procedure list table 20 of FIG. 3 into a flowchart format after the start of the workflow definition (S100) ( S101 to S104).

図８は、復旧基本手順リストテーブル２０からワークフロー定義を作成する処理のフローチャートである。 FIG. 8 is a flowchart of processing for creating a workflow definition from the recovery basic procedure list table 20.

図８において、ワークフロー上で手順を行う枠を処理枠、条件分岐をする枠を条件分岐枠と称する。ワークフロー定義上で、処理枠及び条件分岐枠を作成する位置は、復旧基本手順リストテーブル２０の復旧手順の項番から決定する。項番の付与規則については、図４に記載した規則に従う。 In FIG. 8, a frame for performing a procedure on the workflow is referred to as a processing frame, and a frame for performing conditional branching is referred to as a conditional branch frame. On the workflow definition, the position for creating the processing frame and the conditional branch frame is determined from the item number of the recovery procedure in the recovery basic procedure list table 20. The rule for assigning item numbers follows the rules described in FIG.

次に、図８に示すフローチャートの内容を説明する。 Next, the contents of the flowchart shown in FIG. 8 will be described.

ワークフロー定義の作成を開始すると（Ｓ１５０）、復旧手順の初めの項番の手順から以降の処理を行う（Ｓ１５１）。 When the creation of the workflow definition is started (S150), the subsequent processing is performed from the procedure of the first item number of the recovery procedure (S151).

まず、項番の前に分岐の記載があるかを判定する（Ｓ１５２）。その結果、分岐の記載がある場合は、ワークフロー定義に条件分岐枠を作成し、作成した条件分岐枠の中に分岐条件を記載する。条件分岐枠を作成する位置は、図４に示す項番付与規則によって決定する（Ｓ１５３）。その後、項番の前に記載された分岐条件のＹ／Ｎから、条件分岐枠の該当ＹＥＳ／ＮＯの分岐後の位置に処理枠を作成し、作成した処理枠の中に手順を記載する（Ｓ１５４）。 First, it is determined whether there is a branch description before the item number (S152). As a result, when there is a description of a branch, a conditional branch frame is created in the workflow definition, and the branch condition is described in the created conditional branch frame. The position where the conditional branch frame is created is determined by the item number assignment rule shown in FIG. 4 (S153). After that, from the branch condition Y / N described before the item number, a processing frame is created at the position after the branch of the corresponding YES / NO branch of the conditional branch frame, and the procedure is described in the created processing frame ( S154).

一方、ステップＳ１５２で、分岐の記載がないと判定された場合、ワークフロー定義に処理枠を作成し、作成した処理枠の中に次の手順を記載する。処理枠を作成する位置は、図４に示す項番付与規則によって決定する（Ｓ１５５）。 On the other hand, if it is determined in step S152 that there is no description of a branch, a processing frame is created in the workflow definition, and the next procedure is written in the created processing frame. The position where the processing frame is created is determined by the item number assignment rule shown in FIG. 4 (S155).

その後、最後の項番の手順であるかを判定する（Ｓ１５６）。その結果、最後の項番の手順である場合は、ワークフロー定義作成処理を終了する（Ｓ１５８）。一方、途中の項番の手順である場合は、次の項番の手順に移り（Ｓ１５７）、ステップＳ１５２に戻る。 Thereafter, it is determined whether the procedure is the last item number (S156). As a result, if it is the procedure of the last item number, the workflow definition creation process ends (S158). On the other hand, if the procedure is an intermediate item number, the procedure proceeds to the next item number (S157), and the process returns to step S152.

図２に戻り、監視装置の構成の説明を続ける。 Returning to FIG. 2, the description of the configuration of the monitoring device will be continued.

監視対象となる機器２で障害が発生した時、機器２はアラーム通知を送信する。機器２が送信したアラーム通知は、通信インタフェース１９を介して障害受信部１３が受信する。障害受信部１３は、受信した障害内容を外部入出力インタフェース１０を介して保守者に表示するためのデータを出力する。 When a failure occurs in the device 2 to be monitored, the device 2 transmits an alarm notification. The alarm notification transmitted by the device 2 is received by the failure receiving unit 13 via the communication interface 19. The failure receiving unit 13 outputs data for displaying the received failure content to the maintenance person via the external input / output interface 10.

復旧手順表示部１４は、障害受信部１３が受信した障害の障害名に基づいて、ワークフロー定義テーブル２３の該当障害名のワークフロー定義を、外部入出力インタフェース１０を介して保守者に表示するためのデータを出力する。保守者は、表示されたワークフロー定義の処理枠及び条件分岐枠を、外部入出力インタフェース１０に接続された入力装置を用いて選択できる。復旧手順実行部１５は、表示したワークフロー定義を実行するかを保守者に問い合わせるメッセージを外部入出力インタフェース１０から出力する。復旧手順実行部１５は、保守者からのワークフロー定義の実行決定の通知を外部入出力インタフェース１０を介して受信すると、ワークフロー定義の最上位の処理枠をフォーカスして外部入出力インタフェース１０を介して保守者に表示するためのデータを出力する。保守者は、表示されたワークフロー定義の処理枠及び条件分岐枠を外部入出力インタフェース１０に接続された入力装置を用いて選択できる。 The recovery procedure display unit 14 displays the workflow definition of the corresponding failure name in the workflow definition table 23 to the maintenance person via the external input / output interface 10 based on the failure name of the failure received by the failure receiving unit 13. Output data. The maintenance person can select the displayed workflow definition processing frame and conditional branch frame using an input device connected to the external input / output interface 10. The recovery procedure execution unit 15 outputs a message from the external input / output interface 10 that inquires the maintenance person whether to execute the displayed workflow definition. When the recovery procedure execution unit 15 receives a notification of execution determination of the workflow definition from the maintenance person via the external input / output interface 10, the restoration procedure execution unit 15 focuses on the highest processing frame of the workflow definition via the external input / output interface 10. Output data for display to maintenance personnel. The maintenance person can use the input device connected to the external input / output interface 10 to select the processing frame and the conditional branch frame of the displayed workflow definition.

復旧手順実行部１５は、フォーカスされた処理枠を実行するかを保守者に問い合わせるメッセージを外部入出力インタフェース１０から出力する。復旧手順実行部１５は、保守者からのフォーカスされた処理枠の実行決定の通知を外部入出力インタフェース１０を介して受信する。 The recovery procedure execution unit 15 outputs a message from the external input / output interface 10 that inquires the maintenance person whether to execute the focused processing frame. The recovery procedure execution unit 15 receives a notification of execution decision of the focused processing frame from the maintenance person via the external input / output interface 10.

復旧手順実行部１５は、フォーカスされた処理枠の内容が、監視装置１以外で実施する手順（関連部署等に電話の連絡等）である場合、保守者に手順を実行するようメッセージを表示し、手順が実行されたか保守者に問い合わせるメッセージを外部入出力インタフェース１０から出力する。復旧手順実行部１５は、保守者がフォーカスされた処理枠の手順を実行したことを示す通知を外部入出力インタフェース１０を介して受信すると、ワークフロー定義のフォーカスを次の処理枠又は条件分岐枠に移動して、外部入出力インタフェース１０を介して保守者に表示するためのデータを出力する。 When the content of the focused processing frame is a procedure to be performed by a device other than the monitoring device 1 (such as a telephone call to a related department), the recovery procedure execution unit 15 displays a message to execute the procedure to the maintenance person. The external input / output interface 10 outputs a message asking the maintenance person whether the procedure has been executed. When the recovery procedure execution unit 15 receives a notification indicating that the maintenance person has executed the procedure of the focused processing frame via the external input / output interface 10, the recovery procedure execution unit 15 shifts the focus of the workflow definition to the next processing frame or the conditional branch frame. It moves and outputs data for display to the maintenance person via the external input / output interface 10.

復旧手順実行部１５は、フォーカスされた処理枠の内容がコマンドである場合、コマンドを実行してよいかを保守者に問い合わせるメッセージを外部入出力インタフェース１０から出力する。復旧手順実行部１５は、保守者がフォーカスされた処理枠のコマンドの実行を決定したことの通知を外部入出力インタフェース１０を介して受信すると、コマンドの内容をコマンド実行部１８に送信する。 When the content of the focused processing frame is a command, the recovery procedure execution unit 15 outputs a message from the external input / output interface 10 asking the maintenance person whether the command can be executed. When the recovery procedure execution unit 15 receives a notification that the maintenance person has decided to execute the command of the focused processing frame via the external input / output interface 10, the recovery procedure execution unit 15 transmits the content of the command to the command execution unit 18.

コマンド実行部１８は、通信インタフェース１９を介して監視対象となる機器２にて実行するコマンドを送信する。実行結果が出力されるコマンドについては、コマンド実行部１８がコマンドの実行結果を機器２から通信インタフェース１９を介して受信する。コマンド実行部１８は、受信した実行結果を復旧手順実行部１５に送信する。復旧手順実行部１５は、受信した実行結果を外部入出力インタフェース１０を介して、フォーカスされた処理枠に表示し、ワークフロー定義のフォーカスを次の処理枠、又は条件分岐枠に移動して表示するためのデータを出力する。 The command execution unit 18 transmits a command to be executed by the device 2 to be monitored via the communication interface 19. For the command for which the execution result is output, the command execution unit 18 receives the execution result of the command from the device 2 via the communication interface 19. The command execution unit 18 transmits the received execution result to the recovery procedure execution unit 15. The recovery procedure execution unit 15 displays the received execution result in the focused processing frame via the external input / output interface 10 and moves the focus of the workflow definition to the next processing frame or the conditional branch frame for display. Data for output.

条件分岐枠にフォーカスされている場合は、保守者が条件分岐後のＹＥＳ側／ＮＯ側の処理枠を選択することによって、ワークフロー定義のフォーカスを次に処理枠に移動する。 When the conditional branch frame is focused, the maintenance person selects the processing frame on the YES side / NO side after the conditional branching to move the focus of the workflow definition to the processing frame next.

復旧手順実行部１５は、フォーカスがワークフロー定義の最後の処理枠となるまで上記の処理を繰り返す。ただし、フォーカスされた処理枠の手順の内容を実行しない通知を保守者から外部入出力インタフェース１０を介して受信した場合は、その時点でワークフロー定義の実行を取り止める。また、表示したワークフロー定義を実行するかの保守者への問い合わせに対し、ワークフロー定義を実行しない通知を保守者から外部入出力インタフェース１０を介して復旧手順実行部１５が受信した場合も、その時点でワークフロー定義の実行を取り止める。 The recovery procedure execution unit 15 repeats the above processing until the focus becomes the last processing frame of the workflow definition. However, if a notification that does not execute the contents of the procedure of the focused processing frame is received from the maintenance person via the external input / output interface 10, the execution of the workflow definition is canceled at that point. In addition, when the recovery procedure execution unit 15 receives a notification from the maintenance person via the external input / output interface 10 in response to an inquiry to the maintenance person about whether to execute the displayed workflow definition, at that time Cancel the execution of the workflow definition.

復旧手順実行部１５は、ワークフロー定義の実行を取り止めた後、ワークフロー定義ではない手順の実行を可能とする。 The recovery procedure execution unit 15 enables execution of a procedure that is not a workflow definition after canceling the execution of the workflow definition.

復旧手順実行部１５は、フォーカスされた処理枠について、処理枠内に書き込めるインタフェースを保守者に提供する。書き込みは＜＞で囲んだ範囲で記載する。 The restoration procedure execution unit 15 provides the maintenance person with an interface that can write the focused processing frame in the processing frame. Write in the range enclosed in <>.

復旧手順実行部１５は、フォーカスされた処理枠のコマンドについて、コマンドの引数を入力するインタフェースを保守者に提供する。復旧手順実行部１５は、外部入出力インタフェースを介して、保守者が入力したコマンドの引数を受信する。復旧手順実行部１５は、フォーカスされた処理枠のコマンド及び受信したコマンドの引数をコマンド実行部１８に送信する。 The recovery procedure execution unit 15 provides the maintenance person with an interface for inputting command arguments for the command in the focused processing frame. The recovery procedure execution unit 15 receives an argument of a command input by the maintenance person via the external input / output interface. The restoration procedure execution unit 15 transmits the command of the focused processing frame and the argument of the received command to the command execution unit 18.

コマンドの引数は、基本的に保守者が入力するが、復旧手順実行部１５は、コマンドリストテーブル２１の引数を参照し、構成情報リストテーブル２２から推測できるコマンドの引数は推測値を表示し、保守者に内容の確認を促してもよい。保守者は、表示された引数の推測値を修正することができる。 The command argument is basically input by the maintenance person, but the recovery procedure execution unit 15 refers to the argument of the command list table 21, displays the estimated value of the command argument that can be estimated from the configuration information list table 22, The maintenance person may be prompted to confirm the contents. The maintenance person can correct the estimated value of the displayed argument.

ワークフロー定義のコマンドは、基本的に、保守者が実行可否の判断を行った後に実行するが、ワークフロー定義のコマンドを監視装置１が自動的に実行してもよい。例えば、ワークフロー定義のコマンドを自動実行した際の影響度合いが、コマンドリストテーブル２１に「影響なし」と記載されており、かつ、構成情報リストテーブル２２からコマンドの引数の値を推測できた場合、保守者に確認することなく監視装置１がコマンドを自動実行する。一方、ワークフロー定義のコマンドを自動実行した際の影響度合いが「影響あり」である場合は自動実行しない。このように、監視装置１でコマンドを自動的に実行することができる。 The workflow definition command is basically executed after the maintenance person determines whether or not the workflow definition can be executed. However, the monitoring apparatus 1 may automatically execute the workflow definition command. For example, when the degree of influence when the workflow definition command is automatically executed is described as “no influence” in the command list table 21 and the value of the command argument can be estimated from the configuration information list table 22, The monitoring device 1 automatically executes the command without confirming with the maintenance person. On the other hand, if the degree of influence when the workflow definition command is automatically executed is “influenced”, it is not automatically executed. In this way, the command can be automatically executed by the monitoring device 1.

障害復旧履歴保持部１６は、監視装置１の初期構築後の運用時に、機器２で発生した障害及び障害復旧手順（障害発生から回復までの内容）を、障害復旧履歴テーブル２４として自動的に保持する。障害復旧履歴テーブル２４は、図９に示すように、障害名、発生日時、回復日時、機器Ｎｏ、機器のＩＰアドレス及び復旧手順の情報が保持される。 The failure recovery history holding unit 16 automatically holds, as a failure recovery history table 24, failures that occurred in the device 2 and the failure recovery procedure (contents from failure occurrence to recovery) during operation after the initial construction of the monitoring device 1 To do. As shown in FIG. 9, the failure recovery history table 24 holds information on a failure name, an occurrence date and time, a recovery date and time, a device number, a device IP address, and a recovery procedure.

障害復旧履歴保持部１６は、復旧手順実行部１５で実行したワークフロー定義の手順の内容を障害復旧履歴テーブル２４に格納する。また、障害復旧履歴保持部１６は、ワークフロー定義の実行を取り止めた後、ワークフロー定義ではない手順についても障害復旧履歴テーブル２４に格納する。 The failure recovery history holding unit 16 stores the contents of the workflow definition procedure executed by the recovery procedure executing unit 15 in the failure recovery history table 24. The failure recovery history holding unit 16 also stores the procedure that is not the workflow definition in the failure recovery history table 24 after canceling the execution of the workflow definition.

ワークフロー定義更新部１７は、障害復旧履歴保持部１６が新たな復旧手順を障害復旧履歴テーブル２４に格納したことを契機に、ワークフロー定義テーブル２３を更新する。 The workflow definition update unit 17 updates the workflow definition table 23 when the failure recovery history holding unit 16 stores a new recovery procedure in the failure recovery history table 24.

以降、ワークフロー定義作成後に機器２でコネクションエラーの障害が発生し、その障害を復旧した例を説明する。 Hereinafter, an example will be described in which a connection error failure occurs in the device 2 after the workflow definition is created and the failure is recovered.

図１０は、コネクションエラーの障害に対する履歴（その１）を説明する図である。 FIG. 10 is a diagram for explaining a history (part 1) for a connection error failure.

図１０は、コネクションエラーの障害の復旧手順が、障害復旧履歴テーブル２４に初めて格納され、その復旧手順をフローの処理枠に記載したものである。ワークフロー定義更新部１７は、障害復旧履歴テーブル２４のコネクションエラーの履歴から図１０のフローチャートを作成する。フローチャート中の処理枠内のコマンドに関する説明文は、障害名がコネクションエラーの内容欄の記載をコマンドリストテーブル２１から引用して作成する。 FIG. 10 shows a procedure for recovering a connection error failure for the first time in the failure recovery history table 24, and describes the recovery procedure in the processing frame of the flow. The workflow definition updating unit 17 creates the flowchart of FIG. 10 from the connection error history in the failure recovery history table 24. The explanatory text relating to the command in the processing frame in the flowchart is created by quoting from the command list table 21 the description in the content column where the fault name is connection error.

図１０のフローを説明する。 The flow of FIG. 10 will be described.

まず、ワークフロー定義の作成を開始する（Ｓ１０００）。 First, creation of a workflow definition is started (S1000).

機器２より受信した障害の障害名（コネクションエラー）、機器のＩＰアドレス（１０．１０．１０．１０）に基づいて、以降のフローを実施する（Ｓ１００１）。 Based on the failure name (connection error) received from the device 2 and the IP address (10.10.10.10) of the device, the subsequent flow is performed (S1001).

対向装置３のＩＰアドレスを取得するコマンド［ｏｐｐ＿ｄｅｖ＿ｓｅａｒｃｈ１０．１０．１０．１０］を実行し、［結果：１０．１０．１０．１１］を取得する（Ｓ１００２）。 A command [opp_dev_search 10.10.10.10] for acquiring the IP address of the opposite apparatus 3 is executed, and [result: 10.10.10.11] is acquired (S1002).

対向装置３のアラームを確認するコマンド［ｄｅｖ＿ａｌａｒｍ＿ｃｏｎｆ１０．１０．１０．１１］を実行し、［結果：アラームあり］を取得する（Ｓ１００３）。 A command [dev_alarm_conf 10.10.10.11] for confirming the alarm of the opposite device 3 is executed, and [result: with alarm] is acquired (S1003).

ステップＳ１００３の結果、対向装置３の側にもアラームが発生していることから、機器２と対向装置３との間のネットワークを被疑と判断し、機器２と対向装置３との間の経路切替コマンド［ｔｒａｎｓ＿ｃｈａｎｇｅ１０．１０．１０．１０１０．１０．１０．１１］を実行する（Ｓ１００４）。なお、フロー中＜＞の記載は保守者が入力した内容である。 As a result of step S1003, an alarm is also generated on the opposite device 3 side, so the network between the device 2 and the opposite device 3 is determined to be suspected, and the path between the device 2 and the opposite device 3 is switched. The command [trans_change 10.10.10.10 10.10.10.11] is executed (S1004). Note that <> in the flow is the content input by the maintenance person.

機器２と対向装置３との間の正常性を確認するコマンド［ｉｎｔ＿ｎｏｒｍａｌ＿ｃｏｎｆ１０．１０．１０．１０１０．１０．１０．１１］を実行し、［結果：正常］を取得する（Ｓ１００５）。 A command [int_normal_conf 10.10.10.10 10.10.10.11] for confirming the normality between the device 2 and the opposite apparatus 3 is executed, and [result: normal] is acquired (S1005).

ワークフロー定義更新部１７は、図１０のフローに基づいて、復旧基本手順リストテーブル２０より作成した図７のワークフロー定義を更新する。 The workflow definition update unit 17 updates the workflow definition of FIG. 7 created from the recovery basic procedure list table 20 based on the flow of FIG.

図１１は、図１０のフロー（コネクションエラーの履歴（その１））に基づいて更新したワークフロー定義を説明する図である。 FIG. 11 is a diagram for explaining the workflow definition updated based on the flow of FIG. 10 (connection error history (part 1)).

図７のステップＳ１０１からＳ１０３と、図１０のステップＳ１００１からＳ１００３とは、それぞれ、処理枠内の手順及びコマンド名がそれぞれ同じであるため、図７のステップＳ１０１からＳ１０３は更新しない。図７のステップＳ１０４と図１０のステップＳ１００４は処理枠内のコマンドが異なるため、この処理枠の前に何らかの判定が行われたと判断し、条件分岐枠（Ｓ２００）を追加する。条件分岐の条件内容については分からないため、空欄とする。 Since steps S101 to S103 in FIG. 7 and steps S1001 to S1003 in FIG. 10 have the same procedure and command name in the processing frame, steps S101 to S103 in FIG. 7 are not updated. Step S104 in FIG. 7 and step S1004 in FIG. 10 are different in the command within the processing frame. Therefore, it is determined that some determination has been made before this processing frame, and a conditional branch frame (S200) is added. Since the condition content of the conditional branch is unknown, leave it blank.

その後、ステップＳ１０４を条件分岐枠のＹＥＳ側の下に移動する。図１０のステップＳ１００４は、実際に復旧手順として実行されたときのコマンドの引数の値となっているため、ワークフロー定義として使用できない。このため、コマンドリストテーブル２１の該当コマンドの引数形式に変更したステップＳ２０１を作成し、条件分岐枠（Ｓ２００）のＮＯ側の下に配置する。なお、ステップＳ１００４の＜＞内の記載もステップＳ２０１に記載する。 Thereafter, step S104 is moved below the YES side of the conditional branch frame. Step S1004 in FIG. 10 cannot be used as a workflow definition because it is an argument value of a command when it is actually executed as a recovery procedure. For this reason, step S201 changed to the argument format of the corresponding command in the command list table 21 is created and placed under the NO side of the conditional branch frame (S200). Note that the description in <> of step S1004 is also described in step S201.

図１０のステップＳ１００５についても、実際に復旧手順として実行されたときの引数及び結果となっているため、コマンドリストテーブル２１の該当コマンドの引数及び実行結果の形式に変更し、ステップＳ２０２を作成する。図１０のステップＳ１００５は、ステップＳ１００４の次の手順であることから、ステップＳ２０２をステップＳ２０１の下に配置する。 Step S1005 in FIG. 10 also has arguments and results when actually executed as a recovery procedure, so the format is changed to the argument and execution result format of the corresponding command in the command list table 21, and step S202 is created. . Since step S1005 in FIG. 10 is the next procedure after step S1004, step S202 is arranged below step S201.

図１２は、ワークフロー定義の更新処理を説明する図である。ワークフロー定義の更新は、ワークフロー定義に無い図１０の手順を、ワークフロー定義に盛り込む形で行う。 FIG. 12 is a diagram for explaining a workflow definition update process. The workflow definition is updated by incorporating the procedure of FIG. 10 that is not included in the workflow definition into the workflow definition.

まず、ワークフロー定義の更新を開始する（Ｓ２５０）。 First, update of the workflow definition is started (S250).

ワークフロー定義の手順と障害復旧履歴テーブル２４の手順とを上から比較する。ただし、最初の手順は固定手順なので比較しない。比較の際、処理枠内のコマンドはコマンド名のみで比較する（Ｓ２５１）。 The procedure of the workflow definition and the procedure of the failure recovery history table 24 are compared from above. However, since the first procedure is a fixed procedure, it is not compared. At the time of comparison, the commands in the processing frame are compared only by the command name (S251).

障害復旧履歴テーブル２４の手順がワークフロー定義の手順と全て同じかを判定する（Ｓ２５２）。全ての手順が同じ場合は、障害復旧履歴テーブル２４の復旧手順が、既にワークフロー定義上にある復旧手順なのでワークフロー定義は更新しない（Ｓ２５３）。 It is determined whether the procedure of the failure recovery history table 24 is the same as the procedure of the workflow definition (S252). If all the procedures are the same, the workflow definition is not updated because the recovery procedure in the failure recovery history table 24 is a recovery procedure already in the workflow definition (S253).

一方、一部の手順が異なる場合は、ワークフロー定義と障害復旧履歴テーブル２４で上から比較して最初の異なる手順（以下、「異なる手順」と称する）の前に条件判定があったとして、ワークフロー定義の該当手順の前に条件分岐枠を作成する（Ｓ２５４）。そして、ワークフロー定義の異なる手順を条件分岐のＹＥＳの下に移す（Ｓ２５５）。 On the other hand, if some of the steps are different, the workflow definition and the failure recovery history table 24 compare the above from the top, assuming that there is a condition determination before the first different procedure (hereinafter referred to as “different procedure”). A conditional branch frame is created before the corresponding procedure of the definition (S254). Then, the procedure having a different workflow definition is moved to YES under the conditional branch (S255).

また、条件分岐のＮｏの下に処理枠を作成し、その中に障害復旧履歴の異なる手順を記載する。そして、障害復旧履歴テーブル２４の異なる手順以降の手順についても、異なる手順の下に各手順毎に処理枠を作成し、手順を記載する。尚、＜＞内の内容も記載する（Ｓ２５６）。 In addition, a processing frame is created under the condition branch No, and the procedure with different failure recovery history is described therein. And also about the procedure after a different procedure of the failure recovery log | history table 24, a process frame is produced for every procedure under a different procedure, and a procedure is described. The contents in <> are also described (S256).

その後、ワークフロー定義の更新を終了する（Ｓ２５７）。 Thereafter, the update of the workflow definition is ended (S257).

図１３は、コネクションエラーの障害に対する履歴（その２）を説明する図である。 FIG. 13 is a diagram for explaining a history (part 2) for a connection error failure.

図１３は、障害復旧履歴テーブル２４に図１０のコネクションエラーの障害の復旧手順が保持され、その後に格納されたコネクションエラーの障害の復旧手順をフローの処理枠に記載したものである。 FIG. 13 shows the recovery procedure of the connection error failure shown in FIG. 10 in the failure recovery history table 24, and describes the recovery procedure of the connection error failure stored thereafter in the flow processing frame.

図１３のフローを説明する。 The flow of FIG. 13 will be described.

まず、ワークフロー定義の作成を開始する（Ｓ１１００）。 First, creation of a workflow definition is started (S1100).

機器２より受信した障害の障害名（コネクションエラー）、機器のＩＰアドレス（２０．２０．２０．２０）に基づいて、以降のフローを実施する（Ｓ１１０１）。 Based on the failure name (connection error) of the failure received from the device 2 and the IP address (20.20.20.20) of the device, the subsequent flow is executed (S1101).

対向装置３のＩＰアドレスを取得するコマンド［ｏｐｐ＿ｄｅｖ＿ｓｅａｒｃｈ２０．２０．２０．２０］を実行し、［結果：２０．２０．２０．２１］を取得する（Ｓ１１０２）。 A command [opp_dev_search 20.0.20.20.20] for acquiring the IP address of the opposite apparatus 3 is executed, and [result: 20.20.20.21] is acquired (S1102).

対向装置３のアラームを確認するコマンド［ｄｅｖ＿ａｌａｒｍ＿ｃｏｎｆ２０．２０．２０．２１］を実行し、［結果：アラームなし］を取得する（Ｓ１１０３）。 The command [dev_alarm_conf 20.0.20.20.21] for confirming the alarm of the opposite device 3 is executed, and [result: no alarm] is acquired (S1103).

ステップＳ１１０３の結果、対向装置３の側にアラームが発生していないことから、機器２を被疑と判断し、機器２のプロセッサ切替コマンド［ｓｗａｃｔ２０．２０．２０．２０］を実行する（Ｓ１１０４）。なお、フロー中＜＞の記載は保守者が入力した内容である。 As a result of step S1103, since no alarm is generated on the opposite apparatus 3, the device 2 is determined to be suspected, and the processor switching command [swact 20.20.20.20] of the device 2 is executed (S1104). . Note that <> in the flow is the content input by the maintenance person.

その後、機器２のアラームを確認するコマンド［ｅｑｕ＿ａｌａｒｍ＿ｃｏｎｆ２０．２０．２０．２０］を実行し、［結果：アラームなし］を取得する（Ｓ１１０５）。 Thereafter, a command [equ_alarm_conf 20.20.20.20] for confirming the alarm of the device 2 is executed, and [result: no alarm] is acquired (S1105).

さらに、機器２の正常性を確認するコマンド［ｅｑｕ＿ｎｏｒｍａｌ＿ｃｏｎｆ２０．２０．２０．２０］を実行し、［結果：正常］を取得する（Ｓ１１０６）。 Further, a command [equ_normal_conf 20.20.20.20] for confirming the normality of the device 2 is executed, and [result: normal] is acquired (S1106).

ワークフロー定義更新部１７は、図１３のフローに基づいて、図１１のワークフロー定義を更新する。 The workflow definition update unit 17 updates the workflow definition of FIG. 11 based on the flow of FIG.

図１４は、図１３のフローに基づいて更新したワークフロー定義を説明する図である。図１４のフロー図を説明する図である。 FIG. 14 is a diagram for explaining the workflow definition updated based on the flow of FIG. It is a figure explaining the flowchart of FIG.

図１１のステップＳ１０１からＳ１０３と、図１３のステップＳ１１０１からＳ１１０３とは、それぞれ、処理枠内の手順及びコマンド名が同じであるため、図１１のステップＳ１０１からＳ１０３は更新しない。図１１のステップＳ１０４と図１３のステップＳ１１０４は処理枠内のコマンドが異なり、図１１のステップＳ２０１と図１３のステップＳ１１０４は処理枠内のコマンドが異なる。このため、これらの処理枠の前に何らかの判定が行われたと判断し、条件分岐枠（Ｓ３００）を追加する。条件分岐の条件内容については分からないため、空欄とする。 Since steps S101 to S103 in FIG. 11 and steps S1101 to S1103 in FIG. 13 have the same procedure and command name in the processing frame, steps S101 to S103 in FIG. 11 are not updated. Step S104 in FIG. 11 and step S1104 in FIG. 13 have different commands in the processing frame, and step S201 in FIG. 11 and step S1104 in FIG. 13 have different commands in the processing frame. For this reason, it is determined that some determination has been made before these processing frames, and a conditional branch frame (S300) is added. Since the condition content of the conditional branch is unknown, leave it blank.

その後、ステップＳ２０１を条件分岐枠のＹＥＳ側の下に移動する。図１３のステップＳ１１０４は、実際に復旧手順として実行されたときのコマンドの引数の値となっているため、ワークフロー定義として使用できない。このため、コマンドリストテーブル２１の該当コマンドの引数形式に変更したステップＳ３０１を作成し、条件分岐枠（Ｓ３００）のＮＯ側の下に配置する。なお、ステップＳ１１０４の＜＞内の記載もステップＳ３０１に記載する。 Thereafter, step S201 is moved below the YES side of the conditional branch frame. Step S1104 in FIG. 13 cannot be used as a workflow definition because it is an argument value of a command when actually executed as a recovery procedure. For this reason, step S301 changed to the argument format of the corresponding command in the command list table 21 is created and placed under the NO side of the conditional branch frame (S300). Note that the description in <> of step S1104 is also described in step S301.

図１３のステップＳ１１０５についても、実際に復旧手順として実行されたときの引数及び結果となっているため、コマンドリストテーブル２１の該当コマンドの引数及び実行結果の形式に変更し、ステップＳ３０２を作成する。図１３のステップＳ１１０５は、ステップＳ１１０４の次の手順であることから、ステップＳ３０２をステップＳ３０１の下に配置する。 Step S1105 in FIG. 13 also has arguments and results when actually executed as a recovery procedure. Therefore, the format is changed to the argument and execution result format of the corresponding command in the command list table 21, and step S302 is created. . Since step S1105 in FIG. 13 is the next procedure after step S1104, step S302 is arranged below step S301.

図１３のステップＳ１１０６についても、実際に復旧手順として実行されたときの引数及び結果となっているため、コマンドリストテーブル２１の該当コマンドの引数及び実行結果の形式に変更し、ステップＳ３０３を作成する。図１３のステップＳ１１０６は、ステップＳ１１０５の次の手順であることから、ステップＳ３０３をステップＳ３０２の下に配置する。 Step S1106 in FIG. 13 is also the argument and result when actually executed as a recovery procedure, so the argument and execution result format of the corresponding command in the command list table 21 are changed to create step S303. . Since step S1106 in FIG. 13 is the next procedure after step S1105, step S303 is arranged below step S302.

図１５は、コネクションエラーの障害に対する履歴（その３）を説明する図である。 FIG. 15 is a diagram for explaining a history (part 3) for a connection error failure.

図１５は、障害復旧履歴テーブル２４に図１３のコネクションエラーの障害の復旧手順が保持され、その後に格納されたコネクションエラーの障害の復旧手順をフローの処理枠に記載したものである。 FIG. 15 shows the recovery procedure of the connection error failure shown in FIG. 13 in the failure recovery history table 24 and describes the recovery procedure of the connection error failure stored in the processing frame of the flow.

図１５のフローを説明する。 The flow of FIG. 15 will be described.

まず、ワークフロー定義の作成を開始する（Ｓ１２００）。 First, creation of a workflow definition is started (S1200).

機器２より受信した障害の障害名（コネクションエラー）、機器のＩＰアドレス（３０．３０．３０．３０）に基づいて、以降のフローを実施する（Ｓ１２０１）。 Based on the failure name (connection error) of the failure received from the device 2 and the IP address (30.30.30.30) of the device, the subsequent flow is executed (S1201).

対向装置３のＩＰアドレスを取得するコマンド［ｏｐｐ＿ｄｅｖ＿ｓｅａｒｃｈ３０．３０．３０．３０］を実行し、［結果：３０．３０．３０．３１］を取得する（Ｓ１２０２）。 A command [opp_dev_search 30.30.30.30] for acquiring the IP address of the opposite apparatus 3 is executed, and [result: 30.30.30.31] is acquired (S1202).

対向装置３のアラームを確認するコマンド［ｄｅｖ＿ａｌａｒｍ＿ｃｏｎｆ３０．３０．３０．３１］を実行し、［結果：アラームなし］を取得する（Ｓ１２０３）。 The command [dev_alarm_conf 30.30.30.31] for confirming the alarm of the opposite device 3 is executed, and [result: no alarm] is acquired (S1203).

ステップＳ１２０３の結果、対向装置３の側にアラームが発生していないことから、機器２を被疑と判断し、機器２のプロセッサ切替コマンド［ｓｗａｃｔ３０．３０．３０．３０］を実行する（Ｓ１２０４）。なお、フロー中＜＞の記載は保守者が入力した内容である。 As a result of step S1203, since no alarm has occurred on the opposite device 3, the device 2 is determined to be suspicious and the processor switching command [swact 30.30.30.30] of the device 2 is executed (S1204). . Note that <> in the flow is the content input by the maintenance person.

その後、機器２のアラームを確認するコマンド［ｅｑｕ＿ａｌａｒｍ＿ｃｏｎｆ３０．３０．３０．３０］を実行し、［結果：アラームあり］を取得する（Ｓ１２０５）。 After that, a command [equ_alarm_conf 30.30.30.30] for confirming the alarm of the device 2 is executed, and [result: with alarm] is acquired (S1205).

また、機器２の再起動コマンド［ｒｅｓｅｔ３０．３０．３０．３０］を実行する（Ｓ１２０６）。 Further, the restart command [reset 30.30.30.30] of the device 2 is executed (S1206).

そして、機器２のアラームを確認するコマンド［ｅｑｕ＿ａｌａｒｍ＿ｃｏｎｆ３０．３０．３０．３０］を実行し、［結果：アラームなし］を取得する（Ｓ１２０７）。 Then, a command [equ_alarm_conf 30.30.30.30] for confirming the alarm of the device 2 is executed, and [result: no alarm] is acquired (S1207).

さらに、機器２の正常性を確認するコマンド［ｅｑｕ＿ｎｏｒｍａｌ＿ｃｏｎｆ３０．３０．３０．３０］を実行し、［結果：正常］を取得する（Ｓ１２０８）。 Further, a command [equ_normal_conf 30.30.30.30] for confirming the normality of the device 2 is executed, and [result: normal] is acquired (S1208).

ワークフロー定義更新部１７は、図１５のフローに基づいて、図１４のワークフロー定義を更新する。 The workflow definition update unit 17 updates the workflow definition of FIG. 14 based on the flow of FIG.

図１６Ａ、図１６Ｂは、図１５のフローに基づいて更新したワークフロー定義を説明する図である。 16A and 16B are diagrams for explaining the workflow definition updated based on the flow of FIG.

図１４のステップＳ１０１からＳ１０３と、図１５のステップＳ１２０１からＳ１２０３とは、それぞれ、処理枠内の手順及びコマンド名が同じであるため、図１４のステップＳ１０１からＳ１０３は更新しない。図１４のステップＳ１０４と図１５のステップＳ１２０４は処理枠内のコマンドが異なり、図１４のステップＳ２０１と図１５のステップＳ１２０４は処理枠内のコマンドが異なり、図１４のステップＳ３０１と図１５のステップＳ１２０４が同じであるため、図１４のステップＳ３０１は更新しない。図１４のステップＳ３０２と図１５のステップＳ１２０５が同じであるため、図１４のステップＳ３０２は更新しない。図１４のステップＳ３０３と図１５のステップＳ１２０６は処理枠内のコマンドが異なるため、この処理枠の前に何らかの判定が行われたと判断し、条件分岐枠（Ｓ４００）を追加する。条件分岐の条件内容については分からないため、空欄とする。 Since steps S101 to S103 in FIG. 14 and steps S1201 to S1203 in FIG. 15 have the same procedure and command name in the processing frame, steps S101 to S103 in FIG. 14 are not updated. Step S104 in FIG. 14 and step S1204 in FIG. 15 are different in the command in the processing frame, step S201 in FIG. 14 and step S1204 in FIG. 15 are different in the command in the processing frame, step S301 in FIG. 14 and step in FIG. Since S1204 is the same, step S301 in FIG. 14 is not updated. Since step S302 in FIG. 14 and step S1205 in FIG. 15 are the same, step S302 in FIG. 14 is not updated. Step S303 in FIG. 14 and step S1206 in FIG. 15 are different in the command within the processing frame. Therefore, it is determined that some determination has been made before this processing frame, and a conditional branch frame (S400) is added. Since the condition content of the conditional branch is unknown, leave it blank.

その後、ステップＳ３０３を条件分岐枠のＹＥＳ側の下に移動する。図１５のステップＳ１２０６は、実際に復旧手順として実行されたときのコマンドの引数の値となっているため、ワークフロー定義として使用できない。このため、コマンドリストテーブル２１の該当コマンドの引数形式に変更したステップＳ４０１を作成し、条件分岐枠（Ｓ４００）のＮＯ側の下に配置する。 Thereafter, step S303 is moved below the YES side of the conditional branch frame. Step S1206 in FIG. 15 cannot be used as a workflow definition because it is an argument value of a command when it is actually executed as a recovery procedure. For this reason, step S401 changed to the argument format of the corresponding command in the command list table 21 is created and placed under the NO side of the conditional branch frame (S400).

図１５のステップＳ１２０７についても、実際に復旧手順として実行されたときの引数及び結果となっているため、コマンドリストテーブル２１の該当コマンドの引数及び実行結果の形式に変更し、ステップＳ４０２を作成する。図１５のステップＳ１２０７は、ステップＳ１２０６の次の手順であることから、ステップＳ４０２をステップＳ４０１の下に配置する。 Step S1207 in FIG. 15 also has arguments and results when actually executed as a recovery procedure, so the command list table 21 is changed to the argument and execution result format of the corresponding command, and step S402 is created. . Since step S1207 in FIG. 15 is the next procedure after step S1206, step S402 is arranged below step S401.

図１５のステップＳ１２０８についても、実際に復旧手順として実行されたときの引数及び結果となっているため、コマンドリストテーブル２１の該当コマンドの引数及び実行結果の形式に変更し、ステップＳ４０３を作成する。図１５のステップＳ１２０８は、ステップＳ１２０７の次の手順であることから、ステップＳ４０３ステップをステップＳ４０２の下に配置する。 Step S1208 in FIG. 15 also has arguments and results when actually executed as a recovery procedure, so the format is changed to the argument and execution result format of the corresponding command in the command list table 21, and step S403 is created. . Since step S1208 in FIG. 15 is the next procedure after step S1207, step S403 is placed below step S402.

図１７は、コネクションエラーの障害に対する履歴（その４）を説明する図である。 FIG. 17 is a diagram for explaining a history (part 4) for a connection error failure.

図１７は、障害復旧履歴テーブル２４に図１５のコネクションエラーの障害の復旧手順が保持され、その後に格納されたコネクションエラーの障害の復旧手順をフローの処理枠に記載したものである。 FIG. 17 shows the recovery procedure of the connection error failure shown in FIG. 15 in the failure recovery history table 24 and describes the recovery procedure of the connection error stored in the processing frame of the flow.

図１７のフローを説明する。 The flow of FIG. 17 will be described.

まず、ワークフロー定義の作成を開始する（Ｓ１３００）。 First, creation of a workflow definition is started (S1300).

機器２より受信した障害の障害名（コネクションエラー）、機器のＩＰアドレス（４０．４０．４０．４０）に基づいて、以降のフローを実施する（Ｓ１３０１）。 Based on the failure name (connection error) of the failure received from the device 2 and the IP address (40.40.40.40) of the device, the subsequent flow is executed (S1301).

対向装置３のＩＰアドレスを取得するコマンド［ｏｐｐ＿ｄｅｖ＿ｓｅａｒｃｈ４０．４０．４０．４０］を実行し、［結果：４０．４０．４０．４１］を取得する（Ｓ１３０２）。 A command [opp_dev_search 40.40.40.40] for acquiring the IP address of the opposite apparatus 3 is executed, and [result: 40.40.40.41] is acquired (S1302).

対向装置３のアラームを確認するコマンド［ｄｅｖ＿ａｌａｒｍ＿ｃｏｎｆ４０．４０．４０．４１］を実行し、［結果：アラームなし］を取得する（Ｓ１３０３）。 The command [dev_alarm_conf 40.40.40.41] for confirming the alarm of the opposite device 3 is executed, and [result: no alarm] is acquired (S1303).

ステップＳ１３０３の結果、対向装置３の側にアラームが発生していないことから、機器２を被疑と判断し、機器２のプロセッサ切替コマンド［ｓｗａｃｔ４０．４０．４０．４０］を実行する（Ｓ１３０４）。なお、フロー中＜＞の記載は保守者が入力した内容である。 As a result of step S1303, since no alarm is generated on the opposite device 3, the device 2 is determined to be suspected, and the processor switching command [swact 40.40.40.40] of the device 2 is executed (S1304). . Note that <> in the flow is the content input by the maintenance person.

その後、機器２のアラームを確認するコマンド［ｅｑｕ＿ａｌａｒｍ＿ｃｏｎｆ４０．４０．４０．４０］を実行し、［結果：アラームあり］を取得する（Ｓ１３０５）。 Thereafter, a command [equ_alarm_conf 40.40.40.40] for confirming the alarm of the device 2 is executed, and [result: with alarm] is acquired (S1305).

また、機器２の再起動コマンド［ｒｅｓｅｔ４０．４０．４０．４０］を実行する（Ｓ１３０６）。 Also, a restart command [reset 40.40.40.40] of the device 2 is executed (S1306).

そして、機器２のアラームを確認するコマンド［ｅｑｕ＿ａｌａｒｍ＿ｃｏｎｆ４０．４０．４０．４０］を実行し、［結果：アラームあり］を取得する（Ｓ１３０７）。 Then, a command [equ_alarm_conf 40.40.40.40] for confirming the alarm of the device 2 is executed, and [result: with alarm] is acquired (S1307).

さらに、機器２を停止する停止コマンド［ｓｔｏｐ４０．４０．４０．４０］を実行する（Ｓ１３０８）。 Further, a stop command [stop 40.40.40.40] for stopping the device 2 is executed (S1308).

ワークフロー定義更新部１７は、図１７のフローに基づいて、図１６のワークフロー定義を更新する。 The workflow definition update unit 17 updates the workflow definition of FIG. 16 based on the flow of FIG.

図１８は、図１７のフローに基づいて更新したワークフロー定義を説明する図である。なお、図１７のフローに基づいて更新したワークフロー定義の前半は、図１６Ａと共通なので、その記載を省略した。すなわち、図１８のＳ４０１は、図１６ＡのＳ４００でＮＯの場合に実行される。 FIG. 18 is a diagram illustrating the workflow definition updated based on the flow of FIG. The first half of the workflow definition updated based on the flow of FIG. 17 is the same as that of FIG. That is, S401 in FIG. 18 is executed when S400 in FIG. 16A is NO.

すなわち、図１６ＡのステップＳ１０１からＳ１０３と、図１７のステップＳ１３０１からＳ１３０３は、それぞれ、処理枠内の手順及びコマンド名が同じであるため、図１６ＡのＳ１０１からＳ１０３は更新しない。図１６ＡのステップＳ１０４と図１７のステップＳ１３０４は処理枠内のコマンドが異なり、図１６ＡのステップＳ２０１と図１７のステップＳ１３０４は処理枠内のコマンドが異なり、図１６ＡのステップＳ３０１と図１７のステップＳ１３０４が同じであるため、図１６ＡのステップＳ３０１は更新しない。図１６ＡのステップＳ３０２と図１７のステップＳ１３０５が同じであるため、図１６ＡのステップＳ３０２は更新しない。 That is, steps S101 to S103 in FIG. 16A and steps S1301 to S1303 in FIG. 17 have the same procedure and command name in the processing frame, and therefore S101 to S103 in FIG. 16A are not updated. Step S104 in FIG. 16A and step S1304 in FIG. 17 have different commands in the processing frame. Step S201 in FIG. 16A and step S1304 in FIG. 17 have different commands in the processing frame, and step S301 in FIG. 16A and step in FIG. Since S1304 is the same, step S301 in FIG. 16A is not updated. Since step S302 in FIG. 16A and step S1305 in FIG. 17 are the same, step S302 in FIG. 16A is not updated.

図１６ＡのステップＳ３０３と図１７のステップＳ１３０６は処理枠内のコマンドが異なり、図１６ＢのステップＳ４０１と図１７のステップＳ１３０６が同じであるため、図１６ＢのステップＳ４０１は更新しない。図１６ＢのステップＳ４０２と図１７のステップＳ１３０７が同じであるため、図１６ＢのステップＳ４０２は更新しない。 Step S303 in FIG. 16A and step S1306 in FIG. 17 are different commands within the processing frame, and step S401 in FIG. 16B and step S1306 in FIG. 17 are the same, so step S401 in FIG. 16B is not updated. Since step S402 in FIG. 16B and step S1307 in FIG. 17 are the same, step S402 in FIG. 16B is not updated.

図１６ＢのステップＳ４０３と図１７のステップＳ１３０８ステップは処理枠内のコマンドが異なるため、この処理枠の前に何らかの判定が行われたと判断し、条件分岐枠（Ｓ５００）を追加する。条件分岐の条件内容はわからないため、空欄とする。 Step S403 in FIG. 16B and step S1308 in FIG. 17 have different commands in the processing frame. Therefore, it is determined that some determination has been made before this processing frame, and a conditional branch frame (S500) is added. Since the condition content of the conditional branch is unknown, leave it blank.

その後、ステップＳ４０３を条件分岐枠のＹＥＳ側の下に移動する。図１７のステップＳ１３０８は、実際に復旧手順として実行されたときのコマンドで引数の値となっているため、ワークフロー定義として使用できない。このため、コマンドリストテーブル２１の該当コマンドの引数形式に変更したステップＳ５０１を作成し、条件分岐枠（Ｓ５００）のＮＯ側の下に配置する。 Thereafter, step S403 is moved below the YES side of the conditional branch frame. Step S1308 in FIG. 17 cannot be used as a workflow definition because it is an argument value in a command that is actually executed as a recovery procedure. For this reason, step S501 changed to the argument format of the corresponding command in the command list table 21 is created and placed under the NO side of the conditional branch frame (S500).

ワークフロー定義更新部１７は、ワークフロー定義テーブル２３を更新した後、更新後のワークフロー定義を外部入出力インタフェース１０を介して表示するためのデータを出力する。ワークフロー定義更新部１７は、表示したワークフロー定義の処理枠内、条件分岐枠内に保守者が書き込めるインタフェースを提供する。書き込みは＜＞で囲んだ範囲に記載する。書き込まれた内容は、外部入出力インタフェース１０を介してワークフロー定義更新部１７に通知され、ワークフロー定義更新部１７は書き込まれた内容でワークフロー定義テーブル２３を更新する。 After updating the workflow definition table 23, the workflow definition updating unit 17 outputs data for displaying the updated workflow definition via the external input / output interface 10. The workflow definition update unit 17 provides an interface in which a maintenance person can write in a processing frame and a conditional branch frame of the displayed workflow definition. Write in the range enclosed in <>. The written content is notified to the workflow definition update unit 17 via the external input / output interface 10, and the workflow definition update unit 17 updates the workflow definition table 23 with the written content.

ワークフロー定義の条件分岐枠内の分岐条件は、基本的に保守者が入力する形式が、ワークフロー定義更新部１７は、条件分岐直前の処理枠のコマンドについて、コマンドリストテーブル２１の実行結果の形式を参照し、実行結果が二者択一でかつ条件分岐前まで実行した手順から分岐条件が推測できるものについては、条件分岐枠内に推測した分岐条件を表示し、保守者に内容の確認を促してもよい。推測した分岐条件は保守者が修正することを可能とする。 The branch condition in the conditional branch frame of the workflow definition is basically in a format input by the maintenance person, and the workflow definition update unit 17 sets the format of the execution result of the command list table 21 for the command in the processing frame immediately before the conditional branch. If the execution result is one of two alternatives and the branch condition can be inferred from the procedure executed before the conditional branch, the estimated branch condition is displayed in the conditional branch frame to prompt the maintenance personnel to confirm the contents. May be. The inferred branch condition can be corrected by the maintenance person.

前述したように本発明の実施例によれば、復旧手順途中の条件によって分岐を考慮したワークフロー定義を過去の履歴情報から作成して、実際の障害復旧作業に適した復旧手順を提供することができる。 As described above, according to the embodiment of the present invention, it is possible to provide a recovery procedure suitable for actual failure recovery work by creating a workflow definition considering branching according to conditions during the recovery procedure from past history information. it can.

また、ワークフロー定義と履歴情報とで異なる手順の前に条件分岐を追加し、異なると判定されたワークフロー定義の手順をＹＥＳ側に配置し、異なると判定された履歴情報の手順をＮＯ側に配置するので、ワークフロー定義と異なる復旧手順を用いてワークフロー定義を作成できる。 In addition, a conditional branch is added before the procedure that differs between the workflow definition and the history information, the workflow definition procedure determined to be different is placed on the YES side, and the history information procedure judged to be different is placed on the NO side Therefore, the workflow definition can be created using a recovery procedure different from the workflow definition.

さらに、実行したワークフロー定義の手順及び前記機器で発生した障害を履歴情報として保持するので、実行した復旧手順を用いて、自動的にワークフロー定義を更新できる。さらに、保持した基本復旧手順から障害復旧のワークフロー定義を作成するので、保守者の手を煩わせることなく、自動的にワークフロー定義を作成できる。 Furthermore, since the executed workflow definition procedure and the failure that has occurred in the device are stored as history information, the workflow definition can be automatically updated using the executed recovery procedure. Furthermore, since the workflow definition for failure recovery is created from the stored basic recovery procedure, the workflow definition can be automatically created without bothering the maintenance personnel.

なお、本発明は前述した実施例に限定されるものではなく、添付した特許請求の範囲の趣旨内における様々な変形例及び同等の構成が含まれる。例えば、前述した実施例は本発明を分かりやすく説明するために詳細に説明したものであり、必ずしも説明した全ての構成を備えるものに本発明は限定されない。また、ある実施例の構成の一部を他の実施例の構成に置き換えてもよい。また、ある実施例の構成に他の実施例の構成を加えてもよい。また、各実施例の構成の一部について、他の構成の追加・削除・置換をしてもよい。 The present invention is not limited to the above-described embodiments, and includes various modifications and equivalent configurations within the scope of the appended claims. For example, the above-described embodiments have been described in detail for easy understanding of the present invention, and the present invention is not necessarily limited to those having all the configurations described. A part of the configuration of one embodiment may be replaced with the configuration of another embodiment. Moreover, you may add the structure of another Example to the structure of a certain Example. In addition, for a part of the configuration of each embodiment, another configuration may be added, deleted, or replaced.

また、前述した各構成、機能、処理部、処理手段等は、それらの一部又は全部を、例えば集積回路で設計する等により、ハードウェアで実現してもよく、プロセッサがそれぞれの機能を実現するプログラムを解釈し実行することにより、ソフトウェアで実現してもよい。 In addition, each of the above-described configurations, functions, processing units, processing means, etc. may be realized in hardware by designing a part or all of them, for example, with an integrated circuit, and the processor realizes each function. It may be realized by software by interpreting and executing the program to be executed.

各機能を実現するプログラム、テーブル、ファイル等の情報は、メモリ、ハードディスク、ＳＳＤ（Solid State Drive）等の記憶装置、又は、ＩＣカード、ＳＤカード、ＤＶＤ等の記録媒体に格納することができる。 Information such as programs, tables, and files that realize each function can be stored in a storage device such as a memory, a hard disk, or an SSD (Solid State Drive), or a recording medium such as an IC card, an SD card, or a DVD.

また、制御線や情報線は説明上必要と考えられるものを示しており、実装上必要な全ての制御線や情報線を示しているとは限らない。実際には、ほとんど全ての構成が相互に接続されていると考えてよい。 Further, the control lines and the information lines are those that are considered necessary for the explanation, and not all the control lines and the information lines that are necessary for the mounting are shown. In practice, it can be considered that almost all the components are connected to each other.

１：監視装置
２：機器
３：対向装置
４：通信網
１０：外部入出力インタフェース
１１：障害復旧の基本手順格納部
１２：ワークフロー定義作成部
１３：障害受信部
１４：復旧手順表示部
１５：復旧手順実行部
１６：障害復旧履歴保持部
１７：ワークフロー定義更新部
１８：コマンド実行部
１９：通信インタフェース
２０：復旧基本手順リストテーブル
２１：コマンドリストテーブル
２２：構成情報リストテーブル
２３：ワークフロー定義テーブル
２４：障害復旧履歴テーブル 1: monitoring device 2: device 3: opposite device 4: communication network 10: external input / output interface 11: failure recovery basic procedure storage unit 12: workflow definition creation unit 13: failure reception unit 14: recovery procedure display unit 15: recovery Procedure execution unit 16: Failure recovery history holding unit 17: Workflow definition update unit 18: Command execution unit 19: Communication interface 20: Recovery basic procedure list table 21: Command list table 22: Configuration information list table 23: Workflow definition table 24: Disaster recovery history table

Claims

機器の状態を監視する監視装置であって、
前記機器で発生した障害を復旧するための手順であるワークフロー定義を保持し、
前記機器で発生した障害を監視し、
前記監視によって取得した障害の情報及び当該障害を復旧するために実行した復旧手順を履歴情報として保持し、
前記保持した履歴情報に基づいて前記ワークフロー定義を更新することを特徴とする監視装置。 A monitoring device for monitoring the state of equipment,
Hold the workflow definition, which is the procedure for recovering from the failure that occurred in the device,
Monitor failures that occur in the equipment,
Holding the failure information acquired by the monitoring and the recovery procedure executed to recover the failure as history information,
A monitoring apparatus that updates the workflow definition based on the retained history information.

請求項１に記載の監視装置であって、
前記保持されたワークフロー定義と前記保持された履歴情報とを比較し、
前記保持されたワークフロー定義と前記保持された履歴情報との少なくとも一つの手順が異なる場合、異なると判定されたワークフロー定義の手順の前に条件分岐を追加し、
前記異なると判定されたワークフロー定義の手順を、前記追加された分岐の条件を満たす場合に配置し、
前記異なると判定された履歴情報の手順を、前記追加された分岐の条件を満たさない場合に配置することを特徴とする監視装置。 The monitoring device according to claim 1,
Comparing the retained workflow definition with the retained history information;
When at least one procedure of the retained workflow definition and the retained history information is different, a conditional branch is added before the procedure of the workflow definition determined to be different,
The workflow definition procedure determined to be different is arranged when the added branch condition is satisfied,
A monitoring apparatus, wherein the procedure of history information determined to be different is arranged when the added branch condition is not satisfied.

請求項１又は２に記載の監視装置であって、
前記ワークフロー定義の手順を実行し、
前記実行したワークフロー定義の手順及び前記機器で発生した障害を前記履歴情報として保持することを特徴とする監視装置。 The monitoring device according to claim 1 or 2,
Execute the workflow definition procedure;
A monitoring apparatus that retains the procedure of the executed workflow definition and a failure that has occurred in the device as the history information.

請求項１又は２に記載の監視装置であって、
前記機器で発生した障害を復旧するための基本手順を保持し、
前記保持した基本手順から前記ワークフロー定義を作成することを特徴とする監視装置。 The monitoring device according to claim 1 or 2,
Maintain the basic procedure for recovering from the failure that occurred in the device,
A monitoring apparatus that creates the workflow definition from the stored basic procedure.

機器の障害復旧を支援する支援装置が実行する障害復旧手順更新方法であって、
前記支援装置は、前記機器で発生した障害を復旧するための手順であるワークフロー定義を保持し、
前記方法は、
前記支援装置が、前記機器で発生した障害の情報及び当該障害を復旧するために実行した復旧手順を履歴情報として保持し、
前記支援装置が、前記保持した履歴情報に基づいて前記ワークフロー定義を更新することを特徴とする障害復旧手順更新方法。 A failure recovery procedure update method executed by a support device that supports device failure recovery,
The support apparatus holds a workflow definition that is a procedure for recovering from a failure that has occurred in the device,
The method
The support device holds information on a failure that has occurred in the device and a recovery procedure executed to recover the failure as history information,
A failure recovery procedure update method, wherein the support device updates the workflow definition based on the retained history information.

請求項５に記載の障害復旧手順更新方法であって、
前記支援装置が、前記保持されたワークフロー定義と前記保持された履歴情報とを比較し、
前記支援装置が、前記保持されたワークフロー定義と前記保持された履歴情報との少なくとも一部の手順が異なる場合、異なると判定されたワークフロー定義の手順の前に条件分岐を追加し、
前記支援装置が、前記異なると判定されたワークフロー定義の手順を、前記追加された分岐の条件を満たす場合に配置し、
前記支援装置が、前記異なると判定された履歴情報の手順を、前記追加された分岐の条件を満たさない場合に配置することを特徴とする障害復旧手順更新方法。 The failure recovery procedure update method according to claim 5,
The support device compares the retained workflow definition with the retained history information;
The support device adds a conditional branch before the procedure of the workflow definition determined to be different when at least a part of the procedure of the retained workflow definition and the retained history information is different,
The support device arranges the workflow definition procedure determined to be different when the added branch condition is satisfied,
The failure recovery procedure update method, wherein the support device arranges the procedure of the history information determined to be different when the added branch condition is not satisfied.

請求項５又は６に記載の障害復旧手順更新方法であって、
前記支援装置が、前記ワークフロー定義の手順を実行し、
前記支援装置が、前記実行したワークフロー定義の手順及び前記機器で発生した障害を前記履歴情報として保持することを特徴とする障害復旧手順更新方法。 The failure recovery procedure update method according to claim 5 or 6,
The support device executes the workflow definition procedure,
A failure recovery procedure update method, wherein the support device retains the executed workflow definition procedure and a failure that has occurred in the device as the history information.

請求項５又は６に記載の障害復旧手順更新方法であって、
前記支援装置が、前記機器で発生した障害を復旧するための基本手順を保持し、
前記支援装置が、前記保持した基本手順から前記ワークフロー定義を作成することを特徴とする障害復旧手順更新方法。 The failure recovery procedure update method according to claim 5 or 6,
The support device holds a basic procedure for recovering from a failure that has occurred in the device,
The failure recovery procedure update method, wherein the support device creates the workflow definition from the stored basic procedure.