JPH08235132A

JPH08235132A - Hot stand-by control method for multiserver system

Info

Publication number: JPH08235132A
Application number: JP7033500A
Authority: JP
Inventors: Hiroyuki Tomizawa; 広幸富沢
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1995-02-22
Filing date: 1995-02-22
Publication date: 1996-09-13

Abstract

PURPOSE: To eliminate a need to switch network addresses by a client system when an in-use host is switched to a stand-by host. CONSTITUTION: In-use server hosts 101 and 103, a stand-by server host 102, and a client host 108 are connected to one another by a LAN 106. Further, the respective hosts are mutually connected by a LAN 107 for monitoring a host fault. The monitor program of the in-use host 101 sends the network address A of its host to the monitor program of the stand-by host 102 through the LAN 107. If a fault occurs to the in-use host 101, the monitor program informs the monitor program of the stand-by host 102 of a system switching instruction and the monitor program of the stand-by host 102 rewrites the network address into A.

Description

【発明の詳細な説明】Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、クライアントサーバ型
のオンライントランザクション処理に関し、特に、現用
ホストと予備ホストから構成され、現用ホストに異常が
発生したときに、予備ホストに切り替えるマルチサーバ
システムのホットスタンバイ制御方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a client / server type online transaction processing, and more particularly to a hot server of a multi-server system which is composed of a working host and a spare host and is switched to the spare host when an abnormality occurs in the working host. The present invention relates to a standby control method.

【０００２】[0002]

【従来の技術】通常、オンライン計算機システムなどに
おいては、計算機システムの障害に備えて別の計算機シ
ステムを用意した冗長構成が採られていて、これにより
システムの信頼性を確保している。このような例として
は、特開昭６２−２４８０３７号公報に記載された計算
機システムの切り替え方式がある。2. Description of the Related Art Normally, in an online computer system or the like, a redundant configuration in which another computer system is prepared in preparation for a failure of the computer system is adopted, thereby ensuring system reliability. As such an example, there is a computer system switching method described in JP-A-62-248037.

【０００３】この切り替え方式においては、オンライン
トランザクションプロセスサーバ（実行システム）と待
機システムとを備え、実行システムに障害が発生した場
合に、待機システムを使用することにより実行システム
の処理を継続し、待機中に実行していたプロセス（ジョ
ブ）については、切り替え時処理定義テーブルに定義さ
れている処理（ジョブの処理キャンセルなど）を実行す
ることにより、切り替え後のオンライン処理が、待機中
に実行されていたジョブによって影響を受けることを防
止するものである。In this switching method, an online transaction process server (execution system) and a standby system are provided, and when a failure occurs in the execution system, the standby system is used to continue the processing of the execution system and wait. For the process (job) that was being executed during the process, the online process after switching was executed while waiting by executing the process (such as canceling the job process) defined in the process definition table for switching. It prevents the job from being affected by the job.

【０００４】[0004]

【発明が解決しようとする課題】しかし、上記した従来
技術は、ローカルエリアネットワークを使用しておら
ず、またローカルエリアネットワークを使用したとして
も、切り替え後のサーバシステムがネットワークアドレ
スを引き継いでいない。このため切り替え後において、
多数のクライアントシステムは、新たなネットワークア
ドレスに再接続しなければならず、特にシステム切替発
生時に使用されていなかったクライアントシステムにつ
いても、ネットワークアドレスを切り替える必要があ
り、切り替え時の処理が煩雑になるという問題があっ
た。However, in the above-mentioned conventional technique, the local area network is not used, and even if the local area network is used, the server system after switching does not take over the network address. Therefore, after switching,
Many client systems have to reconnect to a new network address, and it is necessary to switch the network address even for a client system that was not used at the time of system switching, which makes the switching process complicated. There was a problem.

【０００５】このようなことから、オンライントランザ
クションサーバが障害になった場合、当該サーバが使用
していたネットワークアダプタのアドレスを待機システ
ムに切り替えて使用することにより、クライアントシス
テムは同じアドレスを用いてサーバシステムと通信する
ことができ、ホストが切り替わっても同じアドレスで接
続できる方式が求められている。Therefore, when the online transaction server fails, the client system uses the same address by switching the address of the network adapter used by the server to the standby system. There is a need for a method that can communicate with the system and can connect with the same address even if the host switches.

【０００６】本発明の目的は、現用ホストを予備ホスト
に切り替えたときに、クライアントシステムでのネット
ワークアドレスの切り替え処理を不要にしたマルチサー
バシステムのホットスタンバイ制御方法を提供すること
にある。An object of the present invention is to provide a hot standby control method for a multi-server system, which eliminates the need to switch the network address in the client system when the active host is switched to the spare host.

【０００７】[0007]

【課題を解決するための手段】前記目的を達成するため
に、本発明では、複数のサーバホストと、クライアント
ホストがネットワーク（以下、第１のネットワークとい
う）で接続され、該サーバホストは現用サーバホスト
と、待機系の予備サーバホストからなり、該現用サーバ
ホストに障害が発生したときに、該予備サーバホストに
切り替えて処理を実行するマルチサーバシステムのホッ
トスタンバイ制御方法において、前記現用サーバホスト
と予備サーバホストとをホスト障害監視用のネットワー
ク（以下、第２のネットワークという）で接続し、前記
現用サーバホストは、自ホストと前記第１のネットワー
クとを接続する第１のアドレスを管理し、前記予備サー
バホストは、自ホストと前記第１のネットワークとを接
続する第２のアドレスを管理し、前記現用サーバホスト
は、該第２のネットワークを介して、該第１のアドレス
を予備サーバホストに送信して記憶し、前記現用サーバ
ホストに障害が発生したとき、前記現用サーバホスト
は、自ホストで管理されている前記第１のアドレスを無
効にし、該第２のネットワークを介して、システム切り
替え指示を前記予備サーバホストに通知し、該通知され
た前記予備サーバホストは、自ホストで管理されている
前記第２のアドレスを、前記第１のアドレスに書き換え
ることを特徴としている。To achieve the above object, according to the present invention, a plurality of server hosts and a client host are connected by a network (hereinafter referred to as a first network), and the server host is an active server. A hot standby control method for a multi-server system comprising a host and a standby server host of a standby system, which switches to the standby server host and executes processing when a failure occurs in the active server host. A spare server host is connected to a network for monitoring a host failure (hereinafter referred to as a second network), and the active server host manages a first address connecting the own host and the first network, The spare server host is connected to a second address connecting the host and the first network. The active server host transmits the first address to the spare server host via the second network and stores the first address, and when the active server host fails, the active server host Disables the first address managed by the own host, notifies the spare server host of a system switching instruction via the second network, and the notified spare server host It is characterized in that the second address managed by the host is rewritten to the first address.

【０００８】[0008]

【作用】現用のサーバホストのモニタプログラムは、ド
ライバプログラムから自ホストのローカルエリアネット
ワークアドレスを得る。現用のサーバホストのモニタプ
ログラムは、監視用ローカルエリアネットワークを介し
て、予備のサーバホストのモニタプログラムに、該得ら
れたローカルネットワークアドレスを送信する。現用の
サーバホストに障害が発生したとき、現用のサーバホス
トのモニタプログラムは、現用のサーバホストのドライ
バプログラムに対して、ローカルエリアネットワークア
ドレスを無効にするとともに、システム切替指示を予備
のサーバホストのモニタプログラムに通知する。予備の
サーバホストのモニタプログラムは、ドライバプログラ
ムに対して、先に受信したローカルエリアネットワーク
アドレスへのアドレスの書き換えを要求し、ドライバプ
ログラムは、先に受信したローカルエリアネットワーク
アドレスに書き換える。これにより、クライアントホス
トでは、ネットワークアドレスを切り替える処理が不要
になる。The monitor program of the active server host obtains the local area network address of its own host from the driver program. The monitor program of the current server host sends the obtained local network address to the monitor program of the spare server host via the monitoring local area network. When a failure occurs in the active server host, the monitor program of the active server host invalidates the local area network address to the driver program of the active server host and sends a system switching instruction to the spare server host. Notify the monitor program. The monitor program of the spare server host requests the driver program to rewrite the address to the previously received local area network address, and the driver program rewrites to the previously received local area network address. This eliminates the need for the client host to switch the network address.

【０００９】[0009]

【実施例】以下、本発明の一実施例を図面を用いて具体
的に説明する。〈実施例１〉図１は、本発明の一実施例に係るシステム
構成を示す。図において、１０１は現用のサーバホスト
（＃１）、１０２は予備のサーバホスト、１０３は現用
のサーバホスト（＃２）、１０４は現用のサーバホスト
１０１と予備のサーバホスト１０２との間に設けられた
共用ディスク群、１０５は現用のサーバホスト１０３と
予備のサーバホスト１０２との間に設けられた共用ディ
スク群である。この共用ディスク群１０４、１０５に
は、ログ情報、データベースなどが格納され、サーバホ
ストによってアクセスされる。また、現用のサーバホス
ト（実行サーバホスト）１０１と１０３は、通常時に業
務を実行し、バックアップは予備のサーバホスト（待機
サーバホスト）１０２によって行われる。DESCRIPTION OF THE PREFERRED EMBODIMENTS One embodiment of the present invention will be specifically described below with reference to the drawings. <Embodiment 1> FIG. 1 shows a system configuration according to an embodiment of the present invention. In the figure, 101 is a working server host (# 1), 102 is a spare server host, 103 is a working server host (# 2), and 104 is provided between the working server host 101 and the spare server host 102. The shared disk group 105 is a shared disk group provided between the active server host 103 and the spare server host 102. The shared disk groups 104 and 105 store log information, databases, etc., and are accessed by the server host. In addition, the active server hosts (execution server hosts) 101 and 103 execute jobs at normal times, and backup is performed by the backup server host (standby server host) 102.

【００１０】１０６はローカルエリアネットワーク（Ｌ
ＡＮ＃１）、１０７はホスト障害監視用のローカルエリ
アネットワーク（ＬＡＮ＃２）、１０８はクライアント
ホストである。Reference numeral 106 denotes a local area network (L
AN # 1), 107 are local area networks (LAN # 2) for host failure monitoring, and 108 is a client host.

【００１１】各サーバホスト１０１、１０２、１０３と
クライアントホスト１０８とは、ローカルエリアネット
ワーク１０６で相互に接続されている。また、現用のサ
ーバホスト１０１とローカルエリアネットワーク１０６
とは、アドレス（Ａ）のローカルエリアネットワークア
ダプタ１０９で接続されている。同様に、予備のサーバ
ホスト１０２とローカルエリアネットワーク１０６と
は、アドレス（Ｂ）のローカルエリアネットワークアダ
プタ１１０で接続され、現用のサーバホスト１０３とロ
ーカルエリアネットワーク１０６とは、アドレス（Ｃ）
のローカルエリアネットワークアダプタ１１１で接続さ
れている。クライアントホスト１０８とローカルエリア
ネットワーク１０６とは、アドレス（Ｄ）のローカルエ
リアネットワークアダプタ１１２で接続されている。The server hosts 101, 102, 103 and the client host 108 are connected to each other via a local area network 106. In addition, the active server host 101 and the local area network 106
Are connected by the local area network adapter 109 of the address (A). Similarly, the spare server host 102 and the local area network 106 are connected by the local area network adapter 110 having the address (B), and the active server host 103 and the local area network 106 have the address (C).
The local area network adapter 111 is connected. The client host 108 and the local area network 106 are connected by the local area network adapter 112 of the address (D).

【００１２】さらに、各サーバホスト１０１、１０２、
１０３とホスト障害監視用のローカルエリアネットワー
ク１０７とは、それぞれアドレスがＸ、Ｙ、Ｚのアダプ
タ１１３、１１４、１１５で接続されている。Further, each server host 101, 102,
103 and the local area network 107 for host fault monitoring are connected by adapters 113, 114 and 115 having addresses X, Y and Z, respectively.

【００１３】図２は、各サーバホストにおけるソフトウ
エアの構成例を示す。現用のサーバホスト１０１のソフ
トウエアは、オペレーティングシステム（ＯＳ１）２０
１と、ＯＳ１中に含まれたローカルエリアネットワーク
のアダプタドライバプログラム２０４（ＤＲＶ１）と、
これらの上位に存在する、ホットスタンバイを制御する
モニタプログラム２０７（ＭＯＮ１）から構成されてい
る。FIG. 2 shows a software configuration example in each server host. The software of the current server host 101 is the operating system (OS1) 20.
1 and a local area network adapter driver program 204 (DRV1) included in OS1,
It is composed of a monitor program 207 (MON1) which controls the hot standby, which exists above these.

【００１４】予備のサーバホスト１０２のソフトウエア
と、現用のサーバホスト１０３のソフトウエアについて
も、同様にそれぞれ、オペレーティングシステム（ＯＳ
２、３）２０２、２０３と、アダプタドライバプログラ
ム２０５（ＤＲＶ２）、２０６（ＤＲＶ３）と、モニタ
プログラム２０８（ＭＯＮ２）、２０９（ＭＯＮ３）か
ら構成されている。Similarly, the software of the spare server host 102 and the software of the active server host 103 are respectively set in the operating system (OS).
2, 3) 202 and 203, adapter driver programs 205 (DRV2) and 206 (DRV3), and monitor programs 208 (MON2) and 209 (MON3).

【００１５】また、現用のサーバホスト１０１、１０３
上には、それぞれオンライントランザクションプログラ
ム２１０（ＯＬＴＰ１）、２１１（ＯＬＴＰ２）とユー
ザアプリケーションプログラム２１２（ＵＡＰ１）、２
１３（ＵＡＰ２）が格納されている。予備のサーバホス
ト１０２には、２つのオンライントランザクションプロ
グラム２１４（ＯＬＴＰ２Ｓ）、２１５（ＯＬＴＰ１
Ｓ）と、２つのユーザアプリケーションプログラム２１
６（ＵＡＰ２Ｓ）、２１７（ＵＡＰ１Ｓ）が格納されて
いる。なお、上記した説明では、サーバ上の各プログラ
ムを区別して表しているが、予備のサーバホストが実行
サーバホストに切り替えられるものであるから、各サー
バホストのソフトウエア構成は基本的には同じである。In addition, the active server hosts 101 and 103
Above the online transaction programs 210 (OLTP1), 211 (OLTP2) and the user application program 212 (UAP1), 2 respectively.
13 (UAP2) is stored. The backup server host 102 includes two online transaction programs 214 (OLTP2S) and 215 (OLTP1).
S) and two user application programs 21
6 (UAP2S) and 217 (UAP1S) are stored. In the above description, each program on the server is shown separately, but since the spare server host is switched to the execution server host, the software configuration of each server host is basically the same. is there.

【００１６】図３は、本発明のシステム切替え時におけ
る、ネットワークアドレスの引き継ぎを説明する図であ
る。以下に、図１、２、３を参照して、現用のサーバホ
スト１０１から予備のサーバホスト１０２にシステムを
切替えたときのネットワークアドレスの引き継ぎ動作を
説明する。FIG. 3 is a diagram for explaining the takeover of the network address when the system is switched according to the present invention. A network address takeover operation when the system is switched from the active server host 101 to the spare server host 102 will be described below with reference to FIGS.

【００１７】現用のサーバホスト１０１のドライバプロ
グラム２０４は、ローカルエリアネットワークアドレス
（Ａ）をＬＡＮアダプタ１０９からメモリ上に読み込ん
で管理している。これにより、他のホストからのアドレ
ス要求に対して、メモリ上のローカルエリアネットワー
クアドレス（Ａ）を返すことができる。The driver program 204 of the active server host 101 reads and manages the local area network address (A) from the LAN adapter 109 on the memory. As a result, the local area network address (A) on the memory can be returned in response to an address request from another host.

【００１８】現用のサーバホスト１０１のモニタプログ
ラム２０７は、初期設定処理においてドライバプログラ
ム２０４から自ホスト１０１のローカルエリアネットワ
ークアドレス（Ａ）を取得する。そして、モニタプログ
ラム２０７は、監視用ローカルエリアネットワーク１０
７を介して、予備のサーバホスト１０２のモニタプログ
ラム２０８に、ローカルネットワークアドレス（Ａ）を
送信する。これにより、予備のサーバホスト１０２は、
現用のサーバホスト１０１と同じローカルエリアネット
ワークアドレス（Ａ）を認識することができる。The monitor program 207 of the active server host 101 acquires the local area network address (A) of its own host 101 from the driver program 204 in the initial setting process. Then, the monitor program 207 executes the monitoring local area network 10
7, the local network address (A) is transmitted to the monitor program 208 of the spare server host 102. As a result, the spare server host 102
The same local area network address (A) as the active server host 101 can be recognized.

【００１９】現用のサーバホスト１０１に障害が発生し
たとき、モニタプログラム２０７は現用のサーバホスト
１０１のドライバプログラム２０４に対して、ローカル
エリアネットワークアドレス（Ａ）を無効にするととも
に、システム切替指示を、監視用ローカルエリアネット
ワーク１０７を介して、予備のサーバホスト１０２のモ
ニタプログラム２０８に通知する。When a failure occurs in the active server host 101, the monitor program 207 disables the local area network address (A) and instructs the driver program 204 of the active server host 101 to switch the system. Notify the monitor program 208 of the spare server host 102 via the monitoring local area network 107.

【００２０】予備のサーバホスト１０２のモニタプログ
ラム２０８は、ドライバプログラム２０５に対して、ロ
ーカルエリアネットワークアドレス（Ａ）へのアドレス
の書き換えを要求する。ドライバプログラム２０５は、
メモリ上に書かれたローカルエリアネットワークアドレ
ス（Ｂ）を（Ａ）に書き換える。そして、切り替えられ
た予備のサーバホスト１０２が、障害となった現用のサ
ーバホスト１０１の処理を引き継ぐ。このように、シス
テム切替と関連したローカルエリアネットワークアドレ
スの書き換えは、ローカルエリアネットワークアドレス
の重複使用を防止することができる。また、現用のサー
バホストを予備のサーバホストに切り替えたときに、ク
ライアントシステムでのネットワークアドレスの切り替
え処理が不要になる。The monitor program 208 of the spare server host 102 requests the driver program 205 to rewrite the address to the local area network address (A). The driver program 205 is
The local area network address (B) written on the memory is rewritten to (A). Then, the spare server host 102 that has been switched over takes over the processing of the active server host 101 that has failed. In this way, rewriting the local area network address associated with system switching can prevent duplicate use of the local area network address. Also, when the active server host is switched to the spare server host, the network address switching process in the client system becomes unnecessary.

【００２１】〈実施例２〉上記した実施例のアドレス切
替え方式においては、オンライントランザクションサー
バと同一ホスト上にあって同じネットワークアドレスを
使用するディスク共有サーバ、リモートログインサーバ
等のサーバプロセスは、使用していたアドレスが待機シ
ステムに移るため、元のホストで継続してネットワーク
を使用した処理が続行できなくなる。そこで、本実施例
２では、オンライントランザクションサーバのような重
要な特定のプロセスを単独で待機システムに切り替える
のではなく、実行システムで処理していた全てのまたは
複数のプロセスを一括して待機システムに切り替える制
御方法を採るようにしている。<Embodiment 2> In the address switching method of the above embodiment, server processes such as a disk sharing server and a remote login server which are on the same host as the online transaction server and use the same network address are used. The original address is transferred to the standby system, and the original host cannot continue to use the network. Therefore, in the second embodiment, instead of switching an important specific process such as an online transaction server to the standby system alone, all or a plurality of processes processed by the execution system are collectively switched to the standby system. The control method for switching is adopted.

【００２２】図４は、プロセスのグループ単位でシステ
ムを切り替える実施例におけるテーブル構成例を示す。
すなわち、ホットスタンバイを制御するモニタプログラ
ムに、プロセスのグループ、重要なプロセス、再起動す
るプロセスの順番を定義して、重要なプロセスに障害が
発生した時に、システム切替の際に登録されているプロ
セスを起動するものである。なお、この実施例のシステ
ム構成は、図１に示す構成を用い、現用のサーバホスト
１０１と予備のサーバホスト１０２の図示しないメモリ
に、図４のテーブルが格納されている。FIG. 4 shows an example of a table structure in an embodiment in which the system is switched in units of process groups.
In other words, in the monitor program that controls hot standby, the order of process groups, important processes, and restart processes is defined, and when a failure occurs in an important process, the process registered when the system is switched over. Is to start. The system configuration of this embodiment uses the configuration shown in FIG. 1, and the table of FIG. 4 is stored in the memory (not shown) of the active server host 101 and the spare server host 102.

【００２３】現用のサーバホスト１０１で実行されるプ
ロセスグループをＧＡ、ＧＢ、ＧＣとし、プロセスグル
ープＧＡはプロセスＧＡＰ１〜ＧＡＰ４からなり、プロ
セスグループＧＢはプロセスＧＢＰ１〜ＧＢＰ３からな
り、プロセスグループＧＣはプロセスＧＣＰ１〜ＧＣＰ
３からなる。そして、例えばプロセスグループＧＡのプ
ロセスについては、プロセスＧＡＰ１、ＧＡＰ２、ＧＡ
Ｐ３、ＧＡＰ４が連繋し、重要度の高いプロセスＧＡＰ
１とＧＡＰ３にはシステム切替えフラグＯＮが設定さ
れ、重要度の低いプロセスＧＡＰ２とＧＡＰ４にはシス
テム切替えフラグＯＦＦが設定される。The process groups executed by the active server host 101 are GA, GB, and GC. The process group GA includes processes GAP1 to GAP4, the process group GB includes processes GBP1 to GBP3, and the process group GC includes process GCP1. ~ GCP
Consists of three. Then, for example, for processes of the process group GA, processes GAP1, GAP2, GA
P3 and GAP4 are linked, and process GAP of high importance
The system switching flag ON is set for 1 and GAP3, and the system switching flag OFF is set for the processes GAP2 and GAP4 of low importance.

【００２４】現用のサーバホスト１０１で、プロセスグ
ループＧＡに属する一つのプロセスＧＡＰ１が障害にな
ったとする。プロセスＧＡＰ１のシステム切替えフラグ
がＯＮに設定されているので、プロセスＧＡＰ１と連繋
している他のプロセス（ＧＡＰ２〜ＧＡＰ４）を強制的
に停止させて、予備のサーバホスト１０２において対応
するプロセス（ＧＡＰ１〜ＧＡＰ４）を再起動する。そ
の再起動を行うとき、グループ内の切替順位にしたがっ
て、まずプロセスＧＡＰ１を最初に起動し、起動の完了
確認と同期して、切り替え順位が２番目のプロセスＧＡ
Ｐ３を起動する。その後、同様に同期を取って、順位が
３番目の二つのプロセスＧＡＰ２とＧＡＰ４を並列に起
動して、切替が実行される。It is assumed that one process GAP1 belonging to the process group GA fails in the active server host 101. Since the system switch flag of the process GAP1 is set to ON, the other processes (GAP2 to GAP4) linked to the process GAP1 are forcibly stopped, and the corresponding process (GAP1 to GAP1 to GAP1 to GAP1) in the spare server host 102 is stopped. Restart GAP4). When the restart is performed, the process GAP1 is first started according to the switching order in the group, and the process GA having the second switching order is synchronized with the completion confirmation of the startup.
Start P3. After that, in the same manner, the two processes GAP2 and GAP4 having the third rank are activated in parallel, and the switching is executed.

【００２５】また、プロセスグループＧＡに属するプロ
セスＧＰＡ３が障害になったときの処理も、前述したプ
ロセスＧＡＰ１の障害の場合と同様である。しかし、シ
ステム切替えフラグがＯＦＦに設定されているプロセス
ＧＡＰ２またはＧＡＰ４が、現用のサーバホスト１０１
で障害になっても、重要度が低いので、前述したように
プロセスグループＧＡを予備のサーバホスト１０２にシ
ステム切替えせずに、現用のサーバホスト１０１上でプ
ロセスＧＡＰ２またはＧＡＰ４を再起動して、業務を続
行する。The processing when the process GPA3 belonging to the process group GA fails is also the same as in the case where the process GAP1 fails. However, the process GAP2 or GAP4 in which the system switching flag is set to OFF is not the active server host 101.
Even if a failure occurs, the importance is low, so as described above, the process GAP2 or GAP4 is restarted on the active server host 101 without switching the system of the process group GA to the spare server host 102. Continue work.

【００２６】このように、本実施例では、プロセスグル
ープごとに管理しているので、プロセスグループＧＡが
システム切替を実行しても、他のプロセスグループＧＢ
またはＧＣに対して影響を及ぼさず、またホスト全体が
障害になった場合には、プロセスグループごとにシステ
ム切替を行うことができる。As described above, in this embodiment, since each process group is managed, even if the process group GA executes system switching, another process group GB is used.
Alternatively, the system can be switched for each process group if it does not affect the GC and if the entire host fails.

【００２７】〈実施例３〉実施例１で説明したホットス
タンバイシステムにおいて、現用ホストと予備ホストが
１：１に対応している構成だけでなく、経済的に配慮さ
れた現用ホスト２機と予備ホスト１機の構成や、複数の
現用ホストが互いに他方の待機機能を持つような構成を
採る場合には、一つの現用ホストが障害になると、同一
ホストに実行サーバシステムと当該実行サーバシステム
とは異なる実行サーバシステムの待機サーバシステムが
共存する状態が生じ、共存したホスト上では実行サーバ
システムの性能の劣化の問題が発生する。そこで、本実
施例では、待機サーバシステムをシステム切替可能な状
態のままで第３の別のホストに移動する方法を採るよう
にしたものである。<Embodiment 3> In the hot standby system described in Embodiment 1, not only the configuration in which the active host and the spare host correspond to each other 1: 1 but also the economically considered active host 2 and the spare In the case of adopting a configuration of one host or a configuration in which a plurality of active hosts have the other standby function for each other, if one active host fails, the execution server system and the execution server system are on the same host. The standby server systems of different execution server systems coexist, and the performance degradation of the execution server system occurs on the coexisting hosts. Therefore, in this embodiment, a method is adopted in which the standby server system is moved to a third different host while the system can be switched.

【００２８】図５は、待機システムを予備のサーバホス
トから他のホストに移動する実施例の説明図である。ま
ず初期起動で第１段階の状態になる。この第１段階の状
態では、現用のサーバホスト１０１（ＳＨ１）、１０３
（ＳＨ３）がそれぞれ現用のシステムとなり、予備のサ
ーバホスト１０２（ＳＨ２）が待機システムとなる。FIG. 5 is an explanatory diagram of an embodiment in which the standby system is moved from the spare server host to another host. First of all, the state of the first stage is reached by initial startup. In the state of the first stage, the active server hosts 101 (SH1), 103
(SH3) becomes the active system, and the spare server host 102 (SH2) becomes the standby system.

【００２９】その後、サーバホスト１０３（ＳＨ３）で
実行されているオンライントランザクションプログラム
（ＯＬＴＰ２）に障害が発生したとき、システム切替を
行って、予備のサーバホスト１０２（ＳＨ２）の待機シ
ステム（ＯＬＴＰ２Ｓ）が実行サーバ（ＯＬＴＰ２）と
なって、第２段階の状態になる。After that, when a failure occurs in the online transaction program (OLTP2) executed by the server host 103 (SH3), the system is switched so that the standby system (OLTP2S) of the spare server host 102 (SH2) becomes available. It becomes the execution server (OLTP2) and enters the second stage.

【００３０】この結果、予備のサーバホスト１０２の実
行システム（ＯＬＴＰ２）は、待機システム（ＯＬＴＰ
１Ｓ）が存在しているために、メモリなどの資源が少な
くなり、障害発生前の現用のサーバホスト１０３（ＳＨ
２）で動作していた場合に比べ性能が低下してしまう。As a result, the execution system (OLTP2) of the spare server host 102 becomes the standby system (OLTP2).
1S) exists, resources such as memory are reduced, and the active server host 103 (SH
The performance will be lower than if it was operating in 2).

【００３１】そこで、予備のサーバホスト１０２（ＳＨ
２）にある待機システム（ＯＬＴＰ１Ｓ）を、サーバホ
スト１０３（ＳＨ３）に移動させる。すなわち、サーバ
ホスト１０２（ＳＨ２）にある待機システム２１５（Ｏ
ＬＴＰ１Ｓ）をそのままにして、サーバホスト１０３
（ＳＨ３）で待機システム（ＯＬＴＰ１Ｓ）を起動す
る。これにより、第３段階の状態になる。この起動が完
了する直前の状態で、サーバホスト１０２（ＳＨ２）の
待機システム（ＯＬＴＰ１Ｓ）を停止させ、サーバホス
ト１０３（ＳＨ３）の待機システム（ＯＬＴＰ１Ｓ）の
起動を完了させ、第４段階の状態となる。Therefore, the spare server host 102 (SH
The standby system (OLTP1S) in 2) is moved to the server host 103 (SH3). That is, the standby system 215 (O) in the server host 102 (SH2).
LTP1S) as it is and the server host 103
The standby system (OLTP1S) is activated at (SH3). As a result, the state of the third stage is reached. In the state immediately before the completion of this activation, the standby system (OLTP1S) of the server host 102 (SH2) is stopped, the activation of the standby system (OLTP1S) of the server host 103 (SH3) is completed, and the state of the fourth stage is set. Become.

【００３２】このように、本実施例の移動方法によれ
ば、待機システム（ＯＬＴＰ１Ｓ）の移動中に、サーバ
ホスト１０１（ＳＨ１）の実行システム（ＯＬＴＰ１）
が障害になっても、サーバホスト１０２（ＳＨ２）の待
機システム（ＯＬＴＰ１Ｓ）がシステム切替を実行する
ことができる。このように、多様なシステム運用が実現
されることから、ホットスタンバイシステムの適用範囲
が拡大した場合でも、きめ細かい運用が可能となる。As described above, according to the moving method of this embodiment, the execution system (OLTP1) of the server host 101 (SH1) is moved while the standby system (OLTP1S) is moving.
Even if a failure occurs, the standby system (OLTP1S) of the server host 102 (SH2) can perform system switching. In this way, since various system operations are realized, even if the application range of the hot standby system is expanded, fine operation can be performed.

【００３３】[0033]

【発明の効果】以上、説明したように、本発明によれ
ば、現用サーバホストと予備サーバホストとをホスト障
害監視用の第２のネットワークで接続し、現用サーバホ
ストは、自ホストと第１のネットワークとを接続する第
１のアドレスを管理し、予備サーバホストは、自ホスト
と第１のネットワークとを接続する第２のアドレスを管
理し、現用サーバホストは、第２のネットワークを介し
て第１のアドレスを予備サーバホストに送信して記憶
し、現用サーバホストに障害が発生したとき、現用サー
バホストは、自ホストで管理されている第１のアドレス
を無効にし、第２のネットワークを介して、システム切
り替え指示を予備サーバホストに通知し、通知された予
備サーバホストは、自ホストで管理されている第２のア
ドレスを、第１のアドレスに書き換えているので、障害
の発生した現用サーバホストのアドレスが予備サーバホ
ストに引き継がれ、これにより、クライアントホストに
おけるネットワークアドレスの切替運用が不要になる。
またクライアントからみたサーバシステムの障害回復時
間を最小限にすることができる。As described above, according to the present invention, the active server host and the spare server host are connected by the second network for host failure monitoring, and the active server host and the first server are connected to each other. Manages a first address for connecting to another network, the spare server host manages a second address for connecting the self-host and the first network, and the active server host manages via the second network. The first address is transmitted to and stored in the spare server host, and when a failure occurs in the active server host, the active server host invalidates the first address managed by its own host and sets the second network Through the system switching instruction to the spare server host, and the notified spare server host sends the first address to the second address managed by the own host. Since the rewriting to the scan, address of the primary server failed host has taken over the spare server host, thereby, the switching operation of the network address is not required in the client host.
Moreover, the failure recovery time of the server system seen from the client can be minimized.

【図面の簡単な説明】[Brief description of drawings]

【図１】本発明の一実施例に係るシステム構成を示す。FIG. 1 shows a system configuration according to an embodiment of the present invention.

【図２】各サーバホストにおけるソフトウエアの構成例
を示す。FIG. 2 shows a configuration example of software in each server host.

【図３】本発明のシステム切替え時における、ネットワ
ークアドレスの引き継ぎを説明する図である。FIG. 3 is a diagram for explaining inheritance of a network address at the time of system switching according to the present invention.

【図４】プロセスのグループ単位でシステムを切り替え
る実施例におけるテーブル構成例を示す。FIG. 4 shows an example of a table configuration in an embodiment in which the system is switched for each group of processes.

【図５】待機システムを予備のサーバホストから他のホ
ストに移動する実施例の説明図である。FIG. 5 is an explanatory diagram of an embodiment in which a standby system is moved from a spare server host to another host.

【符号の説明】[Explanation of symbols]

１０１、１０３現用のサーバホスト１０２予備のサーバホスト１０４、１０５共用ディスク群１０６ローカルエリアネットワーク１０７ホスト障害監視用のローカルエリアネットワー
ク１０８クライアントホスト１０９、１１０、１１１、１１２、１１３、１１４、１
１５ローカルエリアネットワークアダプタ101, 103 Active server host 102 Spare server host 104, 105 Shared disk group 106 Local area network 107 Local area network for host failure monitoring 108 Client host 109, 110, 111, 112, 113, 114, 1
15 Local Area Network Adapter

Claims

【特許請求の範囲】[Claims]

【請求項１】複数のサーバホストと、クライアントホ
ストがネットワーク（以下、第１のネットワークとい
う）で接続され、該サーバホストは現用サーバホスト
と、待機系の予備サーバホストからなり、該現用サーバ
ホストに障害が発生したときに、該予備サーバホストに
切り替えて処理を実行するマルチサーバシステムのホッ
トスタンバイ制御方法において、前記現用サーバホスト
と予備サーバホストとをホスト障害監視用のネットワー
ク（以下、第２のネットワークという）で接続し、前記
現用サーバホストは、自ホストと前記第１のネットワー
クとを接続する第１のアドレスを管理し、前記予備サー
バホストは、自ホストと前記第１のネットワークとを接
続する第２のアドレスを管理し、前記現用サーバホスト
は、該第２のネットワークを介して、該第１のアドレス
を予備サーバホストに送信して記憶し、前記現用サーバ
ホストに障害が発生したとき、前記現用サーバホスト
は、自ホストで管理されている前記第１のアドレスを無
効にし、該第２のネットワークを介して、システム切り
替え指示を前記予備サーバホストに通知し、該通知され
た前記予備サーバホストは、自ホストで管理されている
前記第２のアドレスを、前記第１のアドレスに書き換え
ることを特徴するマルチサーバシステムのホットスタン
バイ制御方法。1. A plurality of server hosts and a client host are connected by a network (hereinafter referred to as a first network), and the server host comprises an active server host and a standby spare server host, and the active server host. In the hot standby control method of a multi-server system that switches to the spare server host and executes processing when a failure occurs in the backup server host, the active server host and the spare server host are connected to a network for host failure monitoring (hereinafter referred to as the second Network)), the active server host manages a first address connecting the own host and the first network, and the spare server host connects the own host and the first network. It manages the second address to be connected, and the active server host manages the second network. Via the network, the first address is transmitted to and stored in the spare server host, and when a failure occurs in the active server host, the active server host manages the first address managed by its own host. Via the second network, the system switching instruction is notified to the spare server host, and the notified spare server host sends the second address managed by the own host to the second address. A hot standby control method for a multi-server system, characterized by rewriting to a first address.