JPH05165582A

JPH05165582A - Interruption fault processing method for file system

Info

Publication number: JPH05165582A
Application number: JP3331867A
Authority: JP
Inventors: Noboru Kobayashi; 暢小林
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1991-12-16
Filing date: 1991-12-16
Publication date: 1993-07-02

Abstract

PURPOSE:To investigate the cause of a fault without imposing burden on a file controller by collecting internal information relating to a start I/O setting a fault operation command from a host as the opportunity when MIH(MISSING INTERRUPT HANDRER) is generated. CONSTITUTION:The file controller 1 makes access designated filing devices 2a-2n corresponding to an SIO(start I/O) command from the host, and applies interruption to the host 3 when access is completed, and performs data transfer between the host 3 and the filing devices 2a-2n by performing re-coupling. The host 3 detects time-out when no response from the file controller 1 for the SIO command is received within prescribed time, and issues the fault operation command(e). The file controller 1 cancels the SIO command replying to the fault operation command, and also, collects the internal information by the execution of the SIO command, then, preserves it in internal memory(f).

Description

【発明の詳細な説明】Detailed Description of the Invention

【０００１】（目次）産業上の利用分野従来の技術（図５）発明が解決しようとする課題課題を解決するための手段（図１）作用実施例 (a) 一実施例の説明（図２乃至図４） (b) 他の実施例の説明発明の効果(Table of Contents) Industrial Application Field of the Prior Art (FIG. 5) Problem to be Solved by the Invention Means for Solving the Problem (FIG. 1) Action Example (a) Description of One Example (FIG. 2) To FIG. 4) (b) Description of another embodiment Effect of the invention

【０００２】[0002]

【産業上の利用分野】本発明は、電子計算機システムに
おける外部記憶に用いられるファイルシステムの割り込
み障害処理方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an interrupt failure processing method for a file system used for external storage in an electronic computer system.

【０００３】計算機システムの外部記憶装置として用い
られるファイルシステムは、計算機処理の多重化、高速
化に伴い、多重処理、高速処理が要求され、特にファイ
ル制御装置の処理は複雑化している。A file system used as an external storage device of a computer system is required to have multiple processing and high-speed processing as computer processing is multiplexed and speeded up, and the processing of a file controller is particularly complicated.

【０００４】このため、ファイルシステムの障害も複雑
化しており、障害内容の解明が困難となっており、障害
解明を容易にできる技術が望まれている。For this reason, the file system failure is also complicated, and it is difficult to elucidate the content of the failure. Therefore, there is a demand for a technique capable of facilitating the elucidation of the failure.

【０００５】[0005]

【従来の技術】図５は従来技術の説明図である。図５
（Ａ）に示すように、ファイルシステムは、複数の磁気
ディスク装置（ファイルデバイス）２ａ〜２ｎと、各磁
気ディスク装置２ａ〜２ｎを制御するファイル制御装置
１とで構成され、磁気ディスク（ファイル）制御装置
（コントローラ）１が、ホストであるＣＰＵ３と接続さ
れる。2. Description of the Related Art FIG. 5 is an explanatory diagram of a conventional technique. Figure 5
As shown in (A), the file system includes a plurality of magnetic disk devices (file devices) 2a to 2n and a file control device 1 that controls each of the magnetic disk devices 2a to 2n. A control device (controller) 1 is connected to a CPU 3 that is a host.

【０００６】このシステムの動作は、図５（Ｂ）に示す
ように、ＣＰＵ３からのスタートＩ／Ｏ（ＳＩＯ）指令
をコントローラ１が受け、処理（磁気ディスク装置への
リード／ライト）するが、磁気ディスク装置のアクセス
は、磁気ヘッドの移動という機械的動作が伴うため、Ｃ
ＰＵ３、コントローラ１の処理に比べて大幅に処理時間
が遅い。In the operation of this system, as shown in FIG. 5B, the controller 1 receives a start I / O (SIO) command from the CPU 3 and processes it (read / write to the magnetic disk device). Since access to the magnetic disk device is accompanied by a mechanical operation of moving the magnetic head, C
The processing time is significantly slower than the processing of the PU 3 and the controller 1.

【０００７】このため、ＣＰＵ３は、コントローラ１に
ＳＩＯを発行したら、磁気ディスク装置のヘッド位置付
けが終わるまで、一時的に、そのコントローラ１と切り
離れ、他の処理を行い、磁気ディスク装置のヘッド位置
付けが終了すると、コントローラ１が、ＣＰＵ３に割り
込み、再結合して、処理を続行する。Therefore, after issuing the SIO to the controller 1, the CPU 3 temporarily disconnects from the controller 1 and performs other processing until the head positioning of the magnetic disk device is completed, and the head positioning of the magnetic disk device is performed. When is finished, the controller 1 interrupts the CPU 3, rejoins, and continues the processing.

【０００８】このコントローラ１が、ＣＰＵ３よりＳＩ
Ｏを受理し、切り離れた後、何らかの理由により障害が
発生した場合に、コントローラ１が検出可能な障害であ
れば、エラーを通知するが、検出できない障害である
と、ＣＰＵ３に割り込みが出来なかったり、割り込みを
しない。This controller 1 is controlled by the CPU 3 by the SI.
When a failure occurs for some reason after accepting O and disconnecting, an error is notified if the failure can be detected by the controller 1, but if the failure cannot be detected, the CPU 3 cannot be interrupted. Or do not interrupt.

【０００９】このような事態に備えて、ＣＰＵ３は、Ｓ
ＩＯ発行からタイマを起動し、一定時間内に、コントロ
ーラ１から割り込みがなかった時は、タイムアウトを検
出するようにしており、このタイムアウトを、ＭＩＨ
（ＮＩＳＳＩＮＧＩＮＴＥＲＲＵＰＴＨＡＮＤＲＥ
Ｒ）という。In preparation for such a situation, the CPU 3 causes the S
The timer is started from the IO issue, and if there is no interrupt from the controller 1 within a fixed time, a timeout is detected.
(NISSING INTERRUPT HANDRE
R).

【００１０】このＭＩＨは、コントローラ１で検出でき
ないため、ＭＩＨが生じると、ＣＰＵ３は、コントロー
ラ１に対し、ＭＩＨの対象となっている磁気ディスク装
置に対するホルト・オペレーション（ＨＡＬＴＯＰＥ
ＲＡＴＩＯＮ）指令を発行し、コントローラ１にＭＩＨ
発生を知らせ、そのＳＩＯと制御情報をキャンセルし、
磁気ディスク装置を解放して、その磁気ディスク装置に
対する次のＳＩＯを実行させ、処理を継続している。Since the MIH cannot be detected by the controller 1, when the MIH occurs, the CPU 3 causes the controller 1 to perform a halt operation (HALT OPE) for the magnetic disk device which is the target of the MIH.
Issue a RATION) command to MIH controller 1
Notify the occurrence, cancel the SIO and control information,
The magnetic disk device is released, the next SIO for the magnetic disk device is executed, and the processing is continued.

【００１１】従来は、ＭＩＨの発生は、ＣＰＵ３が把握
しているのみであり、コントローラ１のその時の状態
は、ホルト・オペレーション指令により、キャンセルさ
れるから、状態を把握することはできなかった。Conventionally, the occurrence of MIH is only grasped by the CPU 3, and the state of the controller 1 at that time is canceled by the halt operation command, so that the state could not be grasped.

【００１２】[0012]

【発明が解決しようとする課題】しかしながら、従来技
術では、次の問題があった。ＭＩＨ発生時の状態が把握できないため、ＭＩＨの原
因を解析できない。However, the prior art has the following problems. The cause of MIH cannot be analyzed because the state when MIH occurs cannot be grasped.

【００１３】コントローラ１で、常時、ＳＩＯとその
状態を保存しておく方法があり（例えば、特開昭６２−
２９１７５２号公報等）、この方法では、保存内容を解
析して、ＭＩＨの原因を解明できるが、ＳＩＯの処理毎
に保存するため、コントローラ１の処理速度が遅くな
り、処理能力が低下するばかりか、膨大なメモリを必要
とする。There is a method in which the controller 1 always saves the SIO and its state (see, for example, Japanese Patent Laid-Open No. 62-
With this method, the contents of MIH can be analyzed and the cause of MIH can be clarified by this method, but since it is saved for each SIO process, the processing speed of the controller 1 slows down, and not only the processing capability decreases. , Requires huge memory.

【００１４】従って、本発明は、ファイル制御装置の処
理速度を低下することなく、ＭＩＨの原因を解析するこ
とができるファイルシステムの割り込み障害処理方法を
提供することを目的とする。Therefore, it is an object of the present invention to provide a file system interrupt failure processing method capable of analyzing the cause of MIH without reducing the processing speed of the file controller.

【００１５】[0015]

【課題を解決するための手段】図１は本発明の原理図で
ある。本発明の請求項１は、ホスト３に接続されたファ
イル制御装置１と、該ファイル制御装置１に接続され、
制御される複数のファイルデバイス２ａ〜２ｎとを有
し、該ファイル制御装置１が、ホスト３からのスタート
Ｉ／Ｏ指令に応じて、指定された該ファイルデバイス２
ａ〜２ｎをアクセスして、アクセス完了により、該ホス
ト３に割り込み、再結合して、該ホスト３と該ファイル
デバイス２ａ〜２ｎとのデータ転送を行うファイルシス
テムにおいて、該ファイル制御装置１は、該ホスト３
が、該スタートＩ／Ｏ指令発行から一定時間以内に割り
込みがないことにより発行するホルト・オペレーション
指令に応じて、該スタートＩ／Ｏ指令をキャンセルする
とともに、該スタートＩ／Ｏ指令の実行による内部情報
を収集して、内部メモリに保存しておくことを特徴とす
る。FIG. 1 shows the principle of the present invention. According to claim 1 of the present invention, the file controller 1 connected to the host 3 and the file controller 1 connected to the file controller 1,
The file control device 1 has a plurality of controlled file devices 2a to 2n, and the file control device 1 designates the specified file device 2 in response to a start I / O command from the host 3.
In the file system that accesses a to 2n, interrupts the host 3 upon completion of access, rejoins, and transfers data between the host 3 and the file devices 2a to 2n, the file control device 1 The host 3
However, the start I / O command is canceled in accordance with a halt operation command issued when there is no interrupt within a fixed time after the start I / O command is issued, and the internal operation is executed by executing the start I / O command. The feature is that information is collected and stored in an internal memory.

【００１６】本発明の請求項２は、請求項１において、
前記内部情報は、前記ホルト・オペレーション指令で対
象とする前記ファイルデバイスの制御情報であることを
特徴とする。According to claim 2 of the present invention, in claim 1,
The internal information is control information of the file device targeted by the halt operation command.

【００１７】本発明の請求項３は、請求項１又は２にお
いて、前記ファイル制御装置１は、該ホスト３のチャネ
ル３ａ〜３１と接続するためのチャネルアダプタ４ａ〜
４ｍと、該複数のファイルデバイス２ａ〜２ｎと接続す
るためのデバイスアダプタ５ａ〜５ｍと、全体を管理す
る全体管理部６とを有し、該全体管理部６の制御テーブ
ル９ａの制御情報を収集して、該内部メモリに保存する
ことを特徴とする。According to a third aspect of the present invention, in the first or second aspect, the file control device 1 is connected to the channels 3a to 31 of the host 3 through channel adapters 4a to 4a.
4m, device adapters 5a to 5m for connecting to the plurality of file devices 2a to 2n, and an overall management unit 6 that manages the whole, and collects control information of the control table 9a of the overall management unit 6. And is stored in the internal memory.

【００１８】本発明の請求項４は、請求項１又は２又は
３において、前記ファイル制御装置１は、前記ホスト３
からの情報転送コマンドに応じて、前記内部メモリの保
存情報を前記ホスト３に転送することを特徴とする。According to a fourth aspect of the present invention, in the first, second or third aspect, the file controller 1 is the host 3
The storage information in the internal memory is transferred to the host 3 in response to an information transfer command from the.

【００１９】[0019]

【作用】ファイル制御装置１の処理速度を低下しないた
めには、ＭＩＨの発生時のみ、ファイル制御装置の状態
情報を保存すれば良いが、ホルト・オペレーション指令
により、状態が変化してしまうので、その前に状態を保
存する必要がある。In order to prevent the processing speed of the file controller 1 from decreasing, it is sufficient to save the status information of the file controller only when MIH occurs, but the status changes due to the halt operation command. Before that we need to save the state.

【００２０】このファイル制御装置１が、ＭＩＨの発生
を最初に知るのは、ホスト３からのホルト・オペレーシ
ョン指令であるから、これを契機に情報を保存すればよ
い。そこで、本発明の請求項１では、ファイル制御装置
１は、該ホスト３が、該スタートＩ／Ｏ指令発行から一
定時間以内に割り込みがないことにより発行するホルト
・オペレーション指令に応じて、該スタートＩ／Ｏ指令
をキャンセルするとともに、該スタートＩ／Ｏ指令の実
行による内部情報を収集して、内部メモリに保存してお
くようにして、ＭＩＨの原因解明を可能とした。Since the file control device 1 first knows the occurrence of MIH is the halt operation command from the host 3, the information may be saved with this as a trigger. Therefore, in claim 1 of the present invention, the file controller 1 starts the host 3 in response to a halt operation command issued when no interruption occurs within a fixed time after the start I / O command is issued. The cause of MIH can be clarified by canceling the I / O command, collecting internal information by executing the start I / O command, and storing the internal information in the internal memory.

【００２１】本発明の請求項２では、ファイル制御装置
１の全ての内部情報を保存することは望ましいが、この
ようにすると、処理得度が低下し、無駄も多いので、障
害解明に最低必要なホルト・オペレーション指令で対象
とする前記ファイルデバイスの制御情報に限った。According to the second aspect of the present invention, it is desirable to save all the internal information of the file control device 1. However, in this case, the processing efficiency is lowered and waste is large, so that it is at least necessary for the failure elucidation. It is limited to the control information of the file device targeted by the Holt operation command.

【００２２】本発明の請求項３では、前記ファイル制御
装置１は、該ホスト３のチャネル３ａ〜３１と接続する
ためのチャネルアダプタ４ａ〜４ｍと、該複数のファイ
ルデバイス２ａ〜２ｎと接続するためのデバイスアダプ
タ５ａ〜５ｍと、全体を管理する全体管理部６とを有す
るものでは、全体管理部６が制御を握っているので、全
体管理部６の制御テーブル９ａの制御情報を収集して、
該内部メモリに保存するようにした。In the third aspect of the present invention, the file controller 1 connects the channel adapters 4a to 4m for connecting to the channels 3a to 31 of the host 3 and the plurality of file devices 2a to 2n. In the device having the device adapters 5a to 5m and the overall management unit 6 that manages the entire device, the overall management unit 6 is in control, so the control information of the control table 9a of the overall management unit 6 is collected,
It was saved in the internal memory.

【００２３】本発明の請求項４では、前記ファイル制御
装置１は、前記ホスト３からの情報転送コマンドに応じ
て、前記内部メモリの保存情報を前記ホスト３に転送す
るので、ホスト３側のコンソール等で、処理に影響を与
えずに、ＭＩＨの原因を解明できる。In claim 4 of the present invention, the file control device 1 transfers the stored information in the internal memory to the host 3 in response to an information transfer command from the host 3, so that the console on the host 3 side In this way, the cause of MIH can be elucidated without affecting the processing.

【００２４】[0024]

【実施例】【Example】

(a) 一実施例の説明図２は本発明の一実施例構成図、図３は本発明の一実施
例全体管理部の構成図であり、磁気ディスクファイルシ
ステムを示している。(a) Description of an Embodiment FIG. 2 is a configuration diagram of an embodiment of the present invention, and FIG. 3 is a configuration diagram of an overall management unit of an embodiment of the present invention, showing a magnetic disk file system.

【００２５】図中、図１及び図５で示したものと同一の
ものは、同一の記号で示してあり、図２において、ＣＰ
Ｕ３は、複数のチャネル３ａ〜３１を有し、複数の磁気
ディスク制御装置（コントローラ）１に接続される。In the figure, the same parts as those shown in FIGS. 1 and 5 are indicated by the same symbols, and in FIG.
U3 has a plurality of channels 3a to 31 and is connected to a plurality of magnetic disk control devices (controllers) 1.

【００２６】磁気ディスク制御装置１は、ＣＰＵ３のチ
ャネル３ａ〜３１との接続のためのチャネルアダプタ４
ａ〜４ｍと、磁気ディスク装置（デバイス）との接続の
ためのデバイスアダプタ５ａ〜５ｍと、バス７を介しこ
れらと接続し、全体を制御するための全体管理部６とを
有している。The magnetic disk controller 1 includes a channel adapter 4 for connecting to the channels 3a to 31 of the CPU 3.
a to 4 m, device adapters 5 a to 5 m for connecting to a magnetic disk device (device), and an overall management unit 6 connected to these via a bus 7 and controlling the whole.

【００２７】磁気ディスク装置は、各々２つのデバイス
アダプタに接続された磁気ディスク装置２０ａ〜２０
ｎ、２１ａ〜２１ｎ、２２ａ〜２２ｎを備える。磁気デ
ィスク制御装置１は、複数のＣＰＵ３に接続される。The magnetic disk devices are magnetic disk devices 20a to 20 connected to two device adapters, respectively.
n, 21a to 21n, 22a to 22n. The magnetic disk controller 1 is connected to a plurality of CPUs 3.

【００２８】全体管理部６は、ＣＰＵ３からのスタート
Ｉ／Ｏ指令を受理したチャネルアダプタ４ａ〜４ｍから
のバス７を介した解読内容により、指定された磁気ディ
スク装置（デバイス）の状態（使用中、使用されていな
い、リザーブされている等）を判定し、使用されていな
ければ、その磁気ディスク装置のデバイスアダプタ５ａ
〜５ｍを選択し、その磁気ディスク装置をアクセスし、
アクセス完了により、当該チャネルアダプタ４ａ〜４ｍ
とデバイスアダプタ５ａ〜５ｍとを接続して、ＣＰＵ３
と磁気ディスク装置とをオンライン状態として、データ
のリード／ライトを行わせる。The overall management unit 6 determines the state (in use) of the specified magnetic disk device (device) according to the contents of decoding via the bus 7 from the channel adapters 4a to 4m that have received the start I / O command from the CPU 3. , Not used, reserved, etc.), and if not used, the device adapter 5a of the magnetic disk device.
Select ~ 5m, access the magnetic disk device,
Upon completion of access, the channel adapters 4a-4m
And the device adapters 5a to 5m, and the CPU3
And the magnetic disk device are brought into an online state to read / write data.

【００２９】この全体管理部６は、図３に示すように、
マイクロプロセッサ（ＭＰＵ）８と、メモリ９とを有
し、メモリ９に制御情報テーブル９ａを設け、この処理
を実行する。As shown in FIG. 3, the overall management unit 6 is
It has a microprocessor (MPU) 8 and a memory 9, and a control information table 9a is provided in the memory 9 to execute this processing.

【００３０】制御情報テーブル９ａは、各磁気ディスク
装置（デバイス）の状態（使用中、使用されていない、
リザーブされている等）を格納するデバイス状態エリア
９０と、選択された磁気ディスク装置のＳＩＯ実行状態
（結合中、切離し中、シーク完了、再結合中、リード／
ライト中等）を格納するデバイス実行状態エリア９１
と、各チャネルアダプタ４ａ〜４ｍの状態（正常／異常
等）を格納するチャネルアダプタ状態エリア９２と、各
デバイスアダプタ５ａ〜５ｍの状態（正常／異常／動作
中等）を格納するデバイスアダプタ状態エリア９３と、
収集エリア９４等を有する。The control information table 9a shows the status of each magnetic disk device (device) (in use, not used,
A device status area 90 for storing (reserved, etc.) and an SIO execution status of the selected magnetic disk device (combining, disconnecting, seek completion, recombining, read / read)
Device execution status area 91 for storing (writing, etc.)
And a channel adapter status area 92 for storing the status (normal / abnormal, etc.) of each channel adapter 4a-4m, and a device adapter status area 93 for storing the status (normal / abnormal / operating, etc.) of each device adapter 5a-5m. When,
It has a collection area 94 and the like.

【００３１】尚、ＳＩＯは、メモリ９の図示しないエリ
アに格納され、同一磁気ディスク装置を対象とするもの
は、待ち行列に登録され、前のＳＩＯが実行完了となる
と、そのＳＩＯが削除され、実行される。Incidentally, the SIO is stored in an area (not shown) of the memory 9, and those for the same magnetic disk device are registered in the queue, and when the execution of the previous SIO is completed, the SIO is deleted, Executed.

【００３２】図４は本発明の一実施例ＭＩＨ収集処理フ
ロー図である。ＣＰＵ３が、ＳＩＯを発行すると、ＣＰＵ３はタイマ
監視を開始する。ＳＩＯを受理した磁気ディスク制御装置１のチャネルア
ダプタ４ａ〜４ｍは、全体管理部６に通知し、ＣＰＵ３
から切り離され、全体管理部６は、メモリ９にこのＳＩ
Ｏを格納した後、制御情報テーブル９ａのデバイス状態
エリア９０、デバイスアダプタ状態エリア９３を参照し
て、指定された磁気ディスク装置のデバイスアダプタ５
ａ〜５ｍを起動する。FIG. 4 is a flow chart showing the MIH collecting process according to the embodiment of the present invention. When the CPU 3 issues the SIO, the CPU 3 starts timer monitoring. The channel adapters 4a to 4m of the magnetic disk control device 1 that received the SIO notify the overall management unit 6 and the CPU 3
And the general management unit 6 stores this SI in the memory 9.
After O is stored, the device status area 90 and the device adapter status area 93 of the control information table 9a are referred to and the device adapter 5 of the specified magnetic disk device is referred to.
Start a-5m.

【００３３】デバイスアダプタ５ａ〜５ｍは、指定され
た磁気ディスク装置にシーク動作を命じて、切離し、磁
気ディスク装置のシーク完了による割り込みにより再結
合し、全体管理部６の制御により、チャネルアダプタ４
ａ〜４ｍと接続し、チャネルアダプタ４ａ〜４ｍは、Ｃ
ＰＵ３に割り込みを行う。The device adapters 5a to 5m command the designated magnetic disk device to perform a seek operation, disconnect, and reconnect by an interrupt upon completion of the seek of the magnetic disk device, and under the control of the general manager 6, the channel adapter 4
a to 4 m, and the channel adapters 4 a to 4 m are C
Interrupt PU3.

【００３４】この動作の状態は、全て全体管理部６へ通
知され、制御テーブル９ａに格納される。上述が正常動
作であるが、何らかの理由により、ＣＰＵ３に割り込み
を通知しない場合がある。All the states of this operation are notified to the general manager 6 and stored in the control table 9a. Although the above is the normal operation, the CPU 3 may not be notified of the interrupt for some reason.

【００３５】ＣＰＵ３は、磁気ディスク制御装置１か
らのＳＩＯに対する割り込みがタイマ監視時間内に到来
しないと、タイムアウトとなり、ＳＩＯのタイムアウト
を検出する。If the interrupt from the magnetic disk controller 1 to the SIO does not arrive within the timer monitoring time, the CPU 3 times out and detects the SIO time out.

【００３６】ＣＰＵ３は、タイムアウトを検出した磁
気ディスク制御装置１に対して、ホルト・オペレーショ
ン指令を発行する。このホルト・オペレーション指令は、磁気ディスク制御
装置１に、割り込みが監視時間内になかったことを知ら
せる意味と、当該処理途中のＳＩＯとその制御情報を排
除させ、次のＳＩＯを実行させ、ＳＩＯの対象となって
いる磁気ディスク装置を解放する意味がある。The CPU 3 issues a halt operation command to the magnetic disk controller 1 which has detected the time-out. This halt operation command means that the magnetic disk controller 1 is informed that the interrupt was not within the monitoring time and that the SIO in the middle of the process and its control information are eliminated and the next SIO is executed. It makes sense to release the target magnetic disk device.

【００３７】ホルト・オペレーション指令を受けた磁
気ディスク制御装置１では、全体管理部６が、メモリ９
の制御情報の収集を行う。この収集情報は、ＣＰＵ３からどんなＳＩＯが発行され
たかのトレースと、ＳＩＯを実行している制御情報であ
る。In the magnetic disk control device 1 which has received the halt operation command, the general management unit 6 operates the memory 9
The control information of is collected. This collected information is a trace of what SIO has been issued from the CPU 3 and control information for executing SIO.

【００３８】この制御情報は、大量にあるため、全部を
保持すると、時間がかかり、ＣＰＵ３を待たせることに
なるから、そのホルト・オペレーション指令で指定され
た磁気ディスク装置に関する制御情報等とし、全体管理
部６のＭＰＵ８は、図３に示すように、そのＳＩＯの内
容と、デバイス情報エリア９０の指定磁気ディスク装置
の状態と、デバイス実行状態エリア９１の指定磁気ディ
スク装置の実行状態と、チャネルアダプタ状態エリア９
２のそのＳＩＯに関連するチャネルアダプタの状態と、
デバイスアダプタ状態エリア９３の指定磁気ディスク装
置に対応するデバイスアダプタの状態とを取り出し、編
集して、収集エリア９４に格納する。Since this control information is in a large amount, if all of it is held, it takes time and makes the CPU 3 wait. Therefore, the control information regarding the magnetic disk device specified by the halt operation command is used as the whole control information. As shown in FIG. 3, the MPU 8 of the management unit 6 includes the contents of the SIO, the state of the designated magnetic disk device in the device information area 90, the execution state of the designated magnetic disk device in the device execution state area 91, and the channel adapter. State area 9
2, the state of the channel adapter associated with that SIO,
The status of the device adapter corresponding to the designated magnetic disk device in the device adapter status area 93 is extracted, edited, and stored in the collection area 94.

【００３９】そして、ＭＰＵ８は、当該ＳＩＯを排除
し、制御テーブル９ａの当該ＳＩＯの制御情報を排除し
て、デバイスアダプタ、磁気ディスク装置を解放する。
これにより、全体管理部６は、次のＳＩＯの実行が可能
となる。Then, the MPU 8 eliminates the SIO, eliminates the control information of the SIO in the control table 9a, and releases the device adapter and the magnetic disk device.
As a result, the overall management unit 6 can execute the next SIO.

【００４０】このメモリ９の収集エリア９４の収集情
報は、ＣＰＵ３からの転送コマンド（磁気ディスク制御
装置の情報を転送するコマンドであり、例えば、ＳＵＢ
ＳＹＳＴＥＭＣＯＮＴＲＯＬ、ＳＵＢＳＹＳＴＥＭ
ＲＥＡＤというコマンド）により、ＣＰＵ３に吸い上げ
られ、コンソール等に表示して、障害の究明ができる。The collection information in the collection area 94 of the memory 9 is a transfer command from the CPU 3 (a command for transferring information of the magnetic disk control device, for example, SUB
SYSTEM CONTROL, SUBSYSTEM
The command (READ) causes the CPU 3 to download and display it on a console or the like so that the failure can be investigated.

【００４１】例えば、デバイス状態からは、リザーブさ
れているのに処理されているとか、デバイスの実行状態
では、実行されているのに処理されていないとか、チャ
ネルアダプタの状態では、チャネルアダプタが異常であ
るとか、デバイスアダプタの状態では、デバイスアダプ
タが動作しなかった等の分析ができる。For example, from the device state, the channel adapter is reserved but is being processed, the device is in the executing state, is being executed but not being processed, or the channel adapter is abnormal in the channel adapter state. It is possible to analyze that the device adapter did not operate in the state of the device adapter.

【００４２】このような状態は、エラーとして検出され
ず、エラー通知ができないものである。このようにし
て、ＣＰＵ３のホルト・オペレーション指令を契機に内
部情報を収集するので、エラーにならないＭＩＨ障害の
分析が可能である。Such a state is not detected as an error and an error cannot be notified. In this way, since the internal information is collected with the halt operation command of the CPU 3 as a trigger, it is possible to analyze the MIH failure without causing an error.

【００４３】このホルト・オペレーション指令を契機と
すると、その原因となる障害が発生時よりも時間差があ
る情報を収集するが、情報収集しないと、制御情報を変
化してしまい、障害究明が一層困難となり、磁気ディス
ク制御装置１が、最初にＭＩＨを認識できるホルト・オ
ペレーション指令で実行すれば、できる限り障害発生時
に近い時点での情報が収集できる。When this halt operation command is used as an opportunity, information that has a time difference from that at the time of the occurrence of the failure is collected, but if the information is not collected, the control information will change, making it more difficult to identify the failure. Therefore, if the magnetic disk controller 1 executes the halt operation command capable of recognizing the MIH first, information can be collected at the time when the failure occurs as close as possible.

【００４４】しかも、収集情報を、ＭＩＨの対象となっ
たＳＩＯに限っているので、収集処理に時間がかから
ず、次のＳＩＯの処理を長時間待たせることがない。 (b) 他の実施例の説明上述の実施例の他に、本発明は、次のような変形が可能
である。Moreover, since the collection information is limited to the SIO targeted for MIH, the collection processing does not take time and the next SIO processing does not wait for a long time. (b) Description of Other Embodiments In addition to the above embodiments, the present invention can be modified as follows.

【００４５】ファイルデバイスを、磁気ディスク装
置、ファイル制御装置を磁気ディスク制御装置で説明し
たが、光ディスク装置等の他のファイルデバイスにも適
用できる。Although the magnetic disk device is used as the file device and the magnetic disk controller is used as the file control device, the file device can be applied to other file devices such as an optical disk device.

【００４６】収集エリア９４の内容を、例えばハード
ディスク等に保持し、ファイル制御装置のコンソール、
保守パネル等から必要な時に、読み出してもよい。以上、本発明を実施例により説明したが、本発明の主旨
の範囲内で種々の変形が可能であり、これらを本発明の
範囲から排除するものではない。The contents of the collection area 94 are held in, for example, a hard disk, and the console of the file control device,
It may be read from the maintenance panel or the like when necessary. Although the present invention has been described with reference to the embodiments, various modifications are possible within the scope of the gist of the present invention, and these modifications are not excluded from the scope of the present invention.

【００４７】[0047]

【発明の効果】以上説明したように、本発明によれば、
次の効果を奏する。ＭＩＨの発生時に、ホスト３からのホルト・オペレー
ション指令を契機に、ＳＩＯに関連する内部情報を収集
するので、障害の究明がファイル制御装置の負担となら
ずに行うことができる。As described above, according to the present invention,
It has the following effects. When MIH occurs, internal information related to SIO is collected triggered by a halt operation command from the host 3, so that the failure can be investigated without burdening the file controller.

【００４８】又、ＳＩＯに関連する内部情報を収集す
るので、次のＳＩＯの処理を長時間待たせることなく、
内部情報の収集が可能となる。Further, since internal information related to SIO is collected, the next SIO process can be performed without waiting for a long time.
It becomes possible to collect internal information.

【図面の簡単な説明】[Brief description of drawings]

【図１】本発明の原理図である。FIG. 1 is a principle diagram of the present invention.

【図２】本発明の一実施例構成図である。FIG. 2 is a configuration diagram of an embodiment of the present invention.

【図３】本発明の一実施例全体管理部の構成図である。FIG. 3 is a configuration diagram of an overall management unit according to an embodiment of the present invention.

【図４】本発明の一実施例ＭＩＨ収集処理フロー図であ
る。FIG. 4 is a flow chart of MIH collection processing according to an embodiment of the present invention.

【図５】従来技術の説明図である。FIG. 5 is an explanatory diagram of a conventional technique.

【符号の説明】[Explanation of symbols]

１ファイル制御装置（磁気ディスク制御装置）２ａ〜２ｎファイルデバイス（磁気ディスク装置）３ホスト（ＣＰＵ）４ａ〜４ｎチャネルアダプタ５ａ〜５ｎデバイスアダプタ６全体管理部８ＭＰＵ９メモリ９ａ制御テーブル９４収集エリア 1 File Control Device (Magnetic Disk Control Device) 2a to 2n File Device (Magnetic Disk Device) 3 Host (CPU) 4a to 4n Channel Adapter 5a to 5n Device Adapter 6 Overall Management Unit 8 MPU 9 Memory 9a Control Table 94 Collection Area

Claims

【特許請求の範囲】[Claims]

【請求項１】ホスト（３）に接続されたファイル制御
装置（１）と、該ファイル制御装置（１）に接続され、
制御される複数のファイルデバイス（２ａ〜２ｎ）とを
有し、該ファイル制御装置（１）が、ホスト（３）から
のスタートＩ／Ｏ指令に応じて、指定された該ファイル
デバイス（２ａ〜２ｎ）をアクセスして、アクセス完了
により、該ホスト（３）に割り込み、再結合して、該ホ
スト（３）と該ファイルデバイス（２ａ〜２ｎ）とのデ
ータ転送を行うファイルシステムにおいて、該ファイル制御装置（１）は、該ホスト（３）が、該ス
タートＩ／Ｏ指令発行から一定時間以内に割り込みがな
いことにより発行するホルト・オペレーション指令に応
じて、該スタートＩ／Ｏ指令をキャンセルするととも
に、該スタートＩ／Ｏ指令の実行による内部情報を収集し
て、内部メモリに保存しておくことを特徴とするファイ
ルシステムの割り込み障害処理方法。1. A file controller (1) connected to a host (3), and a file controller (1) connected to the file controller (1),
A plurality of controlled file devices (2a to 2n), and the file control device (1) specifies the specified file device (2a to 2n) in response to a start I / O command from the host (3). 2n), and upon completion of the access, interrupt the host (3), rejoin, and transfer the data between the host (3) and the file devices (2a to 2n). The control device (1) cancels the start I / O command in response to the halt operation command issued by the host (3) when there is no interruption within a fixed time after the start I / O command is issued. At the same time, the internal information generated by the execution of the start I / O command is collected and stored in the internal memory, which causes an interrupt failure of the file system. Management method.

【請求項２】前記内部情報は、前記ホルト・オペレー
ション指令で対象とする前記ファイルデバイスの制御情
報であることを特徴とする請求項１のファイルシステム
の割り込み障害処理方法。2. The file system interrupt failure processing method according to claim 1, wherein the internal information is control information of the file device targeted by the halt operation command.

【請求項３】前記ファイル制御装置（１）は、該ホス
ト（３）のチャネル（３ａ〜３１）と接続するためのチ
ャネルアダプタ（４ａ〜４ｍ）と、該複数のファイルデ
バイス（２ａ〜２ｎ）と接続するためのデバイスアダプ
タ（５ａ〜５ｍ）と、全体を管理する全体管理部（６）
とを有し、該全体管理部（６）の制御テーブル（９ａ）
の制御情報を収集して、該内部メモリに保存することを
特徴とする請求項１又は２のファイルシステムの割り込
み障害処理方法。3. The file controller (1) includes channel adapters (4a-4m) for connecting to channels (3a-31) of the host (3), and the plurality of file devices (2a-2n). Device adapters (5a to 5m) for connecting with the device, and an overall management unit (6) for managing the entire device
And a control table (9a) of the overall management unit (6)
3. The interrupt failure processing method for a file system according to claim 1, wherein the control information of 1. is collected and stored in the internal memory.

【請求項４】前記ファイル制御装置（１）は、前記ホ
スト（３）からの情報転送コマンドに応じて、前記内部
メモリの保存情報を前記ホスト（３）に転送することを
特徴とする請求項１又は２又は３のファイルシステムの
割り込み障害処理方法。4. The file control device (1) transfers information stored in the internal memory to the host (3) in response to an information transfer command from the host (3). An interrupt failure processing method for the file system of 1 or 2 or 3.