JPS62130442A

JPS62130442A - File subsystem

Info

Publication number: JPS62130442A
Application number: JP60272064A
Authority: JP
Inventors: Norihiko Sakurai; 櫻井　紀彦; Hiroshi Yamaguchi; 博山口; Yasuo Kinouchi; 木ノ内　康夫
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1985-12-03
Filing date: 1985-12-03
Publication date: 1987-06-12

Abstract

PURPOSE:To reduce the load on a host device by accessing an external device when trouble occurs to a cache and accessing a target data block on the external storage device when a target process can be substituted. CONSTITUTION:A controller 2 is equipped with the cache 4 between a main memory device and the external memory device 3, a flag 241 which indicates whether access is already attained or not is provided for a cache trouble monitoring part 26 and the device 3, and a part which controls data transfer between the host controller 1, and the device 3 and cache 4 is further provided. Then, the monitor part 26 when detects the trouble of the cache 4 confirms the flag 241, when the device 43 is not accessed, the device 3 is accessed and when the device 3 is accessed, the device 1 is informed after its normal end as if the cache 1 were not accessed.

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明は主記憶装置と複数の外部記憶装置を含む計算機
システムにおいて、主記憶装置と外部記憶装置との間に
キャッシュを有するファイルサブシステムに関する。Detailed Description of the Invention [Field of Industrial Application] The present invention relates to a file subsystem having a cache between the main storage device and the external storage device in a computer system including a main storage device and a plurality of external storage devices. .

〔従来の技術〕[Conventional technology]

キャッシュを有するファイルサブシステムでは。 In a file subsystem that has a cache.

外部記憶装置の一部のデータをキャッシュに一時的に格
納し、目的のデータ処理をそのキャッシュ上のデータブ
ロックを用いて行える様に制御することによって、レス
ポンスタイムやスループットの向丘が期待できる。こＮ
で、キャッシュは例えばＭＯＳメモリ素子で構成された
キャッシュメモリやそれにアクセスするための論理回路
等で構成されている。また外部記憶装置としては磁気デ
ィスク装置や磁気テープ装置などのキャッシュよりも低
速なファイル記憶装置が適用される。また制御回路では
これらを制御してデータ転送などの処理を司どるが、一
般にキャッシュはオプシＪンとして構成される場合が多
く、キャッシュなしの場合にも外部記憶装置に対して処
理を行うように構成されていることもある。By temporarily storing some data in an external storage device in a cache and controlling the data so that the desired data processing can be performed using the data blocks on the cache, improvements in response time and throughput can be expected. This N
The cache is composed of, for example, a cache memory composed of a MOS memory element, a logic circuit for accessing the cache memory, and the like. Further, as the external storage device, a file storage device such as a magnetic disk device or a magnetic tape device that is slower than a cache is applied. In addition, the control circuit controls these and manages processes such as data transfer, but the cache is generally configured as an option, so even if there is no cache, processing can be performed on an external storage device. Sometimes it is configured.

これらのファイルサブシステムの中で、特に読みだし命
令に対しては、目的のデータの複写がキャッシュ上に存
在する場合、そのデータを該キャッシュより得られる様
に制御し、また書込み命令に対しては、キャッシュと外
部記憶装置の両方の目的のデータブロックに対し書込み
処理を行うファイルサブシステムは、ストアイン方式の
キャッシュ付ファイルサブシステムと呼ばれている。Among these file subsystems, especially for read commands, if a copy of the target data exists on the cache, control is performed so that the data is obtained from the cache, and for write commands A file subsystem that performs write processing on data blocks for both cache and external storage is called a store-in cache-equipped file subsystem.

この様なファイルサブシステムにおいて、キャッシュに
障害が発生した場合には、一定の回数の再試行を行った
後に回復できない障害の発生としてそのＩ１０処理を異
常終了させ、例えばオペレーティングシステム（ＯＳ）
等に対しセンスバイトを報告するなどの手段で、上位の
制御手段にその状況を報告し、障害回復処理の制御を委
ねている。上位の制御手段では、これに対して、例えば
キャッシャを論理的に切り離す等の障害回復処理を行い
、外部記憶装置に直接アクセスする等によりデータ処理
を行っている。In such a file subsystem, if a failure occurs in the cache, the I10 process is abnormally terminated as an unrecoverable failure after a certain number of retries, and the operating system (OS)
The situation is reported to the higher-level control means by means such as reporting sense bytes to the host controller, etc., and control of failure recovery processing is entrusted to the higher-level control means. In response to this, the higher-level control means performs failure recovery processing such as logically separating the cacher, and performs data processing by directly accessing the external storage device.

一方、キャッシュのみの更新を先に行い、適宜外部記憶
装置の更新を行うキャッシュ書込み方式もある。この方
式は、ストアイン方式とよばれ、ＷＲＩＴＥアクセスに
おけるアクセスタイムが高速であることより、特に高速
性を要求されるデータに対して適用される。この方式の
場合、キャッシュ障害が発生した時点で、必ずしも外部
記憶装置に対して情報の複写が行われれているとは限ら
ない。そこで従来のファイルサブシステムでは、キャッ
シュに障害が発生したことを上位制御手段に報告し、上
位制御手段は、ファイル回復やデバイス全体の障害回復
処理を行っている。On the other hand, there is also a cache write method in which only the cache is updated first, and then the external storage device is updated as appropriate. This method is called a store-in method, and because the access time in WRITE access is fast, it is particularly applied to data that requires high speed. In this method, information is not necessarily copied to the external storage device at the time a cache failure occurs. Therefore, in conventional file subsystems, the occurrence of a failure in the cache is reported to the higher-level control means, and the higher-level control means performs file recovery and failure recovery processing for the entire device.

〔発明が解決しようとする問題点〕[Problem that the invention seeks to solve]

上記の様な方式を用いた場合、以下の様な問題が生じる
。When the above-mentioned method is used, the following problems occur.

（１）本来の記憶媒体である外部記憶装置には正常なデ
ータブロックを記憶している場合でも、キャッシュに障
害が生じると、そのＩ１０処理を異常終了させ、上位の
制御手段に一度障害処理を依頼する。すなわち、本来の
外部記憶装置に直接アクセスすれば、一度のＩ１０処理
で正常に必要なデータブロックにアクセスできるところ
を、さらに、障害処理および再■／○処理が必要となり
、目的のデータ処理ヲ完了するまでの時間が大幅に増加
するという問題がある。(1) Even if a normal data block is stored in the external storage device, which is the original storage medium, if a failure occurs in the cache, the I10 processing will be abnormally terminated and the upper control means will be required to handle the failure once. Make a request. In other words, if the original external storage device is accessed directly, the necessary data blocks can be accessed normally with a single I10 process, but now additional failure handling and re-processing are required to complete the desired data processing. There is a problem in that the time it takes to complete the process increases significantly.

（２）上位の制御手段は、ファイルサブシステムからの
障害処理の依頼に応じるために、キャッシュの存在に留
意し、かつファイルサブシステム障害処理のための手続
きをあらかじめ用意しておかなければならない。(2) In order to respond to requests for troubleshooting from the file subsystem, the upper control means must be aware of the existence of a cache and prepare procedures for troubleshooting in the file subsystem in advance.

前述の説明の様に、ストアスル一方式では、キャッシュ
に異常が生じても、外部記憶装置のデータブロックの内
容は保証されている。従ってＩ１０処理を実行するとい
う本来の目的は、代替手段として外部記憶装置へのアク
セスを行うことによりそれを遂行可能である。また、ス
トアイン方式でも、その時点でのメモリの複写が外部記
憶装置上に存在する場合やＷＲＩＴＥアクセスの場合に
は、ストアスル一方式の場合と同様に処理ができるが、
その他の場合には、キャッシュに異常が生じるとデータ
の回復処理が必要となる。この場合、キャッシュ上に複
写を存在させておいた外部記憶装置に対して、必要な時
点から再度Ｉ１０処理をやりなおすなどの障害回復処理
が必要であり、結果的に外部記憶装置自身に障害が発生
した場合と同様な障害回復処理を行うことになる。As explained above, in the store-through type, even if an abnormality occurs in the cache, the contents of the data blocks in the external storage device are guaranteed. Therefore, the original purpose of executing the I10 process can be achieved by accessing the external storage device as an alternative means. In addition, even with the store-in method, if a copy of the memory at that point exists on an external storage device or in the case of WRITE access, processing can be performed in the same way as with the store-through method.
In other cases, if an abnormality occurs in the cache, data recovery processing is required. In this case, it is necessary to perform failure recovery processing such as redoing the I10 process from the necessary point on the external storage device that has a copy in the cache, and as a result, a failure occurs in the external storage device itself. Failure recovery processing will be performed in the same way as in the case of

本発明の目的は、キャッシュ障害時の上位装置の負担を
極力軽減するファイルサブシステムを提供することにあ
る。SUMMARY OF THE INVENTION An object of the present invention is to provide a file subsystem that reduces the burden on a host device as much as possible in the event of a cache failure.

〔問題点を解決するための手段及び作用〕本発明はキャ
ッシュの障害発生時、外部記憶装置にアクセスすること
によって目的の処理が代替できる場合には、外部記憶装
置をアクセスする制御手段を有する。[Means and operations for solving the problems] The present invention has a control means for accessing an external storage device when a cache failure occurs and the target process can be substituted by accessing the external storage device.

〔作　用〕[For production]

ストアスル一方式の場合、キャッシュ上の本来のデータ
ブロックは外部記憶装置にあり、キャッシュの障害発生
時、上位に障害を報告することなく、外部記憶装置から
目的のデータブロックを読出す。ストアインの場合でも
、本来のデータブロックが外部記憶装置に存在する場合
、同様に扱うことができる。In the case of the store-through type, the original data block on the cache is in an external storage device, and when a cache failure occurs, the target data block is read from the external storage device without reporting the failure to a higher level. Even in the case of store-in, if the original data block exists in the external storage device, it can be handled in the same way.

〔実施例〕〔Example〕

具体的な実施例を説明する前に、本発明の詳細な説明す
る。Before describing specific embodiments, the present invention will be explained in detail.

本発明の実施方法は、ストアスル一方式のキャッシュ付
ファイルサブシステムに適用する場合と、ストアイン方
式のキャッシュ付ファイルサブシスシムに適用する場合
でや＼異なる。そこで、まずストアスル一方式について
説明し、続けてストアイン方式のキャッシャ付ファイル
サブシステムに適用した場合の留意点を説明するものと
する。The method of implementing the present invention differs depending on whether it is applied to a store-through type cached file subsystem or a store-in type cached file subsystem. Therefore, we will first explain the store-through method, and then explain the points to keep in mind when applying it to a store-in method file subsystem with a cache.

一般にストアスル一方式では、更新等を行う場合、キャ
ッシュと外部記憶装置の両方に書込み命令を発行してい
る。そこで、本発明の第１の特徴はキャッシュに障害が
発生した場合（キャッシュに障害が発生したと判断する
時点では、一定の回数の再試行処理が既におこなわれて
いる）に、その時点では障害とは考えず、キャッシュの
複写が存在する外部記憶装置の目的のデータブロックに
対するアクセスを行う手段を設ける。この時、キャッシ
ュの障害検出のタイミングによっては、既に外部記憶装
置にアクセスが行われている場合が考えられる。従って
重複した外部記憶装置にアクセスすることをさけるため
に、外部記憶装置に対して既にアクセスを行っているか
どうかを記憶しておく手段を設ける。Generally, in the store-through type, when updating or the like is performed, a write command is issued to both the cache and the external storage device. Therefore, the first feature of the present invention is that when a cache failure occurs (by the time it is determined that a cache failure has occurred, a certain number of retries have already been performed), Instead of considering this, a means is provided to access the target data block in the external storage device where the cache copy exists. At this time, depending on the timing of cache failure detection, the external storage device may have already been accessed. Therefore, in order to avoid accessing duplicate external storage devices, means is provided for storing whether or not an external storage device has already been accessed.

また、本発明の第２の特徴は外部記憶装置へのアクセス
が正常であった場合には、上位の制御手段に対して正常
な報告を行い、さらにそのアクセスにおいて障害が発生
した場合には、外部記憶装置の障害として上位制御手段
に報告する手段を設ける。ストアイン方式の場合は、キ
ャッシュ上のデータと外部記憶装置上のデータの内容が
一致しているかどうか、すなわちキャッシュの更新と外
部記憶装置の更新の同期がとられているかどうかで処理
が異なる。内容の一致がとられている場合には、ストア
スル一方式の場合と同様の処理が行われる。一致がとら
れていない場合には、そのデータブロックの情報が失わ
れたことになり、例えばデバイス障害やファイル障害の
ような外部記憶装置の障害として上位制御手段に報告し
なければならない。これらのことにより、ストアスル一
方式の場合と比較して、キャッシュ上のデータと外部記
憶装置上のデータの内容が一致しているかどうかを判断
する手段と一致がとれていないと判断した場合もしくは
それが不明の場合には、外部記憶装置へのアクセスは行
わず、キャッシュの障害を外部記憶装置の障害として報
告する手段を設ける。またストアスル一方式とストアイ
ン方式が混在するキャッシュ付ファイルサブシステムで
は、上記両者の特徴的手段を具備しなければならないこ
とはいうまでもない。The second feature of the present invention is that if the access to the external storage device is normal, a normal report is sent to the upper control means, and if a failure occurs in the access, A means is provided to report a failure of the external storage device to the higher-level control means. In the case of the store-in method, processing differs depending on whether the contents of the data on the cache and the data on the external storage device match, that is, whether the cache update and the external storage device update are synchronized. If the contents match, the same processing as in the case of the store-through type is performed. If there is no match, the information in the data block has been lost and must be reported to the higher level control means as an external storage failure, such as a device failure or file failure. Due to these factors, compared to the case of one-way storage, the means for determining whether the data in the cache and the data on the external storage device match, or if it is determined that they do not match, If it is unknown, the external storage device is not accessed, and means is provided to report the cache failure as an external storage device failure. Furthermore, it goes without saying that a file subsystem with a cache in which both the store-through type and the store-in type are used must be equipped with the characteristic means of both types.

本発明の第３の特徴は、障害が発生している間、再びキ
ャッシュにアクセスすることがないように制御するため
に、キャッシュの障害の発生時点からその障害に対する
障害回復が完了するまでキャッシュへのアクセスを禁止
するフラグを発生する手段を設ける。そして、キャッシ
ュの障害に対する何らかの障害回復手段が完了した後に
、上記により表示されたアクセス禁止フラグを解除する
。The third feature of the present invention is that in order to prevent the cache from being accessed again while a failure is occurring, the cache is Provide a means for generating a flag that prohibits access. Then, after some failure recovery means for the cache failure is completed, the access prohibition flag displayed above is canceled.

フラグの設定および解除は、上位制御手段の制御のもと
に行われるのではなく、ファイルサブシステム内の制御
で行う。Setting and canceling of flags is not performed under the control of a higher-level control means, but is performed under control within the file subsystem.

本発明の第４の特徴は、制御装置が正常に動作している
場合、上位装置の手を借りずに、キャッシュに対して適
切な障害処理を施すことを可能とするために、発生した
障害の情報を記憶しておく手段と、障害パターン対応に
障害回復手順を記憶しておく手段と、その障害回復手順
により障害処理を行う手段を設けて対処する。The fourth feature of the present invention is that when the control device is operating normally, it is possible to perform appropriate fault handling on the cache without the help of the host device. To cope with this problem, there are provided a means for storing information on the problem, a means for storing a failure recovery procedure corresponding to the failure pattern, and a means for handling the failure according to the failure recovery procedure.

これらの実現方法としては、適当な信号線を用い論理回
路によって実現することも、また、専用の記憶領域を形
成しマイクロプログラム制御により実現することも可能
である。These can be realized by logic circuits using appropriate signal lines, or by forming a dedicated storage area and controlling by microprogram.

以下の実施例はストアスル一方式のキャッシュ付ファイ
ルサブシステムにおいて、マイクロプログラム制御によ
って実現した一例を示す。The following embodiment shows an example of a store-through cached file subsystem realized by microprogram control.

第１図は本発明の一実施例の構成図である。１は上位制
御装置、２は外部記憶装置およびキャッシュの制御装置
、３は外部記憶装置である。４はキャッシュであり、キ
ャッシュメモリアクセス制御部４１とキャッシュメモリ
４２からなる。２１は上位制御手段１とのインタフェー
ス制御部、２２は上位制御装置１および外部記憶装置３
とキャッシュ４の間のデータ転送を制御する部分である
。FIG. 1 is a block diagram of an embodiment of the present invention. 1 is a host control device, 2 is an external storage device and cache control device, and 3 is an external storage device. 4 is a cache, which consists of a cache memory access control section 41 and a cache memory 42. 21 is an interface control unit with the upper control means 1; 22 is the upper control device 1 and the external storage device 3;
This is a part that controls data transfer between the cache 4 and the cache 4.

２４はマイクロコントローラ部であり、外部記憶装置３
にアクセスを発行済かどうかを示すフラグ２４１、キャ
ッシュ４に障害が発生しそれが回復しないことを示すフ
ラグ２４２、回復処理手順情報テーブル２４３、マイク
ロプロセッサ２４４を有する。２３は外部記憶装置イン
タフェース制御部である。２６はキャッシュ障害監視部
であり、キャッシュ４の障害の検出とその記憶を行う。24 is a microcontroller section, and external storage device 3
It has a flag 241 indicating whether access has been issued to the cache 4, a flag 242 indicating that a failure has occurred in the cache 4 and will not be recovered, a recovery processing procedure information table 243, and a microprocessor 244. 23 is an external storage device interface control section. A cache failure monitoring unit 26 detects failures in the cache 4 and stores them.

なお、第１図においては、本発明に直接関係のないファ
イルサブシステム構成要素は省略しである。Note that in FIG. 1, file subsystem components that are not directly related to the present invention are omitted.

例えば、従来のキャッシュ付ファイルサブシステムにお
いては、ＲＥＡＤアクセス時にキャッシュで障害が発生
した場合には、それを障害検出回路が検出し、マイクロ
プロセッサ２４４に報告する。マイクロプロセッサ２４
４は予め設定された回数の再試行処理を行う。そして規
定回数の再試行を繰返すうちに障害が回復しなかった場
合には、マイクロプロセッサ２４４は例えば永久障害と
判断し、障害情報を作成し、２２．２１を経由して、上
位装置１に処理を依頼する。この場合、本来アクセスさ
れたデータブロックが格納されている外部記憶装置３に
はアクセスしないため、データブロック自身は正常に保
存されている場合でも、上位装置に障害処理を依頼しな
ければならない。さらに、障害情報を連絡しあうために
、キャッシュを明確に認識していなければならず、上位
装置にか＼る負荷は無視できない。For example, in a conventional file subsystem with a cache, if a failure occurs in the cache during a READ access, the failure detection circuit detects it and reports it to the microprocessor 244. microprocessor 24
4 performs retry processing a preset number of times. If the failure is not recovered after repeating the specified number of retries, the microprocessor 244 determines that the failure is permanent, creates failure information, and sends the process to the host device 1 via 22.21. request. In this case, since the external storage device 3 in which the accessed data block is originally stored is not accessed, a request must be made to the higher-level device to handle the failure even if the data block itself is normally stored. Furthermore, in order to communicate fault information, the cache must be clearly recognized, and the load placed on the host device cannot be ignored.

そこで、本発明では、障害監視部２６でキャッシュ４の
障害が発生したことを検出すると、従来のキャッシュ付
ファイルサブシステムのようにただちにその障害状況を
上位装置に報告することはせずに、そのデータブロック
の複写元である外部記憶装置３に正常にアクセスを行う
。しかしストアスル一方式のキャッシュ付ファイルサブ
システムでは、特に、データブロックの更新を行う場合
などでは、キャッシュアクセスと同時に外部記憶装置３
にアクセスする場合があり、障害発生時にはすでに外部
記憶装置３ヘアクセスしていることも考えられる。この
場合に再度アクセスを繰返して効率低下することが無い
様に、フラグ２４１にすでに外部記憶装置にアクセスし
ているかどうか表示する。つまり、障害監視部２６でキ
ャッシュの障害を検出すると、マイクロプロセッサ２４
４はフラグ２４１の内容を確認して、まだ外部記憶装置
３にアクセスされていない場合にはただちに２２を制御
し、２３を経由して外部記憶装置３にアクセスする。こ
の場合のコマンドは例えばマイクロプロセッサ２４４の
内部メモリ等に記憶されている。またすでに外部記憶装
置３にアクセスされている場合には、アクセスをおこな
わない。そこで、外部記憶装置３へのアクセスが正常終
了すると、マイクロプロセッサ２４４はあたかもキャッ
シュ４にはアクセスを行わなかったかのように上位装置
１に対し報告する。一方、外部記憶装置３へのアクセス
が異常終了した場合でも、マイクロプロセッサ２４４は
あたかもキャッシュにはアクセスを行わなかったかのよ
うに、キャッシュを具備しないファイルサブシステムの
障害状況として上位装置に報告し、その障害回復処理を
依頼する。Therefore, in the present invention, when the failure monitoring unit 26 detects that a failure has occurred in the cache 4, it does not immediately report the failure status to the higher-level device as in the conventional file subsystem with cache. The external storage device 3 that is the copy source of the data block is accessed normally. However, in a store-only cached file subsystem, especially when updating a data block, the external storage device 3 is accessed at the same time as cache access.
It is also possible that the external storage device 3 is already being accessed when a failure occurs. In this case, a flag 241 indicates whether or not the external storage device has already been accessed so that the efficiency will not decrease due to repeated accesses. In other words, when the failure monitoring unit 26 detects a cache failure, the microprocessor 24
4 checks the contents of the flag 241, and if the external storage device 3 has not been accessed yet, immediately controls 22 and accesses the external storage device 3 via 23. The command in this case is stored, for example, in the internal memory of the microprocessor 244. Further, if the external storage device 3 has already been accessed, no access is made. Therefore, when the access to the external storage device 3 ends normally, the microprocessor 244 reports to the host device 1 as if the cache 4 had not been accessed. On the other hand, even if access to the external storage device 3 terminates abnormally, the microprocessor 244 reports this to the higher-level device as a failure status of a file subsystem that does not have a cache, as if the cache had not been accessed. Request failure recovery processing.

このように制御することによって、上位装置１はキャッ
シュ４の障害に対処するために専用の障害回復手段を用
意する必要はなく、キャッシュを具備していないファイ
ルサブシステムと同様に扱うことが可能である。なお、
たとえば上位装置が関知しないセンスバイトに情報を埋
め込むといったように、上位制御による制御回復手段か
らキャッシュが陽に見えないような手段により、キャッ
シュ障害状況を報告することはかまわない。By controlling in this way, the host device 1 does not need to prepare a dedicated failure recovery means to deal with failures in the cache 4, and can be treated in the same way as a file subsystem that does not have a cache. be. In addition,
For example, it is possible to report the cache failure status by means such as embedding information in a sense byte that the higher-level device is not concerned with, so that the cache is not explicitly visible to the higher-level control recovery means.

上記報告方法を用いた場合、キャッシュの障害回復処理
を上位装置の手を借りずに行う必要がある。キャッシュ
に障害が発生するとキャッシュにアクセスを禁止するこ
とを表示するフラグ２４２をセットし、さらに、その障
害に関する情報を２６に記憶する。こ＼で全てのファイ
ルサブシステムへのアクセスは最初に上記のフラグ２４
２を参照する様にマイクロップロセッサ２４４により制
御されるため、フラグ２４２がセットされている間はキ
ャッシュ４には、アクセスできない。マイクロプロセッ
サ２４４は、通常のＩ１０処理が終了した時点で、上記
のフラグ２４２を参照し、キャッシュ４に対する障害処
理を開始する。一方、テーブル２４３には標準的なキャ
ッシュの障害パターン対応に障害回復手順を記憶してお
り、２６に記憶されている障害パターンと同一のパター
ンを検索することによりその障害回復方法が示されるよ
うになっている。障害回復方法としては、障害が発生し
た回路を含む障害切り離し単位の切り離しとその部分に
対するキャッシュエントリの抹消、外部記憶装置から得
られた情報により新しいキャッシュエントリの作成など
が考えられる。さらに、上位制御手段に報告することな
く、ファイルサブシステム内で自動的に障害回復処理を
実行させる一連の特徴的な手段が含まれる６第２図（ａ
）は回復処理手順情報テーブル２４３を示し、第２図（
ｂ）にその１エントリの具体例を示す。２４３１の部分
は予想される障害パターンを、２４３２の部分はその障
害パターンが発生した場合の処理を示す部分で、こ＼で
は一例として、走行するマイクロプログラムの先頭アド
レスが記憶されている。第２図（ｃ）はキャッシュ障害
監視部２６の具体例を示しキャッシュ障害検出回路２６
１で障害を検出すると、それに応じた障害パターンを障
害情報記憶部２６２に記憶する。When the above reporting method is used, it is necessary to perform cache failure recovery processing without the help of a host device. When a failure occurs in the cache, a flag 242 indicating that access to the cache is prohibited is set, and information regarding the failure is stored in 26. This allows access to all file subsystems by first setting flag 24 above.
2, the cache 4 cannot be accessed while the flag 242 is set. When the normal I10 processing is completed, the microprocessor 244 refers to the flag 242 and starts fault processing for the cache 4. On the other hand, the table 243 stores failure recovery procedures corresponding to standard cache failure patterns, and by searching for the same failure pattern as the failure pattern stored in the table 243, the failure recovery method is shown. It has become. Possible fault recovery methods include isolating the fault isolation unit that includes the faulty circuit, deleting the cache entry for that part, and creating a new cache entry using information obtained from the external storage device. Furthermore, it includes a series of characteristic means for automatically executing failure recovery processing within the file subsystem without reporting to the upper control means.
) shows the recovery processing procedure information table 243, and FIG.
b) shows a specific example of one entry. The part 2431 shows the expected failure pattern, and the part 2432 shows the process to be performed when the failure pattern occurs.Here, as an example, the start address of the running microprogram is stored. FIG. 2(c) shows a specific example of the cache failure monitoring unit 26, and shows the cache failure detection circuit 26.
When a failure is detected in step 1, a failure pattern corresponding to the failure pattern is stored in the failure information storage unit 262.

この記憶部２６２に格納された障害パターンと一致する
第２図（ａ）の２４３１の部分の障害パターンを持つエ
ントリの障害処理が実行される。Failure processing is executed for an entry having a failure pattern 2431 in FIG. 2(a) that matches the failure pattern stored in the storage unit 262.

これらの手段を用意することにより、マイクロプロセッ
サ２４４は発生した障害パターンを解析し自らその障害
回復処理を試みることができる。By providing these means, the microprocessor 244 can analyze the failure pattern that has occurred and attempt its own failure recovery process.

この処理によってキャッシュ４の障害が回復した場合に
は、フラグ２４２はセットされ、キャッシュ４へのアク
セスが可能となる。一方回復が不可能だった場合にはこ
のフラグ２４２はリセットされずに、論理的にキャッシ
ュが切り離された状態が実現可能である６以上の処理をする間、上位装置は、キャッシュ障害につ
いて関知することなく動作可能である。If the fault in the cache 4 is recovered by this process, the flag 242 is set and the cache 4 can be accessed. On the other hand, if recovery is not possible, this flag 242 is not reset, and a state in which the cache is logically separated can be realized.6 During the above processing, the host device is not concerned about the cache failure. It is possible to operate without

なお、可能な場合には外部記憶装置へのその他のアクセ
スと並行して該処理を行ってもさしつかえないことはい
うまでもない。Note that, if possible, it goes without saying that this processing may be performed in parallel with other accesses to the external storage device.

第３図、第４図は、上記例を実施した場合の制御の流れ
の一例を示した図である。ストアイン方式のキャッシュ
付ファイルサブシステムもしくはストアスル一方式とス
トアイン方式が混在するファイルサブシステムに適用し
た場合には、第３図（ｂ）の本傘印のブロックの前に該
データブロックの情報が、キャッシュ上を外部記憶装置
とで一致しているがどうかを判断する手順を具備する必
要がある。この判断は例えば第５図に示すようなデータ
ブロック同期状態管理テーブル２７を参照することによ
り実現可能である。第５図においては２７１はデータブ
ロックのエントリを、２７２はそのエントリが同期を取
った後に更新アクセスがない場合（すなわちキャッシュ
と外部記憶装置の一致がとれている場合）に１．それ以
外の場合に０を示すフラグである。この処理で一致して
いると判断された場合には、そのまへ串傘印のブロック
に進み、ストアスル一方式の場合と同様に処理される。FIGS. 3 and 4 are diagrams showing an example of the flow of control when the above example is implemented. When applied to a file subsystem with a store-in cache or a file subsystem with a store-in method and a store-in method, the information of the data block is added before the block marked with the umbrella mark in Figure 3(b). However, it is necessary to have a procedure for determining whether the cache matches the external storage device. This determination can be realized, for example, by referring to a data block synchronization state management table 27 as shown in FIG. In FIG. 5, 271 indicates a data block entry, and 272 indicates 1. if there is no update access after the entry is synchronized (that is, if the cache and external storage device match). This flag indicates 0 in other cases. If it is determined that they match in this process, the process proceeds directly to the block with the skewered umbrella mark, and is processed in the same way as in the case of the store-through type.

こ＼で一致がとられていないと判断した場合には、その
時点でＩ１０処理を中断し、外部記憶装置の障害（例え
ばデバイス全体の及び障害等）として上位制御手段に報
告する。If it is determined that there is no match, the I10 process is interrupted at that point, and a failure of the external storage device (for example, a failure of the entire device) is reported to the higher-level control means.

〔発明の効果〕〔Effect of the invention〕

以上の説明から明らかな様に、本発明によれば次の様の
ような利点が得られる。As is clear from the above description, the present invention provides the following advantages.

（１）キャッシュに障害が発生した場合でも、該データ
ブロックの複写元である外部記憶装置にアクセス可能で
あると判断した場合には、上位装置への障害回復処理の
依頼によるＩｌｏの中断をすることなく、データブロッ
クに対する処理が実行できる。これにより、不必要な障
害処理依頼を受付ける処理や再工／○処理が削減できる
ため、キャッシュ障害時の上位制御手段の負荷が削減で
き、キャッシュ障害発生時のＩ１０レスポンスタイムの
増加が抑えられる。(1) Even if a failure occurs in the cache, if it is determined that the external storage device that is the copy source of the data block can be accessed, Ilo is interrupted by requesting the higher-level device to perform failure recovery processing. Processing can be performed on data blocks without having to do so. As a result, unnecessary failure handling request acceptance processing and rework/○ processing can be reduced, so the load on the upper control means when a cache failure occurs can be reduced, and an increase in I10 response time when a cache failure occurs can be suppressed.

（２）キャッシュに障害が発生した場合でも、あたかも
キャッシュを具備していないファイルサブシステムにア
クセスしたかのように、アクセスの結果を上位装置に報
告することができる。つまり、上位装置は障害処理に関
しキャッシュの存在を陽に意識する必要がなく、構成が
簡潔になる。(2) Even if a failure occurs in the cache, the access result can be reported to the higher-level device as if a file subsystem not equipped with a cache was accessed. In other words, the host device does not need to be explicitly aware of the existence of the cache in handling failures, resulting in a simpler configuration.

（３）また、障害回復処理中は、キャッシュへのアクセ
スができないように制御されるため、障害中のキャッシ
ュにアクセスを行う様な無駄な処理を排除することがで
きる。(3) Furthermore, since access to the cache is prohibited during failure recovery processing, unnecessary processing such as accessing a cache during failure can be eliminated.

（４）さらに、従来上位装置によっておこなわていたキ
ャッシュ障害回復処理の多くの部分を、ファイルサブシ
ステム内で処理することができる。(4) Furthermore, much of the cache failure recovery processing that was conventionally performed by a host device can be performed within the file subsystem.

（５）また、障害回復処理が完了した後は、上位装置が
意識することなく、キャッシュへのアクセスを再開する
ことが可能である。(5) Furthermore, after the failure recovery process is completed, access to the cache can be resumed without the host device being aware of it.

【図面の簡単な説明】[Brief explanation of drawings]

第１図は本発明の一実施例を示すブロック図、第２図は
回復処理手順情報テーブル等が記憶されている情報域の
一例を示す図、第３図、第４図は上記例を実施した場合
の制御の流れを示す図、第５図はデータブロック同期状
態管理テーブルの一例を示した図である。１・・・上位装置、　２・・・キャッシュと外部記憶装
置、　　２１・・・インタフェース制御部、２２・・・
上位制御手段１および外部記憶装置とキャッシュの間の
データ転送を制御する部分、２３・・・外部記憶装置イ
ンタフェース、２４・・・マイクロコントローラ部、２４１・・・フラグ、　　２４２・・・フラグ、２４３
・・・回復処理手順情報テーブル、２４４・・・マイク
ロプロセッサ、２６・・・キャッシュ障害監視部、２７・・・データブロック同期状態管理テーブル、３・
・・外部記憶装置、　４・・・キャッシュ、４１・・キ
ャッシュメモリアクセス制御部、４２・・・キャッシャ
メモリ。第　　１　　　図第　　２　　図（ｂン第　　３　　図　　（幻づ３組（ｂ）へ第　　４　　図第　　５　　図FIG. 1 is a block diagram showing an embodiment of the present invention, FIG. 2 is a diagram showing an example of an information area in which a recovery processing procedure information table, etc. is stored, and FIGS. 3 and 4 are diagrams showing an example of implementing the above example. FIG. 5 is a diagram showing an example of a data block synchronization state management table. DESCRIPTION OF SYMBOLS 1... Upper device, 2... Cache and external storage device, 21... Interface control unit, 22...
A part that controls data transfer between the upper control means 1 and the external storage device and the cache, 23... external storage device interface, 24... microcontroller section, 241... flag, 242... flag, 243
...Recovery processing procedure information table, 244...Microprocessor, 26...Cache failure monitoring unit, 27...Data block synchronization state management table, 3.
... external storage device, 4 ... cache, 41 ... cache memory access control unit, 42 ... cashier memory. Figure 1 Figure 2 (b) Figure 3 (Go to Genzu group 3 (b) Figure 4 Figure 5

Claims

【特許請求の範囲】[Claims]

（１）主記憶装置と外部記憶装置の間にキャッシャを具
備し、かつキャッシュと外部記憶装置の制御手段、およ
び外部記憶装置を具備するファイルサブシステムにおい
て、キャッシュの障害発生時、上記外部記憶装置にアク
セスすることよってその目的の処理が代替できる場合に
は、該外部記憶装置の目的のデータブロックに対してア
クセスを行う様に制御する手段を有することを特徴とす
るファイルサブシステム。(1) In a file subsystem that includes a cacher between a main storage device and an external storage device, and also includes a control means for the cache and external storage device, and an external storage device, when a cache failure occurs, the external storage device 1. A file subsystem comprising means for controlling access to a target data block of the external storage device when the target processing can be substituted by accessing the target data block.

（２）上記外部記憶装置の目的のデータブロックに対し
てアクセスを行った結果が正常であった場合には上位装
置に対し正常な報告を行い、そのアクセスにおいて異常
が発生した場合には、外部記憶装置またはその制御手段
における異常として上位装置に報告する手段を持つこと
を特徴とする特許請求の範囲第１項記載のファイルサブ
システム。(2) If the result of accessing the target data block in the external storage device is normal, a normal report is sent to the host device, and if an abnormality occurs in the access, the external 2. The file subsystem according to claim 1, further comprising means for reporting an abnormality in a storage device or its control means to a host device.

（３）キャッシュへのアクセスが正常に行われないとい
う事象の発生に応答して、キャッシャへのアクセスを禁
止する手段を有することを特徴とする特許請求の範囲第
１項もしくは第２項記載のファイルサブシステム。(3) The device according to claim 1 or 2, further comprising means for prohibiting access to the cache in response to occurrence of an event in which access to the cache is not performed normally. file subsystem.

（４）キャッシュの障害回復手段を行う手段を有するこ
とを特徴とする特許請求の範囲第１項、第２項、第３項
のいずれかの項記載のファイルサブシステム。(4) The file subsystem according to any one of claims 1, 2, and 3, further comprising means for performing cache failure recovery means.

（５）キャッシュに対する障害回復処理が完了後、キャ
ッシュへのアクセスの禁止を解除することを特徴とする
特許請求の範囲第４項記載のファイルサブシステム。(5) The file subsystem according to claim 4, wherein the prohibition of access to the cache is lifted after failure recovery processing for the cache is completed.