JP2013167942A

JP2013167942A - Distributed file access device, distributed file access system, distributed file access method and distributed file access program

Info

Publication number: JP2013167942A
Application number: JP2012029403A
Authority: JP
Inventors: Masashi Ikuta; 将史生田
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2012-02-14
Filing date: 2012-02-14
Publication date: 2013-08-29
Anticipated expiration: 2032-02-14
Also published as: JP5966404B2

Abstract

PROBLEM TO BE SOLVED: To provide a distributed file access device for avoiding occurrence of performance deterioration of a system due to a bottleneck in which access to a metadata file converges.SOLUTION: A distributed file access device comprises: metadata file storage means which divides a metadata file of one data file into a plurality of files and stores them; metadata file storage area determination means which inputs identification information of the data file designated by an access instruction to the data file and information about an access address in the data file into a hash function, and uniquely determines identification information of the metadata file storage means storing the metadata file corresponding to the access address of the data file, from an output value of the hash function; and metadata file access means which accesses the metadata file storage means determined by the metadata file storage area determination means.

Description

本願発明は、分散ファイルシステムを持つコンピュータシステムで、ファイルへのアクセス性能を向上させるための分散ファイルアクセス装置、分散ファイルアクセスシステム、分散ファイルアクセス方法、及び、分散ファイルアクセスプログラムに関する。 The present invention relates to a distributed file access apparatus, a distributed file access system, a distributed file access method, and a distributed file access program for improving file access performance in a computer system having a distributed file system.

高速な科学技術計算性能が要求されるＨＰＣ（ＨｉｇｈＰｅｒｆｏｒｍａｎｃｅＣｏｍｐｕｔｉｎｇ）の領域では高並列化の技術が発達し、最近では実行するプログラムの並列数が数千から数万に達することも珍しくなくなってきている。プログラムの並列数が増すに従い使用するファイル数も増加するため、ファイルの取り扱いが困難になる。このため、並列Ｉ／Ｏ処理機能であるＭＰＩ−ＩＯを使用し、複数のプロセスが、１つのファイルにおける他のプロセスと重ならないブロックに対して同時にアクセスするような方式も出てきている。 In the area of HPC (High Performance Computing), where high-speed scientific computing performance is required, highly parallel technology has developed, and recently, the number of programs to be executed has reached the parallel number of thousands to tens of thousands. Yes. Since the number of files to be used increases as the number of parallel programs increases, handling of files becomes difficult. For this reason, a method has been developed in which MPI-IO, which is a parallel I / O processing function, is used so that a plurality of processes simultaneously access blocks that do not overlap with other processes in one file.

このような高並列コンピュータシステムでは、プログラムの並列数が増加すればするほど１つのファイルへアクセスするプログラムの数が増加する。このため、そのファイルを格納したサーバの処理負荷が増大してシステム全体のボトルネックとなり、システム性能が低下する原因の一つとなっている。 In such a highly parallel computer system, the number of programs accessing one file increases as the number of parallel programs increases. For this reason, the processing load of the server storing the file increases, which becomes a bottleneck of the entire system, which is one of the causes of system performance degradation.

このようなシステム性能の低下を回避するための技術として、特許文献１には、論理ファイルを複数ディスク装置に分割して保存することで、並行して複数ディスク装置のファイルブロックにアクセスできるようにしたストライピングファイル機構による論理ファイルの分散管理方法に関するものが公開されている。ここで公開されている技術は、ファイル読み出し時のディスク装置へ並列での読み出しの効率化を図り、大容量ファイルの読み出し時間を短縮化するための方法である。 As a technique for avoiding such a decrease in system performance, Patent Document 1 discloses that a logical file is divided and stored in a plurality of disk devices so that file blocks of the plurality of disk devices can be accessed in parallel. A method for distributing and managing logical files using the striped file mechanism is disclosed. The technology disclosed here is a method for reducing the reading time of a large-capacity file by improving the efficiency of reading to a disk device in parallel when reading a file.

また、特許文献２には、仮想ファイルサーバが設定された複数の装置によって構成されるクラスタ内で仮想ファイルサーバを動的に移動させることにより、仮想ファイルサーバ単位の負荷分散を行い、特定のファイルサーバへの負荷の集中を回避するシステムが公開されている。 Further, Patent Document 2 discloses that a specific file is distributed by performing load distribution in units of virtual file servers by dynamically moving the virtual file servers in a cluster constituted by a plurality of devices in which virtual file servers are set. A system that avoids the concentration of load on the server has been released.

特開2002-182953JP2002-182953 特開2005-267327JP2005-267327

上記のような分散ファイルシステムを持つコンピュータシステムにおいては、ファイルへのアクセス時にファイルの存在場所等の情報を入手する必要があるため、まずファイルのメタ情報を格納したメタデータファイルへアクセスすることになる。高並列コンピュータシステムでは、このメタデータファイルへのアクセスが集中することによるメタデータファイルを格納したサーバの処理負荷が増大することがあるが、前述の特許文献１や特許文献２に記載された技術は、このメタデータファイルを格納したサーバの負荷を軽減するものではない。したがって、高並列化コンピュータシステムにおいては、メタデータファイルへのアクセスの集中がボトルネックとなり、システムの性能低下が生じる恐れがあるという問題がある。 In a computer system having a distributed file system as described above, since it is necessary to obtain information such as the location of the file when accessing the file, it is first necessary to access a metadata file that stores the meta information of the file. Become. In a highly parallel computer system, the processing load on the server storing the metadata file may increase due to the concentration of access to the metadata file. However, the techniques described in Patent Document 1 and Patent Document 2 described above. Does not reduce the load on the server storing the metadata file. Therefore, in the highly parallel computer system, there is a problem that the concentration of access to the metadata file becomes a bottleneck and the performance of the system may be lowered.

本願発明の目的は、これらの問題点を解決した分散ファイルアクセス装置、分散ファイルアクセスシステム、分散ファイルアクセス方法、及び、分散ファイルアクセスプログラムを提供することである。 An object of the present invention is to provide a distributed file access apparatus, a distributed file access system, a distributed file access method, and a distributed file access program that solve these problems.

本願発明の一実施形態の分散ファイルアクセス装置は、１つのデータファイルのメタ情報を格納したメタデータファイルを、複数に分割して格納する複数のメタデータファイル格納手段と、前記データファイルへのアクセス命令において指定された当該データファイルの識別情報と、当該データファイルにおけるアクセスアドレスの情報とをハッシュ関数に入力し、前記ハッシュ関数の出力値から、当該データファイルの当該アクセスアドレスに対応する前記メタデータファイルを格納する前記メタデータファイル格納手段の識別情報を一意に決定するメタデータファイル格納領域決定手段と、前記メタデータファイル格納領域決定手段が決定した前記メタデータファイル格納手段へアクセスするメタデータファイルアクセス手段と、を備える。 A distributed file access apparatus according to an embodiment of the present invention includes: a plurality of metadata file storage units that store a metadata file that stores metadata information of one data file by dividing the metadata file; and access to the data file The identification information of the data file specified in the instruction and the access address information in the data file are input to a hash function, and the metadata corresponding to the access address of the data file is output from the output value of the hash function Metadata file storage area determination means for uniquely determining identification information of the metadata file storage means for storing a file, and a metadata file for accessing the metadata file storage means determined by the metadata file storage area determination means And access means .

本願発明の一実施形態の分散ファイルアクセス方法は、１つのデータファイルのメタ情報を格納したメタデータファイルを、複数に分割して複数の記憶域に格納し、前記データファイルへのアクセス命令において指定された当該データファイルの識別情報と、当該データファイルにおけるアクセスアドレスの情報とをハッシュ関数に入力し、前記ハッシュ関数の出力値から、当該データファイルの当該アクセスアドレスに対応する前記メタデータファイルを格納する前記記憶域の識別情報を一意に決定し、決定した前記識別情報が示す前記記憶域へアクセスする。 A distributed file access method according to an embodiment of the present invention is a method of dividing a metadata file storing meta information of one data file into a plurality of storage areas and specifying in a data file access instruction. The identification information of the data file and the access address information in the data file are input to the hash function, and the metadata file corresponding to the access address of the data file is stored from the output value of the hash function The storage area identification information is uniquely determined, and the storage area indicated by the determined identification information is accessed.

本願発明の一実施形態の分散ファイルアクセスプログラムは、１つのデータファイルのメタ情報を格納したメタデータファイルを、複数に分割して複数の記憶域に格納するメタデータファイル格納処理と、前記データファイルへのアクセス命令において指定された当該データファイルの識別情報と、当該データファイルにおけるアクセスアドレスの情報とをハッシュ関数に入力し、前記ハッシュ関数の出力値から、当該データファイルの当該アクセスアドレスに対応する前記メタデータファイルを格納する前記記憶域の識別情報を一意に決定するメタデータファイル格納領域決定処理と、前記メタデータファイル格納領域決定処理が決定した前記記憶域へアクセスするメタデータファイルアクセス処理と、をコンピュータに実行させる。 A distributed file access program according to an embodiment of the present invention includes a metadata file storing process for dividing a metadata file storing meta information of one data file into a plurality of storage areas, and the data file The identification information of the data file specified in the access instruction and the access address information in the data file are input to the hash function, and the output value of the hash function corresponds to the access address of the data file. A metadata file storage area determination process for uniquely determining the identification information of the storage area for storing the metadata file; and a metadata file access process for accessing the storage area determined by the metadata file storage area determination process; , Execute on the computer.

本願発明は、メタデータファイルへのアクセスにおいて負荷分散を実現する。 The present invention realizes load distribution in accessing a metadata file.

本願発明の第１の実施形態の構成を示すブロック図である。It is a block diagram which shows the structure of the 1st Embodiment of this invention. 本願発明の第１の実施形態の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the 1st Embodiment of this invention. 本願発明の第１の実施形態におけるメタデータの分散格納の構成例である。It is an example of a structure of the distributed storage of the metadata in 1st Embodiment of this invention. 本願発明の第１の実施形態におけるプロセス実行部からのファイルアクセス命令の構成例である。It is an example of a structure of the file access command from the process execution part in 1st Embodiment of this invention. 本願発明の第１の実施形態におけるメタデータファイルの構成例である。It is a structural example of the metadata file in 1st Embodiment of this invention. 本願発明の第２の実施形態の構成を示すブロック図である。It is a block diagram which shows the structure of 2nd Embodiment of this invention. 本願発明の第２の実施形態におけるハッシュ関数設定情報の構成例である。It is a structural example of the hash function setting information in 2nd Embodiment of this invention. 本願発明の第３の実施形態の構成を示すブロック図である。It is a block diagram which shows the structure of 3rd Embodiment of this invention. 本願発明の第３の実施形態におけるメタデータの分散格納の構成例である。It is a structural example of the distributed storage of metadata in the third embodiment of the present invention.

本願発明の第一の実施の形態について図面を参照して詳細に説明する。 A first embodiment of the present invention will be described in detail with reference to the drawings.

図１は本願発明の第一の実施形態の構成を示すブロック図である。 FIG. 1 is a block diagram showing the configuration of the first embodiment of the present invention.

本実施形態の分散ファイルアクセスシステムは、８個のプロセス実行部１０−０乃至１０−７と、４個のデータファイル格納部１１−０乃至１１−３と、３個のメタデータファイル格納部１２−０乃至１２−２と、メタデータファイルアクセス部１３と、メタデータファイル格納領域決定部１４と、データファイルアクセス部１５とを包含している。 The distributed file access system of the present embodiment includes eight process execution units 10-0 to 10-7, four data file storage units 11-0 to 11-3, and three metadata file storage units 12. -0 to 12-2, a metadata file access unit 13, a metadata file storage area determination unit 14, and a data file access unit 15.

プロセス実行部１０−０乃至１０−７は並列化されたプロセスを実行している。プロセスがファイルアクセスを行う場合、ファイルアクセス命令を発行してメタデータファイルアクセス部１３へ送付する。 The process execution units 10-0 to 10-7 are executing parallelized processes. When the process performs file access, it issues a file access command and sends it to the metadata file access unit 13.

各プロセスが、データファイル１００「／ｄｉｒ＿１／ｆｉｌｅ＿Ａ」にアクセスするときのメタデータの分散格納の構成例を図３に示す。プロセス実行部１０−０乃至１０−７が実行しているプロセス０乃至プロセス７は、それぞれデータファイル１００における他のプロセスとは重ならない２Ｇバイトの領域（ブロック０乃至ブロック７）をアクセスする。例えば、プロセス０はブロック０（０乃至２Ｇバイト領域）、プロセス１はブロック１（２乃至４Ｇバイト領域）、プロセス７はブロック７（１４乃至１６Ｇバイト領域）をアクセスする。 FIG. 3 shows a configuration example of the distributed storage of metadata when each process accesses the data file 100 “/ dir_1 / file_A”. Processes 0 to 7 executed by the process execution units 10-0 to 10-7 each access a 2-Gbyte area (block 0 to block 7) that does not overlap with other processes in the data file 100. For example, process 0 accesses block 0 (0 to 2 GB area), process 1 accesses block 1 (2 to 4 GB area), and process 7 accesses block 7 (14 to 16 GB area).

各プロセスが発行するファイルアクセス命令１１０の構成例を図４に示す。例えばプロセス実行部１０−２におけるプロセス２が発行するデータファイル１００へのリードアクセス命令の場合、Ｒｅａｄ／Ｗｒｉｔｅ区分はＲｅａｄ、データファイル名称は／ｄｉｒ＿１／ｆｉｌｅ＿Ａ、開始オフセットは４Ｇバイト以上６Ｇバイト未満の値、アクセスサイズは０バイト以上２Ｇバイト以下の値となる。 A configuration example of the file access instruction 110 issued by each process is shown in FIG. For example, in the case of a read access instruction to the data file 100 issued by the process 2 in the process execution unit 10-2, the Read / Write classification is Read, the data file name is / dir_1 / file_A, and the start offset is 4 GB or more and less than 6 GB. The value and the access size are 0 bytes or more and 2 Gbytes or less.

データファイル格納部１１−０乃至１１−３は、データファイル１００を分割して格納しており、それぞれのシステム管理上の名称はＳＳ０乃至ＳＳ３である。 The data file storage units 11-0 to 11-3 divide and store the data file 100, and their system management names are SS0 to SS3.

メタデータファイル格納部１２−０乃至１２−２は、データファイル１００のメタデータを記録したメタデータファイルを分割して格納しており、それぞれのシステム管理上の名称はＭＤＳ０乃至ＭＤＳ３である。 The metadata file storage units 12-0 to 12-2 divide and store metadata files that record the metadata of the data file 100, and the system management names thereof are MDS0 to MDS3.

メタデータファイルの構成例を図５に示す。メタデータファイルは、メタデータファイル属性情報部１２０とメタデータファイル対応表部１３０を包含している。メタデータファイル属性情報部１２０は、データファイル１００の属性情報を記録したものであり、データファイル名称、アクセス権、所有者、サイズ、メタデータブロックサイズを包含している。メタデータファイル属性情報部１２０の格納データは、ライトアクセスのファイルアクセス命令１１０が発行されたときに、メタデータファイル格納部１２−０乃至１２−２により、ファイルアクセス命令１１０が包含する情報やシステム情報に基づいて書き込まれたものである。 A configuration example of the metadata file is shown in FIG. The metadata file includes a metadata file attribute information part 120 and a metadata file correspondence table part 130. The metadata file attribute information section 120 records attribute information of the data file 100 and includes a data file name, access right, owner, size, and metadata block size. The data stored in the metadata file attribute information section 120 includes information and systems included in the file access instruction 110 by the metadata file storage sections 12-0 to 12-2 when the write access file access instruction 110 is issued. It was written based on information.

メタデータファイル対応表部１３０は、データファイル１００の格納先の情報を記録したものである。メタデータファイル対応表部１３０の格納データは、ライトアクセスのファイルアクセス命令１１０が発行されたときに、メタデータファイル格納部１２−０乃至１２−２により、ファイルアクセス命令１１０が包含するデータファイル名称、開始オフセット、アクセスサイズの情報を基にして書き込まれたものである。 The metadata file correspondence table unit 130 records information on the storage destination of the data file 100. The data stored in the metadata file correspondence table unit 130 includes data file names included in the file access command 110 by the metadata file storage units 12-0 to 12-2 when the write access file access command 110 is issued. This is written based on the information of the start offset and the access size.

本実施形態の分散ファイルアクセスシステムは、メタデータファイル対応表部１３０をメタデータファイル格納部１２−０乃至１２−２に分散格納する。メタデータファイル属性情報部１２０については分散格納を行わずに、メタデータファイル対応表部１３０の先頭部分のデータを格納するメタデータファイル格納部に格納する。 In the distributed file access system of this embodiment, the metadata file correspondence table unit 130 is distributedly stored in the metadata file storage units 12-0 to 12-2. The metadata file attribute information section 120 is stored in a metadata file storage section that stores data at the head of the metadata file correspondence table section 130 without performing distributed storage.

図５に示すとおり、本実施形態では、メタデータファイル格納部１２−０乃至１２−２は、データファイル１００を５３６８７０９１２バイト（５１２Ｍバイト）単位のブロックに分割し、各ブロックをデータファイル格納部１１−０（ＳＳ０）乃至１１−３（ＳＳ３）にラウンドロビンで順番に格納している。尚、これは一例であり、メタデータファイル格納部１２−０乃至１２−２は、データファイル100を他のサイズのブロックで分割する場合もあれば、分割しない場合もある。 As shown in FIG. 5, in the present embodiment, the metadata file storage units 12-0 to 12-2 divide the data file 100 into blocks of 5368870912 bytes (512 Mbytes), and each block is the data file storage unit 11. The data are stored in the round robin in order from −0 (SS0) to 11-3 (SS3). This is only an example, and the metadata file storage units 12-0 to 12-2 may or may not divide the data file 100 by blocks of other sizes.

メタデータファイル格納部１２−０乃至１２−２は、各ブロックのシステム管理上の実ファイル名を、データファイル１００の名称と、上述した５１２Ｍバイトブロックの開始オフセットと終了オフセットを基に、一意になるように付与している。例えば最初の５１２Ｍバイトブロックの場合は、ｄｉｒ＿１＿ｆｉｌｅ＿Ａ：０＿５３６８７０９１１となる。 The metadata file storage units 12-0 to 12-2 uniquely identify the actual file name in the system management of each block based on the name of the data file 100 and the start offset and end offset of the 512 Mbyte block described above. Is granted. For example, in the case of the first 512 Mbyte block, dir_1_file_A: 0_536870911.

図３、図５に示すとおり、メタデータファイル格納部１２−０（ＭＤＳ０）乃至１２−２（ＭＤＳ２）は、データファイル１００のメタデータファイルを、データファイル１００の３Ｇバイト領域単位に対応したメタデータに分割して分散格納している。例えば、メタデータファイル格納部１２−１（ＭＤＳ１）は、データファイル１００の０乃至３Ｇ−１バイト領域に対応するメタデータを格納する。メタデータファイル格納部１２−２（ＭＤＳ２）は、データファイル100の３Ｇ乃至６Ｇ−１バイト領域に対応するメタデータを格納する。 As shown in FIGS. 3 and 5, the metadata file storage units 12-0 (MDS 0) to 12-2 (MDS 2) convert the metadata file of the data file 100 into the metadata corresponding to the 3 Gbyte area unit of the data file 100. The data is divided and stored. For example, the metadata file storage unit 12-1 (MDS1) stores metadata corresponding to the 0 to 3G-1 byte area of the data file 100. The metadata file storage unit 12-2 (MDS2) stores metadata corresponding to the 3G to 6G-1 byte area of the data file 100.

メタデータファイルアクセス部１３は、プロセス実行部１０−０乃至１０−７から受信した、ファイルアクセス命令１１０におけるデータファイル名称と開始オフセットとアクセスサイズの情報を、メタデータファイル格納領域決定部１４に送付する。 The metadata file access unit 13 sends the data file name, start offset, and access size information in the file access command 110 received from the process execution units 10-0 to 10-7 to the metadata file storage area determination unit 14. To do.

メタデータファイル格納領域決定部１４は、データファイル名称と開始オフセットとアクセスサイズの情報を基に、ハッシュ関数１４０を用いて、データファイル１００のメタデータを格納したメタデータファイル格納部を決定する。ハッシュ関数１４０の具体例ＭＤＳｎｕｍ（ｘ，ｂ）を以下に示す。
The metadata file storage area determination unit 14 determines a metadata file storage unit that stores the metadata of the data file 100 using the hash function 140 based on the information of the data file name, the start offset, and the access size. A specific example MDSnum (x, b) of the hash function 140 is shown below.

ＭＤＳｎｕｍ（ｘ，ｂ）＝（ｆｌｏｏｒ（ｘ／３ＧＢ）＋ａｓｃｉｉ（ｂ））ｍｏｄ３

このハッシュ関数において、ｘはデータファイル１００における位置を示すオフセット、ｂはデータファイル１００の名称、ｆｌｏｏｒ（ｚ）はｚ以下の最大の整数を出力する関数、ａｓｃｉｉ（ｂ）は文字列ｂの各文字のアスキーコードの和を出力する関数である。データファイル１００「／ｄｉｒ＿１／ｆｉｌｅ＿Ａ」へのアクセスの場合、ａｓｃｉｉ（／ｄｉｒ＿１／ｆｉｌｅ＿Ａ）の値は１１３８となる。ハッシュ関数１４０がｆｌｏｏｒ（ｘ／３ＧＢ）を使用するのは、データファイル１００の３Ｇバイト領域単位でメタデータを分割して、メタデータファイル格納部１２−０乃至１２−２を割り当てるためである。ハッシュ関数１４０がｍｏｄ３を使用するのは、メタデータファイル格納部１２−０乃至１２−２の数が３個であるためである。 MDSnum (x, b) = (floor (x / 3GB) + ascii (b)) mod 3

In this hash function, x is an offset indicating the position in the data file 100, b is the name of the data file 100, floor (z) is a function that outputs the largest integer less than or equal to z, and ascii (b) is each character string b. This function outputs the sum of ASCII characters. In the case of access to the data file 100 “/ dir_1 / file_A”, the value of ascii (/ dir_1 / file_A) is 1138. The reason why the hash function 140 uses floor (x / 3 GB) is to divide the metadata in units of 3 Gbyte areas of the data file 100 and allocate the metadata file storage units 12-0 to 12-2. The reason why the hash function 140 uses mod 3 is that the number of the metadata file storage units 12-0 to 12-2 is three.

例えば、プロセス実行部１０−１が実行しているプロセス１が、データファイル１００における開始オフセット２Ｇバイトから２Ｇバイトサイズの領域へのファイルアクセス命令１１０を発行した場合、メタデータファイル格納領域決定部１４は、ＭＤＳｎｕｍ（２ＧＢ，／ｄｉｒ＿１／ｆｉｌｅ＿Ａ）＝１、ＭＤＳｎｕｍ（３ＧＢ，／ｄｉｒ＿１／ｆｉｌｅ＿Ａ）＝２、ＭＤＳｎｕｍ（４Ｇ−１Ｂ，／ｄｉｒ＿１／ｆｉｌｅ＿Ａ）＝２を算出する。その結果、メタデータファイル格納領域決定部１４は、メタデータファイルアクセス部１３に対して、「／ｄｉｒ＿１／ｆｉｌｅ＿Ａのオフセット２Ｇバイト乃至３Ｇ−１バイトの領域に対するメタデータファイル格納部はＭＤＳ１」であることと、「／ｄｉｒ＿１／ｆｉｌｅ＿Ａのオフセット３Ｇバイト乃至４Ｇ−１バイトの領域に対するメタデータファイル格納部はＭＤＳ２」であることを通知する。 For example, when the process 1 executed by the process execution unit 10-1 issues a file access instruction 110 from the start offset 2 Gbytes to the 2 Gbyte size area in the data file 100, the metadata file storage area determination unit 14 Calculates MDSnum (2GB, / dir_1 / file_A) = 1, MDSnum (3GB, / dir_1 / file_A) = 2, and MDSnum (4G-1B, / dir_1 / file_A) = 2. As a result, the metadata file storage area determination unit 14 determines that the metadata file storage unit for the 2 Gbyte to 3G-1 byte offset of / dir_1 / file_A is MDS1 with respect to the metadata file access unit 13. And that the metadata file storage unit for the 3 Gbyte to 4G-1 byte offset of / dir_1 / file_A is MDS2.

前記の例で、ＭＤＳｎｕｍ（３ＧＢ，／ｄｉｒ＿１／ｆｉｌｅ＿Ａ）＝２を算出するのは、データファイル１００の３Ｇバイト領域単位でメタデータを分割して、メタデータファイル格納部１２−０乃至１２−２を割り当てるためである。すなわち、３Ｇバイト境界を境に格納先のメタデータファイル格納部１２−０乃至１２−２が変わるからである。メタデータファイル格納領域決定部１４は、データファイル１００のアクセス領域の開始オフセットと終了オフセットの間に３Ｇバイト境界が存在する場合は、境界部分についてハッシュ関数１４０の値を算出する。 In the above example, MDSnum (3GB, / dir_1 / file_A) = 2 is calculated by dividing the metadata in units of 3 Gbyte area of the data file 100, and the metadata file storage units 12-0 to 12-2. It is for assigning. In other words, the metadata file storage units 12-0 to 12-2 as storage destinations change at the 3 Gbyte boundary. When a 3 Gbyte boundary exists between the start offset and the end offset of the access area of the data file 100, the metadata file storage area determination unit 14 calculates the value of the hash function 140 for the boundary part.

メタデータファイルアクセス部１３は、メタデータ格納領域決定部１４から受信したメタデータファイル格納先の情報に従い、メタデータファイル格納部１２−０乃至１２−２の何れかにアクセスする。プロセス実行部１０−１のプロセス１が、データファイル１００における開始オフセット２Ｇバイトから２Ｇバイトサイズの領域へのファイルアクセス命令１１０を発行した場合の例では、メタデータファイルアクセス部１３は、上述の通りメタデータファイル格納部１２−１（ＭＤＳ１）、１２−２（ＭＤＳ２）へアクセスする。 The metadata file access unit 13 accesses any one of the metadata file storage units 12-0 to 12-2 according to the metadata file storage location information received from the metadata storage area determination unit 14. In the case where the process 1 of the process execution unit 10-1 issues a file access instruction 110 from the start offset 2 Gbytes to the 2 Gbyte size area in the data file 100, the metadata file access unit 13 is as described above. The metadata file storage unit 12-1 (MDS1) and 12-2 (MDS2) are accessed.

メタデータファイルアクセス部１３は、図５におけるメタデータファイル対応表部１３０に記載のとおり、メタデータファイル格納部１２−１（ＭＤＳ１）から、データファイル１００のオフセット２Ｇバイト乃至３Ｇ−１バイトの領域のデータの格納先がデータファイル格納部１１−０（ＳＳ０）と１１−１（ＳＳ１）（５１２Ｍバイトブロック単位で分割格納）であることを、その実ファイル名とともに読み出す。続いてメタデータファイルアクセス部１３は、メタデータファイル格納部１２−２（ＭＤＳ２）から、データファイル１００のオフセット３Ｇバイト乃至４Ｇ−１バイトの領域のデータの格納先がデータファイル格納部１１−２（ＳＳ２）と１１−３（ＳＳ３）（５１２Ｍバイトブロック単位で分割格納）であることを、その実ファイル名とともに読み出す。メタデータファイルアクセス部１３は、読み出したデータファイル格納先と実ファイル名の情報を、データファイルアクセス部１５へ送付する。 As described in the metadata file correspondence table unit 130 in FIG. 5, the metadata file access unit 13 has an offset 2 Gbyte to 3G-1 byte area of the data file 100 from the metadata file storage unit 12-1 (MDS1). Are stored together with their actual file names that are stored in the data file storage units 11-0 (SS0) and 11-1 (SS1) (512 Mbyte block unit). Subsequently, the metadata file access unit 13 determines that the data file storage unit 11-2 stores the data in the offset 3G byte to 4G-1 byte area of the data file 100 from the metadata file storage unit 12-2 (MDS2). (SS2) and 11-3 (SS3) (divided storage in units of 512 Mbyte blocks) are read together with their actual file names. The metadata file access unit 13 sends the read data file storage destination and actual file name information to the data file access unit 15.

データファイルアクセス部１５は、メタデータファイルアクセス部１３から受信したテータファイル格納先と実ファイル名の情報に基づき、データファイル格納部１１−０乃至１１−３へアクセスする。 The data file access unit 15 accesses the data file storage units 11-0 to 11-3 based on the data file storage destination and the actual file name information received from the metadata file access unit 13.

次に図２のフローチャートを参照して、本実施形態の動作について詳細に説明する。 Next, the operation of this embodiment will be described in detail with reference to the flowchart of FIG.

プロセス実行部１０−０乃至１０−７が実行しているプロセスは、データファイル１００へのファイルアクセス命令１１０を発行し、メタデータファイルアクセス部１３へ送付する（Ｓ１０１）。 The processes executed by the process execution units 10-0 to 10-7 issue a file access command 110 to the data file 100 and send it to the metadata file access unit 13 (S101).

メタデータファイルアクセス部１３は、ファイルアクセス命令１１０をメタデータファイル格納領域決定部１４へ送付する（Ｓ１０２）。 The metadata file access unit 13 sends a file access command 110 to the metadata file storage area determination unit 14 (S102).

メタデータファイル格納領域決定部１４は、ファイルアクセス命令１１０におけるデータファイル名称と開始オフセットとアクセスサイズの情報から、ハッシュ関数１４０を使用してデータファイル１００に対応するメタデータファイルの格納先となるメタデータファイル格納部を決定し、メタデータファイル格納先の情報をメタデータファイルアクセス部１３へ送付する（Ｓ１０３）。 The metadata file storage area determination unit 14 uses a hash function 140 to store a metadata file corresponding to the data file 100 from the data file name, start offset, and access size information in the file access instruction 110. The data file storage unit is determined, and the metadata file storage location information is sent to the metadata file access unit 13 (S103).

メタデータファイルアクセス部１３は、受信したメタデータファイル格納先の情報が示す何れかのメタデータファイル格納部１２−ｎ（ｎ＝０乃至２の何れか）に、ファイルアクセス命令１１０を送付する（Ｓ１０４）。 The metadata file access unit 13 sends a file access command 110 to any one of the metadata file storage units 12-n (n = 0 to 2) indicated by the received metadata file storage location information ( S104).

ファイルアクセス命令１１０がＷｒｉｔｅリクエストである場合（Ｓ１０５でＹｅｓ）、メタデータファイル格納部１２−ｎは、ファイルアクセス命令１１０の情報から、データファイル１１０のメタデータを作成し、メタデータファイル属性情報部１２０とメタデータファイル対応表部１３０に、メタデータを書き込む（Ｓ１０６）。 When the file access command 110 is a write request (Yes in S105), the metadata file storage unit 12-n creates metadata of the data file 110 from the information of the file access command 110, and a metadata file attribute information unit The metadata is written in 120 and the metadata file correspondence table unit 130 (S106).

メタデータファイル格納部１２−ｎは、作成したデータファイル１１０のメタデータをメタデータファイルアクセス部１３へ送付する（Ｓ１０７）。 The metadata file storage unit 12-n sends the metadata of the created data file 110 to the metadata file access unit 13 (S107).

ファイルアクセス命令１１０がＷｒｉｔｅリクエストでない場合（Ｓ１０５でＮｏ）、ファイルアクセス命令１１０はＲｅａｄリクエストであり、メタデータファイル格納部１２−ｎは、メタデータファイル対応表部１３０から、ファイルアクセス命令１１０が包含する情報から決定するレコードに格納されたメタデータを読み込む（Ｓ１０８）。 When the file access command 110 is not a write request (No in S105), the file access command 110 is a read request, and the metadata file storage unit 12-n is included in the file access command 110 from the metadata file correspondence table unit 130. The metadata stored in the record determined from the information to be read is read (S108).

メタデータファイル格納部１２−ｎは、読み込んだデータファイル１１０のメタデータを、メタデータファイルアクセス部１３へ送付する（Ｓ１０９）。 The metadata file storage unit 12-n sends the metadata of the read data file 110 to the metadata file access unit 13 (S109).

メタデータファイルアクセス部1３は、データファイル１１０のメタデータを、データファイルアクセス部１５へ送付する（Ｓ１１０）。 The metadata file access unit 13 sends the metadata of the data file 110 to the data file access unit 15 (S110).

データファイルアクセス部１５は、データファイル１１０のメタデータが示す何れかのデータファイル格納部１１−０乃至１１−３に対して、データファイル１１０のメタデータが示す実ファイル名でアクセスする（Ｓ１１１）。 The data file access unit 15 accesses any one of the data file storage units 11-0 to 11-3 indicated by the metadata of the data file 110 with the actual file name indicated by the metadata of the data file 110 (S111). .

本実施形態には、データファイルのメタデータを格納したメタデータファイルを分散配置させることで、メタデータファイルを格納したある特定のサーバへの負荷の集中を回避する効果がある。なぜなら、メタデータファイル格納領域決定部１４が、プロセス実行部１０−０〜１０−７で実行されるプロセスが発行する、データファイル１００へのファイルアクセス命令１１０におけるデータファイル名称とオフセット情報を基に、ハッシュ関数１４０を用いて、データファイル１００のメタデータファイルを分散配置させるからである。 The present embodiment has an effect of avoiding concentration of load on a specific server storing the metadata file by distributing and arranging the metadata file storing the metadata of the data file. This is because the metadata file storage area determination unit 14 is based on the data file name and offset information in the file access instruction 110 to the data file 100 issued by the processes executed by the process execution units 10-0 to 10-7. This is because the metadata file of the data file 100 is distributed using the hash function 140.

メタデータファイルの分散配置により、複数のメタデータファイル格納サーバに負荷を分散させることで、メタデータ格納ファイルサーバの処理のボトルネックによるシステム性能の低下を回避することが可能となる。 By distributing the metadata file, the load is distributed to a plurality of metadata file storage servers, so that it is possible to avoid a decrease in system performance due to a bottleneck in processing of the metadata storage file server.

また本実施形態は、１つのデータファイルにおけるブロック領域単位に対応したメタデータを分散配置しているため、１つのデータファイルについてブロック領域単位でデータを並列処理するような大きな配列変数の演算を行うＨＰＣ領域のアプリケーションにおいて、大きな効果がある。
＜第二の実施形態＞
次に、本願発明の第二の実施形態について図面を参照して詳細に説明する。 In addition, since the metadata corresponding to the block area unit in one data file is distributed and arranged in the present embodiment, a large array variable is calculated such that data is processed in parallel for each block area for one data file. This has a great effect in the application in the HPC region.
<Second Embodiment>
Next, a second embodiment of the present invention will be described in detail with reference to the drawings.

図６は本願発明の第二の実施形態の構成を示すブロック図である。 FIG. 6 is a block diagram showing the configuration of the second embodiment of the present invention.

本実施形態は、第一の実施形態の構成に加えて、ハッシュ関数数式設定部１４１とハッシュ関数設定情報格納部１６とを備えている。ハッシュ関数数式設定部１４１とハッシュ関数設定情報格納部１６を除く各構成要素の機能は第一の実施形態と同様である。 The present embodiment includes a hash function formula setting unit 141 and a hash function setting information storage unit 16 in addition to the configuration of the first embodiment. The functions of the constituent elements other than the hash function formula setting unit 141 and the hash function setting information storage unit 16 are the same as those in the first embodiment.

ハッシュ関数数式設定入力部１４１は、メタデータファイルアクセス部１３から受信した、ファイルアクセス命令１１０におけるデータファイル名称を、ハッシュ関数設定情報格納部１６に送付する。 The hash function formula setting input unit 141 sends the data file name in the file access command 110 received from the metadata file access unit 13 to the hash function setting information storage unit 16.

ハッシュ関数設定情報格納部１６は、ハッシュ関数設定情報１６０を格納している。ハッシュ関数設定情報１６０の構成例を図７に示す。ハッシュ関数設定情報１６０はデータファイル１００とハッシュ関数１４０との対応関係を格納している。例えば、データファイル１００が「／ｄｉｒ＿１／ｆｉｌｅ＿Ａ」の場合は、メタデータファイル格納領域決定部１４０は、ハッシュ関数として、

ＭＤＳｎｕｍ（ｘ，ｂ）＝（ｆｌｏｏｒ（ｘ／３ＧＢ）＋ａｓｃｉｉ（ｂ））ｍｏｄ３

を使用し、データファイル１００が「／ｄｉｒ＿１／ｆｉｌｅ＿Ｂ」の場合は、メタデータファイル格納領域決定部１４０は、ハッシュ関数として、

ＭＤＳｎｕｍ（ｘ，ｂ）＝（ｆｌｏｏｒ（ｘ／６ＧＢ）＋ａｓｃｉｉ（ｂ））ｍｏｄ３

を使用することを示している。 The hash function setting information storage unit 16 stores hash function setting information 160. A configuration example of the hash function setting information 160 is shown in FIG. The hash function setting information 160 stores the correspondence between the data file 100 and the hash function 140. For example, when the data file 100 is “/ dir_1 / file_A”, the metadata file storage area determination unit 140 uses a hash function as

MDSnum (x, b) = (floor (x / 3GB) + ascii (b)) mod 3

When the data file 100 is “/ dir_1 / file_B”, the metadata file storage area determination unit 140 uses a hash function as

MDSnum (x, b) = (floor (x / 6GB) + ascii (b)) mod 3

Shows that you use.

ハッシュ関数設定情報１６０は、実行するアプリケーションのデータファイルへのアクセスの特性を熟知した担当者が設定する。 The hash function setting information 160 is set by a person in charge who is familiar with the characteristics of access to the data file of the application to be executed.

ハッシュ関数設定情報格納部１６は、ハッシュ関数数式設定入力部１４１から受信したデータファイル名称と、データファイル名称が一致するレコードをハッシュ関数設定情報１６０から読み出して、読み出したレコードにおけるハッシュ関数数式の情報をハッシュ関数数式設定入力部１４１へ送付する。 The hash function setting information storage unit 16 reads from the hash function setting information 160 a record whose data file name matches the data file name received from the hash function equation setting input unit 141, and information on the hash function equation in the read record Is sent to the hash function formula setting input unit 141.

ハッシュ関数数式設定入力部１４１は、受信したハッシュ関数数式をハッシュ関数１４０に設定する。 The hash function formula setting input unit 141 sets the received hash function formula in the hash function 140.

本実施形態には、メタデータファイルの分散配置を決定するハッシュ関数の数式をデータファイルにより可変としたことで、データファイル毎にメタデータファイルの分散配置の仕方を変更できる効果がある。なぜなら、ハッシュ関数数式設定部１４１が、アクセスするデータファイルに対応したハッシュ関数の数式をハッシュ関数設定情格納部１６から取得し、その数式をメタデータファイル格納領域決定部１４で使用するハッシュ関数１４０として設定するからである。 In the present embodiment, the formula of the hash function for determining the distributed arrangement of the metadata file is variable depending on the data file, so that the method of distributing the distributed metadata file can be changed for each data file. This is because the hash function formula setting unit 141 acquires the formula of the hash function corresponding to the data file to be accessed from the hash function setting information storage unit 16 and uses the formula in the metadata file storage area determination unit 14. It is because it sets as.

メタデータファイルの分散配置の仕方をデータファイルにより可変とすることで、実行するプログラムのデータファイルへのアクセスの仕方の特徴をふまえたメタデータファイルの分散配置の仕方を設定し、チューニングを行うことが可能となる。
＜第三の実施形態＞
次に、本願発明の第三の実施形態について図面を参照して詳細に説明する。 By making the method of distributed arrangement of metadata files variable depending on the data file, setting the method of distributed arrangement of metadata files based on the characteristics of how to access the data file of the program to be executed and tuning Is possible.
<Third embodiment>
Next, a third embodiment of the present invention will be described in detail with reference to the drawings.

図８は本願発明の第三の実施形態の構成を示すブロック図である。 FIG. 8 is a block diagram showing the configuration of the third embodiment of the present invention.

本実施形態の分散ファイルアクセス装置は、複数、例えば３個のメタデータファイル格納部２２−０乃至２２−２と、メタデータファイルアクセス部２３と、メタデータファイル格納領域決定部２４とを包含している。 The distributed file access apparatus according to this embodiment includes a plurality of, for example, three metadata file storage units 22-0 to 22-2, a metadata file access unit 23, and a metadata file storage area determination unit 24. ing.

メタデータファイル格納部２２−０乃至２２−２は、データファイル２００のメタデータを記録したメタデータファイルを分割して格納しており、それぞれのシステム管理上の名称はＭＤＳ０乃至ＭＤＳ２である。 The metadata file storage units 22-0 to 22-2 divide and store the metadata file that records the metadata of the data file 200, and the system management names thereof are MDS0 to MDS2.

メタデータファイル格納領域決定部２４は、データファイル２００へのアクセス命令におけるデータファイル２００の識別情報とアクセスアドレスの情報を基に、ハッシュ関数２４０を用いて、データファイル２００のメタデータが格納されたメタデータファイル格納部２２−０（ＭＤＳ０）乃至２２−２（ＭＤＳ２）の何れかを一意に決定する。 The metadata file storage area determination unit 24 stores the metadata of the data file 200 using the hash function 240 based on the identification information and access address information of the data file 200 in the access command to the data file 200. Any one of the metadata file storage units 22-0 (MDS0) to 22-2 (MDS2) is uniquely determined.

データファイル２００へのアクセス命令の構成は、第一の実施例と同様、図４に示すとおりである。したがって、データファイル２００の識別情報としてファイルアクセス命令１１０におけるデータファイル名称を使用し、前記のアクセスアドレスの情報として、ファイルアクセス命令１１０における開始オフセットとアクセスサイズを使用する。 The configuration of the access command to the data file 200 is as shown in FIG. 4 as in the first embodiment. Therefore, the data file name in the file access instruction 110 is used as the identification information of the data file 200, and the start offset and the access size in the file access instruction 110 are used as the access address information.

メタデータファイルアクセス部２３は、メタデータファイル格納領域決定部２４が決定したメタデータファイル格納部２２−０（ＭＤＳ０）乃至２２−２（ＭＤＳ２）の何れかに格納されたメタデータファイルにアクセスする。 The metadata file access unit 23 accesses a metadata file stored in any one of the metadata file storage units 22-0 (MDS0) to 22-2 (MDS2) determined by the metadata file storage area determination unit 24. .

本実施形態におけるデータファイル２００のメタデータの分散格納の構成例を図９に示す。本実施形態では、メタデータファイル格納部２２−０乃至２２−２が、第一の実施形態の様にデータファイル１００のブロック領域単位に対応するメタデータを格納するのではなく、データファイル２００のアドレス単位に対応するメタデータを格納する。 A configuration example of the distributed storage of metadata of the data file 200 in the present embodiment is shown in FIG. In this embodiment, the metadata file storage units 22-0 to 22-2 do not store the metadata corresponding to the block area unit of the data file 100 as in the first embodiment, but instead of storing the metadata of the data file 200. Stores metadata corresponding to address units.

本実施形態には、第一の実施形態と同様に、データファイルのメタデータを格納したメタデータファイルを分散配置させることで、メタデータファイルを格納したある特定のサーバへの負荷の集中を回避する効果がある。 As in the first embodiment, the present embodiment distributes metadata files that store metadata of data files, thereby avoiding concentration of load on a specific server that stores the metadata files. There is an effect to.

また本実施形態は、１つデータファイルにおけるアドレス単位に対応したメタデータを分散配置しているため、１つのデータファイル内のデータを不規則に同時に参照して処理するような一般的なアプリケーションにおいて効果がある。 In the present embodiment, since metadata corresponding to the address unit in one data file is distributed and arranged, in a general application in which data in one data file is simultaneously referred to and processed irregularly. effective.

以上、実施形態を参照して本願発明を説明したが、本願発明は上記実施形態に限定されたものではない。本願発明の構成や詳細には、本願発明のスコープ内で当業者が理解し得る様々な変更をすることができる。 While the present invention has been described with reference to the embodiments, the present invention is not limited to the above embodiments. Various changes that can be understood by those skilled in the art can be made to the configuration and details of the present invention within the scope of the present invention.

１０−０乃至１０−７プロセス実行部
１１−０乃至１１−３データファイル格納部
１２−０乃至１２−２メタデータファイル格納部
１３メタデータファイルアクセス部
１４メタデータファイル格納領域決定部
１４０ハッシュ関数
１５データファイルアクセス部
１００データファイル
１１０ファイルアクセス命令
１２０メタデータファイル属性情報部
１３０メタデータファイル対応表部
１４１ハッシュ関数数式設定部
１６ハッシュ関数設定情報格納部
１６０ハッシュ関数設定情報
２２−０乃至２２−２メタデータファイル格納部
２３メタデータファイルアクセス部
２４メタデータファイル格納領域決定部
２４０ハッシュ関数
２００データファイル 10-0 to 10-7 Process execution unit 11-0 to 11-3 Data file storage unit 12-0 to 12-2 Metadata file storage unit 13 Metadata file access unit 14 Metadata file storage area determination unit 140 Hash function DESCRIPTION OF SYMBOLS 15 Data file access part 100 Data file 110 File access instruction 120 Metadata file attribute information part 130 Metadata file correspondence table part 141 Hash function numerical formula setting part 16 Hash function setting information storage part 160 Hash function setting information 22-0 thru | or 22- 2 Metadata File Storage Unit 23 Metadata File Access Unit 24 Metadata File Storage Area Determination Unit 240 Hash Function 200 Data File

Claims

１つのデータファイルのメタ情報を格納したメタデータファイルを、複数に分割して格納する複数のメタデータファイル格納手段と、
前記データファイルへのアクセス命令において指定された当該データファイルの識別情報と、当該データファイルにおけるアクセスアドレスの情報とをハッシュ関数に入力し、前記ハッシュ関数の出力値から、当該データファイルの当該アクセスアドレスに対応する前記メタデータファイルを格納する前記メタデータファイル格納手段の識別情報を一意に決定するメタデータファイル格納領域決定手段と、
前記メタデータファイル格納領域決定手段が決定した前記メタデータファイル格納手段へアクセスするメタデータファイルアクセス手段と、
を備える分散ファイルアクセス装置。 A plurality of metadata file storage means for storing a metadata file storing the metadata of one data file by dividing the metadata file into a plurality of metadata files;
The identification information of the data file specified in the access instruction to the data file and the access address information in the data file are input to the hash function, and the access address of the data file is calculated from the output value of the hash function. Metadata file storage area determination means for uniquely determining identification information of the metadata file storage means for storing the metadata file corresponding to
Metadata file access means for accessing the metadata file storage means determined by the metadata file storage area determination means;
A distributed file access device comprising:

前記メタデータファイル格納手段は、１つの前記データファイルを構成する複数のブロック領域単位で、前記ブロック領域に対応する前記メタデータファイルを格納し、
前記メタデータファイル格納領域決定手段は、前記アクセス命令において指定された当該データファイルにおける前記アクセスアドレスを、前記ブロック領域単位で、当該データファイルの識別情報とともに前記ハッシュ関数に入力して、当該ブロック領域に対応する前記メタデータファイルを格納する前記メタデータファイル格納手段の識別情報を一意に決定する
請求項１の分散ファイルアクセス装置。 The metadata file storage means stores the metadata file corresponding to the block area in a plurality of block area units constituting one data file,
The metadata file storage area determination means inputs the access address in the data file specified in the access instruction to the hash function together with the identification information of the data file in the block area unit, The distributed file access device according to claim 1, wherein identification information of the metadata file storage unit that stores the metadata file corresponding to is uniquely determined.

前記データファイルの識別情報と、前記ハッシュ関数の数式との対応情報を格納する、ハッシュ関数設定情報格納手段を更に備え、
前記メタデータファイル格納領域決定手段は、前記対応情報を構成するレコードの中から、前記データファイルの識別情報が、前記データファイルへのアクセス命令における当該データファイルの識別情報と一致するレコードを参照して、当該レコードに格納された前記ハッシュ関数の数式を使用する、
請求項１乃至２の何れかの分散ファイルアクセス装置。 A hash function setting information storage unit that stores correspondence information between the identification information of the data file and the mathematical formula of the hash function;
The metadata file storage area determination means refers to a record in which the identification information of the data file matches the identification information of the data file in the access instruction to the data file, among the records constituting the correspondence information. And using the hash function formula stored in the record,
The distributed file access apparatus according to claim 1.

請求項１から３に記載の分散ファイルアクセス装置と、
前記データファイルへのアクセス命令を発行して前記データファイルへアクセスするプロセス実行装置と、
前記データファイルを格納するデータファイル格納装置と、
を包含する分散ファイルアクセスシステム。 The distributed file access device according to claim 1,
A process execution device that issues an access instruction to the data file to access the data file;
A data file storage device for storing the data file;
A distributed file access system.

１つのデータファイルのメタ情報を格納したメタデータファイルを、複数に分割して複数の記憶域に格納し、
前記データファイルへのアクセス命令において指定された当該データファイルの識別情報と、当該データファイルにおけるアクセスアドレスの情報とをハッシュ関数に入力し、前記ハッシュ関数の出力値から、当該データファイルの当該アクセスアドレスに対応する前記メタデータファイルを格納する前記記憶域の識別情報を一意に決定し、
決定した前記識別情報が示す前記記憶域へアクセスする
分散ファイルアクセス方法。 A metadata file storing meta information of one data file is divided into a plurality of pieces and stored in a plurality of storage areas.
The identification information of the data file specified in the access instruction to the data file and the access address information in the data file are input to the hash function, and the access address of the data file is calculated from the output value of the hash function. Uniquely identifying the storage area storing the metadata file corresponding to
A distributed file access method for accessing the storage area indicated by the determined identification information.

前記メタデータファイルが格納された記憶域は、１つの前記データファイルを構成する複数のブロック領域単位で、前記ブロック領域に対応する前記メタデータファイルを格納し、
前記アクセス命令において指定された当該データファイルにおける前記アクセスアドレスを、前記ブロック領域単位で、当該データファイルの識別情報とともに前記ハッシュ関数に入力して、当該ブロック領域に対応する前記メタデータファイルが格納された前記メタデータファイルを格納する前記記憶域の識別情報を一意に決定する
請求項５の分散ファイルアクセス方法。 The storage area in which the metadata file is stored stores the metadata file corresponding to the block area in a plurality of block area units constituting one data file,
The access address in the data file specified in the access command is input to the hash function together with the identification information of the data file in the block area unit, and the metadata file corresponding to the block area is stored. 6. The distributed file access method according to claim 5, wherein identification information of the storage area for storing the metadata file is uniquely determined.

前記データファイルの識別情報と、前記ハッシュ関数の数式との対応情報を記憶域に格納し、
前記対応情報を構成するレコードの中から、前記データファイルの識別情報が、前記データファイルへのアクセス命令における当該データファイルの識別情報と一致するレコードを参照して、当該レコードに格納された前記ハッシュ関数の数式を、前記メタデータファイルを格納する記憶域の識別情報を決定するために使用する
請求項５乃至６の何れかの分散ファイルアクセス方法。 Storing correspondence information between the identification information of the data file and the formula of the hash function in a storage area;
The hash stored in the record with reference to the record in which the identification information of the data file matches the identification information of the data file in the access instruction to the data file from among the records constituting the correspondence information The distributed file access method according to claim 5, wherein a mathematical expression of a function is used to determine identification information of a storage area in which the metadata file is stored.

１つのデータファイルのメタ情報を格納したメタデータファイルを、複数に分割して複数の記憶域に格納するメタデータファイル格納処理と、
前記データファイルへのアクセス命令において指定された当該データファイルの識別情報と、当該データファイルにおけるアクセスアドレスの情報とをハッシュ関数に入力し、前記ハッシュ関数の出力値から、当該データファイルの当該アクセスアドレスに対応する前記メタデータファイルを格納する前記記憶域の識別情報を一意に決定するメタデータファイル格納領域決定処理と、
前記メタデータファイル格納領域決定処理が決定した前記記憶域へアクセスするメタデータファイルアクセス処理と、
をコンピュータに実行させる分散ファイルアクセスプログラム。 A metadata file storing process in which a metadata file storing metadata information of one data file is divided into a plurality of pieces and stored in a plurality of storage areas;
The identification information of the data file specified in the access instruction to the data file and the access address information in the data file are input to the hash function, and the access address of the data file is calculated from the output value of the hash function. Metadata file storage area determination processing for uniquely determining the storage area identification information for storing the metadata file corresponding to
A metadata file access process for accessing the storage area determined by the metadata file storage area determination process;
Distributed file access program that causes computers to execute.

前記メタデータファイルを格納する記憶域は、１つの前記データファイルを構成する複数のブロック領域単位で、前記ブロック領域に対応する前記メタデータファイルを格納し、
前記メタデータファイル格納領域決定処理は、前記アクセス命令において指定された当該データファイルにおける前記アクセスアドレスを、前記ブロック領域単位で、当該データファイルの識別情報とともに前記ハッシュ関数に入力して、当該ブロック領域に対応する前記メタデータファイルを格納する前記メタデータファイル格納処理の記憶域の識別情報を一意に決定する
請求項８の分散ファイルアクセスプログラム。 The storage area for storing the metadata file stores the metadata file corresponding to the block area in a plurality of block area units constituting one data file,
The metadata file storage area determination process inputs the access address in the data file specified in the access instruction to the hash function together with the identification information of the data file in the block area unit, The distributed file access program according to claim 8, wherein identification information of a storage area of the metadata file storing process for storing the metadata file corresponding to is uniquely determined.

前記データファイルの識別情報と、前記ハッシュ関数の数式との対応情報を記憶域に格納するハッシュ関数設定情報格納処理と、
前記対応情報を構成するレコードの中から、前記データファイルの識別情報が、前記データファイルへのアクセス命令における当該データファイルの識別情報と一致するレコードを参照して、当該レコードに格納された前記ハッシュ関数の数式を使用する前記メタデータファイル格納領域決定処理と、
をコンピュータに実行させる請求項８乃至９の何れかの分散ファイルアクセスプログラム。 Hash function setting information storage processing for storing correspondence information between the identification information of the data file and the formula of the hash function in a storage area;
The hash stored in the record with reference to the record in which the identification information of the data file matches the identification information of the data file in the access instruction to the data file from among the records constituting the correspondence information The metadata file storage area determination process using a mathematical expression of a function;
10. The distributed file access program according to claim 8, wherein the computer executes the program.