JP2021022287A

JP2021022287A - Backup processor, backup processing method, and backup processing program

Info

Publication number: JP2021022287A
Application number: JP2019139757A
Authority: JP
Inventors: 祐介内村; Yusuke UCHIMURA
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2019-07-30
Filing date: 2019-07-30
Publication date: 2021-02-18
Anticipated expiration: 2039-07-30
Also published as: JP7326970B2

Abstract

To efficiently perform backup processing in a system that backs up a difference between data when backup data of some data is nonexistent.SOLUTION: A backup processor 40 comprises: a determination unit 41 which, when backing up first data 51-1, determines whether or not backup data 44-1 exists in the local device on the basis of a differential between first data 51-1 stored in a first other device 50-1 and backup data 44-1 of first data 51-1 stored in the local device; a specification unit 42 which, when the backup data 44-1 does not exist in the local device, specifies the backup data 44-2 of second data 51-2 stored in a second other device 50-2, which exists in the local device and satisfies a difference in specification with the first data 51-1; and an execution unit 43 for executing a backup process on the basis of a difference between the first data 51-1 and the backup data 44-2.SELECTED DRAWING: Figure 3

Description

本願発明は、データのバックアップを、ある時点のバックアップデータと現在のデータとの差分に基づいて行う技術に関する。 The present invention relates to a technique for backing up data based on the difference between the backup data at a certain point in time and the current data.

様々なコンピュータシステムにおいて、障害の発生に備えて、システムにおいて記憶されている全てのデータあるいは重要なデータをバックアップしておく運用が、一般的に行われている。そして、このようなバックアップ処理の実行に伴うサービスの停止時間を最小限に抑えるために、ある時点のバックアップデータと現在のデータとの差分に基づいてバックアップ処理を行う様々な差分バックアップ方式が考案されており、この差分バックアップを効率的に行う技術への期待が高まってきている。 In various computer systems, it is common practice to back up all the data or important data stored in the system in case of a failure. Then, in order to minimize the service downtime associated with the execution of such a backup process, various differential backup methods have been devised that perform the backup process based on the difference between the backup data at a certain point in time and the current data. Therefore, expectations for a technology for efficiently performing this differential backup are increasing.

このような技術に関連する技術として、特許文献１には、管理サーバと管理対象マシン間の通信量を抑えながら、差分データ量を最適化するバックアップ方式が開示されている。このバックアップ方式における管理サーバは、フルバックアップデータを格納するフルバックアップデータ格納手段を備える。この管理サーバは、フルバックアップデータを複数のブロックに分割するブロックサイズと、各ブロックに対応する複数のハッシュ値とを含むハッシュデータを格納するワーキングエリアを備える。この管理サーバは、フルバックアップデータと現在のデータとの差分データを、ハッシュデータを用いて抽出させるブートイメージを管理対象マシンに送信する。この管理サーバは、そのブートイメージを用いて管理対象マシンが起動された後、ハッシュデータを管理対象マシンへ送信する。そして、この管理サーバは、管理対象マシンから差分データを受信し、フルバックアップデータと差分データとを用いて、次のバックアップに用いるブロックサイズを計算する。 As a technique related to such a technique, Patent Document 1 discloses a backup method that optimizes the amount of differential data while suppressing the amount of communication between the management server and the managed machine. The management server in this backup method includes a full backup data storage means for storing the full backup data. This management server has a working area for storing hash data including a block size for dividing the full backup data into a plurality of blocks and a plurality of hash values corresponding to each block. This management server sends a boot image to the managed machine to extract the difference data between the full backup data and the current data using hash data. This management server sends hash data to the managed machine after the managed machine is started by using the boot image. Then, this management server receives the difference data from the managed machine, and calculates the block size to be used for the next backup by using the full backup data and the difference data.

また、特許文献２には、差分バックアップの所要時間を短縮する差分バックアップシステムが開示されている。この差分バックアップシステムでは、クライアントとバックアップ管理サーバとが相互に接続されている。バックアップ管理サーバは、第１乃至第３のブロックサイズの各々で、第１のハッシュデータを算出し、この第１のハッシュデータを試験用領域とそれ以外の領域のものとに分割してクライアントに送信する。クライアントは、第１のハッシュデータを利用して試験用領域について試験的差分バックアップを行い、最も所要時間の少なかったブロックサイズを最適サイズとして決定する。そして、クライアントは、最適サイズにより残領域をブロックに区切って差分バックアップを行う。 Further, Patent Document 2 discloses a differential backup system that shortens the time required for differential backup. In this differential backup system, the client and the backup management server are interconnected. The backup management server calculates the first hash data for each of the first to third block sizes, divides the first hash data into a test area and other areas, and sends it to the client. Send. The client performs a test differential backup for the test area using the first hash data, and determines the block size with the shortest required time as the optimum size. Then, the client divides the remaining area into blocks according to the optimum size and performs differential backup.

特許第５２８７３６６号公報Japanese Patent No. 5287366 特開２０１１−２１００６８号公報Japanese Unexamined Patent Publication No. 2011-210068

上述した差分バックアップを行うシステムにおいて、例えばシステム内におけるあるディスクのデータについて最初のバックアップ処理を行う場合、当該データのバックアップデータがまだ存在していない状態にある。したがって、差分バックアップを行うシステムにおいても、あるディスクのデータのバックアップデータがまだ存在していない場合におけるバックアップ処理は、通常、当該データ全体について行われる（即ちフルバックアップが行われる）。 In the system for performing the differential backup described above, for example, when the first backup process is performed on the data of a certain disk in the system, the backup data of the data does not yet exist. Therefore, even in a system that performs differential backup, the backup process when the backup data of the data of a certain disk does not yet exist is usually performed for the entire data (that is, a full backup is performed).

しかしながら、大規模化によって膨大なデータを記憶する昨今のコンピュータシステムにおいて、あるディスクのデータのバックアップデータがまだ存在していない場合のバックアップ処理をフルバックアップによって行うことによって生じるサービスの停止時間等が及ぼす影響は、必ずしも小さいとはいえない。すなわち、差分バックアップを行うシステムにおいて、あるデータのバックアップデータがまだ存在していない場合のバックアップ処理を効率的に行うことが課題である。特許文献１及び２は、このような課題については特に言及していない。本願発明の主たる目的は、この課題を解決するバックアップ処理装置等を提供することである。 However, in recent computer systems that store huge amounts of data due to large scale, the service downtime caused by performing the backup process when the backup data of the data of a certain disk does not yet exist by full backup has an effect. The impact is not always small. That is, in a system that performs differential backup, it is an issue to efficiently perform backup processing when backup data of certain data does not yet exist. Patent Documents 1 and 2 do not particularly mention such a problem. A main object of the present invention is to provide a backup processing apparatus or the like that solves this problem.

本願発明の一態様に係るバックアップ処理装置は、第一の他装置に記憶された現在の第一のデータと、自装置に記憶された、過去に生成あるいは更新された前記第一のデータのバックアップデータとの差分に基づいて、前記第一のデータに対するバックアップ処理を行う場合に、前記第一のデータのバックアップデータが自装置に存在するか否かを判定する判定手段と、前記第一のデータのバックアップデータが自装置に存在しない場合、自装置に存在する、前記第一のデータとの仕様の差分が基準を満たす、第二の他装置に記憶された第二のデータのバックアップデータを特定する特定手段と、現在の前記第一のデータと前記第二のデータのバックアップデータとの差分に基づいて、前記第一のデータに対するバックアップ処理を行う実行手段と、を備える。 The backup processing device according to one aspect of the present invention is a backup of the current first data stored in the first other device and the previously generated or updated first data stored in the own device. When performing backup processing on the first data based on the difference from the data, a determination means for determining whether or not the backup data of the first data exists in the own device, and the first data. If the backup data of is not present in the own device, identify the backup data of the second data stored in the second other device that exists in the own device and the difference in specifications from the first data meets the criteria. The specific means is provided, and an execution means for performing a backup process on the first data based on the difference between the current first data and the backup data of the second data.

上記目的を達成する他の見地において、本願発明の一態様に係るバックアップ処理方法は、情報処理装置によって、第一の他装置に記憶された現在の第一のデータと、自装置に記憶された、過去に生成あるいは更新された前記第一のデータのバックアップデータとの差分に基づいて、前記第一のデータに対するバックアップ処理を行う場合に、前記第一のデータのバックアップデータが前記情報処理装置に存在するか否かを判定し、前記第一のデータのバックアップデータが前記情報処理装置に存在しない場合、前記情報処理装置に存在する、前記第一のデータとの仕様の差分が基準を満たす、第二の他装置に記憶された第二のデータのバックアップデータを特定し、現在の前記第一のデータと、前記第二のデータのバックアップデータとの差分に基づいて、前記第一のデータに対するバックアップ処理を行う。 From another point of view of achieving the above object, the backup processing method according to one aspect of the present invention is stored in the own device together with the current first data stored in the first other device by the information processing device. When the backup process for the first data is performed based on the difference from the backup data of the first data generated or updated in the past, the backup data of the first data is sent to the information processing apparatus. It is determined whether or not the data exists, and if the backup data of the first data does not exist in the information processing device, the difference in specifications from the first data existing in the information processing device satisfies the standard. The backup data of the second data stored in the second other device is specified, and the backup data of the first data is based on the difference between the current first data and the backup data of the second data. Perform backup processing.

また、上記目的を達成する更なる見地において、本願発明の一態様に係るバックアップ処理プログラムは、第一の他装置に記憶された現在の第一のデータと、自装置に記憶された、過去に生成あるいは更新された前記第一のデータのバックアップデータとの差分に基づいて、前記第一のデータに対するバックアップ処理を行う場合に、前記第一のデータのバックアップデータが自装置に存在するか否かを判定する判定処理と、前記第一のデータのバックアップデータが自装置に存在しない場合、自装置に存在する、前記第一のデータとの仕様の差分が基準を満たす、第二の他装置に記憶された第二のデータのバックアップデータを特定する特定処理と、現在の前記第一のデータと、前記第二のデータのバックアップデータとの差分に基づいて、前記第一のデータに対するバックアップ処理を行う実行処理と、を自装置に実行させる。 Further, from the further viewpoint of achieving the above object, the backup processing program according to one aspect of the present invention includes the current first data stored in the first other device and the past stored in the own device. Whether or not the backup data of the first data exists in the own device when the backup process for the first data is performed based on the difference between the generated or updated backup data of the first data and the backup data. When the judgment process for determining the above and the backup data of the first data do not exist in the own device, the difference in specifications from the first data existing in the own device satisfies the standard, and the second other device The backup process for the first data is performed based on the specific process for specifying the backup data of the stored second data and the difference between the current first data and the backup data for the second data. Let the own device execute the execution process to be performed.

更に、本願発明は、係るバックアップ処理プログラム（コンピュータプログラム）が格納された、コンピュータ読み取り可能な、不揮発性の記録媒体によっても実現可能である。 Further, the present invention can also be realized by a computer-readable, non-volatile recording medium in which the backup processing program (computer program) is stored.

本願発明は、データの差分バックアップを行うシステムにおいて、あるデータのバックアップデータがまだ存在していない場合のバックアップ処理を、効率的に行うことを可能とする。 The present invention makes it possible to efficiently perform a backup process when backup data of a certain data does not yet exist in a system for performing differential data backup.

本願発明の第１の実施形態に係るバックアップ処理システム１の構成を示すブロック図である。It is a block diagram which shows the structure of the backup processing system 1 which concerns on 1st Embodiment of this invention. 本願発明の第１の実施形態に係るバックアップ処理システム１の動作を示すフローチャート（１／２）である。It is a flowchart (1/2) which shows the operation of the backup processing system 1 which concerns on 1st Embodiment of this invention. 本願発明の第１の実施形態に係るバックアップ処理システム１の動作を示すフローチャート（２／２）である。It is a flowchart (2/2) which shows the operation of the backup processing system 1 which concerns on 1st Embodiment of this invention. 本願発明の第２の実施形態に係るバックアップ処理装置４０の構成を示すブロック図である。It is a block diagram which shows the structure of the backup processing apparatus 40 which concerns on 2nd Embodiment of this invention. 本願発明の各実施形態に係るバックアップ処理装置を実行可能な情報処理装置９００の構成を示すブロック図である。It is a block diagram which shows the structure of the information processing apparatus 900 which can execute the backup processing apparatus which concerns on each embodiment of this invention.

以下、本願発明の実施の形態について図面を参照して詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

＜第１の実施形態＞
図１は、本願発明の第１の実施の形態に係るバックアップ処理システム１の構成を示すブロック図である。バックアップ処理システム１は、大別して、バックアップ処理装置１０と、クライアント装置２０−１乃至２０−３とを有する。バックアップ処理装置１０と、クライアント装置２０−１乃至２０−３とは、通信ネットワーク３０によって通信可能に接続されている。 <First Embodiment>
FIG. 1 is a block diagram showing a configuration of a backup processing system 1 according to the first embodiment of the present invention. The backup processing system 1 is roughly classified into a backup processing device 10 and client devices 20-1 to 20-3. The backup processing device 10 and the client devices 20-1 to 20-3 are communicably connected by the communication network 30.

クライアント装置２０−１乃至２０−３は、例えばパーソナルコンピュータ等の情報処理装置であり、後述するようにサーバ装置としての機能を備えるバックアップ処理装置１０へアクセスすることによって、バックアップ処理装置１０から様々なサービスを提供される。クライアント装置２０−１乃至２０−３は、順にディスク２１−１乃至２１−３を備え、ディスク２１−１乃至２１−３に、使用するデータ２１０−１乃至２１０−３を記憶している。但し、ディスク２１−１乃至２１−３は、例えばＨＤＤ（Hard Disk Drive）等の記憶デバイスである。 The client devices 20-1 to 20-3 are information processing devices such as a personal computer, and various devices from the backup processing device 10 can be used by accessing the backup processing device 10 having a function as a server device as described later. Service is provided. The client devices 20-1 to 20-3 include disks 21-1 to 21-3 in order, and the data 210-1 to 210-3 to be used are stored in the disks 21-1 to 21-3. However, the disks 21-1 to 21-3 are storage devices such as an HDD (Hard Disk Drive).

尚、本願では以降、クライアント装置２０−１乃至２０−３をまとめてクライアント装置２０と称し、ディスク２１−１乃至２１−３をまとめてディスク２１と称し、データ２１０−１乃至２１０−３をまとめてデータ２１０と称する場合がある。また、図１には一例として３つのクライアント装置２０−１乃至２０−３が示されているが、バックアップ処理システム１は、２以上の任意の数のクライアント装置２０を有すればよい。 In the present application, the client devices 20-1 to 20-3 are collectively referred to as a client device 20, the disks 21-1 to 21-3 are collectively referred to as a disk 21, and the data 210-1 to 210-3 are collectively referred to as a disk 21. It may be referred to as data 210. Further, although three client devices 20-1 to 20-3 are shown as an example in FIG. 1, the backup processing system 1 may have two or more arbitrary number of client devices 20.

バックアップ処理装置１０は、例えばサーバ装置等の情報処理装置であり、クライアント装置２０のディスク２１に記憶されているデータ２１０のバックアップ処理を、差分バックアップにより行う機能を備える。 The backup processing device 10 is, for example, an information processing device such as a server device, and has a function of performing backup processing of data 210 stored in the disk 21 of the client device 20 by differential backup.

バックアップ処理装置１０は、判定部１１、特定部１２、実行部１３、及び、記憶部１４を備えている。 The backup processing device 10 includes a determination unit 11, a specific unit 12, an execution unit 13, and a storage unit 14.

記憶部１４は、例えばＨＤＤあるいは電子メモリ等の記憶デバイスであり、バックアップデータ１４１−１乃至１４１−３、及び、差分データ１４２−１乃至１４２−３を記憶している。 The storage unit 14 is a storage device such as an HDD or an electronic memory, and stores backup data 141-1 to 141-3 and difference data 142-1 to 142-3.

バックアップデータ１４１−１は、過去のある時点において、クライアント装置２０−１におけるデータ２１０−１がバックアップされたデータである。差分データ１４２−１は、バックアップデータ１４１−１と、現在のデータ２１０−１との差分を表すデータであり、バックアップ処理装置１０が、現在のデータ２１０−１のバックアップ処理を行う際に生成するデータである。 The backup data 141-1 is data in which the data 210-1 in the client device 20-1 is backed up at a certain point in the past. The difference data 142-1 is data representing the difference between the backup data 141-1 and the current data 210-1, and is generated when the backup processing device 10 performs the backup process of the current data 210-1. It is data.

バックアップデータ１４１−２あるいは１４１−３もバックアップデータ１４１−１と同様に、過去のある時点において、クライアント装置２０−２あるいは２０−３におけるデータ２１０−２あるいは２１０−３がバックアップされたデータである。差分データ１４２−２あるいは１４２−３も差分データ１４２−１と同様に、バックアップデータ１４１−２あるいは１４１−３と、現在のデータ２１０−２あるいは２１０−３との差分を表すデータであり、バックアップ処理装置１０が、現在のデータ２１０−２あるいは２１０−３のバックアップ処理を行う際に生成するデータである。 Similar to the backup data 141-1, the backup data 141-2 or 141-3 is also the data in which the data 210-2 or 210-3 in the client device 20-2 or 20-3 is backed up at a certain point in the past. .. Similar to the difference data 142-1, the difference data 142-2 or 142-3 is also data representing the difference between the backup data 141-2 or 141-3 and the current data 210-2 or 210-3, and is a backup. This is data generated when the processing device 10 backs up the current data 210-2 or 210-3.

尚、本願では以降、バックアップデータ１４１−１乃至１４１−３をまとめてバックアップデータ１４１と称し、差分データ１４２−１乃至１４２−３をまとめて差分データ１４２と称する場合がある。 In the present application, the backup data 141-1 to 141-3 may be collectively referred to as backup data 141, and the difference data 142-1 to 142-3 may be collectively referred to as difference data 142.

判定部１１は、バックアップ処理装置１０が、例えばデータ２１０−１（第一のデータ）に対するバックアップ処理を行う際に、バックアップ処理装置１０によってデータ２１０−１のバックアップ処理がこれまでに少なくとも１回以上行われた結果、過去のある時点におけるバックアップデータ１４１−１が記憶部１４（自装置）に存在するか否かを判定する。 In the determination unit 11, when the backup processing device 10 performs backup processing for, for example, data 210-1 (first data), the backup processing device 10 has performed backup processing for data 210-1 at least once so far. As a result of this, it is determined whether or not the backup data 141-1 at a certain point in the past exists in the storage unit 14 (own device).

まず、記憶部１４にバックアップデータ１４１−１が存在する場合における、バックアップ処理装置１０によるデータ２１０−１に対するバックアップ動作について説明する。 First, a backup operation for data 210-1 by the backup processing device 10 when backup data 141-1 exists in the storage unit 14 will be described.

実行部１３は、上述した判定部１１による判定結果を受けて、バックアップデータ１４１−１を所定の大きさのブロックに分割して、そのブロックごとにハッシュ値を算出する。実行部１３は、そのブロックに含まれるデータの内容に基づいて、例えば所定のハッシュ関数を用いてハッシュ値を算出する。実行部１３は、算出したバックアップデータ１４１−１の各ブロックのハッシュ値をクライアント装置２０−１に送信するとともに、現在のデータ２１０−１における当該ブロックに相当するブロックごとにハッシュ値を算出するように、クライアント装置２０−１を制御する。 The execution unit 13 receives the determination result by the determination unit 11 described above, divides the backup data 141-1 into blocks of a predetermined size, and calculates a hash value for each block. The execution unit 13 calculates a hash value using, for example, a predetermined hash function based on the contents of the data included in the block. The execution unit 13 transmits the hash value of each block of the calculated backup data 141-1 to the client device 20-1, and calculates the hash value for each block corresponding to the block in the current data 210-1. In addition, the client device 20-1 is controlled.

もし、あるブロックに関して、データ２１０−１がバックアップデータ１４１−１から更新されていない場合、バックアップデータ１４１−１の当該ブロックのハッシュ値と、データ２１０−１の当該ブロックのハッシュ値とは等しくなる。一方、あるブロックに関して、データ２１０−１がバックアップデータ１４１−１から更新されている場合、バックアップデータ１４１−１の当該ブロックのハッシュ値と、データ２１０−１の当該ブロックのハッシュ値とは、異なる値になる。 If the data 210-1 has not been updated from the backup data 141-1 for a certain block, the hash value of the block of the backup data 141-1 becomes equal to the hash value of the block of the data 210-1. .. On the other hand, when data 210-1 is updated from backup data 141-1 for a certain block, the hash value of the block of backup data 141-1 and the hash value of the block of data 210-1 are different. Become a value.

クライアント装置２０−１は、実行部１３による制御を受けて、バックアップデータ１４１−１に関するハッシュ値とデータ２１０−１に関するハッシュ値とが異なるブロック（即ち、データ２１０−１において、バックアップデータ１４１−１が生成あるいは更新された時点から更新が発生しているブロック）を特定する。クライアント装置２０−１は、データ２１０−１における特定したブロックと、当該ブロックを識別可能な識別子とを、バックアップ処理装置１０へ送信する。 Under the control of the execution unit 13, the client device 20-1 has a block in which the hash value for the backup data 141-1 and the hash value for the data 210-1 are different (that is, in the data 210-1, the backup data 141-1). (Blocks that have been updated since they were created or updated). The client device 20-1 transmits the block specified in the data 210-1 and an identifier that can identify the block to the backup processing device 10.

実行部１３は、クライアント装置２０−１から受信したデータ２１０−１におけるブロック及びその識別子を、差分データ１４２−１として生成する。実行部１３は、生成した差分データ１４２−１をバックアップデータ１４１−１に反映することによって、バックアップデータ１４１−１を、現在のデータ２１０−１の内容に更新し、バックアップ処理を完了する。 The execution unit 13 generates the block in the data 210-1 received from the client device 20-1 and its identifier as the difference data 142-1. The execution unit 13 updates the backup data 141-1 to the contents of the current data 210-1 by reflecting the generated difference data 142-1 in the backup data 141-1, and completes the backup process.

次に、バックアップ処理装置１０によってデータ２１０−１のバックアップ処理がまだ１回も行われていないこと等により、過去のある時点におけるバックアップデータ１４１−１が記憶部１４に存在しない場合における、バックアップ処理装置１０のバックアップ動作について説明する。 Next, the backup process when the backup data 141-1 at a certain point in the past does not exist in the storage unit 14 because the backup process of the data 210-1 has not been performed even once by the backup processing device 10. The backup operation of the device 10 will be described.

この場合、特定部１２は、判定部１１によるバックアップデータ１４１−１が記憶部１４に存在しないという判定結果を受けて、記憶部１４に存在する、データ２１０−１との仕様の差分が基準を満たす、クライアント装置２０−１（第一の他装置）以外のクライアント装置２０−ｉ（ｉは２以上のいずれかの整数）（第二の他装置）のディスク２１−ｉに記憶されたデータ２１０−ｉのバックアップデータ１４１−ｉを特定する。 In this case, the specific unit 12 receives the determination result that the backup data 141-1 by the determination unit 11 does not exist in the storage unit 14, and the difference in specifications from the data 210-1 existing in the storage unit 14 is used as a reference. Data 210 stored on disk 21-i of client device 20-i (i is any integer of 2 or more) (second other device) other than client device 20-1 (first other device) to be satisfied. Identify the backup data 141-i of −i.

特定部１２は、上述した基準として、例えば、クライアント装置２０においてデータ２１０を管理するファイルシステムの種別、ディスク２１の記憶領域の大きさ、及びディスク２１のパーティション構成について、データ２１０−１と同様であることを用いることができる。但し、特定部１２は、各クライアント装置２０におけるファイルシステムの種別、ディスク２１の記憶領域の大きさ、及びディスク２１のパーティション構成を表す情報を、例えば各クライアント装置２０などから取得可能であることとする。 The identification unit 12 is the same as the data 210-1 in terms of, for example, the type of file system that manages the data 210 in the client device 20, the size of the storage area of the disk 21, and the partition configuration of the disk 21 as the above-mentioned criteria. Something can be used. However, the specific unit 12 can acquire information representing the type of file system in each client device 20, the size of the storage area of the disk 21, and the partition configuration of the disk 21 from, for example, each client device 20. To do.

ここでは、特定部１２によって特定されたバックアップデータ１４１−ｉがバックアップデータ１４１−２である（即ち、ｉ＝２である）場合を例に、バックアップ処理装置１０の動作を説明する。 Here, the operation of the backup processing device 10 will be described by taking as an example the case where the backup data 141-i specified by the specific unit 12 is the backup data 141-2 (that is, i = 2).

実行部１３は、上述した特定部１２による特定結果を受けて、バックアップデータ１４１−２を所定の大きさのブロック（第一のブロック）に分割して、そのブロックごとにハッシュ値（第一のハッシュ値）を算出する。実行部１３は、そのブロックに含まれるデータの内容に基づいて、例えば上述した所定のハッシュ関数を用いてハッシュ値を算出する。実行部１３は、算出したバックアップデータ１４１−２の各ブロックのハッシュ値をクライアント装置２０−１に送信するとともに、現在のデータ２１０−１における当該ブロックに相当するブロック（第二のブロック）ごとにハッシュ値（第二のハッシュ値）を算出するように、クライアント装置２０−１を制御する。 The execution unit 13 receives the specific result of the specific unit 12 described above, divides the backup data 141-2 into blocks of a predetermined size (first block), and has a hash value (first block) for each block. Hash value) is calculated. The execution unit 13 calculates a hash value based on the contents of the data included in the block, for example, by using the predetermined hash function described above. The execution unit 13 transmits the hash value of each block of the calculated backup data 141-2 to the client device 20-1, and for each block (second block) corresponding to the block in the current data 210-1. The client device 20-1 is controlled so as to calculate the hash value (second hash value).

クライアント装置２０−１は、実行部１３による制御を受けて、バックアップデータ１４１−２に関する第一のハッシュ値とデータ２１０−１に関する第二のハッシュ値とが異なるブロックを特定する。クライアント装置２０−１は、データ２１０−１における特定したブロックと、当該ブロックを識別可能な識別子とを、バックアップ処理装置１０へ送信する。 Under the control of the execution unit 13, the client device 20-1 identifies a block in which the first hash value for the backup data 141-2 and the second hash value for the data 210-1 are different. The client device 20-1 transmits the block specified in the data 210-1 and an identifier that can identify the block to the backup processing device 10.

実行部１３は、クライアント装置２０−１から受信したデータ２１０−１におけるブロック及びその識別子を、差分データ１４２−１として生成する。実行部１３は、生成した差分データ１４２−１とバックアップデータ１４１−２とに基づいて、現在のデータ２１０−１と等しいバックアップデータ１４１−１を生成する。 The execution unit 13 generates the block in the data 210-1 received from the client device 20-1 and its identifier as the difference data 142-1. The execution unit 13 generates backup data 141-1 equal to the current data 210-1 based on the generated difference data 142-1 and backup data 141-2.

次に図２Ａ及び２Ｂのフローチャートを参照して、本実施形態に係るバックアップ処理システム１の動作（処理）について詳細に説明する。 Next, the operation (processing) of the backup processing system 1 according to the present embodiment will be described in detail with reference to the flowcharts of FIGS. 2A and 2B.

バックアップ処理装置１０は、クライアント装置２０−１のデータ２１０−１に対するバックアップ処理の実行指示を、外部の装置から受信する（ステップＳ１０１）。判定部１１は、記憶部１４に、データ２１０−１のバックアップデータ１４１−１が存在するか否かを判定する（ステップＳ１０２）。 The backup processing device 10 receives a backup processing execution instruction for the data 210-1 of the client device 20-1 from an external device (step S101). The determination unit 11 determines whether or not the backup data 141-1 of the data 210-1 exists in the storage unit 14 (step S102).

記憶部１４にバックアップデータ１４１−１が存在する場合（ステップＳ１０３でＹｅｓ）、実行部１３は、バックアップデータ１４１−１のブロックごとのハッシュ値を算出し、算出したハッシュ値をクライアント装置２０−１に送信する。そして実行部１３は、データ２１０−１のブロックごとのハッシュ値の算出、及び、ブロックごとのバックアップデータ１４１−１とデータ２１０−１とのハッシュ値の比較を行うように、クライアント装置２０−１を制御する（ステップＳ１０４）。 When the backup data 141-1 exists in the storage unit 14 (Yes in step S103), the execution unit 13 calculates a hash value for each block of the backup data 141-1 and uses the calculated hash value as the client device 20-1. Send to. Then, the execution unit 13 calculates the hash value for each block of the data 210-1, and compares the hash value between the backup data 141-1 and the data 210-1 for each block, so that the client device 20-1 is used. Is controlled (step S104).

クライアント装置２０−１は、実行部１３による制御を受けて、データ２１０−１のブロックごとのハッシュ値を算出する（ステップＳ１０５）。クライアント装置２０−１は、ブロックごとにステップＳ１０４で受信したハッシュ値とステップＳ１０５で算出したハッシュ値とを比較する（ステップＳ１０６）。クライアント装置２０−１は、その比較結果においてハッシュ値が異なるデータ２１０−１のブロック、及び当該ブロックの識別子を、バックアップ処理装置１０に送信する（ステップＳ１０７）。 The client device 20-1 calculates a hash value for each block of data 210-1 under the control of the execution unit 13 (step S105). The client device 20-1 compares the hash value received in step S104 with the hash value calculated in step S105 for each block (step S106). The client device 20-1 transmits a block of data 210-1 having a different hash value in the comparison result and an identifier of the block to the backup processing device 10 (step S107).

実行部１３は、クライアント装置２０−１から受信したデータ２１０−１のブロック及びその識別子を、差分データ１４２−１とすることによって、バックアップデータ１４１−１を現在のデータ２１０−１と等しくなるように更新し（ステップＳ１０８）、全体の処理は終了する。 The execution unit 13 sets the block of the data 210-1 received from the client device 20-1 and its identifier as the difference data 142-1 so that the backup data 141-1 becomes equal to the current data 210-1. (Step S108), and the entire process ends.

記憶部１４にバックアップデータ１４１−１が存在しない場合（ステップＳ１０３でＮｏ）、特定部１２は、ファイルシステムの種別、ディスクの記憶領域の大きさ、パーティション構成について、データ２１０−１と同様であるという基準を満たすデータ２１０のバックアップデータ１４１が記憶部１４に存在するか否か判定する（ステップＳ１０９）。 When the backup data 141-1 does not exist in the storage unit 14 (No in step S103), the specific unit 12 is the same as the data 210-1 in terms of the type of file system, the size of the storage area of the disk, and the partition configuration. It is determined whether or not the backup data 141 of the data 210 satisfying the above-mentioned criterion exists in the storage unit 14 (step S109).

当該基準を満たすバックアップデータ１４１が存在しない場合（ステップＳ１１０でＮｏ）、実行部１３は、データ２１０−１に対して、差分バックアップ処理ではなくフルバックアップ処理を行うことにより、バックアップデータ１４１−１を生成し（ステップＳ１１１）、全体の処理は終了する。 When there is no backup data 141 that satisfies the criteria (No in step S110), the execution unit 13 performs the backup data 141-1 on the data 210-1 by performing a full backup process instead of the differential backup process. It is generated (step S111), and the entire process ends.

当該基準を満たすバックアップデータ１４１が存在する（但しここでは、バックアップデータ１４１−２が当該基準を満たすこととする）場合（ステップＳ１１０でＹｅｓ）、実行部１３は、基準を満たすバックアップデータ１４１−２のブロックごとのハッシュ値を算出し、算出したハッシュ値をクライアント装置２０−１に送信する。そして実行部１３は、データ２１０−１のブロックごとのハッシュ値の算出、及び、ブロックごとのバックアップデータ１４１−２とデータ２１０−１とのハッシュ値の比較を行うように、クライアント装置２０−１を制御する（ステップＳ１１２）。 When there is backup data 141 that meets the criteria (however, here, it is assumed that the backup data 141-2 meets the criteria) (Yes in step S110), the execution unit 13 determines the backup data 141-2 that meets the criteria. The hash value for each block is calculated, and the calculated hash value is transmitted to the client device 20-1. Then, the execution unit 13 calculates the hash value of the data 210-1 for each block and compares the hash value of the backup data 141-2 and the data 210-1 for each block with the client device 20-1. Is controlled (step S112).

クライアント装置２０−１は、実行部１３による制御を受けて、データ２１０−１のブロックごとのハッシュ値を算出する（ステップＳ１１３）。クライアント装置２０−１は、ブロックごとにステップＳ１１２で受信したハッシュ値とステップＳ１１３で算出したハッシュ値とを比較する（ステップＳ１１４）。クライアント装置２０−１は、その比較結果においてハッシュ値が異なるデータ２１０−１のブロック、及び当該ブロックの識別子を、バックアップ処理装置１０に送信する（ステップＳ１１５）。 The client device 20-1 calculates a hash value for each block of data 210-1 under the control of the execution unit 13 (step S113). The client device 20-1 compares the hash value received in step S112 with the hash value calculated in step S113 for each block (step S114). The client device 20-1 transmits a block of data 210-1 having a different hash value in the comparison result and an identifier of the block to the backup processing device 10 (step S115).

実行部１３は、クライアント装置２０−１から受信したデータ２１０−１のブロック及びその識別子を、差分データ１４２−１として、差分データ１４２−１とバックアップデータ１４１−２とを用いてバックアップデータ１４１−１を生成し（ステップＳ１１６）、全体の処理は終了する。 The execution unit 13 uses the block of data 210-1 received from the client device 20-1 and its identifier as the difference data 142-1 and backs up data 141- using the difference data 142-1 and the backup data 141-2. 1 is generated (step S116), and the entire process is completed.

本実施形態に係るバックアップ処理装置１０は、データの差分バックアップを行うシステムにおいて、あるデータのバックアップデータがまだ存在していない場合のバックアップ処理を、効率的に行うことができる。その理由は、バックアップ処理装置１０は、データ２１０−１のバックアップデータ１４１−１が記憶部１４に存在しない場合、記憶部１４に存在する、データ２１０−１との仕様の差分が基準を満たすデータ２１０−ｉのバックアップデータ１４１−ｉを特定し、特定したバックアップデータ１４１−ｉを利用した差分バックアップを行うからである。 The backup processing device 10 according to the present embodiment can efficiently perform the backup processing when the backup data of a certain data does not yet exist in the system for performing the differential backup of the data. The reason is that when the backup data 141-1 of the data 210-1 does not exist in the storage unit 14, the backup processing device 10 has the data in the storage unit 14 whose specification difference from the data 210-1 satisfies the standard. This is because the backup data 141-i of 210-i is specified, and differential backup is performed using the specified backup data 141-i.

以下に、本実施形態に係るバックアップ処理装置１０によって実現される効果について、詳細に説明する。 The effects realized by the backup processing device 10 according to the present embodiment will be described in detail below.

差分バックアップを行うシステムにおいて、例えばシステム内におけるあるディスクのデータのバックアップデータがまだ存在していない場合におけるバックアップ処理は、通常、差分バックアップではなくフルバックアップにより行われる。しかしながら、膨大なデータを記憶する昨今の情報処理システムにおいて、あるディスクのデータのバックアップデータがまだ存在していない場合のバックアップ処理をフルバックアップによって行うことによって生じるサービスの停止時間等が及ぼす影響は、必ずしも小さいとはいえない。すなわち、差分バックアップを行うシステムにおいて、あるデータのバックアップデータがまだ存在していない場合のバックアップ処理を効率的に行うことが課題である。 In a system that performs differential backup, for example, when backup data of data on a certain disk in the system does not yet exist, the backup process is usually performed by full backup instead of differential backup. However, in a recent information processing system that stores a huge amount of data, the influence of the service downtime caused by performing the backup process by full backup when the backup data of the data of a certain disk does not exist yet is Not necessarily small. That is, in a system that performs differential backup, it is an issue to efficiently perform backup processing when backup data of certain data does not yet exist.

このような課題に対して、本実施形態に係るバックアップ処理装置１０は、判定部１１と、特定部１２と、実行部１３と、を備え、例えば図１、図２Ａ及び２Ｂを参照して上述した通り動作する。即ち、判定部１１は、クライアント装置２０−１（第一の他装置）に記憶された現在のデータ２１０−１（第一のデータ）と、バックアップ処理装置１０（自装置）に記憶された、過去に生成あるいは更新されたデータ２１０−１のバックアップデータ１４１−１との差分に基づいて、データ２１０−１に対するバックアップ処理を行う場合に、データ２１０−１のバックアップデータ１４１−１がバックアップ処理装置１０に存在するか否かを判定する。特定部１２は、データ２１０−１のバックアップデータ１４１−１がバックアップ処理装置１０に存在しない場合、バックアップ処理装置１０に存在する、データ２１０−１との仕様の差分が基準を満たす、クライアント装置２０−２（第二の他装置）に記憶されたデータ２１０−２（第二のデータ）のバックアップデータ１４１−２を特定する。そして、実行部１３は、現在のデータ２１０−１とデータ２１０−２のバックアップデータ１４１−２との差分に基づいて、データ２１０−１に対するバックアップ処理を行う。 In response to such a problem, the backup processing device 10 according to the present embodiment includes a determination unit 11, a specific unit 12, and an execution unit 13, for example, with reference to FIGS. 1, 2A and 2B described above. It works as you did. That is, the determination unit 11 stores the current data 210-1 (first data) stored in the client device 20-1 (first other device) and the backup processing device 10 (own device). When performing backup processing on data 210-1 based on the difference between the data 210-1 generated or updated in the past and the backup data 141-1, the backup data 141-1 of the data 210-1 is the backup processing device. It is determined whether or not it exists in 10. When the backup data 141-1 of the data 210-1 does not exist in the backup processing device 10, the specifying unit 12 has the client device 20 in which the difference in specifications from the data 210-1 existing in the backup processing device 10 satisfies the standard. The backup data 141-2 of the data 210-2 (second data) stored in -2 (second other device) is specified. Then, the execution unit 13 performs backup processing on the data 210-1 based on the difference between the current data 210-1 and the backup data 141-2 of the data 210-2.

即ち、本実施形態に係るバックアップ処理装置１０は、あるデータのバックアップデータがまだ存在していない場合であっても、当該データと仕様（特徴）の差分が基準を満たす別のデータのバックアップデータを利用することによって差分バックアップを実行する。バックアップ処理装置１０は、より具体的には、例えば、クライアント装置２０においてデータ２１０を管理するファイルシステムの種別、ディスク２１の記憶領域の大きさ、及びディスク２１のパーティション構成が同様であることを、当該基準として使用する。バックアップ対象のデータとは別のデータがこのような基準を満たす場合、当該バックアップ対象のデータ及び当該別のデータは、同じようにブロックに分割することができる。以上のことから、本実施形態に係るバックアップ処理装置１０は、データの差分バックアップを行うシステムにおいて、あるデータのバックアップデータがまだ存在していない場合のバックアップ処理を、効率的に行うことができる。 That is, the backup processing device 10 according to the present embodiment can perform backup data of another data in which the difference between the data and the specifications (features) satisfies the standard even when the backup data of a certain data does not yet exist. Perform differential backup by using it. More specifically, the backup processing device 10 indicates that, for example, the type of the file system that manages the data 210 in the client device 20, the size of the storage area of the disk 21, and the partition configuration of the disk 21 are the same. Used as the standard. If data other than the data to be backed up meets such criteria, the data to be backed up and the other data can be similarly divided into blocks. From the above, the backup processing device 10 according to the present embodiment can efficiently perform the backup process when the backup data of a certain data does not yet exist in the system for performing the differential backup of the data.

また、本実施形態に係る実行部１３は、クライアント装置２０−１においてデータ２１０−１（第一のデータ）を管理するファイルシステムを解析することによって、データ２１０−１に関してバックアップ処理を行う必要がある領域（例えばデータとして有効な領域）を特定し、特定した領域に関してバックアップ処理を行うようにしてもよい。これにより、本実施形態に係るバックアップ処理装置１０は、バックアップ処理をより効率的に行うことができる。 Further, the execution unit 13 according to the present embodiment needs to perform backup processing on the data 210-1 by analyzing the file system that manages the data 210-1 (first data) in the client device 20-1. A certain area (for example, an area effective as data) may be specified, and backup processing may be performed on the specified area. As a result, the backup processing device 10 according to the present embodiment can perform the backup processing more efficiently.

また、本実施形態に係るクライアント装置２０は、仮想マシンであってもよい。この場合において、特定部１２は、クライアント装置２０−１（第一の仮想マシン）に記憶されたデータ２１０−１（第一のデータ）のバックアップデータ１４１−１が記憶部１４に存在しない場合、上述した仕様の差分が基準を満たす、クライアント装置２０−２（第二の仮想マシン）に記憶されたデータ２１０−２（第二のデータ）のバックアップデータ１４１−２を特定する。そしてこの場合、特定部１２は、クライアント装置２０−１及び２０−２が、同一の仮想マシンテンプレートに基づいて生成されていることをもって、バックアップデータ１４１−２を特定してもよい。 Further, the client device 20 according to the present embodiment may be a virtual machine. In this case, the specific unit 12 determines that the backup data 141-1 of the data 210-1 (first data) stored in the client device 20-1 (first virtual machine) does not exist in the storage unit 14. The backup data 141-2 of the data 210-2 (second data) stored in the client device 20-2 (second virtual machine), to which the difference of the above specifications satisfies the standard, is specified. Then, in this case, the identification unit 12 may specify the backup data 141-2 by the fact that the client devices 20-1 and 20-2 are generated based on the same virtual machine template.

昨今の仮想環境を構築したコンピュータシステムでは、同一の仮想マシンテンプレートに基づいて多数の仮想マシンを構築する場合がある。そして、同一の仮想マシンテンプレートに基づいて生成された仮想マシン同士では、通常、データ２１０を管理するファイルシステムの種別、ディスク２１の記憶領域の大きさ、及びディスク２１のパーティション構成は同様となる。したがってこの場合、特定部１２は、仕様の差分が基準を満たすバックアップデータ１４１を特定することを、より効率的に行うことができる。 In a computer system that has built a virtual environment these days, many virtual machines may be built based on the same virtual machine template. Then, in the virtual machines generated based on the same virtual machine template, the type of the file system that manages the data 210, the size of the storage area of the disk 21, and the partition configuration of the disk 21 are usually the same. Therefore, in this case, the specifying unit 12 can more efficiently specify the backup data 141 whose specification difference satisfies the reference.

また、本実施形態に係る特定部１２は、クライアント装置２０−１及び２０−２（第一及び第二の物理マシン）のディスク２１−１及び２１−２に構築されたＯＳが同一のマスターイメージからクローニングされることによって構築されていることをもって、バックアップデータ１４１−２を特定してもよい。 Further, the specific unit 12 according to the present embodiment is a master image having the same OS built on the disks 21-1 and 21-2 of the client devices 20-1 and 20-2 (first and second physical machines). Backup data 141-2 may be identified by being constructed by being cloned from.

昨今のコンピュータシステムでは、同一のマスターイメージからＯＳのクローニングを行うことによって、多数の物理マシンのＯＳ環境を自動構築する場合がある。そして、同一のマスターイメージからＯＳのクローニングが行われたクライアント装置２０同士では、通常、データ２１０を管理するファイルシステムの種別、ディスク２１の記憶領域の大きさ、及びディスク２１のパーティション構成は同様となる。したがってこの場合、特定部１２は、仕様の差分が基準を満たすバックアップデータ１４１を特定することを、より効率的に行うことができる。 In recent computer systems, the OS environment of many physical machines may be automatically constructed by cloning the OS from the same master image. Then, in the client devices 20 in which the OS is cloned from the same master image, the type of the file system that manages the data 210, the size of the storage area of the disk 21, and the partition configuration of the disk 21 are usually the same. Become. Therefore, in this case, the specifying unit 12 can more efficiently specify the backup data 141 whose specification difference satisfies the reference.

＜第２の実施形態＞
図３は、本願発明の第２の実施形態に係るバックアップ処理装置４０の構成を示すブロック図である。 <Second embodiment>
FIG. 3 is a block diagram showing the configuration of the backup processing device 40 according to the second embodiment of the present invention.

本実施形態に係るバックアップ処理装置４０は、判定部４１、特定部４２、及び、実行部４３を備えている。 The backup processing device 40 according to the present embodiment includes a determination unit 41, a specific unit 42, and an execution unit 43.

判定部４１は、第一の他装置５０−１に記憶された現在の第一のデータ５１−１と、バックアップ処理装置４０（自装置）に記憶された、過去に生成あるいは更新された第一のデータ５１−１のバックアップデータ４４−１との差分に基づいて、第一のデータ５１−１に対するバックアップ処理を行う場合に、第一のデータ５１−１のバックアップデータ４４−１がバックアップ処理装置４０に存在するか否かを判定する。 The determination unit 41 has the current first data 51-1 stored in the first other device 50-1 and the first data generated or updated in the past stored in the backup processing device 40 (own device). When the backup process for the first data 51-1 is performed based on the difference from the backup data 44-1 of the data 51-1 of the above, the backup data 44-1 of the first data 51-1 is the backup processing device. It is determined whether or not it exists in 40.

尚、第一の他装置５０−１は、例えば、第一の実施形態に係るクライアント装置２０と同等な装置であってもよい。また、第一のデータ５１−１は、例えば、第一の実施形態に係るデータ２１０と同様に、第一の他装置５０−１が備えるディスクに記憶されたデータであってもよい。また、バックアップデータ４４−１は、例えば、第一の実施形態に係るバックアップデータ１４１と同様に、バックアップ処理装置４０が備える記憶部に記憶されたデータであってもよい。 The first other device 50-1 may be, for example, a device equivalent to the client device 20 according to the first embodiment. Further, the first data 51-1 may be, for example, data stored in a disk included in the first other device 50-1, similarly to the data 210 according to the first embodiment. Further, the backup data 44-1 may be data stored in a storage unit included in the backup processing device 40, as in the case of the backup data 141 according to the first embodiment, for example.

特定部４２は、第一のデータ５１−１のバックアップデータ４４−１がバックアップ処理装置４０に存在しない場合、バックアップ処理装置４０に存在する、第一のデータ５１−１との仕様の差分が基準を満たす、第二の他装置５０−２に記憶された第二のデータ５１−２のバックアップデータ４４−２を特定する。 When the backup data 44-1 of the first data 51-1 does not exist in the backup processing device 40, the specific unit 42 uses the difference in specifications from the first data 51-1 existing in the backup processing device 40 as a reference. The backup data 44-2 of the second data 51-2 stored in the second other device 50-2 that satisfies the above is specified.

尚、第二の他装置５０−２は、例えば、第一の実施形態に係るクライアント装置２０と同等な装置であってもよい。また、第二のデータ５１−２は、例えば、第一の実施形態に係るデータ２１０と同様に、第二の他装置５０−２が備えるディスクに記憶されたデータであってもよい。また、バックアップデータ４４−１は、例えば、第一の実施形態に係るバックアップデータ１４１と同様に、バックアップ処理装置４０が備える記憶部に記憶されたデータであってもよい。また、特定部４２は、バックアップデータ４４−２を特定する際に、第一の実施形態に係る特定部１２が用いる基準と同様な基準を用いてもよい。 The second other device 50-2 may be, for example, a device equivalent to the client device 20 according to the first embodiment. Further, the second data 51-2 may be, for example, data stored in a disk included in the second other device 50-2, similarly to the data 210 according to the first embodiment. Further, the backup data 44-1 may be data stored in a storage unit included in the backup processing device 40, as in the case of the backup data 141 according to the first embodiment, for example. Further, when specifying the backup data 44-2, the specific unit 42 may use the same standard as the standard used by the specific unit 12 according to the first embodiment.

実行部４３は、現在の第一のデータ５１−１と第二のデータ５１−２のバックアップデータ４４−２との差分に基づいて、第一のデータ５１−１に対するバックアップ処理を行う。 The execution unit 43 performs backup processing on the first data 51-1 based on the difference between the current first data 51-1 and the backup data 44-2 of the second data 51-2.

本実施形態に係るバックアップ処理装置４０は、データの差分バックアップを行うシステムにおいて、あるデータのバックアップデータがまだ存在していない場合のバックアップ処理を、効率的に行うことができる。その理由は、バックアップ処理装置４０は、第一のデータ５１−１のバックアップデータ４４−１がバックアップ処理装置４０に存在しない場合、バックアップ処理装置４０に存在する、第一のデータ５１−１との仕様の差分が基準を満たす第二のデータ５１−２のバックアップデータ４４−２を特定し、特定したバックアップデータ４４−２を利用した差分バックアップを行うからである。 The backup processing device 40 according to the present embodiment can efficiently perform the backup processing when the backup data of a certain data does not yet exist in the system for performing the differential backup of the data. The reason is that the backup processing device 40 has the backup data 44-1 of the first data 51-1 with the first data 51-1 existing in the backup processing device 40 when the backup data 44-1 does not exist in the backup processing device 40. This is because the backup data 44-2 of the second data 51-2 whose specification difference satisfies the standard is specified, and the differential backup using the specified backup data 44-2 is performed.

＜ハードウェア構成例＞
上述した各実施形態において図１、及び、図３に示したバックアップ処理装置における各部は、専用のＨＷ（ＨａｒｄＷａｒｅ）（電子回路）によって実現することができる。また、図１、及び、図３において、少なくとも、下記構成は、ソフトウェアプログラムの機能（処理）単位（ソフトウェアモジュール）と捉えることができる。
・判定部１１及び４１、
・特定部１２及び４２、
・実行部１３及び４３、
・記憶部１４における記憶制御機能。 <Hardware configuration example>
In each of the above-described embodiments, each part of the backup processing apparatus shown in FIGS. 1 and 3 can be realized by a dedicated HW (HardWare) (electronic circuit). Further, in FIGS. 1 and 3, at least the following configuration can be regarded as a function (processing) unit (software module) of the software program.
・ Judgment units 11 and 41,
・ Specific parts 12 and 42,
・ Execution units 13 and 43,
-Memory control function in the storage unit 14.

但し、これらの図面に示した各部の区分けは、説明の便宜上の構成であり、実装に際しては、様々な構成が想定され得る。この場合のハードウェア環境の一例を、図４を参照して説明する。 However, the division of each part shown in these drawings is a configuration for convenience of explanation, and various configurations can be assumed at the time of mounting. An example of the hardware environment in this case will be described with reference to FIG.

図４は、本願発明の各実施形態に係るバックアップ処理装置を実行可能な情報処理装置９００（コンピュータ）の構成を例示的に説明する図である。即ち、図４は、図１、及び、図３に示したバックアップ処理装置を実現可能なコンピュータ（情報処理装置）の構成であって、上述した実施形態における各機能を実現可能なハードウェア環境を表す。 FIG. 4 is a diagram illustrating an example of a configuration of an information processing device 900 (computer) capable of executing a backup processing device according to each embodiment of the present invention. That is, FIG. 4 shows a configuration of a computer (information processing device) capable of realizing the backup processing device shown in FIGS. 1 and 3, and provides a hardware environment capable of realizing each function in the above-described embodiment. Represent.

図４に示した情報処理装置９００は、構成要素として下記を備えている。
・ＣＰＵ（Ｃｅｎｔｒａｌ＿Ｐｒｏｃｅｓｓｉｎｇ＿Ｕｎｉｔ）９０１、
・ＲＯＭ（Ｒｅａｄ＿Ｏｎｌｙ＿Ｍｅｍｏｒｙ）９０２、
・ＲＡＭ（Ｒａｎｄｏｍ＿Ａｃｃｅｓｓ＿Ｍｅｍｏｒｙ）９０３、
・ハードディスク（記憶装置）９０４、
・通信インタフェース９０５、
・バス９０６（通信線）、
・ＣＤ−ＲＯＭ（Ｃｏｍｐａｃｔ＿Ｄｉｓｃ＿Ｒｅａｄ＿Ｏｎｌｙ＿Ｍｅｍｏｒｙ）等の記録媒体９０７に格納されたデータを読み書き可能なリーダライタ９０８、
・モニターやスピーカ、キーボード等の入出力インタフェース９０９。 The information processing apparatus 900 shown in FIG. 4 includes the following as components.
-CPU (Central_Processing_Unit) 901,
-ROM (Read_Only_Memory) 902,
・ RAM (Random_Access_Memory) 903,
-Hard disk (storage device) 904,
-Communication interface 905,
・ Bus 906 (communication line),
A reader / writer 908 that can read and write data stored in a recording medium 907 such as a CD-ROM (Compact_Disc_Read_Only_Memory),
-Input / output interface 909 for monitors, speakers, keyboards, etc.

即ち、上記構成要素を備える情報処理装置９００は、これらの構成がバス９０６を介して接続された一般的なコンピュータである。情報処理装置９００は、ＣＰＵ９０１を複数備える場合もあれば、マルチコアにより構成されたＣＰＵ９０１を備える場合もある。 That is, the information processing device 900 including the above components is a general computer in which these components are connected via the bus 906. The information processing device 900 may include a plurality of CPUs 901, or may include a CPU 901 configured by a multi-core processor.

そして、上述した実施形態を例に説明した本願発明は、図４に示した情報処理装置９００に対して、次の機能を実現可能なコンピュータプログラムを供給する。その機能とは、その実施形態の説明において参照したブロック構成図（図１、及び、図３）における上述した構成、或いはフローチャート（図２Ａ及び２Ｂ）の機能である。本願発明は、その後、そのコンピュータプログラムを、当該ハードウェアのＣＰＵ９０１に読み出して解釈し実行することによって達成される。また、当該装置内に供給されたコンピュータプログラムは、読み書き可能な揮発性のメモリ（ＲＡＭ９０３）、または、ＲＯＭ９０２やハードディスク９０４等の不揮発性の記憶デバイスに格納すれば良い。 Then, the present invention described by taking the above-described embodiment as an example supplies the computer program capable of realizing the following functions to the information processing apparatus 900 shown in FIG. The function is the above-described configuration in the block configuration diagrams (FIGS. 1 and 3) referred to in the description of the embodiment, or the function of the flowchart (FIGS. 2A and 2B). The present invention is then achieved by reading, interpreting, and executing the computer program in the CPU 901 of the hardware. Further, the computer program supplied in the device may be stored in a readable / writable volatile memory (RAM 903) or a non-volatile storage device such as a ROM 902 or a hard disk 904.

また、前記の場合において、当該ハードウェア内へのコンピュータプログラムの供給方法は、現在では一般的な手順を採用することができる。その手順としては、例えば、ＣＤ−ＲＯＭ等の各種記録媒体９０７を介して当該装置内にインストールする方法や、インターネット等の通信回線を介して外部よりダウンロードする方法等がある。そして、このような場合において、本願発明は、係るコンピュータプログラムを構成するコード或いは、そのコードが格納された記録媒体９０７によって構成されると捉えることができる。 Further, in the above case, as a method of supplying the computer program into the hardware, a general procedure can be adopted at present. As the procedure, for example, there are a method of installing in the device via various recording media 907 such as a CD-ROM, a method of downloading from the outside via a communication line such as the Internet, and the like. Then, in such a case, the present invention can be regarded as being composed of a code constituting the computer program or a recording medium 907 in which the code is stored.

以上、上述した実施形態を模範的な例として本願発明を説明した。しかしながら、本願発明は、上述した実施形態には限定されない。即ち、本願発明は、本願発明のスコープ内において、当業者が理解し得る様々な態様を適用することができる。 The invention of the present application has been described above using the above-described embodiment as a model example. However, the present invention is not limited to the above-described embodiments. That is, the present invention can apply various aspects that can be understood by those skilled in the art within the scope of the present invention.

１バックアップ処理システム
１０バックアップ処理装置
１１判定部
１２特定部
１３実行部
１４記憶部
１４１−１乃至１４１−３バックアップデータ
１４２−１乃至１４２−３差分データ
２０−１乃至２０−３クライアント装置
２１−１乃至２１−３ディスク
２１０−１乃至２１０−３データ
３０通信ネットワーク
４０バックアップ処理装置
４１判定部
４２特定部
４３実行部
４４−１バックアップデータ
４４−２バックアップデータ
５０−１第一の他装置
５０−２第二の他装置
５１−１第一のデータ
５１−２第二のデータ
９００情報処理装置
９０１ＣＰＵ
９０２ＲＯＭ
９０３ＲＡＭ
９０４ハードディスク（記憶装置）
９０５通信インタフェース
９０６バス
９０７記録媒体
９０８リーダライタ
９０９入出力インタフェース 1 Backup processing system 10 Backup processing device 11 Judgment unit 12 Specific unit 13 Execution unit 14 Storage unit 141-1 to 141-3 Backup data 142-1 to 142-3 Difference data 20-1 to 20-3 Client device 21-1 ~ 21-3 Disk 210-1 ~ 210-3 Data 30 Communication network 40 Backup processing device 41 Judgment unit 42 Specific unit 43 Execution unit 44-1 Backup data 44-2 Backup data 50-1 First other device 50-2 Second other device 51-1 First data 51-2 Second data 900 Information processing device 901 CPU
902 ROM
903 RAM
904 hard disk (storage device)
905 Communication interface 906 Bus 907 Recording medium 908 Reader / writer 909 Input / output interface

Claims

第一の他装置に記憶された現在の第一のデータと、自装置に記憶された、過去に生成あるいは更新された前記第一のデータのバックアップデータとの差分に基づいて、前記第一のデータに対するバックアップ処理を行う場合に、前記第一のデータのバックアップデータが自装置に存在するか否かを判定する判定手段と、
前記第一のデータのバックアップデータが自装置に存在しない場合、自装置に存在する、前記第一のデータとの仕様の差分が基準を満たす、第二の他装置に記憶された第二のデータのバックアップデータを特定する特定手段と、
現在の前記第一のデータと前記第二のデータのバックアップデータとの差分に基づいて、前記第一のデータに対するバックアップ処理を行う実行手段と、
を備えるバックアップ処理装置。 Based on the difference between the current first data stored in the first other device and the backup data of the first data stored in the own device and generated or updated in the past, the first When performing backup processing on data, a determination means for determining whether or not the backup data of the first data exists in the own device, and
When the backup data of the first data does not exist in the own device, the second data stored in the second other device, which exists in the own device and whose specification difference from the first data satisfies the standard. Specific means to identify the backup data of
An execution means for performing backup processing on the first data based on the difference between the current backup data of the first data and the backup data of the second data.
A backup processing device equipped with.

前記特定手段は、前記第一及び第二のデータを管理するファイルシステムの種別、前記第一及び第二のデータが格納された記憶領域の大きさ、及び、前記記憶領域に関するパーティション構成に関する前記仕様の差分に基づいて、前記第二のデータのバックアップデータを特定する、
請求項１に記載のバックアップ処理装置。 The specific means is the specification regarding the type of the file system that manages the first and second data, the size of the storage area in which the first and second data are stored, and the partition configuration related to the storage area. Identify the backup data of the second data based on the difference of
The backup processing device according to claim 1.

前記実行手段は、前記第一のデータを管理するファイルシステムを解析することによって、前記第一のデータに関して前記バックアップ処理を行う必要がある領域を特定し、特定した前記領域に関して前記バックアップ処理を行う、
請求項１または請求項２に記載のバックアップ処理装置。 The execution means identifies an area in which the backup process needs to be performed on the first data by analyzing a file system that manages the first data, and performs the backup process on the specified area. ,
The backup processing device according to claim 1 or 2.

前記特定手段は、第一の仮想マシンである前記第一の他装置に記憶された前記第一のデータのバックアップデータが自装置に存在しない場合、第二の仮想マシンである前記第二の他装置に記憶された前記第二のデータのバックアップデータを特定し、
前記第一及び第二の仮想マシンは、同一の仮想マシンテンプレートに基づいて生成されている、
請求項１乃至請求項３のいずれか一項に記載のバックアップ処理装置。 The specific means is the second virtual machine, the second other, when the backup data of the first data stored in the first other device, which is the first virtual machine, does not exist in the own device. Identify the backup data of the second data stored in the device and
The first and second virtual machines are generated based on the same virtual machine template.
The backup processing device according to any one of claims 1 to 3.

前記特定手段は、第一の物理マシンである前記第一の他装置の記憶デバイスに記憶された前記第一のデータのバックアップデータが自装置に存在しない場合、第二の物理マシンである前記第二の他装置の記憶デバイスに記憶された前記第二のデータのバックアップデータを特定し、
前記第一の他装置の記憶デバイスに構築されたＯＳ（Operating System）、及び、前記第二の他装置の記憶デバイスに構築されたＯＳは、同一のマスターイメージからクローニングされることによって構築されている、
請求項１乃至請求項３のいずれか一項に記載のバックアップ処理装置。 The specific means is the second physical machine when the backup data of the first data stored in the storage device of the first other device, which is the first physical machine, does not exist in the own device. Identify the backup data of the second data stored in the storage device of the second other device,
The OS (Operating System) built in the storage device of the first other device and the OS built in the storage device of the second other device are built by cloning from the same master image. Yes,
The backup processing device according to any one of claims 1 to 3.

前記実行手段は、前記第二のデータのバックアップデータに含まれる第一のブロックごとに第一のハッシュ値を算出して、算出した前記第一のハッシュ値を前記第一の他装置に送信するとともに、前記第一の他装置によって算出された、前記第一のデータにおいて前記第一のブロックに対応する第二のブロックの第二のハッシュ値が、前記第一のハッシュ値とは異なる前記第二のブロックを、前記第一の他装置から受信し、受信した前記第二のブロックを、現在の前記第一のデータと前記第二のデータのバックアップデータとの差分として使用する、
請求項１乃至請求項５のいずれか一項に記載のバックアップ処理装置。 The executing means calculates a first hash value for each first block included in the backup data of the second data, and transmits the calculated first hash value to the first other device. At the same time, the second hash value of the second block corresponding to the first block in the first data calculated by the first other device is different from the first hash value. The second block is received from the first other device, and the received second block is used as the difference between the current first data and the backup data of the second data.
The backup processing device according to any one of claims 1 to 5.

請求項１乃至請求項６のいずれか一項に記載のバックアップ処理装置と、
前記第一及び第二の他装置と、
を含むバックアップ処理システム。 The backup processing device according to any one of claims 1 to 6.
With the first and second other devices,
Backup processing system including.

情報処理装置によって、
第一の他装置に記憶された現在の第一のデータと、自装置に記憶された、過去に生成あるいは更新された前記第一のデータのバックアップデータとの差分に基づいて、前記第一のデータに対するバックアップ処理を行う場合に、前記第一のデータのバックアップデータが前記情報処理装置に存在するか否かを判定し、
前記第一のデータのバックアップデータが前記情報処理装置に存在しない場合、前記情報処理装置に存在する、前記第一のデータとの仕様の差分が基準を満たす、第二の他装置に記憶された第二のデータのバックアップデータを特定し、
現在の前記第一のデータと、前記第二のデータのバックアップデータとの差分に基づいて、前記第一のデータに対するバックアップ処理を行う、
バックアップ処理方法。 Depending on the information processing device
Based on the difference between the current first data stored in the first other device and the backup data of the first data stored in the own device and generated or updated in the past, the first When performing backup processing on data, it is determined whether or not the backup data of the first data exists in the information processing apparatus.
When the backup data of the first data does not exist in the information processing device, it is stored in the second other device in which the difference in specifications from the first data exists in the information processing device and satisfies the standard. Identify the backup data of the second data,
A backup process for the first data is performed based on the difference between the current first data and the backup data of the second data.
Backup processing method.

前記第一及び第二のデータを管理するファイルシステム、前記第一及び第二のデータが格納された記憶領域の大きさ、及び、前記記憶領域に関するパーティション構成に関する前記仕様の差分に基づいて、前記第二のデータのバックアップデータを特定する、
請求項８に記載のバックアップ処理方法。 Based on the difference in the file system that manages the first and second data, the size of the storage area in which the first and second data are stored, and the specifications regarding the partition configuration for the storage area. Identify the backup data of the second data,
The backup processing method according to claim 8.

第一の他装置に記憶された現在の第一のデータと、自装置に記憶された、過去に生成あるいは更新された前記第一のデータのバックアップデータとの差分に基づいて、前記第一のデータに対するバックアップ処理を行う場合に、前記第一のデータのバックアップデータが自装置に存在するか否かを判定する判定処理と、
前記第一のデータのバックアップデータが自装置に存在しない場合、自装置に存在する、前記第一のデータとの仕様の差分が基準を満たす、第二の他装置に記憶された第二のデータのバックアップデータを特定する特定処理と、
現在の前記第一のデータと、前記第二のデータのバックアップデータとの差分に基づいて、前記第一のデータに対するバックアップ処理を行う実行処理と、
を自装置に実行させるためのバックアップ処理プログラム。 Based on the difference between the current first data stored in the first other device and the backup data of the first data stored in the own device and generated or updated in the past, the first When performing backup processing on data, determination processing for determining whether or not the backup data of the first data exists in the own device, and determination processing.
When the backup data of the first data does not exist in the own device, the second data stored in the second other device, which exists in the own device and whose specification difference from the first data satisfies the standard. Specific processing to identify the backup data of
An execution process that performs a backup process on the first data based on the difference between the current first data and the backup data of the second data, and an execution process.
A backup processing program that allows your device to execute.