JP2010271862A

JP2010271862A - System and appaastus for processing information

Info

Publication number: JP2010271862A
Application number: JP2009122315A
Authority: JP
Inventors: Shinko Riku; 振宏陸; Rei Yano; 令矢野; Kenjiro Naemura; 健二郎苗村; Ai Noshiro; 愛能代
Original assignee: Toshiba Corp; Toshiba Solutions Corp
Current assignee: Toshiba Corp; Toshiba Digital Solutions Corp
Priority date: 2009-05-20
Filing date: 2009-05-20
Publication date: 2010-12-02

Abstract

<P>PROBLEM TO BE SOLVED: To achieve the appropriate transfer processing of structured data and unstructured data. <P>SOLUTION: In a structured text creation client 10, when a structured text including the reference information of an image has been updated by an input operation to a structured text editing UI part 11 and when a hash matching determination part 15 determines that the hash value of an original image already stored in a hash cache part 14 is matched with the hash value of the image of the updated structured text, it is determined that the image of the updated structured text has not been updated, and the updated structured text is transmitted by a server communication part 12 to a structured text server 20, and the image of the updated structured text is not transmitted. The structured text server 20 updates the structured text shown by the structured text URI (Uniform Resource Identifier) of the received structured text among the structured texts stored in a document DB 40 to the received structured text. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は、ＸＭＬ（登録商標）形式などの構造化データを扱う分散編集環境において、画像、音声、映像等の非構造化データを同時に取り扱い、サーバとクライアント間で、ネットワークを経由して非構造化データを含む構造化データをやり取りする情報処理システムおよび情報処理装置に関する。 The present invention simultaneously handles unstructured data such as images, sounds, and videos in a distributed editing environment that handles structured data such as XML (registered trademark) format, and unstructured between a server and a client via a network. The present invention relates to an information processing system and an information processing apparatus that exchange structured data including structured data.

従来、ＸＭＬＤＢを含むＸＭＬ分散編集環境において、ＸＭＬ形式の構造化文書と画像などの非構造化データを両方扱う要望が高まってきている。しかしＸＭＬＤＢにおいては、画像などの非構造化データを扱うため標準的な手法が確立されていない。 Conventionally, in an XML distributed editing environment including XMLDB, there is an increasing demand for handling both structured documents in XML format and unstructured data such as images. However, in XMLDB, a standard method has not been established for handling unstructured data such as images.

手法の一例として、画像などの非構造化データをBase64などでエンコードして、バイナリデータとして構造化データに取り込むことは可能だが、データサイズが増えてしまうので、画像の描画速度など性能の劣化につながることが多い。 As an example of the technique, it is possible to encode unstructured data such as images with Base64 etc. and import it into structured data as binary data, but the data size will increase, resulting in performance degradation such as image drawing speed Often connected.

また、例えば特許文献１に開示されるように、構造化文書や非構造化データにＵＲＩ（Uniform Resource Identifier）を付与し、HTTPアドレスで定義される統一ユーザインタフェースを用いてクライアント・サーバ間で構造化文書や非構造化データの通信を行なう技術が開示されている。 Further, as disclosed in, for example, Patent Document 1, a URI (Uniform Resource Identifier) is assigned to a structured document or unstructured data, and a structure is established between a client and a server using a unified user interface defined by an HTTP address. A technique for communicating structured documents and unstructured data is disclosed.

特開２００１−３５７０８９号公報JP 2001-357089 A

構造化文書や非構造化データの通信に関わる従来の技術では、構造化文書作成クライアントが、構造化文書を構造化文書サーバに送信するとともに、構造化文書に含まれる画像などの非構造化データを非構造化データサーバに送信する。 In conventional technology related to communication of structured documents and unstructured data, a structured document creation client sends a structured document to a structured document server and unstructured data such as images included in the structured document. To the unstructured data server.

しかしながら、構造化文書の更新を行なった場合は、非構造化データの更新がなされていない場合でも、構造化文書作成クライアントは、構造化文書及び非構造化データを構造化文書サーバや非構造化データサーバにそれぞれ送信することになり、ネットワークへ負荷が高くなる問題があった。 However, if the structured document is updated, the structured document creation client can send the structured document and the unstructured data to the structured document server or unstructured data even if the unstructured data is not updated. There was a problem of increasing the load on the network due to the transmission to each data server.

そこで、本発明の目的は、構造化データおよび非構造化データの適切な伝送処理を行なうことが可能になる情報処理システムおよび情報処理装置を提供することにある。 Therefore, an object of the present invention is to provide an information processing system and an information processing apparatus that can perform appropriate transmission processing of structured data and unstructured data.

すなわち、本発明に係わる情報処理システムは、クライアント装置、構造化データサーバ装置、および非構造化データサーバ装置を有する情報処理システムであって、前記クライアント装置は、構造化データおよび当該構造化データに関わる非構造化データの作成のための入力操作を受け付ける入力手段と、前記入力手段により作成された構造化データに関わる非構造化データのハッシュ値を計算する第１の計算手段と、前記入力手段により作成された構造化データおよび当該構造化データに関わる非構造化データを前記構造化データサーバ装置に送信する第１の送信手段と、前記作成された構造化データが前記入力手段への入力操作により更新された場合に、当該更新された構造化データに関わる非構造化データのハッシュ値を計算する第２の計算手段と、前記第２の計算手段により計算したハッシュ値と前記第１の計算手段により計算した、更新前の構造化データに関わる非構造化データのハッシュ値とが一致するか否かを判別する判別手段と、前記ハッシュ値が一致すると前記判別手段により判別した場合に、前記更新された構造化データを前記構造化データサーバ装置に送信し、前記更新された構造化データに関わる非構造化データは送信しない第２の送信手段とを有し、前記構造化データサーバ装置は、前記構造化データを記憶するための構造化データ記憶装置と、前記クライアント装置からの前記作成時で更新前の構造化データに関わる非構造化データを前記非構造化データサーバ装置に送信する送信手段と、前記クライアント装置からの前記作成時で更新前の構造化データに、前記非構造化データサーバ装置からの前記非構造化データの参照情報を含ませて前記構造化データ記憶装置に記憶する第１の記憶制御手段と、前記構造化データ記憶装置に記憶される前記更新前の構造化データを、前記クライアント装置からの前記更新された構造化データに書き換える第２の記憶制御手段とを有し、前記非構造化データサーバ装置は、前記非構造化データを記憶するための非構造化データ記憶装置と、前記構造化データサーバ装置からの前記作成時で更新前の構造化データに関わる非構造化データを前記非構造化データ記憶装置に記憶する記憶制御手段と、前記作成時で更新前における、前記構造化データサーバ装置からの非構造化データを前記更新前の構造化データから参照するための参照情報を前記構造化データサーバ装置に送信する送信手段とを備えたことを特徴とする。 That is, an information processing system according to the present invention is an information processing system having a client device, a structured data server device, and an unstructured data server device, wherein the client device stores structured data and the structured data. Input means for accepting an input operation for creating related unstructured data, first calculation means for calculating a hash value of unstructured data related to structured data created by the input means, and the input means A first transmission means for transmitting the structured data created by the method and unstructured data related to the structured data to the structured data server device, and an operation for inputting the created structured data to the input means. The first step is to calculate the hash value of unstructured data related to the updated structured data. Whether the hash value calculated by the second calculation unit and the hash value of the unstructured data related to the structured data before the update calculated by the first calculation unit match. When the discriminating unit discriminates that the hash value matches, the discriminating unit sends the updated structured data to the structured data server device, and the non-structure related to the updated structured data The structured data server device includes a structured data storage device for storing the structured data, and a pre-update at the time of creation from the client device. Transmitting means for transmitting unstructured data related to the structured data to the unstructured data server device, and structuring before update at the time of creation from the client device First storage control means for storing the reference data of the unstructured data from the unstructured data server device in the structured data storage device and storing the data in the structured data storage device And second storage control means for rewriting the updated structured data before the update to the updated structured data from the client device, wherein the unstructured data server device includes the unstructured data And storage control for storing unstructured data related to structured data before update at the time of creation from the structured data server device in the unstructured data storage device And reference information for referring to unstructured data from the structured data server device before the update at the time of creation from the structured data before the update. And a transmission means for transmitting to the data server device.

また、本発明に係わる情報処理装置は、構造化データおよび当該構造化データに関わる非構造化データの作成のための入力操作を受け付ける入力手段と、前記入力手段により作成された構造化データに関わる非構造化データのハッシュ値を計算する第１の計算手段と、前記入力手段により作成された構造化データを構造化データサーバ装置に記憶するための構造化データとして送信し、当該構造化データに関わる非構造化データを非構造化データサーバ装置に記憶するための非構造化データとして送信する第１の送信手段と、前記作成されて前記構造化データサーバに記憶された構造化データが前記入力手段により更新された場合に、当該構造化データに関わる非構造化データのハッシュ値を計算する第２の計算手段と、前記第２の計算手段により計算したハッシュ値と前記第１の計算手段により計算した、前記更新前の構造化データに関わる非構造化データのハッシュ値とが一致するか否かを判別する判別手段と、前記ハッシュ値が一致すると前記判別手段により判別した場合に、前記更新された構造化データを前記構造化データサーバ装置に記憶するための構造化データとして送信し、前記更新された構造化データに関わる非構造化データは送信しない第２の送信手段とを備えたことを特徴とする。 An information processing apparatus according to the present invention relates to input means for receiving an input operation for creating structured data and unstructured data related to the structured data, and to structured data created by the input means. The first calculation means for calculating the hash value of the unstructured data and the structured data created by the input means are transmitted as structured data for storing in the structured data server device, and the structured data is sent to the structured data. First transmission means for transmitting unstructured data concerned as unstructured data for storing in the unstructured data server device; and the structured data created and stored in the structured data server as the input A second calculating means for calculating a hash value of unstructured data related to the structured data when updated by the means; and the second calculating means Determining means for determining whether or not the hash value calculated by the first calculation means matches the hash value of the unstructured data related to the structured data before update, and the hash value is When it is determined by the determining means that they match, the updated structured data is transmitted as structured data for storage in the structured data server device, and unstructured data related to the updated structured data Is provided with a second transmitting means that does not transmit.

本発明によれば、構造化データおよび非構造化データの適切な伝送処理を行なうことができる。 According to the present invention, it is possible to perform appropriate transmission processing of structured data and unstructured data.

本発明を実現するための情報処理システムの構成例を示すブロック図。The block diagram which shows the structural example of the information processing system for implement | achieving this invention. 本発明を実現するための情報処理システムによる構造化文書作成処理の一例を示すフローチャート。The flowchart which shows an example of the structured document creation process by the information processing system for implement | achieving this invention. 本発明を実現するための情報処理システムによる構造化文書更新処理の一例を示すフローチャート。The flowchart which shows an example of the structured document update process by the information processing system for implement | achieving this invention. 本発明の実施形態における情報処理システムの構成例を示すブロック図。The block diagram which shows the structural example of the information processing system in embodiment of this invention. 本発明の実施形態における情報処理システムによる構造化文書の作成の手順を示す図。The figure which shows the procedure of preparation of the structured document by the information processing system in embodiment of this invention. 本発明の実施形態における情報処理システムによる画像付き構造化文書の伝送処理の一例を示すフローチャート。6 is a flowchart showing an example of transmission processing of a structured document with an image by the information processing system according to the embodiment of the present invention. 本発明の実施形態における情報処理システムによる構造化文書の作成画面の一例を示す図。The figure which shows an example of the creation screen of the structured document by the information processing system in embodiment of this invention. 本発明の実施形態における情報処理システムによる構造化文書のソースコードの一例を示す図。The figure which shows an example of the source code of the structured document by the information processing system in embodiment of this invention. 本発明の実施形態における情報処理システムによる構造化文書の画像ハッシュ値の保存後のソースコードの一例を示す図。The figure which shows an example of the source code after the preservation | save of the image hash value of the structured document by the information processing system in embodiment of this invention. 本発明の実施形態における情報処理システムによる構造化文書の送信情報の一例を示す図。The figure which shows an example of the transmission information of the structured document by the information processing system in embodiment of this invention. 本発明の実施形態における情報処理システムによる構造化文書の画像参照の書き換え後のソースコードの一例を示す図。The figure which shows an example of the source code after rewriting of the image reference of the structured document by the information processing system in the embodiment of the present invention. 本発明の実施形態における情報処理システムによる構造化文書の参照の手順を示す図。The figure which shows the procedure of the reference of the structured document by the information processing system in embodiment of this invention. 本発明の実施形態における情報処理システムによる画像付き構造化文書の参照処理の一例を示すフローチャート。6 is a flowchart showing an example of a reference process for a structured document with an image by the information processing system according to the embodiment of the present invention. 本発明の実施形態における情報処理システムによる構造化文書の更新の手順の第１の例を示す図。The figure which shows the 1st example of the procedure of the update of the structured document by the information processing system in embodiment of this invention. 本発明の実施形態における情報処理システムによる画像付き構造化文書の更新処理の第１の例を示すフローチャート。The flowchart which shows the 1st example of the update process of the structured document with an image by the information processing system in embodiment of this invention. 本発明の実施形態における情報処理システムによる処理対象の画像データ同士の近似度の高低を示す図。The figure which shows the level of the closeness of the image data of the process target by the information processing system in embodiment of this invention. 本発明の実施形態における情報処理システムによる構造化文書の更新の手順の第２の例を示す図。The figure which shows the 2nd example of the procedure of the update of the structured document by the information processing system in embodiment of this invention. 本発明の実施形態における情報処理システムによる画像付き構造化文書の更新処理の第２の例を示すフローチャート。The flowchart which shows the 2nd example of the update process of the structured document with an image by the information processing system in embodiment of this invention. 本発明の実施形態における情報処理システムによる画像付き構造化文書の上書き処理後の参照関係の一例を示す図。The figure which shows an example of the reference relationship after the overwrite process of the structured document with an image by the information processing system in embodiment of this invention. 本発明の実施形態における情報処理システムによる構造化文書の更新の手順の第３の例を示す図。The figure which shows the 3rd example of the procedure of the update of the structured document by the information processing system in embodiment of this invention. 本発明の実施形態における情報処理システムによる画像付き構造化文書の更新処理の第３の例を示すフローチャート。The flowchart which shows the 3rd example of the update process of the structured document with an image by the information processing system in embodiment of this invention. 本発明の実施形態における情報処理システムによる画像付き構造化文書の置換処理後の参照関係の一例を示す図。The figure which shows an example of the reference relationship after the replacement process of the structured document with an image by the information processing system in embodiment of this invention. 本発明の実施形態における情報処理システムによる構造化文書の削除の手順の一例を示す図。The figure which shows an example of the procedure of deletion of the structured document by the information processing system in embodiment of this invention. 本発明の実施形態における情報処理システムによる画像付き構造化文書の削除処理の第３の例を示すフローチャート。The flowchart which shows the 3rd example of the deletion process of the structured document with an image by the information processing system in embodiment of this invention. 本発明の実施形態における情報処理システムの構成の変形例を示すブロック図。The block diagram which shows the modification of a structure of the information processing system in embodiment of this invention. 本発明の実施形態の変形例における情報処理システムによる構造化文書の変更の手順の一例を示す図。The figure which shows an example of the procedure of the change of the structured document by the information processing system in the modification of embodiment of this invention. 本発明の実施形態の変形例における情報処理システムによる画像付き構造化文書の変更処理の一例を示すフローチャート。The flowchart which shows an example of the change process of the structured document with an image by the information processing system in the modification of embodiment of this invention.

本発明では、ＸＭＬＤＢを含む構造化文書の分散編集環境における画像などの非構造化データの取り扱い方法として、標準的な処理方式を提供する。
また、従来技術では、画像などの非構造化データを含む構造化文書を更新する際、非構造化データの更新がされていなくても、当該非構造化データがサーバへ送信され、更新されてしまう無駄があるが、本発明では、無駄のない非構造化データの更新方式を提供する。 The present invention provides a standard processing method as a method for handling unstructured data such as images in a distributed editing environment for structured documents including XMLDB.
Further, in the prior art, when updating a structured document including unstructured data such as an image, even if the unstructured data is not updated, the unstructured data is transmitted to the server and updated. However, the present invention provides a method for updating unstructured data without waste.

また、本発明では、複数の構造化文書から、ある非構造化データを参照している場合において、非構造化データが更新された場合に、構造化文書のプロパティなどの情報を元に、自動的に非構造化データへの参照情報を更新する手段を提供する。 Also, in the present invention, when unstructured data is updated in a case where some unstructured data is referenced from a plurality of structured documents, an automatic operation is performed based on information such as the properties of the structured document. A means for updating reference information to unstructured data is provided.

図１は、本発明を実現するための情報処理システムの構成例を示すブロック図である。
図１に示すように、本発明の実施形態における情報処理システムは、構造化文書作成クライアント１、構造化文書サーバ２、非構造化データサーバ３、構造化文書ＤＢ（データベース）４および非構造化データＤＢ５を備える。 FIG. 1 is a block diagram showing a configuration example of an information processing system for realizing the present invention.
As shown in FIG. 1, an information processing system according to an embodiment of the present invention includes a structured document creation client 1, a structured document server 2, an unstructured data server 3, a structured document DB (database) 4, and an unstructured system. A data DB 5 is provided.

構造化文書作成クライアント１、構造化文書サーバ２および非構造化データサーバ３は、それぞれの間で通信可能に接続される。また、構造化文書ＤＢ４は構造化文書サーバ２と通信可能に接続され、非構造化データＤＢ５は非構造化データサーバ３と通信可能に接続される。また、構造化文書サーバ２は構造化文書ＤＢ４を内蔵する形態であってもよく、非構造化データサーバ３は非構造化データＤＢ５を内蔵する形態であってもよい。 The structured document creation client 1, the structured document server 2, and the unstructured data server 3 are connected so as to be able to communicate with each other. The structured document DB 4 is communicably connected to the structured document server 2, and the unstructured data DB 5 is communicably connected to the unstructured data server 3. Further, the structured document server 2 may have a form in which the structured document DB 4 is built, and the unstructured data server 3 may have a form in which the unstructured data DB 5 is built.

構造化文書ＤＢ４は、例えばハードディスクドライブや不揮発性メモリなどの記憶媒体であり、構造化文書の管理のためのデータベース機能を有する。
非構造化データＤＢ５は、例えばハードディスクドライブや不揮発性メモリなどの記憶媒体であり、非構造化データの管理のためのデータベース機能を有する。 The structured document DB 4 is a storage medium such as a hard disk drive or a nonvolatile memory, and has a database function for managing structured documents.
The unstructured data DB5 is a storage medium such as a hard disk drive or a nonvolatile memory, and has a database function for managing unstructured data.

構造化文書作成クライアント１は、構造化文書編集ＵＩ（ユーザインタフェース）部１ａ、サーバ通信部１ｂ、非構造化データハッシュ計算部１ｃ、ハッシュキャッシュ部１ｄ、ハッシュ一致判別部１ｅを有する。 The structured document creation client 1 includes a structured document editing UI (user interface) unit 1a, a server communication unit 1b, an unstructured data hash calculation unit 1c, a hash cache unit 1d, and a hash match determination unit 1e.

構造化文書編集ＵＩ部１ａは、非構造化データを含む構造化データである構造化文書を編集するためのユーザインタフェース機能を有し、キーボードやマウスなどの入力装置、およびディスプレイ装置などの出力装置により実現される。 The structured document editing UI unit 1a has a user interface function for editing a structured document that is structured data including unstructured data, and includes an input device such as a keyboard and a mouse, and an output device such as a display device. It is realized by.

サーバ通信部１ｂは、構造化文書サーバ２や非構造化データサーバ３と通信するためのクライアントサイド機能を有する。
非構造化データハッシュ計算部１ｃは、非構造化データのハッシュ値を計算する。 The server communication unit 1 b has a client side function for communicating with the structured document server 2 and the unstructured data server 3.
The unstructured data hash calculation unit 1c calculates a hash value of unstructured data.

ハッシュキャッシュ部１ｄは、不揮発性メモリなどの記憶媒体であり、非構造化データハッシュ計算部１ｃによって得られた非構造化データのハッシュを保存する。
ハッシュ一致判別部１ｅは、ハッシュキャッシュ部１ｄに保存されるハッシュ値と、この保存後に当該保存されたハッシュに関わる非構造化データを参照する構造化文書が更新された際に、当該更新された構造化文書が参照する非構造化データに関して非構造化データハッシュ計算部１ｃによって新たに得られたハッシュ値とが一致するか否かを判別する。 The hash cache unit 1d is a storage medium such as a non-volatile memory, and stores a hash of unstructured data obtained by the unstructured data hash calculation unit 1c.
The hash match determination unit 1e is updated when the hash value stored in the hash cache unit 1d and the structured document that refers to the unstructured data related to the stored hash are updated after the storage. It is determined whether or not the hash value newly obtained by the unstructured data hash calculation unit 1c matches the unstructured data referred to by the structured document.

構造化文書サーバ２は、クライアント通信部２ａ、非構造化データサーバ通信部２ｂ、構造化文書ＩＤ発行部２ｃ、非構造化データＵＲＩ変換部２ｄ、非構造化データ近似度計算部２ｅ、非構造化データマルチ参照処理部２ｆおよび構造化文書ＤＢアクセス部２ｇを有する。 The structured document server 2 includes a client communication unit 2a, an unstructured data server communication unit 2b, a structured document ID issuing unit 2c, an unstructured data URI conversion unit 2d, an unstructured data approximation degree calculation unit 2e, an unstructured A structured data multi-reference processing unit 2f and a structured document DB access unit 2g.

クライアント通信部２ａは、構造化文書作成クライアント１と通信を行なうためのサーバサイド機能を有する。
非構造化データサーバ通信部２ｂは、非構造化データサーバ３と通信を行なうためのクライアントサイド機能を有する。 The client communication unit 2a has a server-side function for communicating with the structured document creation client 1.
The unstructured data server communication unit 2 b has a client side function for communicating with the unstructured data server 3.

構造化文書ＩＤ発行部２ｃは、構造化文書を識別するための構造化文書ＩＤを発行する。
非構造化データＵＲＩ変換部２ｄは、構造化文書が参照する非構造化データが更新されて非構造化データの識別情報である非構造化データＩＤが新たに発行された場合、当該構造化文書の中に書かれた非構造化データへの参照情報である非構造化データＵＲＩを必要に応じて書き換える。 The structured document ID issuing unit 2c issues a structured document ID for identifying the structured document.
When the unstructured data referred to by the structured document is updated and an unstructured data ID that is identification information of the unstructured data is newly issued, the unstructured data URI conversion unit 2d The unstructured data URI that is reference information to the unstructured data written in is rewritten as necessary.

非構造化データ近似度計算部２ｅは、構造化文書が参照する非構造化データが更新された場合に、更新前の構造化文書が参照する非構造化データと更新後の構造化文書が参照する非構造化データとの近似度を計算する。
非構造化データマルチ参照処理部２ｆは、ある非構造化データを参照する構造化文書が複数存在する場合で当該非構造化データが更新された場合に、各構造化文書の中に書かれた、当該該非構造化データＵＲＩを必要に応じて書き換える。 When the unstructured data that is referred to by the structured document is updated, the unstructured data approximation degree calculation unit 2e refers to the unstructured data that is referenced by the structured document before the update and the structured document after the update. The degree of approximation with unstructured data is calculated.
The unstructured data multi-reference processing unit 2f is written in each structured document when there are a plurality of structured documents that refer to certain unstructured data and the unstructured data is updated. The unstructured data URI is rewritten as necessary.

構造化文書ＤＢアクセス部２ｇは、構造化文書ＤＢ４にアクセスするための機能を有し、この構造化文書ＤＢ４に対する構造化文書の作成、参照、更新、削除といった機能を有する。 The structured document DB access unit 2g has a function for accessing the structured document DB 4, and has a function of creating, referencing, updating, and deleting the structured document with respect to the structured document DB 4.

非構造化データサーバ３は、リクエスト処理部３ａ、非構造化データＩＤ発行部３ｂおよび非構造化データＤＢアクセス部３ｃを有する。
リクエスト処理部３ａは、構造化文書作成クライアント１や構造化文書サーバ２と通信を行なうためのサーバサイド機能を有する。 The unstructured data server 3 includes a request processing unit 3a, an unstructured data ID issuing unit 3b, and an unstructured data DB access unit 3c.
The request processing unit 3 a has a server side function for communicating with the structured document creation client 1 and the structured document server 2.

非構造化データＩＤ発行部３ｂは、非構造化データを識別するための非構造化データＩＤを発行する。
非構造化データＤＢアクセス部３ｃは、非構造化データＤＢ５にアクセスするための機能を有し、この非構造化データＤＢ５に対する非構造化データの作成、参照、更新、削除といった機能を有する。 The unstructured data ID issuing unit 3b issues an unstructured data ID for identifying unstructured data.
The unstructured data DB access unit 3c has a function for accessing the unstructured data DB5, and has a function of creating, referring to, updating, and deleting unstructured data for the unstructured data DB5.

図２は、本発明を実現するための情報処理システムによる構造化文書作成処理の一例を示すフローチャートである。
まず、構造化文書作成クライアント１は、編集者による構造化文書編集ＵＩ部１ａへの操作により構造化文書が作成されると（ステップＡ１）、この構造化文書の中に非構造化データが含まれるか否かをチェックする（ステップＡ２）。 FIG. 2 is a flowchart showing an example of structured document creation processing by the information processing system for realizing the present invention.
First, when the structured document is created by the editor operating the structured document editing UI unit 1a (step A1), the structured document creating client 1 includes unstructured data. Is checked (step A2).

構造化文書に非構造化データが含まれる場合は、構造化文書作成クライアント１は、作成済みの構造化文書が参照する非構造化データのハッシュ値を非構造化データハッシュ計算部１ｃにより計算する（ステップＡ３）。 When unstructured data is included in the structured document, the structured document creation client 1 calculates the hash value of the unstructured data referred to by the created structured document by the unstructured data hash calculation unit 1c. (Step A3).

このように非構造化データを含む構造化文書は、前述した非構造化データを参照するための非構造化データ参照情報を含む。構造化文書の作成時では、非構造化データ参照情報は当該非構造化データのファイル名である。 As described above, the structured document including unstructured data includes unstructured data reference information for referring to the above-described unstructured data. At the time of creating the structured document, the unstructured data reference information is the file name of the unstructured data.

そして、構造化文書作成クライアント１は、この計算したハッシュ値をハッシュキャッシュ部１ｄに保存し（ステップＡ４）、サーバ通信部１ｂにより、マルチパート形式を用いて非構造化データと構造化文書の両方を構造化文書サーバ２へ送信する（ステップＡ６）。 Then, the structured document creation client 1 stores the calculated hash value in the hash cache unit 1d (step A4), and the server communication unit 1b uses the multipart format for both the unstructured data and the structured document. Is transmitted to the structured document server 2 (step A6).

一方、構造化文書に非構造化データが含まれない場合は、構造化文書作成クライアント１は、サーバ通信部１ｂにより、構造化文書をそのまま構造化文書サーバ２へ送信する（ステップＡ５）。 On the other hand, when the unstructured data is not included in the structured document, the structured document creation client 1 transmits the structured document as it is to the structured document server 2 through the server communication unit 1b (step A5).

構造化文書サーバ２は、構造化文書作成クライアント１からの構造化文書をクライアント通信部２ａにより受信して（ステップＡ１１）、この構造化文書が非構造化データを含むか否かを判別する（ステップＡ１２）。 The structured document server 2 receives the structured document from the structured document creation client 1 by the client communication unit 2a (step A11), and determines whether or not the structured document includes unstructured data (step A11). Step A12).

ステップＡ１１の処理で受信済みの構造化文書に非構造化データが含まれない場合は、構造化文書サーバ２は、この受信済みの構造化文書を識別するための構造化文書ＩＤを構造化文書ＩＤ発行部２ｃにより発行して（ステップＡ１３）、この構造化文書を当該発行した構造化文書ＩＤとともに構造化文書ＤＢアクセス部２ｇにより構造化文書ＤＢ４に保存する（ステップＡ１４）。 If the structured document received in step A11 does not contain unstructured data, the structured document server 2 sets the structured document ID for identifying the received structured document as a structured document. Issued by the ID issuing unit 2c (step A13), the structured document is stored in the structured document DB 4 by the structured document DB access unit 2g together with the issued structured document ID (step A14).

そして、クライアント通信部２ａは、前述のように発行された構造化文書ＩＤから計算可能なＵＲＩである構造化文書ＵＲＩを構造化文書作成クライアント１に返信する（ステップＡ１５）。この構造化文書ＵＲＩは、構造化文書作成クライアント１が受け取り（ステップＡ１６）、処理が終了する。 Then, the client communication unit 2a returns a structured document URI, which is a URI that can be calculated from the structured document ID issued as described above, to the structured document creation client 1 (step A15). The structured document URI is received by the structured document creation client 1 (step A16), and the process ends.

一方、ステップＡ１１の処理で受信済みの構造化文書に非構造化データが含まれる場合は、構造化文書サーバ２は、この非構造化データを非構造化データサーバ通信部２ｂにより非構造化データサーバ３に送信する（ステップＡ１７）。 On the other hand, if the structured document that has been received in step A11 contains unstructured data, the structured document server 2 sends the unstructured data to the unstructured data server communication unit 2b. It transmits to the server 3 (step A17).

非構造化データサーバ３は、構造化文書サーバ２からの非構造化データをリクエスト処理部３ａにより受信すると（ステップＡ２１）、この非構造化データを識別するための非構造化データＩＤを非構造化データＩＤ発行部３ｂにより発行する（ステップＡ２２）。 When the unstructured data server 3 receives the unstructured data from the structured document server 2 by the request processing unit 3a (step A21), the unstructured data server 3 identifies an unstructured data ID for identifying the unstructured data. Issued by the data ID issuing unit 3b (step A22).

そして、非構造化データサーバ３は、ステップＡ２１の処理で受信した非構造化データを発行済みの非構造化データＩＤとともに非構造化データＤＢアクセス部３ｃにより新規に非構造化データＤＢ５に保存し（ステップＡ２３）、発行済みの非構造化データＩＤから計算可能である非構造化データＵＲＩを構造化文書サーバ２に返信する（ステップＡ２４）。 Then, the unstructured data server 3 newly stores the unstructured data received in step A21 together with the issued unstructured data ID in the unstructured data DB 5 by the unstructured data DB access unit 3c. (Step A23), an unstructured data URI that can be calculated from the issued unstructured data ID is returned to the structured document server 2 (Step A24).

そして、構造化文書サーバ２は、非構造化データサーバ３からの非構造化データＵＲＩを非構造化データサーバ通信部２ｂにより受信し（ステップＡ３１）、この受信した非構造化データＵＲＩを用いて、ステップＡ１１の処理で構造化文書作成クライアント１により受信済みの構造化文書の中の非構造化データ参照情報を非構造化データＵＲＩ変換部２ｄにより受信済みの非構造化データＵＲＩに書き換える（ステップＡ３２）。 The structured document server 2 receives the unstructured data URI from the unstructured data server 3 by the unstructured data server communication unit 2b (step A31), and uses the received unstructured data URI. In step A11, the unstructured data reference information in the structured document received by the structured document creation client 1 is rewritten to the unstructured data URI received by the unstructured data URI conversion unit 2d (step A11). A32).

その後、構造化文書ＩＤ発行部２ｃは、ステップＡ１１の処理で構造化文書作成クライアント１により受信済みの構造化文書の構造化文書ＩＤを発行し（ステップＡ１３）、当該構造化文書を当該発行した構造化文書ＩＤとともに構造化文書ＤＢアクセス部２ｇにより構造化文書ＤＢ４に新規保存する（ステップＡ１４）。 Thereafter, the structured document ID issuing unit 2c issues a structured document ID of the structured document that has been received by the structured document creation client 1 in the process of step A11 (step A13), and issues the structured document. The structured document ID is newly stored in the structured document DB 4 by the structured document DB access unit 2g together with the structured document ID (step A14).

そして、構造化文書サーバ２は、前述のように発行された構造化文書ＩＤから計算可能なＵＲＩである構造化文書ＵＲＩとステップＡ３１の処理で非構造化データサーバ３から受信した非構造化データＵＲＩとのペアをクライアント通信部２ａにより構造化文書作成クライアント１に返信する（ステップＡ１５）。 The structured document server 2 then sends the structured document URI, which is a URI that can be calculated from the structured document ID issued as described above, and the unstructured data received from the unstructured data server 3 in step A31. A pair with the URI is returned to the structured document creation client 1 by the client communication unit 2a (step A15).

構造化文書作成クライアント１は、構造化文書サーバ２からの構造化文書ＵＲＩと非構造化データＵＲＩとをサーバ通信部１ｂにより受信し（ステップＡ１６）、これらのＵＲＩを構造化文書編集ＵＩ部１１により表示させることで、構造化文書の作成処理を終了する。 The structured document creation client 1 receives the structured document URI and the unstructured data URI from the structured document server 2 by the server communication unit 1b (step A16), and these URIs are structured document editing UI unit 11. The structured document creation process is terminated by displaying the above.

次に、前述のように作成された構造化文書を更新する際の処理の流れについて説明する。図３は、本発明を実現するための情報処理システムによる構造化文書更新処理の一例を示すフローチャートである。
まず、構造化文書作成クライアント１は、編集者による構造化文書編集ＵＩ部１１への操作による更新対象の構造化文書の構造化文書ＵＲＩの指定にともなって、当該更新対象の構造化文書を構造化文書編集ＵＩ部１１により表示させ、この構造化文書が入力操作により更新されると（ステップＢ１）、この更新された構造化文書に非構造化データが含まれるか否かをチェックする（ステップＢ２）。 Next, the flow of processing when updating the structured document created as described above will be described. FIG. 3 is a flowchart showing an example of structured document update processing by the information processing system for realizing the present invention.
First, the structured document creation client 1 constructs the structured document to be updated in accordance with the designation of the structured document URI of the structured document to be updated by an operation to the structured document editing UI unit 11 by the editor. When the structured document is displayed by the structured document editing UI unit 11 and this structured document is updated by an input operation (step B1), it is checked whether or not unstructured data is included in the updated structured document (step B1). B2).

構造化文書作成クライアント１は、更新された構造化文書に非構造化データが含まれない場合は、この構造化文書を当該文書の構造化文書ＵＲＩとともにサーバ通信部１ｂにより構造化文書サーバ２にそのまま送信する（ステップＢ３）。 When the updated structured document does not include unstructured data, the structured document creation client 1 sends the structured document to the structured document server 2 by the server communication unit 1b together with the structured document URI of the document. It transmits as it is (step B3).

一方、構造化文書作成クライアント１は、更新された構造化文書に非構造化データが含まれる場合は、この非構造化データのハッシュ値を非構造化データハッシュ計算部１ｃにより計算する。（ステップＢ４）
ハッシュ一致判別部１ｅは、既にハッシュキャッシュ部１ｄに保存された元のハッシュ値を取得し（ステップＢ５）、この取得したハッシュ値とステップＢ４の処理により計算済みのハッシュ値とを比較することで両者が一致するか否かを判別する（ステップＢ６，Ｂ７）。 On the other hand, when the structured document that has been updated contains unstructured data, the structured document creation client 1 calculates the hash value of the unstructured data by the unstructured data hash calculation unit 1c. (Step B4)
The hash match determination unit 1e acquires the original hash value already stored in the hash cache unit 1d (step B5), and compares the acquired hash value with the hash value that has been calculated by the process of step B4. It is determined whether or not both match (steps B6 and B7).

構造化文書作成クライアント１は、この判別の結果、ハッシュ値同士が一致する場合は、更新された構造化文書の非構造化データが更新されていないと判別して、構造化文書を当該文書の構造化文書ＵＲＩとともにサーバ通信部１ｂにより構造化文書サーバ２に送信する一方、前述した更新されていない非構造化データは送信対象としない（ステップＢ３）。 If the hash values match as a result of this determination, the structured document creation client 1 determines that the unstructured data of the updated structured document has not been updated, and the structured document is stored in the document. The server communication unit 1b transmits the structured document URI together with the structured document URI to the structured document server 2 while the unstructured unstructured data described above is not a transmission target (step B3).

一方、構造化文書作成クライアント１は、ステップＢ７の処理による判別の結果、両者のハッシュ値が一致しない場合は、更新された構造化文書の非構造化データが更新されたとみなし、この非構造化データを含む構造化文書を当該文書の構造化文書ＵＲＩとともにサーバ通信部１ｂにより構造化文書サーバ２に送信する（ステップＢ８，Ｂ９）。 On the other hand, if the hash value of the structured document creation client 1 does not match as a result of the determination in step B7, the structured document creation client 1 considers that the unstructured data of the updated structured document has been updated, and this unstructured data. A structured document including data is transmitted to the structured document server 2 by the server communication unit 1b together with the structured document URI of the document (steps B8 and B9).

構造化文書サーバ２は、構造化文書作成クライアント１からの構造化文書をクライアント通信部２ａにより受信すると（ステップＢ１１）、この受信した構造化文書に非構造化データが含まれるか否かをチェックする（ステップＢ１２）。 When the structured document server 2 receives the structured document from the structured document creation client 1 by the client communication unit 2a (step B11), the structured document server 2 checks whether or not the received structured document includes unstructured data. (Step B12).

構造化文書サーバ２は、構造化文書作成クライアント１からの構造化文書に非構造化データが含まれない場合は、構造化文書ＤＢ４に保存される構造化文書のうち、ステップＢ１１の処理で受信した構造化文書の構造化文書ＵＲＩから計算される構造化文書ＩＤに対応する構造化文書の更新処理のみを構造化文書ＤＢアクセス部２ｇにより行ない（ステップＢ１３）、更新完了通知をクライアント通信部２ａを介して構造化文書作成クライアント１に発行する（ステップＢ１４）。
構造化文書作成クライアント１は、構造化文書サーバ２からの更新完了通知をサーバ通信部１ｂにより受領し（ステップＢ１５）、更新処理を終了する。 When the structured document from the structured document creation client 1 does not include unstructured data, the structured document server 2 receives the structured document stored in the structured document DB 4 in the process of step B11. The structured document DB access unit 2g performs only the updating process of the structured document corresponding to the structured document ID calculated from the structured document URI of the structured document (step B13), and notifies the client communication unit 2a of the update completion. Is issued to the structured document creation client 1 (step B14).
The structured document creation client 1 receives the update completion notification from the structured document server 2 by the server communication unit 1b (step B15), and ends the update process.

一方、構造化文書サーバ２は、構造化文書作成クライアント１からの構造化文書に非構造化データが含まれる場合は、元の非構造化データを非構造化データサーバ３から取得した上で、この元の非構造化データと前述のように更新後の構造化文書に含まれる非構造化データとの近似度を計算する（ステップＢ１６）。 On the other hand, when the structured document from the structured document creation client 1 includes unstructured data, the structured document server 2 acquires the original unstructured data from the unstructured data server 3, The degree of approximation between the original unstructured data and the unstructured data included in the updated structured document as described above is calculated (step B16).

構造化文書サーバ２は、ステップＢ１６の処理の結果、両者の近似度が高い場合、更新前後の非構造化データが同じものとみなして、この非構造化データは上書き対象のデータであると判別し、ステップＢ１１の処理で構造化文書作成クライアント１から受信済みの構造化文書の中で示される非構造化データＵＲＩを取得する（ステップＢ１７→Ｂ１８）。 The structured document server 2 determines that the unstructured data before and after the update is the same and determines that the unstructured data is the data to be overwritten when the degree of approximation between both is high as a result of the process of step B16. Then, the unstructured data URI indicated in the structured document received from the structured document creation client 1 in the process of step B11 is acquired (step B17 → B18).

構造化文書サーバ２は、上書き対象である非構造化データを取得済みの非構造化データＵＲＩおよび上書きの指示とともに非構造化データサーバ通信部２ｂにより非構造化データサーバ３に送信する（ステップＢ１９）。 The structured document server 2 transmits the unstructured data to be overwritten to the unstructured data server 3 through the unstructured data server communication unit 2b together with the acquired unstructured data URI and an overwriting instruction (step B19). ).

非構造化データサーバ３は、構造化文書サーバ２からの非構造化データなどをリクエスト処理部３ａにより受信すると（ステップＢ２１）、非構造化データＤＢアクセス部３ｃは、非構造化データＤＢ５に保存される非構造化データのうち、ステップＢ２１の処理で構造化文書サーバ２から受信した非構造化データＵＲＩから計算される非構造化データＩＤに対応する非構造化データを、同じくステップＢ１１の処理で構造化文書サーバ２から受信した非構造化データに上書き更新し（ステップＢ２２）、当該非構造化データの更新完了通知をリクエスト処理部３ａにより構造化文書サーバ２に返信する（ステップＢ２３）。 When the unstructured data server 3 receives the unstructured data from the structured document server 2 by the request processing unit 3a (step B21), the unstructured data DB access unit 3c stores the unstructured data DB 3 in the unstructured data DB 5. Among the unstructured data to be processed, the unstructured data corresponding to the unstructured data ID calculated from the unstructured data URI received from the structured document server 2 in the process of step B21 is similarly processed in the process of step B11. Then, the unstructured data received from the structured document server 2 is overwritten and updated (step B22), and an update completion notification of the unstructured data is returned to the structured document server 2 by the request processing unit 3a (step B23).

構造化文書サーバ２は、非構造化データサーバ３からの更新完了通知を非構造化データサーバ通信部２ｂにより受領すると（ステップＢ２４）、構造化文書ＤＢ４に保存される構造化文書のうち、既にステップＢ１１の処理で構造化文書作成クライアント１から受信した構造化文書の構造化文書ＵＲＩから計算される構造化文書ＩＤに対応する構造化文書の更新処理を構造化文書ＤＢアクセス部２ｇにより行ない（ステップＢ１３）、更新完了通知をクライアント通信部２ａにより構造化文書作成クライアント１に発行する（ステップＢ１４）。
構造化文書作成クライアント１は、構造化文書サーバ２からの更新完了通知をサーバ通信部１ｂにより受領し（ステップＢ１５）、更新処理を終了する。 When the structured document server 2 receives the update completion notification from the unstructured data server 3 by the unstructured data server communication unit 2b (step B24), the structured document server 2 already has the structured document stored in the structured document DB 4. The structured document DB access unit 2g performs the updating process of the structured document corresponding to the structured document ID calculated from the structured document URI of the structured document received from the structured document creation client 1 in the process of step B11 ( In step B13), an update completion notification is issued to the structured document creation client 1 by the client communication unit 2a (step B14).
The structured document creation client 1 receives the update completion notification from the structured document server 2 by the server communication unit 1b (step B15), and ends the update process.

一方、構造化文書サーバ２は、ステップＢ１６の処理の結果、近似度が低い場合は、更新後の非構造化データが元の非構造化データと全く異なるとみなして、この非構造化データは上書き対象ではなく置換対象のデータであると判別する（ステップＢ１７→Ｂ２５）。 On the other hand, if the degree of approximation is low as a result of the processing in step B16, the structured document server 2 considers that the unstructured data after the update is completely different from the original unstructured data. It is determined that the data is not the overwriting target but the replacement target data (step B17 → B25).

構造化文書サーバ２は、置換対象である非構造化データを置換の指示とともに非構造化データサーバ通信部２ｂにより非構造化データサーバ３に送信する（ステップＢ２６）。 The structured document server 2 transmits the unstructured data to be replaced to the unstructured data server 3 through the unstructured data server communication unit 2b together with a replacement instruction (step B26).

非構造化データサーバ３は、構造化文書サーバ２からの置換対象の非構造化データなどをリクエスト処理部３ａにより受信すると（ステップＢ２７）、この非構造化データの非構造化データＩＤを非構造化データＩＤ発行部３ｂにより新規に発行する（ステップＢ２８）。 When the unstructured data server 3 receives the unstructured data to be replaced from the structured document server 2 by the request processing unit 3a (step B27), the unstructured data ID of the unstructured data is unstructured. A new data ID issuance unit 3b issues a new one (step B28).

非構造化データＤＢアクセス部３ｃは、この非構造化データＩＤをステップＢ２７の処理で受信済みの非構造化データと関連付けて非構造化データＤＢ５に新規保存し（ステップＢ２９）、この非構造化データＩＤから非構造化データＵＲＩを計算して、当該計算した非構造化データＵＲＩをリクエスト処理部３ａにより構造化文書サーバ２に返信する（ステップＢ３０）。 The unstructured data DB access unit 3c newly stores the unstructured data ID in the unstructured data DB 5 in association with the unstructured data received in the process of step B27 (step B29). The unstructured data URI is calculated from the data ID, and the calculated unstructured data URI is returned to the structured document server 2 by the request processing unit 3a (step B30).

構造化文書サーバ２の非構造化データサーバ通信部２ｂが、非構造化データサーバ３からの非構造化データＵＲＩを受信すると（ステップＢ３１）、非構造化データＵＲＩ変換部２ｄは、ステップＢ１１の処理で構造化文書作成クライアント１から受信した構造化文書に含まれる非構造化データＵＲＩを、ステップＢ３１の処理で非構造化データサーバ３から受信した新たな非構造化データＵＲＩに書き換える（ステップＢ３２）。 When the unstructured data server communication unit 2b of the structured document server 2 receives the unstructured data URI from the unstructured data server 3 (step B31), the unstructured data URI conversion unit 2d The unstructured data URI included in the structured document received from the structured document creation client 1 in the process is rewritten with the new unstructured data URI received from the unstructured data server 3 in the process of step B31 (step B32). ).

さらに、構造化文書サーバ２の構造化文書ＤＢアクセス部２ｇは、構造化文書ＤＢ４に保存される構造化文書のうち、受信した構造化文書の構造化文書ＵＲＩから計算される構造化文書ＩＤに対応する構造化文書を、ステップＢ３２の処理により非構造化データＵＲＩが書き換えられた構造化文書に更新する（ステップＢ３３）。 Further, the structured document DB access unit 2g of the structured document server 2 uses the structured document ID calculated from the structured document URI of the received structured document among the structured documents stored in the structured document DB 4. The corresponding structured document is updated to a structured document in which the unstructured data URI has been rewritten by the process of step B32 (step B33).

その次に、構造化文書サーバ２は、当該非構造化データへ参照する他の構造化文書があるか否かをチェックし（ステップＢ３４）、当該他の構造化文書がない場合は、構造化文書の更新完了通知をクライアント通信部２ａにより構造化文書作成クライアント１に発行する（ステップＢ１４）。
そして構造化文書作成クライアント１が構造化文書の更新完了通知をサーバ通信部１ｂにより受け取り（ステップＢ１５）、更新処理が終了する。 Next, the structured document server 2 checks whether or not there is another structured document that refers to the unstructured data (step B34). If there is no other structured document, the structured document server 2 checks the structured document. A document update completion notification is issued to the structured document creation client 1 by the client communication unit 2a (step B14).
The structured document creation client 1 receives the update completion notification of the structured document by the server communication unit 1b (step B15), and the update process ends.

一方、構造化文書サーバ２は、前述した他の構造化文書がある場合は、これらの構造化文書のプロパティを調査し、この調査したプロパティが更新済みの構造化文書と類似しているか否かなどの判別結果をもとに、調査済みの構造化文書に含まれる非構造化データＵＲＩを、ステップＢ３１の処理で新たに受信した非構造化データＵＲＩに書き換え（ステップＢ３５）、更新完了通知をクライアント通信部２ａにより構造化文書作成クライアント１に発行する（ステップＢ１４）。
そして構造化文書作成クライアント１が構造化文書の更新完了通知をサーバ通信部１ｂにより受け取り（ステップＢ１５）、更新処理が終了する。 On the other hand, when there is another structured document described above, the structured document server 2 investigates the properties of these structured documents, and whether or not the investigated properties are similar to the updated structured document. The unstructured data URI included in the investigated structured document is rewritten to the newly received unstructured data URI in the process of step B31 (step B35), and an update completion notification is sent. Issued to the structured document creation client 1 by the client communication unit 2a (step B14).
The structured document creation client 1 receives the update completion notification of the structured document by the server communication unit 1b (step B15), and the update process ends.

以下は、非構造化データの具体例が画像である場合の本発明の実施形態を説明する。図４は、本発明の実施形態における情報処理システムの構成例を示すブロック図である。
図４に示すように、本発明の実施形態における情報処理システムは、構造化文書作成クライアント１０、構造化文書サーバ２０、画像サーバ３０、文書ＤＢ４０および画像ＤＢ５０を備える。構造化文書作成クライアント１０、構造化文書サーバ２０、画像サーバ３０は、それぞれの間で通信可能に接続される。 The following describes an embodiment of the present invention where a specific example of unstructured data is an image. FIG. 4 is a block diagram showing a configuration example of the information processing system in the embodiment of the present invention.
As illustrated in FIG. 4, the information processing system according to the embodiment of the present invention includes a structured document creation client 10, a structured document server 20, an image server 30, a document DB 40, and an image DB 50. The structured document creation client 10, the structured document server 20, and the image server 30 are communicably connected to each other.

また、文書ＤＢ４０は構造化文書サーバ２０と通信可能に接続され、画像ＤＢ５０は画像サーバ３０と通信可能に接続される。また、構造化文書サーバ２０は文書ＤＢ４０を内蔵する形態であってもよく、画像サーバ３０は画像ＤＢ５０を内蔵する形態であってもよい。 The document DB 40 is communicably connected to the structured document server 20, and the image DB 50 is communicably connected to the image server 30. Further, the structured document server 20 may have a form in which the document DB 40 is built, and the image server 30 may have a form in which the image DB 50 is built.

文書ＤＢ４０は、例えばハードディスクドライブや不揮発性メモリなどの記憶媒体であり、構造化文書の管理のためのデータベース機能を有する。
画像ＤＢ５０は、例えばハードディスクドライブや不揮発性メモリなどの記憶媒体であり、画像の管理のためのデータベース機能を有する。 The document DB 40 is a storage medium such as a hard disk drive or a non-volatile memory, and has a database function for managing structured documents.
The image DB 50 is a storage medium such as a hard disk drive or a nonvolatile memory, and has a database function for image management.

構造化文書作成クライアント１０は、構造化文書編集ＵＩ部１１、サーバ通信部１２、画像ハッシュ計算部１３、ハッシュキャッシュ部１４、ハッシュ一致判別部１５を有する。
構造化文書編集ＵＩ部１１は、画像を含む構造化データである構造化文書を編集するためのユーザインタフェース機能を有し、キーボードやマウスなどの入力装置、およびディスプレイ装置などの出力装置により実現される。
サーバ通信部１２は、構造化文書サーバ２０や画像サーバ３０と通信するためのクライアントサイド機能を有する。 The structured document creation client 10 includes a structured document editing UI unit 11, a server communication unit 12, an image hash calculation unit 13, a hash cache unit 14, and a hash match determination unit 15.
The structured document editing UI unit 11 has a user interface function for editing a structured document that is structured data including an image, and is realized by an input device such as a keyboard and a mouse and an output device such as a display device. The
The server communication unit 12 has a client side function for communicating with the structured document server 20 and the image server 30.

画像ハッシュ計算部１３は、画像のハッシュ値を計算する。ハッシュキャッシュ部１４は、不揮発性メモリなどの記憶媒体であり、画像ハッシュ計算部１３によって得られたハッシュ値を記憶する。 The image hash calculation unit 13 calculates a hash value of the image. The hash cache unit 14 is a storage medium such as a nonvolatile memory, and stores the hash value obtained by the image hash calculation unit 13.

ハッシュ一致判別部１５は、ハッシュキャッシュ部１４に記憶されるハッシュ値と、この記憶後に構造化文書が更新された際に、当該構造化文書が参照する画像に関して画像ハッシュ計算部１３によって新たに得られたハッシュ値とが一致するか否かを判別する。 The hash match determination unit 15 newly obtains the hash value stored in the hash cache unit 14 and the image referred to by the structured document when the structured document is updated after the storage by the image hash calculation unit 13. It is determined whether or not the obtained hash value matches.

構造化文書サーバ２０は、クライアント通信部２１、画像サーバ通信部２２、文書ＩＤ発行部２３、画像ＵＲＩ変換部２４、画像近似度計算部２５、画像マルチ参照処理部２６および構造化文書ＤＢアクセス部２７を有する。 The structured document server 20 includes a client communication unit 21, an image server communication unit 22, a document ID issuing unit 23, an image URI conversion unit 24, an image approximation degree calculation unit 25, an image multi-reference processing unit 26, and a structured document DB access unit. 27.

クライアント通信部２１は、構造化文書作成クライアント１０と通信を行なうためのサーバサイド機能を有する。
画像サーバ通信部２２は、画像サーバ３０と通信を行なうためのクライアントサイド機能を有する。
文書ＩＤ発行部２３は、構造化文書を識別するための文書ＩＤを発行する。 The client communication unit 21 has a server-side function for communicating with the structured document creation client 10.
The image server communication unit 22 has a client side function for communicating with the image server 30.
The document ID issuing unit 23 issues a document ID for identifying the structured document.

画像ＵＲＩ変換部２４は、構造化文書が参照する画像が更新されて当該画像のＩＤが新たに発行された場合、構造化文書の中に書かれた画像参照情報である画像ＵＲＩを書き換える。
画像近似度計算部２５は、構造化文書が参照する画像が更新された場合に、更新前の構造化文書が参照する画像と更新後の構造化文書が参照する画像との近似度を計算する。 When the image referred to by the structured document is updated and an ID of the image is newly issued, the image URI conversion unit 24 rewrites the image URI that is image reference information written in the structured document.
When the image referred to by the structured document is updated, the image approximation degree calculation unit 25 calculates the degree of approximation between the image referred to by the structured document before update and the image referred to by the updated structured document. .

画像マルチ参照処理部２６は、ある画像を参照する構造化文書が複数存在する場合で当該画像が更新された場合に、各構造化文書の中に書かれた当該画像参照情報である画像ＵＲＩを書き換える。
構造化文書ＤＢアクセス部２７は、文書ＤＢ４０にアクセスするための機能を有し、この文書ＤＢ４０に対する構造化文書の作成、参照、更新、削除といった機能を有する。 When there are a plurality of structured documents that refer to an image and the image is updated, the image multi-reference processing unit 26 obtains an image URI that is the image reference information written in each structured document. rewrite.
The structured document DB access unit 27 has a function for accessing the document DB 40, and has a function of creating, referring to, updating, and deleting a structured document with respect to the document DB 40.

画像サーバ３０は、リクエスト処理部３１、画像ＩＤ発行部３２および画像ＤＢアクセス部３３を有する。
リクエスト処理部３１は、構造化文書作成クライアント１０や構造化文書サーバ２０と通信を行なうためのサーバサイド機能を有する。 The image server 30 includes a request processing unit 31, an image ID issuing unit 32, and an image DB access unit 33.
The request processing unit 31 has a server-side function for communicating with the structured document creation client 10 and the structured document server 20.

画像ＩＤ発行部３２は、画像を識別するための画像ＩＤを発行する。
画像ＤＢアクセス部３３は、画像ＤＢ５０にアクセスするための機能を有し、この画像ＤＢ５０に対する画像の作成、参照、更新、削除といった機能を有する。 The image ID issuing unit 32 issues an image ID for identifying an image.
The image DB access unit 33 has a function for accessing the image DB 50, and has a function of creating, referring to, updating, and deleting an image with respect to the image DB 50.

次に、画像を含む構造化文書を編集作成する場合の動作について説明する。
図５は、本発明の実施形態における情報処理システムによる構造化文書の作成の手順を示す図である。
図６は、本発明の実施形態における情報処理システムによる画像付き構造化文書の伝送処理の一例を示すフローチャートである。
図７は、本発明の実施形態における情報処理システムによる構造化文書の作成画面の一例を示す図である。
図８は、本発明の実施形態における情報処理システムによる構造化文書のソースコードの一例を示す図である。
構造化文書作成クライアント１０の構造化文書編集ＵＩ部１１により表示する各種画面は、Ｗｅｂブラウザやスタンドアロンアプリケーションプログラムにより表示される。編集者は構造化文書編集ＵＩ部１１への入力操作により、図７に示した作成画面を表示させて、この画面上で文書のタイトル、作成者名、作成日、本文を編集し、かつ当該文書に対する画像の添付のための入力操作を行なうことで、図８に示したようなＸＭＬ形式の画像付き構造化文書の作成がなされる（ステップＳ１）。 Next, an operation for editing and creating a structured document including an image will be described.
FIG. 5 is a diagram showing a procedure for creating a structured document by the information processing system according to the embodiment of the present invention.
FIG. 6 is a flowchart illustrating an example of transmission processing of a structured document with images by the information processing system according to the embodiment of the present invention.
FIG. 7 is a diagram showing an example of a structured document creation screen by the information processing system according to the embodiment of the present invention.
FIG. 8 is a diagram showing an example of the source code of the structured document by the information processing system according to the embodiment of the present invention.
Various screens displayed by the structured document editing UI unit 11 of the structured document creation client 10 are displayed by a Web browser or a stand-alone application program. The editor displays the creation screen shown in FIG. 7 by an input operation to the structured document editing UI unit 11, edits the title, creator name, creation date, and text of the document on this screen, and By performing an input operation for attaching an image to the document, a structured document with an image in the XML format as shown in FIG. 8 is created (step S1).

作成済みの構造化文書には、当該構造化文書に対する画像を参照するための画像参照情報が含まれる。図８に示した例では、構造化文書における画像参照情報は画像ファイル名「waterlilies.jpg」である。 The created structured document includes image reference information for referring to an image with respect to the structured document. In the example shown in FIG. 8, the image reference information in the structured document is an image file name “waterlilies.jpg”.

すると、画像ハッシュ計算部１３は、作成済みの画像付き構造化文書における画像のハッシュ値を計算し（ステップＳ２）、このハッシュ値をハッシュキャッシュ部１４に保存する（ステップＳ３）。 Then, the image hash calculation unit 13 calculates the hash value of the image in the created structured document with an image (step S2), and stores this hash value in the hash cache unit 14 (step S3).

図９は、本発明の実施形態における情報処理システムによる構造化文書の画像ハッシュ値の保存後のソースコードの一例を示す図である。
画像のハッシュ値の保存方法はいくつか考えられるが、例えば図９に示すように、作成済みの構造化文書のソースコード中にハッシュ値を含ませて、この文書をハッシュキャッシュ部１４に保存することが挙げられる。 FIG. 9 is a diagram showing an example of the source code after storing the image hash value of the structured document by the information processing system according to the embodiment of the present invention.
There are several methods for storing the hash value of the image. For example, as shown in FIG. 9, the hash value is included in the source code of the structured document that has been created, and the document is stored in the hash cache unit 14. Can be mentioned.

本実施形態では、構造化文書作成クライアント１０と構造化文書サーバ２０と画像サーバ３０と間の通信は、REST（Representational State Transfer）に基づき、HTTP上で行われる。
通信時のインタフェースは、構造化文書の作成時は「HTTP POST」で、構造化文書の参照時は「HTTP GET」で、構造化文書の更新時は「HTTP PUT」で、構造化文書の削除時は「HTTP DELETE」である。 In this embodiment, communication among the structured document creation client 10, the structured document server 20, and the image server 30 is performed on HTTP based on REST (Representational State Transfer).
The interface for communication is "HTTP POST" when creating a structured document, "HTTP GET" when referring to a structured document, and "HTTP PUT" when updating a structured document. Deleting a structured document The time is “HTTP DELETE”.

よって、構造化文書作成クライアント１０のサーバ通信部１２は、文書の新規作成時は、リクエスト「POST http://document.example.com/server」を用いて、図９に示したような構造化文書のＸＭＬと画像をマルチパートで構造化文書サーバ２０に送信する（ステップＳ４）。このリクエスト中の「http://document.example.com」は構造化文書サーバ２０のサーバ名である。 Therefore, the server communication unit 12 of the structured document creation client 10 uses the request “POST http://document.example.com/server” when creating a new document, and performs the structured process as shown in FIG. The XML and image of the document are transmitted to the structured document server 20 in multipart (step S4). “Http://document.example.com” in this request is the server name of the structured document server 20.

図１０は、本発明の実施形態における情報処理システムによる構造化文書の送信情報の一例を示す図である。
構造化文書サーバ２０のクライアント通信部２１が、構造化文書作成クライアント１０からの画像付きの構造化文書を受信すると（ステップＳ５）、画像サーバ通信部２２は、この受信した画像をリクエスト「POST http://picture.example.com/server」用いて画像サーバ３０に送信する（ステップＳ６）。このリクエスト中の「http://picture.example.com」は画像サーバ３０のサーバ名である。 FIG. 10 is a diagram showing an example of transmission information of a structured document by the information processing system according to the embodiment of the present invention.
When the client communication unit 21 of the structured document server 20 receives a structured document with an image from the structured document creation client 10 (step S5), the image server communication unit 22 requests the received image “POST http”. </picture.example.com/server "is transmitted to the image server 30 (step S6). “Http://picture.example.com” in this request is the server name of the image server 30.

画像サーバ３０は、構造化文書サーバ２０から送られてきた画像をリクエスト処理部３１により受信し、この画像を識別する画像ＩＤを画像ＩＤ発行部３２により発行する。ここでは発行された画像ＩＤは「picId」とする。そして、画像ＤＢアクセス部３３は、受信済みの画像を発行済みの画像ＩＤと関連付けて画像ＤＢ５０に登録する（ステップＳ７）。 The image server 30 receives the image sent from the structured document server 20 by the request processing unit 31 and issues an image ID for identifying the image by the image ID issuing unit 32. Here, the issued image ID is “picId”. Then, the image DB access unit 33 registers the received image in the image DB 50 in association with the issued image ID (step S7).

そして、画像サーバ３０のリクエスト処理部３１は、発行済みの画像ＩＤをもとに画像ＵＲＩを計算して、この画像ＵＲＩを構造化文書サーバ２０に返信する（ステップＳ８）。ここでは、画像ＵＲＩは「http://picture.example.com/picId」であり、当該ＵＲＩの末尾は前述した画像ＩＤである。 Then, the request processing unit 31 of the image server 30 calculates an image URI based on the issued image ID, and returns this image URI to the structured document server 20 (step S8). Here, the image URI is “http://picture.example.com/picId”, and the end of the URI is the above-described image ID.

そして、構造化文書サーバ２０の画像サーバ通信部２２が、画像サーバ３０からの画像ＵＲＩを受信すると（ステップＳ９）、画像ＵＲＩ変換部２４は、ステップＳ５の処理で前述したように構造化文書作成クライアント１０から受信した構造化文書の中の画像参照情報を、ステップＳ９の処理で受信済みの非構造化データＵＲＩに書き換える（ステップＳ１０）。 When the image server communication unit 22 of the structured document server 20 receives the image URI from the image server 30 (step S9), the image URI conversion unit 24 creates a structured document as described above in step S5. The image reference information in the structured document received from the client 10 is rewritten to the unstructured data URI that has been received in step S9 (step S10).

そして、構造化文書サーバ２０の文書ＩＤ発行部２３は、ステップＳ５の処理で受信した構造化文書の構造化文書ＩＤを発行する。ここでは、発行された構造化文書ＩＤは「docId」である。 Then, the document ID issuing unit 23 of the structured document server 20 issues the structured document ID of the structured document received in the process of step S5. Here, the issued structured document ID is “docId”.

そして、構造化文書サーバ２０は、発行された文書ＩＤをステップＳ５の処理で受信済みの構造化文書に書き込み、この構造化文書を文書ＤＢアクセス部２７により、文書ＤＢ４０に登録する（ステップＳ１１）。 Then, the structured document server 20 writes the issued document ID in the structured document that has been received in step S5, and registers this structured document in the document DB 40 by the document DB access unit 27 (step S11). .

図１１は、本発明の実施形態における情報処理システムによる構造化文書の画像参照の書き換え後のソースコードの一例を示す図である。
構造化文書サーバ２０は、最後に、前述のように発行された構造化文書ＩＤ「docId」をもとに構造化文書のＵＲＩである文書ＵＲＩ「http://document.example.com/docId」を計算し、この文書ＵＲＩをステップＳ９の処理で画像サーバ３０から受信済みである、画像ＵＲＩ「http://picture.example.com/picId」とともにクライアント通信部２１により構造化文書作成クライアント１０に返信する（ステップＳ１２）。 FIG. 11 is a diagram showing an example of the source code after rewriting the image reference of the structured document by the information processing system according to the embodiment of the present invention.
Finally, the structured document server 20 uses the document URI “http://document.example.com/docId” which is the URI of the structured document based on the structured document ID “docId” issued as described above. And the client URI 21 together with the image URI “http://picture.example.com/picId”, which has been received from the image server 30 in the process of step S9, is sent to the structured document creation client 10. A reply is made (step S12).

構造化文書作成クライアント１０は、構造化文書サーバ２０からの文書ＵＲＩおよび画像ＵＲＩをサーバ通信部１２により受信すると、これらを内部記憶装置に保存し、かつ、画像ＵＲＩを前述したようにハッシュキャッシュ部１４に記憶されたハッシュ値に関連付けるとともに、受信済みの文書ＵＲＩおよび画像ＵＲＩを構造化文書編集ＵＩ部１１を用いて画面表示させる。 When the structured document creation client 10 receives the document URI and the image URI from the structured document server 20 by the server communication unit 12, the structured document creation client 10 stores them in the internal storage device and stores the image URI as described above. The received document URI and image URI are displayed on the screen using the structured document editing UI unit 11.

編集者は、これらの構造化文書ＵＲＩと画像ＵＲＩを確認して、後にこれらの構造化文書ＵＲＩや画像ＵＲＩを構造化文書編集ＵＩ部１１により指定することで、構造化文書や画像をそれぞれ参照することができる。 The editor confirms the structured document URI and the image URI, and later designates the structured document URI and the image URI by the structured document editing UI unit 11, thereby referring to the structured document and the image, respectively. can do.

構造化文書ＩＤや画像ＩＤの発行に関しては、様々なアルゴリズムを利用することが可能である。例えば、連番、例えばdoc001、doc002、doc003、…のように、UUID（Universally Unique Identifier）などを利用することができる。 Various algorithms can be used for issuing structured document IDs and image IDs. For example, serial numbers such as UUID (Universally Unique Identifier) such as doc001, doc002, doc003,... Can be used.

また、文書ＵＲＩと画像のＵＲＩは、それぞれ発行された文書ＩＤや画像ＩＤを用いて計算することが可能である。
本実施形態では例えば、構造化文書ＩＤと構造化文書サーバ２０のサーバ名をもとに、文書ＵＲＩは「http://構造化文書サーバ名/{文書ＩＤ}」であると計算することができる。
また、例えば、画像ＩＤと画像サーバ３０のサーバ名をもとに、画像ＵＲＩは「http://画像サーバ名/{画像ＩＤ}」であると計算することができる。 The document URI and the image URI can be calculated using the issued document ID and image ID, respectively.
In the present embodiment, for example, based on the structured document ID and the server name of the structured document server 20, the document URI can be calculated as “http: // structured document server name / {document ID}”. it can.
For example, based on the image ID and the server name of the image server 30, the image URI can be calculated as “http: // image server name / {image ID}”.

例えば、構造化文書ＩＤが「doc001」の場合で、前述したように構造化文書サーバ２０のサーバ名が「http://document.example.com」である場合は、構造化文書のＵＲＩは「http://document.example.com/doc001」となる。 For example, when the structured document ID is “doc001” and the server name of the structured document server 20 is “http://document.example.com” as described above, the URI of the structured document is “ http://document.example.com/doc001 ".

また、画像ＩＤが「pic001」の場で、前述したように画像サーバ３０のサーバ名が「http://picture.example.com」である場合は、画像のＵＲＩは「http://picture.example.com/pic001」となる。 When the image ID is “pic001” and the server name of the image server 30 is “http://picture.example.com” as described above, the URI of the image is “http: // picture. example.com/pic001 ".

次に、作成済みの構造化文書の参照について説明する。
図１２は、本発明の実施形態における情報処理システムによる構造化文書の参照の手順を示す図である。
図１３は、本発明の実施形態における情報処理システムによる画像付き構造化文書の参照処理の一例を示すフローチャートである。
まず、構造化文書作成クライアント１０は、編集者による構造化文書編集ＵＩ部１１への入力操作により参照対象の構造化文書の構造化文書ＵＲＩを指定することで（ステップＳ２１）、この構造化文書ＵＲＩを用いて構造化文書の参照のリクエストをサーバ通信部１２により構造化文書サーバ２０に送信する（ステップＳ２２）。
例えば、指定された構造化文書ＵＲＩが「http://document.example.com/docId」である場合には、参照のリクエストは「GET http://document.example.com/docId」となる。 Next, reference to a created structured document will be described.
FIG. 12 is a diagram showing a procedure for referring to a structured document by the information processing system according to the embodiment of the present invention.
FIG. 13 is a flowchart showing an example of a reference process for a structured document with an image by the information processing system according to the embodiment of the present invention.
First, the structured document creation client 10 designates the structured document URI of the structured document to be referred to by an input operation to the structured document editing UI unit 11 by the editor (step S21). A request for referring to the structured document is transmitted to the structured document server 20 by the server communication unit 12 using the URI (step S22).
For example, when the specified structured document URI is “http://document.example.com/docId”, the reference request is “GET http://document.example.com/docId”.

構造化文書サーバ２０は、構造化文書作成クライアント１０からの参照のリクエストをクライアント通信部２１により受信すると（ステップＳ２３）、この構造化文書ＵＲＩから、参照対象の構造化文書の構造化文書ＩＤを計算する。
参照のリクエストが前述した「GET http://document.example.com/docId」である場合は、構造化文書ＩＤは当該リクエストの末尾の「docId」となる。 When the structured document server 20 receives a request for reference from the structured document creation client 10 by the client communication unit 21 (step S23), the structured document ID of the structured document to be referenced is obtained from the structured document URI. calculate.
When the reference request is “GET http://document.example.com/docId”, the structured document ID is “docId” at the end of the request.

そして、文書ＤＢアクセス部２７は、文書ＤＢ４０に保存される構造化文書のうち、計算済みの構造化文書ＩＤに対応する構造化文書を取得する。この構造化文書はクライアント通信部２１により構造化文書作成クライアント１０へ送信される（ステップＳ２４）。 Then, the document DB access unit 27 acquires a structured document corresponding to the calculated structured document ID among the structured documents stored in the document DB 40. This structured document is transmitted to the structured document creation client 10 by the client communication unit 21 (step S24).

構造化文書作成クライアント１０は、構造化文書サーバ２０からの構造化文書をサーバ通信部１２により受信すると（ステップＳ２５）、この構造化文書の中に記述される画像参照用の画像ＵＲＩを認識し（ステップＳ２６）、この認識した画像ＵＲＩをもとにした画像参照のリクエストをサーバ通信部１２により画像サーバ３０へ送信する（ステップＳ２７）。
例えば、認識した画像ＵＲＩが「http://picture.example.com/picId」である場合には、画像参照のリクエストは「GET http://picture.example.com/picId」となる。 When the structured document creation client 10 receives the structured document from the structured document server 20 by the server communication unit 12 (step S25), the structured document creation client 10 recognizes an image URI for image reference described in the structured document. (Step S26), the server communication unit 12 transmits an image reference request based on the recognized image URI to the image server 30 (Step S27).
For example, when the recognized image URI is “http://picture.example.com/picId”, the image reference request is “GET http://picture.example.com/picId”.

画像サーバ３０は、構造化文書サーバ２０からの画像参照のリクエストをリクエスト処理部３１により受信すると（ステップＳ２８）、この画像ＵＲＩから、参照対象の画像の画像ＩＤを計算する。
参照のリクエストが前述した「GET http://picture.example.com/picId」である場合は、画像ＩＤは当該リクエストの末尾の「picId」となる。 When the image processing request is received by the request processing unit 31 from the structured document server 20 (step S28), the image server 30 calculates the image ID of the reference target image from the image URI.
If the reference request is “GET http://picture.example.com/picId”, the image ID is “picId” at the end of the request.

そして、画像ＤＢアクセス部３３は、画像ＤＢ５０に保存される画像のうち、計算済みの画像ＩＤに対応する画像を取得する。この画像はリクエスト処理部３１により構造化文書作成クライアント１０へ送信される（ステップＳ２９）。 Then, the image DB access unit 33 acquires an image corresponding to the calculated image ID among the images stored in the image DB 50. This image is transmitted to the structured document creation client 10 by the request processing unit 31 (step S29).

構造化文書作成クライアント１０は、画像サーバ３０からの画像をサーバ通信部１２により受信すると（ステップＳ３０）、この画像を構造化文書編集ＵＩ部１１により表示する（ステップＳ３１）。以上で参照処理が終了する。 When the structured document creation client 10 receives an image from the image server 30 by the server communication unit 12 (step S30), the structured document creation client 10 displays the image by the structured document editing UI unit 11 (step S31). This completes the reference process.

次に、参照済みの構造化文書を更に編集することによる更新処理について説明する。
図１４は、本発明の実施形態における情報処理システムによる構造化文書の更新の手順の第１の例を示す図である。
図１５は、本発明の実施形態における情報処理システムによる画像付き構造化文書の更新処理の第１の例を示すフローチャートである。
更新処理は２通りあり、まず、更新後の構造化文書の画像のハッシュ値と更新前の同じ構造化文書の画像のハッシュ値とが一致する場合の更新処理について説明する。 Next, update processing by further editing the referenced structured document will be described.
FIG. 14 is a diagram showing a first example of a procedure for updating a structured document by the information processing system in the embodiment of the present invention.
FIG. 15 is a flowchart illustrating a first example of the update process of the structured document with image by the information processing system according to the embodiment of the present invention.
There are two types of update processing. First, update processing in the case where the hash value of the image of the structured document after update matches the hash value of the image of the same structured document before update will be described.

まず、前述した参照のための操作により構造化文書作成クライアント１０の構造化文書編集ＵＩ部１１により表示される構造化文書が編集者による入力操作により更新されると（ステップＳ４１）、画像ハッシュ計算部１３は、この更新された構造化文書に添付される画像のハッシュ値を計算する（ステップＳ４２）。 First, when the structured document displayed by the structured document editing UI unit 11 of the structured document creation client 10 is updated by the input operation by the editor by the above-described reference operation (step S41), the image hash calculation is performed. The unit 13 calculates the hash value of the image attached to the updated structured document (step S42).

前述したハッシュ値は、元となるデータに対してハッシュ計算アルゴリズムを用いて計算される。ハッシュ計算アルゴリズムにはMDA、SHA1などがある。
ハッシュの特徴としては以下の四つが挙げられる。第１の特徴は、ハッシュ値の長さは元データのデータサイズに関わらず固定長になる点である。第２の特徴は、ハッシュ値の長さは、ハッシュ計算のために用いたアルゴリズムによって異なる点である。 The hash value described above is calculated using a hash calculation algorithm on the original data. There are HDA algorithms such as MDA and SHA1.
The hash has the following four features. The first feature is that the hash value has a fixed length regardless of the data size of the original data. The second feature is that the length of the hash value differs depending on the algorithm used for the hash calculation.

第３の特徴は、元データが異なれば、その計算結果であるハッシュ値も異なり、同じハッシュ値になることは極めてまれである点である。第４の特徴は、ハッシュ値とその計算アルゴリズムが判明しても元データを特定することができない点である。 The third feature is that if the original data is different, the hash value as the calculation result is also different, and it is extremely rare that the same hash value is obtained. The fourth feature is that the original data cannot be specified even if the hash value and its calculation algorithm are known.

前述したように、ハッシュキャッシュ部１４に記憶されたハッシュ値には当該ハッシュ値の画像を参照する構造化文書の文書ＵＲＩ「http://document.example.com/docId」が関連付けられる。ハッシュ一致判別部１５は、既にハッシュキャッシュ部１４に保存されるハッシュ値のうち、前述した参照のために指定した構造化文書ＵＲＩが関連付けられるハッシュ値である元のハッシュ値を取得し（ステップＳ４３）、この取得したハッシュ値とステップＳ４２の処理により計算済みのハッシュ値とを比較することで両者が一致するか否かを判別する（ステップＳ４４）。 As described above, the hash value stored in the hash cache unit 14 is associated with the document URI “http://document.example.com/docId” of the structured document that refers to the image of the hash value. The hash match determination unit 15 acquires an original hash value, which is a hash value associated with the structured document URI designated for reference, from among the hash values already stored in the hash cache unit 14 (step S43). ) By comparing the acquired hash value with the hash value already calculated by the process of step S42, it is determined whether or not they match (step S44).

構造化文書作成クライアント１０は、ハッシュ一致判別部１５によりハッシュ値同士が一致すると判別した場合は（ステップＳ４５）、更新された構造化文書の画像が更新されていないと判別して、構造化文書を当該文書の構造化文書ＵＲＩ「http://document.example.com/docId」をもとにした更新リクエスト「PUT http://document.example.com/docId」を用いてサーバ通信部１２により構造化文書サーバ２０に送信する一方で、前述した更新されていないと判別した画像は送信対象としない（ステップＳ４６）。 When the hash match determination unit 15 determines that the hash values match (step S45), the structured document creation client 10 determines that the image of the updated structured document has not been updated, and the structured document By the server communication unit 12 using the update request “PUT http://document.example.com/docId” based on the structured document URI “http://document.example.com/docId” of the document. While transmitting to the structured document server 20, the image determined to be not updated as described above is not a transmission target (step S46).

構造化文書サーバ２０は、構造化文書作成クライアント１０からの構造化文書の更新リクエストをクライアント通信部２１により受信すると（ステップＳ４７）、文書ＤＢ４０に保存される構造化文書のうち、当該受信したリクエストで示される構造化文書ＵＲＩ「http://document.example.com/docId」から計算される構造化文書ＩＤ「docId」に対応する構造化文書を当該受信した構造化文書に更新する処理を構造化文書ＤＢアクセス部２７により行ない（ステップＳ４８）、更新完了通知をクライアント通信部２１により構造化文書作成クライアント１０に発行する（ステップＳ４９）。
構造化文書作成クライアント１０は、構造化文書サーバ２０からの更新完了通知をサーバ通信部１２により受領し、更新処理を終了する。 When the structured document server 20 receives a structured document update request from the structured document creation client 10 by the client communication unit 21 (step S47), the structured request stored in the structured document stored in the document DB 40 is received. The process of updating the structured document corresponding to the structured document ID “docId” calculated from the structured document URI “http://document.example.com/docId” shown in FIG. This is performed by the structured document DB access unit 27 (step S48), and an update completion notification is issued to the structured document creation client 10 by the client communication unit 21 (step S49).
The structured document creation client 10 receives the update completion notification from the structured document server 20 by the server communication unit 12 and ends the update process.

このように、更新された構造化文書の画像が更新されていない場合には、当該構造化文書の画像は構造化文書作成クライアント１０から構造化文書サーバ２０に送信されないので、ネットワークの負荷を低減することができる。 As described above, when the image of the updated structured document is not updated, the image of the structured document is not transmitted from the structured document creation client 10 to the structured document server 20, thereby reducing the network load. can do.

次に、更新後の構造化文書の画像のハッシュ値と更新前の同じ構造化文書の画像のハッシュ値とが一致しない場合の更新処理について説明する。
図１６は、本発明の実施形態における情報処理システムによる処理対象の画像データ同士の近似度の高低を示す図である。
この、ハッシュ値同士が一致しない場合の更新処理は、図１６に示すように更新後の構造化文書の画像と更新前の同じ構造化文書の画像との近似度が高い場合と低い場合とで異なる。まず、更新後の構造化文書の画像と更新前の同じ構造化文書の画像との近似度が高い場合の更新処理について説明する。 Next, an update process when the hash value of the image of the structured document after the update and the hash value of the image of the same structured document before the update do not match will be described.
FIG. 16 is a diagram showing the degree of approximation between image data to be processed by the information processing system according to the embodiment of the present invention.
The update processing in the case where the hash values do not match is shown in FIG. 16 when the degree of approximation between the image of the structured document after update and the image of the same structured document before update is high or low. Different. First, update processing when the degree of approximation between an image of a structured document after update and an image of the same structured document before update is high will be described.

図１７は、本発明の実施形態における情報処理システムによる構造化文書の更新の手順の第２の例を示す図である。
図１８は、本発明の実施形態における情報処理システムによる画像付き構造化文書の更新処理の第２の例を示すフローチャートである。
まず、前述したステップＳ４１〜Ｓ４４までの処理、つまり画像付き構造化文書の更新からハッシュ比較までの処理がなされた上で、構造化文書作成クライアント１０は、ハッシュ一致判別部１５によりハッシュ値同士が一致しないと判別した場合は（ステップＳ５１）、更新された構造化文書の画像が元の画像から更新されたことを意味する。 FIG. 17 is a diagram showing a second example of the procedure for updating the structured document by the information processing system according to the embodiment of the present invention.
FIG. 18 is a flowchart showing a second example of the updating process of the structured document with image by the information processing system according to the embodiment of the present invention.
First, after the processing from the above-described steps S41 to S44, that is, the processing from the update of the structured document with image to the hash comparison, the structured document creation client 10 uses the hash match determination unit 15 to compare the hash values. If it is determined that they do not match (step S51), it means that the image of the updated structured document has been updated from the original image.

この場合、画像ハッシュ計算部１３は、更新された構造化文書に関わる画像のハッシュ値を新たに計算する。さらに、画像ハッシュ計算部１３は、ハッシュキャッシュ部１４に保存されるハッシュ値のうち、更新された構造化文書の構造化文書ＵＲＩが関連付けられるハッシュ値を前述したように新たに計算したハッシュ値に更新する（ステップＳ５２）。 In this case, the image hash calculation unit 13 newly calculates a hash value of an image related to the updated structured document. Furthermore, the image hash calculation unit 13 converts the hash value stored in the hash cache unit 14 to the hash value newly calculated as described above for the hash value associated with the structured document URI of the updated structured document. Update (step S52).

そして、構造化文書作成クライアント１０は、更新された構造化文書および画像を図１０に示したように、マルチパートにより、当該文書の構造化文書ＵＲＩ「http://document.example.com/docId」をもとにした更新リクエスト「PUT http://document.example.com/docId」を用いてサーバ通信部１２により構造化文書サーバ２０に送信する（ステップＳ５３）。ここでの作成時との違いは、更新時はPUTメソッド「PUT http://document.example.com/docId」を利用する点である。 Then, the structured document creation client 10 uses the multi-part structured document URI “http://document.example.com/docId” for the updated structured document and image as shown in FIG. Is transmitted to the structured document server 20 by the server communication unit 12 using the update request “PUT http://document.example.com/docId” based on “” (step S53). The difference from the creation here is that the PUT method “PUT http://document.example.com/docId” is used when updating.

構造化文書サーバ２０は、クライアント通信部２１により、構造化文書作成クライアント１０からの画像付き構造化文書を更新リクエストとともに受信すると（ステップＳ５４）、構造化文書作成クライアント１０から受信した構造化文書の中で示される画像ＵＲＩ「http://picture.example.com/picId」を認識して、この画像ＵＲＩをもとに更新対象の画像の画像ＩＤ「picId」を認識する。 When the structured document server 20 receives the structured document with image from the structured document creation client 10 together with the update request from the structured document creation client 10 by the client communication unit 21 (step S54), the structured document server 20 receives the structured document received from the structured document creation client 10. The image URI “http://picture.example.com/picId” shown therein is recognized, and the image ID “picId” of the image to be updated is recognized based on this image URI.

この画像ＩＤ「picId」は元の画像の画像ＩＤでもある。よって、構造化文書サーバ２０は、この画像ＩＤをもとに、元の画像の参照リクエスト「GET http://picture.example.com/picId」を画像サーバ通信部２２を介して画像サーバ３０に送信する（ステップＳ５５）。 This image ID “picId” is also the image ID of the original image. Therefore, the structured document server 20 sends an original image reference request “GET http://picture.example.com/picId” to the image server 30 via the image server communication unit 22 based on the image ID. Transmit (step S55).

画像サーバ３０は、構造化文書サーバ２０からの元の画像の参照リクエストをリクエスト処理部３１により受信すると、画像ＤＢ５０に保存される画像のうち、受信した参照リクエスト「GET http://picture.example.com/picId」から計算される画像ＩＤ「picId」に対応する元の画像を画像ＤＢアクセス部３３により取得し（ステップＳ５６）、この元の画像をリクエスト処理部３１により構造化文書サーバ２０に送信する（ステップＳ５７）。 When the request processing unit 31 receives the original image reference request from the structured document server 20, the image server 30 receives the received reference request “GET http: //picture.example” from among the images stored in the image DB 50. The original image corresponding to the image ID “picId” calculated from “.com / picId” is acquired by the image DB access unit 33 (step S56), and this original image is sent to the structured document server 20 by the request processing unit 31. Transmit (step S57).

構造化文書サーバ２０は、画像サーバ３０からの元の画像を画像サーバ通信部２２により受信する（ステップＳ５８）。すると、画像近似度計算部２５は、ステップＳ５４の処理で受信した更新後の画像とステップＳ５８の処理で受信した元の画像とを比較することで、両者の近似度を判別する（ステップＳ５９）。 The structured document server 20 receives the original image from the image server 30 by the image server communication unit 22 (step S58). Then, the image approximation calculation unit 25 compares the updated image received in the process of step S54 with the original image received in the process of step S58, thereby determining the degree of approximation of both (step S59). .

なお、画像の近似度計算は公知の技術を用いて行なう。
画像近似度については、例えば下記の論文や特許文献に開示されている。
（１）岡山理科大学松若光一、榎本拓郎、鴉裕孝「画像類似度に関する研究」 [online]、[２００９年４月１０日検索]、インターネット＜ＵＲＬ：http://gemini.mis.ous.ac.jp/Rejume/2005/sim.pdf＞
（２）早稲田大学「類似画像検索方法および類似画像検索装置」、特開２００６−２８５５７０号公報、[online]、[２００９年４月１０日検索]、インターネット＜ＵＲＬ：http://jstore.jst.go.jp/cgibin/patent/advanced/pat/detail_pat.cgi?patid=13404&detail_id=15474＞
構造化文書サーバ２０は、近似度が所定値以上、つまり近似度が高いと画像近似度計算部２５により判別した場合（ステップＳ６０）、更新前後の画像が同じものとみなして、この画像は上書き対象のデータであると判別し、構造化文書作成クライアント１０からの構造化文書の中で示される画像ＵＲＩ「http://picture.example.com/picId」を取得する。 Note that the degree of approximation of an image is calculated using a known technique.
The image approximation degree is disclosed in, for example, the following papers and patent documents.
(1) Okayama University of Science Koichi Matsuwaka, Takuro Enomoto, Hirotaka Tsuji “Studies on Image Similarity” [online], [Search April 10, 2009], Internet <URL: http://gemini.mis.ous.ac .jp / Rejume / 2005 / sim.pdf>
(2) Waseda University “Similar Image Retrieval Method and Similar Image Retrieval Device”, Japanese Patent Laid-Open No. 2006-285570, [online], [Retrieval on April 10, 2009], Internet <URL: http: //jstore.jst .go.jp / cgibin / patent / advanced / pat / detail_pat.cgi? patid = 13404 & detail_id = 15474>
If the structured document server 20 determines that the degree of approximation is equal to or greater than a predetermined value, that is, the degree of approximation is high by the image approximation degree calculation unit 25 (step S60), the structured document server 20 regards the images before and after the update as the same and overwrites this image. It is determined that the data is the target data, and the image URI “http://picture.example.com/picId” indicated in the structured document from the structured document creation client 10 is acquired.

構造化文書サーバ２０は、上書き対象である画像を取得済みの画像ＵＲＩをもとにした画像上書き更新リクエスト「PUT http://picture.example.com/picId」を用いて画像サーバ通信部２２により画像サーバ３０に送信する（ステップＳ６１）。 The structured document server 20 uses the image overwriting update request “PUT http://picture.example.com/picId” based on the image URI from which the image to be overwritten has been acquired. It transmits to the image server 30 (step S61).

画像サーバ３０のリクエスト処理部３１が、構造化文書サーバ２０からの画像上書き更新リクエストを上書き対象の画像とともに受信すると、画像データＤＢアクセス部３３は、画像ＤＢ５０に保存される画像のうち、受信した画像上書き更新リクエスト中の画像ＩＤ「picId」に対応する画像を、画像上書き更新リクエストとともに受信した画像に上書き更新する（ステップＳ６２）。そして、リクエスト処理部３１は、当該画像の更新完了通知を構造化文書サーバ２０に返信する。
つまり、画像サーバ３０側で画像の更新処理はなされるが、その際に当該画像の画像ＩＤはそのままである。したがって、当該画像の画像ＵＲＩもそのままで変更されないことになる。 When the request processing unit 31 of the image server 30 receives the image overwrite update request from the structured document server 20 together with the image to be overwritten, the image data DB access unit 33 receives the image stored in the image DB 50. The image corresponding to the image ID “picId” in the image overwrite update request is overwritten and updated on the image received together with the image overwrite update request (step S62). Then, the request processing unit 31 returns an update completion notification of the image to the structured document server 20.
That is, the image update process is performed on the image server 30 side, but the image ID of the image remains unchanged at that time. Therefore, the image URI of the image is not changed as it is.

構造化文書サーバ２０は、画像サーバ３０からの更新完了通知を画像サーバ通信部２２により受信すると、文書ＤＢ４０に保存される構造化文書のうち、受信済みの更新リクエストで示される構造化文書ＵＲＩ「http://document.example.com/docId」から計算される構造化文書ＩＤ「docId」に対応する構造化文書をステップＳ５４の処理で受信した構造化文書に更新する処理を構造化文書ＤＢアクセス部２７により行ない（ステップＳ６３）、更新完了通知をクライアント通信部２１により構造化文書作成クライアント１０に発行する（ステップＳ６４）。
構造化文書作成クライアント１０は、構造化文書サーバ２０からの更新完了通知をサーバ通信部１２により受領し、更新処理を終了する。 When the structured document server 20 receives the update completion notification from the image server 30 by the image server communication unit 22, the structured document URI “indicated by the received update request among the structured documents stored in the document DB 40. Structured document DB access processing for updating the structured document corresponding to the structured document ID “docId” calculated from “http://document.example.com/docId” to the structured document received in step S54. This is performed by the unit 27 (step S63), and an update completion notification is issued to the structured document creation client 10 by the client communication unit 21 (step S64).
The structured document creation client 10 receives the update completion notification from the structured document server 20 by the server communication unit 12 and ends the update process.

また、その際に、構造化文書サーバ２０において、当該画像へ参照する構造化文書が複数存在する場合、つまりマルチ参照である場合で、かつ、前述したように画像ＵＲＩが更新されない場合には、構造化文書内の画像参照情報を書き換えるなどの処理を行なう必要はない。 At that time, in the structured document server 20, when there are a plurality of structured documents referred to the image, that is, in the case of multi-reference, and when the image URI is not updated as described above, There is no need to perform processing such as rewriting image reference information in a structured document.

図１９は、本発明の実施形態における情報処理システムによる画像付き構造化文書の上書き処理後の参照関係の一例を示す図である。
図１９は、マルチ参照において画像の上書き更新処理を行った後の画像参照関係を示すものである。図１９に示した例では、構造化文書である「文書１」を参照する画像である「画像１」が「画像１’」に上書き更新した場合で、この画像を参照する構造化文書が「文書１」および「文書２」である場合、「文書１」内の「画像１’」の画像ＵＲＩおよび「文書２」内の「画像１’」の画像ＵＲＩは、ともに「ＵＲＩ１」のままで変更されていない。 FIG. 19 is a diagram showing an example of the reference relationship after the overwriting process of the structured document with image by the information processing system according to the embodiment of the present invention.
FIG. 19 shows an image reference relationship after performing an image overwrite update process in multi-reference. In the example shown in FIG. 19, when “Image 1” that is an image referring to “Document 1” that is a structured document is overwritten and updated to “Image 1 ′”, the structured document that refers to this image is “ In the case of “document 1” and “document 2”, both the image URI of “image 1 ′” in “document 1” and the image URI of “image 1 ′” in “document 2” remain “URI1”. It has not changed.

次に、更新後の構造化文書の画像と更新前の同じ構造化文書の画像との近似度が低い場合の更新処理について説明する。
図２０は、本発明の実施形態における情報処理システムによる構造化文書の更新の手順の第３の例を示す図である。
図２１は、本発明の実施形態における情報処理システムによる画像付き構造化文書の更新処理の第３の例を示すフローチャートである。
まず、前述したステップＳ４１〜Ｓ４４の処理、およびステップＳ５１〜Ｓ５９の処理、つまり画像付き構造化文書の更新から画像近似度計算までの処理がなされた上で、構造化文書サーバ２０は、近似度が所定値未満、つまり近似度が低いと画像近似度計算部２５により判別した場合は（ステップＳ７１）、更新後の画像が元の画像と全く異なるとみなして、この画像データは置換対象のデータであると判別し、構造化文書作成クライアント１０からの構造化文書の中で示される画像ＵＲＩ「http://picture.example.com/picId」を取得する。 Next, update processing when the degree of approximation between the image of the structured document after update and the image of the same structured document before update is low will be described.
FIG. 20 is a diagram showing a third example of the procedure for updating the structured document by the information processing system according to the embodiment of the present invention.
FIG. 21 is a flowchart showing a third example of the updating process of the structured document with image by the information processing system according to the embodiment of the present invention.
First, after the processing in steps S41 to S44 and the processing in steps S51 to S59, that is, the processing from the update of the structured document with image to the calculation of the image approximation, the structured document server 20 performs the approximation. Is less than a predetermined value, that is, the degree of approximation is low (step S71), the updated image is regarded as completely different from the original image, and this image data is the data to be replaced. And the image URI “http://picture.example.com/picId” shown in the structured document from the structured document creation client 10 is acquired.

構造化文書サーバ２０は、置換対象である画像を当該画像の新規登録リクエスト「POST http://picture.example.com/server」を用いて画像サーバ通信部２２により画像サーバ３０に送信する（ステップＳ７２）。 The structured document server 20 transmits the image to be replaced to the image server 30 by the image server communication unit 22 by using the new registration request “POST http://picture.example.com/server” of the image (step). S72).

画像サーバ３０のリクエスト処理部３１が、構造化文書サーバ２０からの新規登録リクエストを置換対象の画像とともに受信すると、この置換対象の画像の新たな画像ＩＤを画像ＩＤ発行部３２により新規に発行し、この画像ＩＤを受信済みの画像と関連付けて画像ＤＢ５０に新規保存する（ステップＳ７３）。ここでは、新規の画像ＩＤは「picId2」である。 When the request processing unit 31 of the image server 30 receives the new registration request from the structured document server 20 together with the replacement target image, the image ID issuing unit 32 newly issues a new image ID of the replacement target image. The image ID is newly stored in the image DB 50 in association with the received image (step S73). Here, the new image ID is “picId2”.

リクエスト処理部３１は、この新規の画像ＩＤ「picId2」から新たな画像ＵＲＩ「http://picture.example.com/picId2」を計算して構造化文書サーバ２０に返信する（ステップＳ７４）。 The request processing unit 31 calculates a new image URI “http://picture.example.com/picId2” from the new image ID “picId2” and returns it to the structured document server 20 (step S74).

構造化文書サーバ２０は、画像サーバ３０からの新たな画像ＵＲＩを画像サーバ通信部２２により受信すると（ステップＳ７５）、ステップＳ５４の処理で既に受信した構造化文書に含まれる画像ＵＲＩ「http://picture.example.com/picId」を、ステップＳ７５の処理で新たに受信した画像ＵＲＩ「http://picture.example.com/picId2」に書き換える（ステップＳ７６）。 When the structured document server 20 receives a new image URI from the image server 30 by the image server communication unit 22 (step S75), the structured document server 20 receives the image URI “http: / included in the structured document already received in the process of step S54. "/picture.example.com/picId" is rewritten to the image URI "http://picture.example.com/picId2" newly received in the process of step S75 (step S76).

そして、構造化文書ＤＢアクセス部２７は、文書ＤＢ４０に保存される構造化文書のうち、既に受信した構造化文書の構造化文書ＵＲＩ「http://document.example.com/docId」から計算される構造化文書ＩＤ「docId」に対応する構造化文書を、ステップＳ７６の処理により画像ＵＲＩが書き換えられた構造化文書に更新する（ステップＳ７７）。 Then, the structured document DB access unit 27 is calculated from the structured document URI “http://document.example.com/docId” of the already received structured document among the structured documents stored in the document DB 40. The structured document corresponding to the structured document ID “docId” is updated to the structured document in which the image URI has been rewritten by the process of step S76 (step S77).

そして、構造化文書サーバ２０は、前述のように更新された構造化文書のＵＲＩ「http://document.example.com/docId」を、ステップＳ７５の処理で画像サーバ３０から受信済みである、新たな画像ＵＲＩ「http://picture.example.com/picId2」とともにクライアント通信部２１により構造化文書作成クライアント１０に返信する（ステップＳ７８）。 The structured document server 20 has received the URI “http://document.example.com/docId” of the structured document updated as described above from the image server 30 in the process of step S75. A new image URI “http://picture.example.com/picId2” is returned to the structured document creation client 10 by the client communication unit 21 (step S78).

構造化文書作成クライアント１０は、構造化文書サーバ２０からの文書ＵＲＩおよび新たな画像ＵＲＩを受信すると、これらを内部記憶装置に保存し、かつ、新たな画像ＵＲＩをステップＳ５２の処理でハッシュキャッシュ部１４において更新されたハッシュ値に関連付けるとともに、文書ＵＲＩや新たな画像ＵＲＩを構造化文書編集ＵＩ部１１を用いて画面表示させる。
編集者は、これらの構造化文書ＵＲＩと新たな画像ＵＲＩを確認して、後にこれらの構造化文書ＵＲＩと画像ＵＲＩを構造化文書編集ＵＩ部１１により指定することで、文書と画像をそれぞれ参照することができる。 When the structured document creation client 10 receives the document URI and the new image URI from the structured document server 20, the structured document creation client 10 stores them in the internal storage device, and stores the new image URI in the hash cache unit in step S52. 14, the document URI and the new image URI are displayed on the screen using the structured document editing UI unit 11.
The editor confirms the structured document URI and the new image URI, and later designates the structured document URI and the image URI by the structured document editing UI unit 11, thereby referring to the document and the image, respectively. can do.

また、その際に、構造化文書サーバ２０において、当該置換後の画像を参照する構造化文書が複数存在する場合、つまり前述したマルチ参照である場合で、かつ、画像ＵＲＩが前述したように新たに発行された場合は、それぞれの構造化文書の中に記述される画像参照の書き換えなどの処理を行なうか否かを判別する必要がある。 At that time, in the structured document server 20, when there are a plurality of structured documents that refer to the replaced image, that is, in the case of the above-described multi-reference, and the image URI is newly added as described above. Is issued, it is necessary to determine whether or not to perform processing such as rewriting of image references described in each structured document.

図２２は、本発明の実施形態における情報処理システムによる画像付き構造化文書の置換処理後の参照関係の一例を示す図である。
画像参照の書き換え処理については、構造化文書サーバ２０は、同じ画像参照を有するそれぞれの構造化文書のプロパティを照合することにより、当該書き換え処理を行なうか否かの判断を決定する。 FIG. 22 is a diagram showing an example of the reference relationship after the replacement process of the structured document with image by the information processing system according to the embodiment of the present invention.
For the image reference rewriting process, the structured document server 20 determines whether to perform the rewriting process by checking the properties of the structured documents having the same image reference.

図２２に示した例では、更新対象の構造化文書が「文書１」、その他の構造化文書が「文書２」、これらの構造化文書が画像ＵＲＩ「ＵＲＩ１により」参照する更新前の画像が「画像１」で、更新後の画像が「画像２」であるとする。 In the example shown in FIG. 22, the document to be updated refers to “document 1” as the structured document to be updated, “document 2” as the other structured document, and the images before updating that these structured documents refer to the image URI “using URI1”. Assume that “image 1” and the updated image is “image 2”.

例えば、構造化文書サーバ２０は、図２２に示すように、同じ画像ＵＲＩ「ＵＲＩ１」を含む複数の構造化文書「文書１」、「文書２」のうち画像を「画像１」から「画像２」に更新した構造化文書「文書１」とは異なる構造化文書「文書２」を選択する。
そして、構造化文書サーバ２０は、この文書の作成日プロパティ、例えば図８に示したようなdateタグをチェックし、この作成日が更新対象の構造化文書「文書１」の作成日と近ければ、当該選択した構造化文書「文書２」の画像ＵＲＩ「ＵＲＩ１」を更新対象の構造化文書「文書１」において前述のように書き換えられた画像ＵＲＩ「ＵＲＩ２」に書き換え、つまり図２２に示した形態２に移行する。 For example, as shown in FIG. 22, the structured document server 20 converts the images from “image 1” to “image 2” among the plurality of structured documents “document 1” and “document 2” including the same image URI “URI1”. The structured document “Document 2” different from the structured document “Document 1” updated to “” is selected.
The structured document server 20 checks the creation date property of this document, for example, a date tag as shown in FIG. 8, and if this creation date is close to the creation date of the structured document “document 1” to be updated. Then, the image URI “URI1” of the selected structured document “document 2” is rewritten to the image URI “URI2” rewritten as described above in the structured document “document 1” to be updated, that is, as shown in FIG. Transition to Form 2.

また、選択済みの構造化文書「文書２」の作成日が更新対象の構造化文書「文書１」の作成日と近くない場合には、当該選択した構造化文書「文書１」の画像ＵＲＩ「ＵＲＩ１」の書き換えを行なわずに、つまり図２２に示した形態１に移行する。また、画像参照の書き換えの有無は他の拡張プロパティを用いて判別するようにしてもよい。 If the creation date of the selected structured document “Document 2” is not close to the creation date of the structured document “Document 1” to be updated, the image URI “ Without rewriting “URI1”, that is, the mode 1 shown in FIG. 22 is entered. Further, whether or not image reference is rewritten may be determined using other extended properties.

次に、構造化文書の削除について説明する。
図２３は、本発明の実施形態における情報処理システムによる構造化文書の削除の手順の一例を示す図である。
図２４は、本発明の実施形態における情報処理システムによる画像付き構造化文書の削除処理の第３の例を示すフローチャートである。
まず、構造化文書作成クライアント１０は、編集者による構造化文書編集ＵＩ部１１への操作により削除対象の構造化文書の構造化文書ＵＲＩ、ここでは「http://document.example.com/docId」が指定されると（ステップＳ８１）、この構造化文書ＵＲＩをもとにした削除リクエスト「DELETE http://document.example.com/docId」をサーバ通信部１２により構造化文書サーバ２０に送信する（ステップＳ８２）。 Next, deletion of a structured document will be described.
FIG. 23 is a diagram illustrating an example of a procedure for deleting a structured document by the information processing system according to the embodiment of this invention.
FIG. 24 is a flowchart showing a third example of the deletion process of the structured document with image by the information processing system according to the embodiment of the present invention.
First, the structured document creation client 10 performs a structured document URI of a structured document to be deleted by an operation on the structured document editing UI unit 11 by an editor, in this case “http://document.example.com/docId”. ”Is designated (step S81), a delete request“ DELETE http://document.example.com/docId ”based on the structured document URI is transmitted to the structured document server 20 by the server communication unit 12. (Step S82).

構造化文書サーバ２０は、構造化文書作成クライアント１０からの削除リクエスト「http://document.example.com/docId」をクライアント通信部２１により受信すると（ステップＳ８３）、この削除リクエストが示す構造化文書ＵＲＩから、削除対象の構造化文書の構造化文書ＩＤ「docId」を計算する。 When the structured document server 20 receives the delete request “http://document.example.com/docId” from the structured document creation client 10 by the client communication unit 21 (step S83), the structured document indicated by the delete request is displayed. From the document URI, the structured document ID “docId” of the structured document to be deleted is calculated.

そして、文書ＤＢアクセス部２７は、文書ＤＢ４０に保存される構造化文書のうち、計算済みの構造化文書ＩＤ「docId」に対応する構造化文書を取得して、この構造化文書の中に記述される画像ＵＲＩ「http://picture.example.com/picId」を認識し（ステップＳ８４）、この認識した画像ＵＲＩをもとにした画像削除のリクエスト「DELETE http://picture.example.com/picId」を画像サーバ通信部２２を介して画像サーバ３０へ送信する（ステップＳ８５）。 Then, the document DB access unit 27 acquires a structured document corresponding to the calculated structured document ID “docId” among the structured documents stored in the document DB 40 and describes the structured document in the structured document. The image URI “http://picture.example.com/picId” is recognized (step S84), and a request for deleting an image “DELETE http://picture.example.com” based on the recognized image URI. / picId "is transmitted to the image server 30 via the image server communication unit 22 (step S85).

画像サーバ３０は、構造化文書サーバ２０からの画像削除のリクエストをリクエスト処理部３１により受信すると（ステップＳ８６）、このリクエストで示される画像ＵＲＩから、削除対象の画像の画像ＩＤ「picId」を計算する。 When the image processing request is received by the request processing unit 31 from the structured document server 20 (step S86), the image server 30 calculates the image ID “picId” of the image to be deleted from the image URI indicated by this request. To do.

そして、画像ＤＢアクセス部３３は、画像ＤＢ５０に保存される画像のうち、計算済みの画像ＩＤ「picId」に対応する画像を削除する（ステップＳ８７）。
そして、リクエスト処理部３１は、画像削除の完了通知を構造化文書サーバ２０に送信する（ステップＳ８８）。 Then, the image DB access unit 33 deletes the image corresponding to the calculated image ID “picId” from the images stored in the image DB 50 (step S87).
Then, the request processing unit 31 transmits an image deletion completion notification to the structured document server 20 (step S88).

構造化文書サーバ２０の画像サーバ通信部２２が画像サーバ３０からの削除完了通知を受信すると、文書ＤＢアクセス部２７は、文書ＤＢ４０に保存される構造化文書のうち、前述したように計算済みである構造化文書ＩＤ「docId」に対応する構造化文書を削除する（ステップＳ８９）。そして、クライアント通信部２１は、構造化文書および画像の削除完了通知を構造化文書作成クライアント１０に返信する（ステップＳ９０）。
なお、これまでの例では、一つの画像を含む構造化文書を例として説明したが、この構造化文書は複数の画像を含むものであってもよい。 When the image server communication unit 22 of the structured document server 20 receives the deletion completion notification from the image server 30, the document DB access unit 27 has already calculated as described above among the structured documents stored in the document DB 40. A structured document corresponding to a certain structured document ID “docId” is deleted (step S89). Then, the client communication unit 21 returns a structured document and image deletion completion notification to the structured document creation client 10 (step S90).
In the above examples, the structured document including one image has been described as an example. However, the structured document may include a plurality of images.

以上のように、本発明の実施形態における情報処理システムでは、画像を例とした非構造化データの参照情報を参照する構造化文書が更新された場合に、当該構造化文書の非構造化データが更新されているか否かを判別し、更新されていない場合には、非構造化データの参照情報を有する構造化文書を構造化文書サーバに送信する一方で、当該非構造化データは送信しないようにするので、ネットワークトラフィックを大幅に軽減する効果を得ることができる。 As described above, in the information processing system according to the embodiment of the present invention, when a structured document that refers to reference information of unstructured data using an image as an example is updated, the unstructured data of the structured document is updated. If it is not updated, the structured document having the reference information of the unstructured data is transmitted to the structured document server, but the unstructured data is not transmitted. As a result, the effect of greatly reducing network traffic can be obtained.

また、非構造化データ更新がなされた場合、更新前の非構造化データと更新後の非構造化データとの近似度の高低により、更新された画像の上書きまたは置換といった更新処理を行なうので、画像の適切な更新処理を行なうことができる。 When unstructured data is updated, update processing such as overwriting or replacement of the updated image is performed depending on the degree of approximation between the unstructured data before the update and the unstructured data after the update. Appropriate image update processing can be performed.

また、複数の構造化文書が同じ非構造化データを参照している場合において、前述した複数の構造化文書のうち何れかの構造化文書の非構造化データが更新されたら、各構造化文書のプロパティなどの情報を元に、同じ非構造化データを参照する他の構造化文書における非構造化データへの参照情報を更新するので、各構造化文書の参照情報を適切なものとすることができる。 In addition, when a plurality of structured documents refer to the same unstructured data, when the unstructured data of any one of the plurality of structured documents is updated, each structured document is updated. Because the reference information to unstructured data in other structured documents that refer to the same unstructured data is updated based on information such as the properties of the structure, the reference information of each structured document should be appropriate Can do.

次に、本実施形態における情報処理システムの変形例を説明する。
前述した実施形態では、構造化文書の更新がなされた場合で、この構造化文書を参照する画像が上書き更新用画像であるか置換用画像であるかの判別を、構造化文書の更新にともなって自動的に行なっていたが、この変形例では、更新した構造化文書を参照する画像が上書き更新用画像であるか置換用画像であるかの判別を、当該構造化文書に関する、編集者による指定内容にしたがって判別することにより、編集者の意向にしたがった画像更新を行なう機能を有する。 Next, a modification of the information processing system in the present embodiment will be described.
In the above-described embodiment, when the structured document is updated, the determination of whether the image referring to the structured document is an overwrite update image or a replacement image is accompanied by the update of the structured document. However, in this modification, it is determined by the editor regarding the structured document whether the image referring to the updated structured document is an overwrite update image or a replacement image. By discriminating according to the designated contents, it has a function of updating the image according to the editor's intention.

図２５は、本発明の実施形態における情報処理システムの構成の変形例を示すブロック図である。
図２５に示すように、この変形例では、図４に示した形態と比較して、構造化文書作成クライアント１０は、変更種別判別部１６をさらに備える。
この変更種別判別部１６は、構造化文書の更新がなされた場合で、この構造化文書を参照する画像が上書き更新用画像であるか置換用画像であるかを判別する。 FIG. 25 is a block diagram showing a modification of the configuration of the information processing system in the embodiment of the present invention.
As shown in FIG. 25, in this modification, the structured document creation client 10 further includes a change type determination unit 16 as compared with the form shown in FIG.
The change type determination unit 16 determines whether the image that refers to the structured document is an overwrite update image or a replacement image when the structured document is updated.

図２６は、本発明の実施形態の変形例における情報処理システムによる構造化文書の変更の手順の一例を示す図である。
図２７は本発明の実施形態の変形例における情報処理システムによる画像付き構造化文書の変更処理の一例を示すフローチャートである。
この処理では、前述したステップＳ４１の処理、つまり構造化文書が更新された場合において、当該構造化文書の画像の変更種別が上書き更新用であるか置換であるかを定めるための編集者による入力操作を構造化文書編集ＵＩ部１１により受け付け、この受け付けた内容を構造化文書作成クライアント１０の内部記憶装置に記憶するものとする。 FIG. 26 is a diagram showing an example of a procedure for changing a structured document by the information processing system in the modification of the embodiment of the present invention.
FIG. 27 is a flowchart showing an example of a process for changing the structured document with image by the information processing system according to the modification of the embodiment of the present invention.
In this process, when the structured document is updated in step S41 described above, that is, an input by the editor for determining whether the change type of the image of the structured document is for overwriting update or replacement. Assume that the structured document editing UI unit 11 accepts an operation and stores the accepted content in the internal storage device of the structured document creation client 10.

そして、ステップＳ４２〜Ｓ４４の処理、およびステップＳ５１，Ｓ５２の処理、つまり画像付き構造化文書の画像ハッシュの計算からハッシュ比較の処理、および当該比較対象のハッシュが一致しないと判別して、ハッシュキャッシュ部１４に保存される画像ハッシュが更新された場合において、構造化文書作成クライアント１０の変更種別判別部１６は、前述したように内部記憶装置に記憶される、構造化文書編集ＵＩ部１１により受け付けた操作内容で示される変更種別が上書き更新であるか置換であるかを判別する（ステップＳ１０１）。 Then, it is determined that the processing of steps S42 to S44 and the processing of steps S51 and S52, that is, the hash comparison processing from the calculation of the image hash of the structured document with image, and the comparison target hash do not match, and the hash cache When the image hash stored in the unit 14 is updated, the change type determination unit 16 of the structured document creation client 10 receives the structured document editing UI unit 11 stored in the internal storage device as described above. It is determined whether the change type indicated by the operation content is overwrite update or replacement (step S101).

変更種別が上書き更新であった場合には（ステップＳ１０２のＹＥＳ）、前述したステップＳ７２からＳ７８までの処理、つまり置換用画像の画像サーバ３０への送信から、当該画像サーバ３０による文書ＵＲＩや画像ＵＲＩの構造化文書作成クライアント１０への返信までの処理がなされる。 If the change type is overwrite update (YES in step S102), the processing from step S72 to S78 described above, that is, the transmission of the replacement image to the image server 30, the document URI or image by the image server 30 is displayed. Processing up to the reply to the URI structured document creation client 10 is performed.

一方、変更種別が上書き更新であった場合には（ステップＳ１０２のＮＯ）、前述したステップＳ６１からＳ６３までの処理、つまり置換用画像の画像サーバ３０への送信から構造化文書サーバ２０における構造化文書の保存までの処理がなされる。 On the other hand, when the change type is overwrite update (NO in step S102), the processing in steps S61 to S63, that is, the transmission of the replacement image to the image server 30 to the structured document server 20 is structured. Processing up to the storage of the document is performed.

また、前述したように更新した構造化文書の画像を参照する構造化文書が他にも存在する場合、つまり前述したマルチ参照である場合における、当該他の構造化文書の画像の種別については、更新した構造化文書の画像について指定された変更種別をそのまま採用してもよいし、構造化文書を更新する際に、当該構造化文書の画像を参照する他の文書の一覧を表示させて、それぞれの構造化文書について、当該構造化文書の画像の更新形態が上書き更新か置換であるかを定めるための指定の入力操作を構造化文書編集ＵＩ部１１により受け付けた上で、この受け付けた内容にしたがって、それぞれの構造化文書の画像を上書き更新または置換するようにしてもよい。
また、本実施形態では非構造化データは画像であるとして説明したが、これに限らず、例えば音声や映像であってもよい。 In addition, when there is another structured document that refers to the image of the structured document updated as described above, that is, in the case of the multi-reference described above, the type of the image of the other structured document is as follows. The change type specified for the image of the updated structured document may be adopted as it is, or when updating the structured document, a list of other documents that refer to the image of the structured document is displayed, For each structured document, the structured document editing UI unit 11 receives a specified input operation for determining whether the image update form of the structured document is overwrite update or replacement, and the received contents Accordingly, the image of each structured document may be overwritten or updated or replaced.
In the present embodiment, the unstructured data is described as an image. However, the present invention is not limited to this, and may be audio or video, for example.

なお、この発明は前記実施形態そのままに限定されるものではなく実施段階ではその要旨を逸脱しない範囲で構成要素を変形して具体化できる。また、前記実施形態に開示されている複数の構成要素の適宜な組み合わせにより種々の発明を形成できる。例えば、実施形態に示される全構成要素から幾つかの構成要素を省略してもよい。更に、異なる実施形態に亘る構成要素を適宜組み合せてもよい。 The present invention is not limited to the above-described embodiment as it is, and can be embodied by modifying the constituent elements without departing from the scope of the invention in the implementation stage. Various inventions can be formed by appropriately combining a plurality of constituent elements disclosed in the embodiment. For example, some components may be omitted from all the components shown in the embodiment. Furthermore, you may combine suitably the component covering different embodiment.

１，１０…構造化文書作成クライアント、１ａ，１１…構造化文書編集ＵＩ部、１ｂ，１２…サーバ通信部、１ｃ…非構造化データハッシュ計算部、１ｄ…ハッシュキャッシュ部、１ｅ，１５…ハッシュ一致判別部、１６…変更種別判別部、２，２０…構造化文書サーバ、２ａ，２１…クライアント通信部、２ｂ…非構造化データサーバ通信部、２ｃ…構造化文書ＩＤ発行部、２ｄ…非構造化データＵＲＩ変換部、２ｅ…非構造化データ近似度計算部、２ｆ…非構造化データマルチ参照処理部、２ｇ…構造化文書ＤＢアクセス部、３…非構造化データサーバ、３ａ，３１…リクエスト処理部、３ｂ…非構造化データＩＤ発行部、３ｃ…非構造化データＤＢアクセス部、４…構造化文書ＤＢ、５…非構造化データＤＢ、１３…画像ハッシュ計算部、１４…画像ハッシュキャッシュ部、２２…画像サーバ通信部、２３…画像ＩＤ発行部、２４…画像ＵＲＩ変換部、２５…画像近似度計算部、２６…画像マルチ参照処理部、２７…文書ＤＢアクセス部、３０…画像サーバ、３２…画像ＩＤ発行部、３３…画像ＤＢアクセス部、４０…文書ＤＢ、５０…画像ＤＢ。 DESCRIPTION OF SYMBOLS 1,10 ... Structured document creation client, 1a, 11 ... Structured document editing UI part, 1b, 12 ... Server communication part, 1c ... Unstructured data hash calculation part, 1d ... Hash cache part, 1e, 15 ... Hash Match determination unit, 16 ... change type determination unit, 2, 20 ... structured document server, 2a, 21 ... client communication unit, 2b ... unstructured data server communication unit, 2c ... structured document ID issuing unit, 2d ... non- Structured data URI conversion unit, 2e... Unstructured data approximation degree calculation unit, 2f... Unstructured data multi-reference processing unit, 2g ... Structured document DB access unit, 3 ... Unstructured data server, 3a, 31. Request processing unit, 3b ... unstructured data ID issuing unit, 3c ... unstructured data DB access unit, 4 ... structured document DB, 5 ... unstructured data DB, 13 ... image hash calculation , 14 ... Image hash cache unit, 22 ... Image server communication unit, 23 ... Image ID issuing unit, 24 ... Image URI conversion unit, 25 ... Image approximation calculation unit, 26 ... Image multi-reference processing unit, 27 ... Document DB access 30: Image server 32: Image ID issuing unit 33: Image DB access unit 40: Document DB 50: Image DB

Claims

クライアント装置、構造化データサーバ装置、および非構造化データサーバ装置を有する情報処理システムであって、
前記クライアント装置は、
構造化データおよび当該構造化データに関わる非構造化データの作成のための入力操作を受け付ける入力手段と、
前記入力手段により作成された構造化データに関わる非構造化データのハッシュ値を計算する第１の計算手段と、
前記入力手段により作成された構造化データおよび当該構造化データに関わる非構造化データを前記構造化データサーバ装置に送信する第１の送信手段と、
前記作成された構造化データが前記入力手段への入力操作により更新された場合に、当該更新された構造化データに関わる非構造化データのハッシュ値を計算する第２の計算手段と、
前記第２の計算手段により計算したハッシュ値と前記第１の計算手段により計算した、更新前の構造化データに関わる非構造化データのハッシュ値とが一致するか否かを判別する判別手段と、
前記ハッシュ値が一致すると前記判別手段により判別した場合に、前記更新された構造化データを前記構造化データサーバ装置に送信し、前記更新された構造化データに関わる非構造化データは送信しない第２の送信手段とを有し、
前記構造化データサーバ装置は、
前記構造化データを記憶するための構造化データ記憶装置と、
前記クライアント装置からの前記作成時で更新前の構造化データに関わる非構造化データを前記非構造化データサーバ装置に送信する送信手段と、
前記クライアント装置からの前記作成時で更新前の構造化データに、前記非構造化データサーバ装置からの参照情報を含ませて前記構造化データ記憶装置に記憶する第１の記憶制御手段と、
前記構造化データ記憶装置に記憶される前記更新前の構造化データを、前記クライアント装置からの前記更新された構造化データに書き換える第２の記憶制御手段とを有し、
前記非構造化データサーバ装置は、
前記非構造化データを記憶するための非構造化データ記憶装置と、
前記構造化データサーバ装置からの前記作成時で更新前の構造化データに関わる非構造化データを前記非構造化データ記憶装置に記憶する記憶制御手段と、
前記作成時で更新前における、前記構造化データサーバ装置からの非構造化データを前記更新前の構造化データから参照するための参照情報を前記構造化データサーバ装置に送信する送信手段とを備えた
ことを特徴とする情報処理システム。 An information processing system having a client device, a structured data server device, and an unstructured data server device,
The client device is
An input means for accepting an input operation for creating structured data and unstructured data related to the structured data;
First calculation means for calculating a hash value of unstructured data related to structured data created by the input means;
First transmission means for transmitting structured data created by the input means and unstructured data related to the structured data to the structured data server device;
Second calculation means for calculating a hash value of unstructured data related to the updated structured data when the created structured data is updated by an input operation to the input means;
Discrimination means for discriminating whether or not the hash value calculated by the second calculation means matches the hash value of the unstructured data related to the structured data before update calculated by the first calculation means; ,
When the determination means determines that the hash values match, the updated structured data is transmitted to the structured data server device, and unstructured data related to the updated structured data is not transmitted. 2 transmission means,
The structured data server device comprises:
A structured data storage device for storing the structured data;
Transmitting means for transmitting unstructured data related to structured data before update at the time of creation from the client device to the unstructured data server device;
First storage control means for including in the structured data storage device the reference data from the unstructured data server device in the structured data before update at the time of creation from the client device;
Second storage control means for rewriting the pre-update structured data stored in the structured data storage device with the updated structured data from the client device;
The unstructured data server device
An unstructured data storage device for storing the unstructured data;
Storage control means for storing unstructured data related to structured data before update at the time of creation from the structured data server device in the unstructured data storage device;
Transmission means for transmitting, to the structured data server device, reference information for referring to unstructured data from the structured data server device before the update at the time of creation from the structured data before the update. An information processing system characterized by that.

前記クライアント装置は、
前記ハッシュ値が一致しないと前記判別手段により判別した場合に、前記更新された構造化データを当該構造化データに関わる非構造化データとともに前記構造化データサーバ装置に送信する第３の送信手段をさらに備え、
前記構造化データサーバ装置は、
前記クライアント装置の第３の送信手段からの前記更新された構造化データを受信した場合に、当該構造化データに含まれる前記参照情報をもとに前記非構造化データサーバ装置に対して前記更新前の非構造化データの送信リクエストを行なうリクエスト手段と、
前記リクエスト手段によるリクエストに伴う、前記非構造化データサーバ装置の第２の送信手段からの前記更新前の非構造化データと前記クライアント装置の第３の送信手段からの前記更新された構造化データに関わる非構造化データとの近似度を計算する近似度計算手段と、
前記近似度計算手段により計算した近似度が所定値以上である場合に、前記クライアント装置からの、前記更新された構造化データに関わる非構造化データを前記非構造化データサーバに送信する第２の送信手段と、
前記近似度計算手段により計算した近似度が所定値未満である場合に、前記クライアント装置からの、前記更新された構造化データに関わる非構造化データを前記非構造化データサーバに送信する第３の送信手段と、
前記非構造化データサーバ装置の第３の送信手段からの前記非構造化データの新たな参照情報を前記クライアント装置に送信する第４の送信手段と、
前記クライアント装置からの前記更新された構造化データに含まれる前記参照情報を前記非構造化データサーバ装置の第３の送信手段からの新たな参照情報に書き換える参照情報書き換え手段とをさらに備え、
前記非構造化データサーバ装置は、
前記構造化データサーバ装置からのリクエストに従って前記更新前の非構造化データを前記構造化データサーバ装置に送信する第２の送信手段をさらに備え、
前記構造化データサーバ装置の前記第２の送信手段からの前記非構造化データを前記更新前の非構造化データに替えて前記非構造化データ記憶装置に記憶する第２の記憶制御手段と、
前記構造化データサーバ装置の前記第３の送信手段からの非構造化データを前記更新前の非構造化データとは別に前記非構造化データ記憶装置に記憶する第３の制御記憶手段と、
前記構造化データサーバ装置の前記第３の送信手段からの非構造化データの新たな参照情報を前記構造化データサーバ装置に送信する第３の送信手段とをさらに備えた
ことを特徴とする請求項１に記載の情報処理システム。 The client device is
A third transmitting unit configured to transmit the updated structured data to the structured data server device together with unstructured data related to the structured data when the determining unit determines that the hash values do not match; In addition,
The structured data server device comprises:
When the updated structured data is received from the third transmission unit of the client device, the update is performed on the unstructured data server device based on the reference information included in the structured data. A request means for making a request to send the previous unstructured data;
The unstructured data before the update from the second transmission unit of the unstructured data server device and the updated structured data from the third transmission unit of the client device, accompanying the request by the request unit. A degree of approximation calculating means for calculating a degree of approximation with unstructured data related to
A second unit for transmitting unstructured data related to the updated structured data from the client device to the unstructured data server when the degree of approximation calculated by the degree-of-approximation calculating unit is a predetermined value or more; Means for sending
A third unit for transmitting unstructured data related to the updated structured data from the client device to the unstructured data server when the degree of approximation calculated by the degree-of-approximation calculating unit is less than a predetermined value; Means for sending
Fourth transmission means for transmitting new reference information of the unstructured data from the third transmission means of the unstructured data server device to the client device;
Reference information rewriting means for rewriting the reference information included in the updated structured data from the client device with new reference information from a third sending means of the unstructured data server device,
The unstructured data server device
A second transmission means for transmitting the unstructured data before the update to the structured data server device in accordance with a request from the structured data server device;
Second storage control means for storing the unstructured data from the second transmission means of the structured data server device in the unstructured data storage device in place of the unstructured data before the update;
Third control storage means for storing unstructured data from the third transmission means of the structured data server device in the unstructured data storage device separately from the unstructured data before the update;
And further comprising: third transmission means for transmitting new reference information of unstructured data from the third transmission means of the structured data server device to the structured data server device. Item 4. The information processing system according to Item 1.

前記構造化データサーバ装置の参照情報書き換え手段は、
前記更新された構造化データに関わる非構造化データを参照する他の構造化文書に含まれる前記参照情報を前記非構造化データサーバ装置からの新たな参照情報に書き換えることを特徴とする請求項２に記載の情報処理システム。 The reference information rewriting means of the structured data server device is:
The referential information included in another structured document that refers to unstructured data related to the updated structured data is rewritten with new reference information from the unstructured data server device. 2. The information processing system according to 2.

構造化データおよび当該構造化データに関わる非構造化データの作成のための入力操作を受け付ける入力手段と、
前記入力手段により作成された構造化データに関わる非構造化データのハッシュ値を計算する第１の計算手段と、
前記入力手段により作成された構造化データを構造化データサーバ装置に記憶するための構造化データとして送信し、当該構造化データに関わる非構造化データを非構造化データサーバ装置に記憶するための非構造化データとして送信する第１の送信手段と、
前記作成されて前記構造化データサーバに記憶された構造化データが前記入力手段への入力操作により更新された場合に、当該構造化データに関わる非構造化データのハッシュ値を計算する第２の計算手段と、
前記第２の計算手段により計算したハッシュ値と前記第１の計算手段により計算した、前記更新前の構造化データに関わる非構造化データのハッシュ値とが一致するか否かを判別する判別手段と、
前記ハッシュ値が一致すると前記判別手段により判別した場合に、前記更新された構造化データを前記構造化データサーバ装置に記憶される更新前の構造化データに替えて記憶するための構造化データとして送信し、前記更新された構造化データに関わる非構造化データは送信しない第２の送信手段とを備えたことを特徴とする情報処理装置。 An input means for accepting an input operation for creating structured data and unstructured data related to the structured data;
First calculation means for calculating a hash value of unstructured data related to structured data created by the input means;
Transmitting structured data created by the input means as structured data to be stored in the structured data server device, and storing unstructured data related to the structured data in the unstructured data server device First transmitting means for transmitting as unstructured data;
A second value for calculating a hash value of unstructured data related to the structured data when the structured data created and stored in the structured data server is updated by an input operation to the input means; A calculation means;
Discriminating means for discriminating whether or not the hash value calculated by the second calculating means matches the hash value of the unstructured data related to the structured data before update calculated by the first calculating means. When,
As structured data for storing the updated structured data in place of the structured data before update stored in the structured data server device when the determining means determines that the hash values match. An information processing apparatus comprising: a second transmission unit that transmits and does not transmit unstructured data related to the updated structured data.