JP2010267081A

JP2010267081A - Information search method, device and program

Info

Publication number: JP2010267081A
Application number: JP2009117964A
Authority: JP
Inventors: Toshibumi Enomoto; 俊文榎本; Nobuyuki Kobayashi; 伸幸小林; Gengo Suzuki; 源吾鈴木; Masashi Yamamuro; 雅司山室; Noburo Taniguchi; 展郎谷口
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2009-05-14
Filing date: 2009-05-14
Publication date: 2010-11-25

Abstract

<P>PROBLEM TO BE SOLVED: To efficiently perform connection search to an XML data group that is structured data. <P>SOLUTION: An inquiry tree is developed to an execution tree by use of node information, an execution node is selected according to the execution tree, connection is performed about a connection key node group of a connection condition of the execution node, a connection ID is imparted in each connected pair and is stored in a search storage means, node information of a search node group that is a unit of the search is obtained from a node storage means, a corresponding node is matched to the search node group based on a range label of node information of the connection key node group, the matched search node group is obtained together with the connection ID of the connection key node group, the search node group and the connection key node group are connected and are stored in the search storage means, and a result is generated from information stored in the search storage means based on designation of a search query. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は、情報検索方法及び装置及びプログラムに係り、特に、構造化データである多数のＸＭＬデータを蓄積及び検索するデータベース及びインデクスの分野で、結合検索実行時の処理を効率化し、高速に行う情報検索方法及び装置及びプログラムに関する。 The present invention relates to an information search method, apparatus, and program, and in particular, in the field of databases and indexes for storing and searching a large number of XML data as structured data, the processing at the time of executing a combined search is performed efficiently and at high speed. The present invention relates to an information search method, apparatus, and program.

ＸＭＬとは、マークアップ言語の一つで、ＸＭＬで記述されたデータは構造化され、構造に意味を持ったデータである。図１３は、ＸＭＬデータとその構造を示した図である。ＸＭＬデータを構成する要素を識別するために使われるマークをタグと呼び、タグに挟まれた間に値（テキスト）を記述する。タグは入れ子にすることができる。 XML is one of markup languages, and data described in XML is structured and has meaning in the structure. FIG. 13 is a diagram showing XML data and its structure. A mark used to identify an element constituting XML data is called a tag, and a value (text) is described between the tags. Tags can be nested.

したがって、ＸＭＬデータはモデル的に木構造で表すことができ、各節をノードと呼ぶ。特に、値のノードをルート要素、値（記述内容）をテキストノード、タグ中に記述されたものを属性と呼ぶ。 Therefore, XML data can be represented in a tree structure as a model, and each node is called a node. In particular, a value node is called a root element, a value (description content) is called a text node, and an element described in a tag is called an attribute.

各ノードは、"／"の後ろにルート要素からのタグを"／"でつなげた「パス」で指定する。例えば、本のタイトルを示したい場合、パスを「/book/title」と指定する。 Each node is specified by a “path” in which a tag from the root element is connected by “/” after “/”. For example, if you want to show the title of a book, specify the path as “/ book / title”.

ＸＭＬデータに対して効率的な検索を実現しようとした場合、構造の任意の場所にある記述内容を高速に取り出すことが可能でなければならない。一般的には、ＸＭＬデータの構造と記述内容を記述したインデクス（構造インデクス）を予め構築しておく。構造インデクスの構築は様々な方法が存在するが、多くの場合、ＸＭＬデータの持つ全てのパスとパスに指定される記述内容を関係づけるインデクスが構築される。以上のようなＸＭＬデータの効率的な検索を実現する一手法として、ＸＭＬデータの要約情報に範囲ラベルを付与して保持し、検索時に、問い合わせを木構造に変換して値の部分を削除した問い合わせ木の構造部を生成し、中間問い合わせ木と要約情報を照合して中間要約木の集合を生成し、ＸＭＬデータに対して中間要約木の集合を実体化したデータを含む実体中間木の集合に変換し、ＸＭＬデータに対して、実体中間も木の集合に対応する値の部分を用いてフィルタ処理を行うことで、ＸＭＬデータの効率的な検索を実現する技術がある（例えば、特許文献１参照）。 When an efficient search is to be performed on XML data, it is necessary to be able to retrieve the description contents at an arbitrary place in the structure at high speed. In general, an index (structure index) describing the structure and description contents of XML data is constructed in advance. There are various methods for constructing a structure index. In many cases, an index that associates all the paths of XML data with the description contents specified in the path is constructed. As a technique for realizing efficient search of XML data as described above, a range label is attached to XML data summary information and retained, and at the time of search, the query is converted to a tree structure and the value part is deleted A query tree structure part is generated, a set of intermediate summary trees is generated by collating the summary information with the intermediate query tree, and a set of entity intermediate trees including data obtained by materializing the set of intermediate summary trees with respect to XML data There is a technique that realizes an efficient search of XML data by converting the XML data into a filter process using the portion of the value corresponding to the set of trees in the entity intermediate (see, for example, Patent Documents). 1).

特開２００６−２２８１５５号公報JP 2006-228155 A

検索の一パターンとして、２種類（同一種類でも可）のＸＭＬデータ群の結合がある。例えば、図１４に示されるような「クラス情報ＸＭＬ」「受講者ＸＭＬ」の２種類のＸＭＬ群がある場合、受講者のＩＤを示す「クラス情報ＸＭＬ」の
"/course/students/student_id/text()"("text()"はテキストノードを示す)
と「受講者ＸＭＬ」の As a search pattern, there is a combination of two types (or the same type) of XML data groups. For example, when there are two types of XML groups “class information XML” and “student XML” as shown in FIG. 14, the “class information XML” indicating the ID of the student is displayed.
"/ course / students / student_id / text ()"("text()" indicates a text node)
And “Student XML”

をキーに２つの情報を結合させ、図１５に示すような受講者のプロファイル情報を加えたクラス情報を取得したといった検索が考えられる。

A search may be considered in which two pieces of information are combined using the key and the class information including the student's profile information as shown in FIG. 15 is acquired.

この例の結合の検索クエリは、ＸＭＬ問い合わせ言語１つであるXQueryでは、図１６のように記述できる。 In this example, the combined search query can be described as shown in FIG. 16 in XQuery, which is one XML query language.

単純な検索処理手法としては、各ＸＭＬデータのルート要素から順次子ノードを辿っていき、検索条件に合ったノード群を探し出す方法があるが、大きなデータや多数のデータ群の場合や、パスが深いところを指している場合などには、走査に時間がかってしまう。そこで、前述の特許文献１に示すような、処理に必要なノード群だけを直接取り出し、それらの親子及び先祖子孫関係を判定することで、効率的な検索が行われる。 As a simple search processing method, there is a method in which child nodes are sequentially traced from the root element of each XML data to find a node group that matches the search condition. However, in the case of large data or a large number of data groups, the path is When it points to a deep place, scanning takes time. Therefore, as shown in the above-mentioned Patent Document 1, only a node group necessary for processing is directly taken out, and an efficient search is performed by determining a parent-child and ancestor-descendant relationship.

しかし、当該手法は、結合検索にまでは言及されておらず、効率的な手法は提案されていない。したがって、結合検索となる双方のＸＭＬデータ群を特許文献１に示すような効率的な検索を行った後、結合処理を単純な手法で一つ一つ突合せていく方法が考えられるが、図１６の例のように結合条件
（"$a/students/student_id/text()=$b/@id"）
以外に条件がない場合などは、中間結果が多くなりすぎて、効率化にはつながらないケースがある。 However, this method is not mentioned even in the join search, and an efficient method has not been proposed. Therefore, after performing efficient search as shown in Patent Document 1 for both XML data groups to be combined search, a method of matching the combination processing one by one with a simple method can be considered. Join condition ("$ a / students / student_id / text () = $ b / @ id")
If there are no other conditions, there are cases where the number of intermediate results is too large to improve efficiency.

また、結合処理については、関係データベース（ＲＤＢ：Relational Data Base）においては効率的な手法が実現されているが、ＸＭＬデータの結合は以下の点で異なるため、そのままでは適用できない。 In addition, regarding the joining process, an efficient technique is realized in a relational database (RDB), but the joining of XML data differs in the following points, and thus cannot be applied as it is.

ＲＤＢの場合、結合の単位（タプル）と結合条件となるキーは１対１関係にある（図１７参照）が、ＸＭＬの場合は１対Ｎ関係となる。例えば、上記の例の場合、「クラス情報ＸＭＬ」の結合の単位は、
"/course"
以下の部分木、つまり文書全体であるが、結合条件となるキー
"/course/students/students_id"
は文書内に複数個ある。 In the case of RDB, the unit (tuple) of the bond and the key serving as the join condition have a one-to-one relationship (see FIG. 17), but in the case of XML, the key has a one-to-N relationship. For example, in the case of the above example, the unit of combination of “class information XML” is
"/ course"
The following subtree, that is, the entire document, but the key that becomes the join condition
"/ course / students / students_id"
There are several in the document.

つまり、Ｎ個のキー同士のどれかのペアがマッチすれば「真」という結合が必要になる。 In other words, a combination of “true” is required if any pair of N keys matches.

本発明は、上記の点に鑑みなされたもので、効率的に結合検索を行うことが可能な情報検索方法及び装置及びプログラムを提供することを目的とする。 The present invention has been made in view of the above points, and an object of the present invention is to provide an information search method, apparatus, and program capable of efficiently performing a combined search.

図１は、本発明の原理を説明するための図である。 FIG. 1 is a diagram for explaining the principle of the present invention.

本発明（請求項１）は、構造化データであるＸＭＬデータ群に対して、蓄積及び検索する装置における情報検索方法であって、
入力された検索クエリで指定された検索パス及び検索キーワードに基づいて、ＸＭＬの構造をパス単位で格納したパスインデクス記憶手段と、ＸＭＬの各ノード間の関係を保持するための、親ノードが子ノードを含むノードＩＤで表される範囲ラベルをノードＩＤとするノード情報を格納したノード記憶手段を参照して、問い合わせ木を生成し（ステップ１）、該問い合わせ木の処理対象となるパスに対し、ノード群を取得する処理、それらの先祖子孫関係の判定処理、ノード群の結合処理、最終的な結合結果を生成する結果生成処理の各処理種別のノードを含む実行木を生成する（ステップ２）実行木生成ステップと、
実行木の各実行ノードにおいて、子ノードがある場合は該子ノードの結果を取得し、当該ノードの処理種別に応じた処理を行い（ステップ３）、最終的な結果を出力する（ステップ４）検索ステップと、を行う。 The present invention (Claim 1) is an information search method in an apparatus for storing and searching XML data groups that are structured data,
Based on the search path and the search keyword specified by the input search query, the path index storage means for storing the XML structure in units of paths and the parent node for maintaining the relationship between each node of the XML are children. A query tree is generated by referring to node storage means storing node information whose node ID is a range label represented by a node ID including a node (step 1). Then, an execution tree including nodes of each processing type is generated (step 2), a process for acquiring a node group, a process for determining their ancestor-descendant relationship, a node group combining process, and a result generating process for generating a final combined result (step 2). ) Execution tree generation step;
If there is a child node in each execution node of the execution tree, the result of the child node is acquired, the process according to the processing type of the node is performed (step 3), and the final result is output (step 4). And a search step.

また、本発明（請求項２）は、検索ステップにおいて、
実行木の実行ノードが結合条件を有しているとき、該結合条件で指定されているノード群（結合キーノード群）について結合を行い、結合したペア毎に結合ＩＤを付与し、検索記憶手段に格納し、
ノード記憶手段から、検索の単位となるノード群（検索ノード群）のノード情報を取得し、該検索ノード群に対し、結合キーノード群のノード情報の範囲ラベルに基づいて該当するものをマッチングし、マッチする検索ノード群を当該結合キーノード群の結合ＩＤと共に取得し、該検索ノード群と該結合キーノード群を結合し、検索記憶手段に格納し、
検索クエリの指定に基づいて検索記憶手段に格納されている情報から結果を生成する。 Further, the present invention (Claim 2), in the search step,
When an execution node of an execution tree has a join condition, the nodes specified in the join condition (join key node group) are joined, and a join ID is assigned to each joined pair and stored in the search storage means. And
Obtain node information of a node group (search node group) that is a unit of search from the node storage means, and match the corresponding one based on the range label of the node information of the combined key node group to the search node group, A matching search node group is acquired together with a combination ID of the combination key node group, the search node group and the combination key node group are combined, and stored in a search storage unit;
Based on the specification of the search query, a result is generated from the information stored in the search storage means.

図２は、本発明の原理構成図である。 FIG. 2 is a principle configuration diagram of the present invention.

本発明（請求項３）は、構造化データであるＸＭＬデータ群に対して、蓄積及び検索する装置における情報検索装置であって、
ＸＭＬの構造をパス単位で格納したパスインデクス記憶手段１と、
ＸＭＬの各ノード間の関係を保持するための、親ノードが子ノードを含むノードＩＤで表される範囲ラベルをノードＩＤとするノード情報を格納したノード記憶手段２と、
入力された検索クエリで指定された検索パス及び検索キーワードに基づいて、パスインデクス記憶手段とノード記憶手段を参照して、問い合わせ木を生成し、該問い合わせ木の処理対象となるパスに対し、ノード群を取得する処理、それらの先祖子孫関係の判定処理、ノード群の結合処理、最終的な結合結果を生成する結果生成処理の各処理種別のノードを含む実行木を生成する実行木生成手段４１と、
実行木の各実行ノードにおいて、子ノードがある場合は該子ノードの結果を取得し、当該ノードの処理種別に応じた処理を行い、最終的な結果を出力する検索実行手段４２と、を有する。 The present invention (Claim 3) is an information search device in an apparatus for storing and searching XML data groups that are structured data.
Path index storage means 1 storing the XML structure in units of paths;
A node storage unit 2 for storing node information in which a node ID is a range label represented by a node ID in which a parent node includes a child node, in order to maintain a relationship between XML nodes;
Based on the search path and the search keyword specified by the input search query, a query tree is generated with reference to the path index storage means and the node storage means, and a node is obtained for the path to be processed by the query tree. Execution tree generation means 41 for generating an execution tree including nodes of each processing type of a process for acquiring a group, a determination process for their ancestor-descendant relationship, a node group combination process, and a result generation process for generating a final combination result When,
Each execution node of the execution tree has a search execution means 42 that acquires the result of the child node if there is a child node, performs processing according to the processing type of the node, and outputs the final result .

また、本発明（請求項４）の検索実行手段４２は、
実行木の実行ノードが結合条件を有しているとき、該結合条件で指定されているノード群（結合キーノード群）について結合を行い、結合したペア毎に結合ＩＤを付与し、検索記憶手段に格納する手段と、
ノード記憶手段２から、検索の単位となるノード群（検索ノード群）のノード情報を取得し、該検索ノード群に対し、結合キーノード群のノード情報の前記範囲ラベルに基づいて該当するものをマッチングし、マッチする検索ノード群を当該結合キーノード群の結合ＩＤと共に取得し、該検索ノード群と該結合キーノード群を結合し、検索記憶手段に格納する手段と、
検索クエリの指定に基づいて検索記憶手段に格納されている情報から結果を生成する手段と、を含む。 The search execution means 42 of the present invention (Claim 4)
When an execution node of an execution tree has a join condition, the nodes specified in the join condition (join key node group) are joined, and a join ID is assigned to each joined pair and stored in the search storage means. Means to
Node information of a node group (search node group) that is a unit of search is acquired from the node storage means 2, and the corresponding one is matched to the search node group based on the range label of the node information of the join key node group Obtaining a matching search node group together with a combination ID of the combination key node group, combining the search node group and the combination key node group, and storing in the search storage unit;
Means for generating a result from information stored in the search storage means based on the designation of the search query.

本発明（請求項５）は、請求項３または４記載の情報検索装置を構成する各手段としてコンピュータを機能させるための情報検索プログラムである。 The present invention (Claim 5) is an information search program for causing a computer to function as each means constituting the information search apparatus according to claim 3 or 4.

上記のように本発明によれば、ＸＭＬの階層構造を順次辿る中間処理を行うことなく、必要最小限のノード群のみを直接取り扱うことで、効率的に結合検索を行うことができる。 As described above, according to the present invention, it is possible to efficiently perform a combined search by directly handling only the minimum necessary node group without performing an intermediate process for sequentially tracing the hierarchical structure of XML.

本発明の原理を説明するための図である。It is a figure for demonstrating the principle of this invention. 本発明の原理構成図である。It is a principle block diagram of this invention. 本発明の一実施の形態における情報検索装置の構成図である。It is a block diagram of the information search device in one embodiment of the present invention. 本発明の一実施の形態におけるＸＭＬの結合例である。It is the example of a coupling | bonding of XML in one embodiment of this invention. 本発明の一実施の形態におけるXQueryの例である。It is an example of XQuery in one embodiment of this invention. 本発明の一実施の形態におけるパス情報とノード管理のノード情報の格納例である。It is an example of storing path information and node information of node management in an embodiment of the present invention. 本発明の一実施の形態におけるノード管理部の範囲ラベルの例である。It is an example of the range label of the node management part in one embodiment of this invention. 本発明の一実施の形態における検索処理のフローチャートである。It is a flowchart of the search process in one embodiment of this invention. 本発明の一実施の形態におけるノード取得処理のフローチャートである。It is a flowchart of the node acquisition process in one embodiment of this invention. 本発明の一実施の形態における検索部で生成される問い合わせ木の例である。It is an example of the inquiry tree produced | generated by the search part in one embodiment of this invention. 本発明の一実施の形態における検索部で生成される実行木の例である。It is an example of the execution tree produced | generated by the search part in one embodiment of this invention. 本発明の一実施の形態における実行例での中間結果である。It is an intermediate result in the example of execution in one embodiment of the present invention. ＸＭＬデータとその構造の例である。It is an example of XML data and its structure. ２つのＸＭＬデータの例である。It is an example of two XML data. ＸＭＬデータの結合例である。This is an example of combining XML data. 結合のXQuery例である。This is an XQuery example of join. ＲＤＢでの結合例である。This is an example of bonding in RDB.

以下、図面と共に本発明の実施の形態を説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

本発明は、ＸＭＬの階層構造を順次辿ることなく、必要最低限のノード群のみを直接取り扱うことで、効率的に結合検索を行う手法（例えば、特許文献１）をベースに、ＲＤＢで実現されている結合処理を組み合わせることで、ＸＭＬデータ群に対する効率的な結合処理を実現するものであある。 The present invention is realized in RDB based on a technique (for example, Patent Document 1) that efficiently performs a join search by directly handling only the minimum necessary node group without sequentially following the hierarchical structure of XML. By combining the combination processing, the efficient combination processing for the XML data group is realized.

図３は、本発明の一実施の形態における情報検索装置の構成を示す。 FIG. 3 shows the configuration of the information retrieval apparatus in one embodiment of the present invention.

同図に示す情報検索装置は、ＸＭＬの構造をパス単位で保持するパスインデクス部１、ＸＭＬの各ノード間のＸＭＬ構造における関係を把握するノード管理部２、各ノードの値のインデクスを構築及び管理する値インデクス部３と、（結合）検索を実行する検索部４とを有する。パスインデクス部１とノード管理部２は、前述の背景技術の項における構造インデクスにあたる。 The information retrieval apparatus shown in FIG. 1 includes a path index unit 1 that holds the XML structure in units of paths, a node management unit 2 that grasps the relationship in the XML structure between each node of the XML, and constructs an index of the value of each node. It has a value index unit 3 to be managed and a search unit 4 that executes a (join) search. The path index unit 1 and the node management unit 2 correspond to the structure index in the above-mentioned background art section.

パスインデクス部１は、ＸＭＬデータの持つ全てのパスとパスに指定されるノードを関係付けるインデクスを構築し、内部の記憶手段に格納する。 The path index unit 1 constructs an index that associates all the paths of XML data with nodes specified in the path, and stores the index in an internal storage unit.

ノード管理部２は、各ノード間のＸＭＬデータの構造上の関係を内部の記憶手段に保持する。一般的には、構造上の関係を管理するために、ＸＭＬデータの各ノードにＩＤを付与し、ノード間の関係（先祖子孫関係など）を範囲ラベルとして保持する。 The node management unit 2 holds the structural relationship of XML data between nodes in an internal storage unit. In general, in order to manage the structural relationship, an ID is assigned to each node of the XML data, and the relationship between nodes (such as an ancestor-descendant relationship) is held as a range label.

値インデクス部３は、テキストノード及び属性の値に対するインデクス（B−Tree、ハッシュ等）を構築し、内部の記憶手段に保持する。検索するだけであれば不要のものであるが、高速検索を実現するためには必要となる。 The value index unit 3 constructs an index (B-Tree, hash, etc.) for the text node and attribute value, and holds it in the internal storage means. This is unnecessary if only searching, but is necessary to achieve high-speed searching.

検索部４の処理は以下の手順で行う。以下、用語として、検索の単位となるノード（群）を「検索ノード（群）」、結合条件となるノード（群）を「結合キーノード（群）」と呼ぶこととする。 The processing of the search unit 4 is performed according to the following procedure. Hereinafter, as a term, a node (group) serving as a search unit is referred to as a “search node (group)”, and a node (group) serving as a join condition is referred to as a “join key node (group)”.

前述の図１６のクエリの例の場合、"$a"、"$b"にセットされた全データの"/course"及び"/profile"がそれぞれの検索ノード群であり、where句で条件とされている
"/course/students/student_id/text()"
及び、 In the example of the query in FIG. 16 described above, “/ course” and “/ profile” of all the data set in “$ a” and “$ b” are the respective search nodes, and the conditions in the where clause Has been
"/ course / students / student_id / text ()"
as well as,

が結合キーノード群となる。

Is a group of combined key nodes.

（１）双方の結合キーノード群について、そのノード群の単位で結合処理を行う（以降、当該処理を「キー結合」処理と呼ぶ）。 (1) For both combined key node groups, the combined process is performed in units of the node groups (hereinafter, this process is referred to as “key combined” process).

これは、ＲＤＢと同等の処理方法で実現できる。その際、マッチしたペアを識別できる結合ＩＤ（双方もしくは片方のノードＩＤでも可）を付与し、当該検索部４内のメモリ（図示せず）に保持する。 This can be realized by a processing method equivalent to RDB. At this time, a binding ID (both or one of the node IDs) that can identify the matched pair is assigned and held in a memory (not shown) in the search unit 4.

（２）双方の検索ノード群を取得する。（その際、（１）で得た結合キーノード群を含むXMLデータのみに限定して取得するとより効率的である）。 (2) Acquire both search node groups. (At that time, it is more efficient to obtain only the XML data including the combination key node group obtained in (1)).

（３）XMLデータ群ごとに（２）で得た検索ノード群に対し、（１）で得た結合キーノード群から、（ノードIDによる親子及び先祖子孫関係判定を用い）該当するものをマッチングし、マッチする検索ノード群を、結合ＩＤ（群）と共に取得する。 (3) For each XML data group, the search node group obtained in (2) is matched with the corresponding one from the join key node group obtained in (1) (using parent-child and ancestor-descendant relationship determination by node ID). , A matching search node group is acquired together with a binding ID (group).

（４）双方の検索ノード群の対応を結合ＩＤ（群）により判定し、結合を行う。但し、この場合、結果の単位（まとめ方）はクエリ指定に依存する。 (4) The correspondence between both search node groups is determined by the combination ID (group), and the connection is performed. However, in this case, the result unit (how to summarize) depends on the query specification.

（５）検索ノードを基軸とし、結果に必要な（return句に指定されている）ノードに取得し、結果を生成し、返却する。 (5) Using the search node as a base, obtain the node necessary for the result (specified in the return clause), generate the result, and return it.

すなわち、結合条件について結合キーノードのレベルでRDBの結合処理と同等の処理を行うことで、候補を絞り込むことができ、更に結合ＩＤを保存しておくことで、最終的な検索ノード同士の結合処理も効率化を図ることができる。 In other words, candidates can be narrowed down by performing the same process as the RDB join process at the join key node level for the join condition, and by further saving the join ID, the final join process between search nodes Can also improve efficiency.

上記の（４）について補足すると、図１６のクエリの場合、"$a"は"for"文で指定されているため、検索ノード"$a"の単位で結果が纏められる（図１５）。一方、図４のように"$b"の単位で結果を得ることも可能で、その場合のXQueryは、図５のように"$b"も"for"文で指定することで、内側の"for"である検索ノード"$b"の単位で結果が纏められる。 Supplementing the above (4), in the case of the query of FIG. 16, since “$ a” is specified by the “for” statement, the results are collected in units of the search node “$ a” (FIG. 15). On the other hand, as shown in Fig. 4, it is also possible to obtain the result in units of "$ b". In that case, the XQuery specifies the "$ b" in the "for" statement as shown in Fig. Results are collected in units of search node "$ b" which is "for".

また、結合条件が複数ある場合は、それぞれの結合条件に対し、結合ＩＤ(群)を保持し、その組み合わせの判定を（４）において行うことで対応できる。 Further, when there are a plurality of binding conditions, it can be dealt with by holding a binding ID (group) for each binding condition and determining the combination in (4).

また、図５のクエリのように、他の絞り込み条件（"$a/meta/grade='初級クラス'"）がある場合は、（１）〜（３）及びその前後の適切なタイミングで絞り込みを行うことで対応できる（適切な順序は、絞り込み条件と対象データ群により変化する）。 In addition, if there are other narrowing conditions ("$ a / meta / grade = 'Beginning class'") as shown in the query in FIG. 5, narrowing down at appropriate timings before and after (1) to (3). (The appropriate order varies depending on the narrowing-down condition and the target data group).

以下に、実現方法の一形態として、特許文献１の手法をベースした例を説明する。 Hereinafter, an example based on the technique of Patent Document 1 will be described as one form of the realizing method.

図３に示す各構成要素について詳細に説明する。 Each component shown in FIG. 3 will be described in detail.

パスインデクス部１は、パス情報を内部の記憶手段で管理するモジュールで、パスとパスに対応する各データのノードへの関係の保持も含まれる。パス情報の格納例を図６（Ａ）に示す。パスとそのパスに対するＩＤを保持する。 The path index unit 1 is a module that manages path information by an internal storage unit, and includes the maintenance of the relationship between the path and each data corresponding to the path to the node. A storage example of path information is shown in FIG. Holds the path and the ID for that path.

ノード管理部２は、各ノードの情報を内部の記憶手段に格納し管理する。ノードの情報とは、図６（Ｂ）に示すように、パスインデクス部１のパスＩＤに対応するパスＩＤ、データ毎のＩＤ（文書ＩＤ）、ノードＩＤ（範囲ラベル）、タグ、値等から構成される。範囲ラベルは、図７に示すようにノード毎に２値で表され、親ノードが子ノードを含むように付与する。これにより、任意のノード間で親子及び先祖子孫関係を判定可能となる。この例では、文書ＩＤ１０１と２０１の例のみを示しているが、今回の場合は、各ＸＭＬ群で同一の構造であるため、他のＸＭＬに対しても同様の範囲ラベルが付与される（一般的には、同じパスに対しても同一の範囲ラベルとは限らない）。 The node management unit 2 stores and manages information of each node in an internal storage unit. As shown in FIG. 6B, the node information includes a path ID corresponding to the path ID of the path index unit 1, an ID (document ID) for each data, a node ID (range label), a tag, a value, and the like. Composed. The range label is represented by a binary value for each node as shown in FIG. 7, and is given so that the parent node includes a child node. This makes it possible to determine the parent-child and ancestor-descendant relationship between arbitrary nodes. In this example, only the document IDs 101 and 201 are shown. However, in this case, since the XML groups have the same structure, the same range labels are assigned to other XMLs (general). (In other words, the same range label is not always the same for the same path).

値インデクス部３は、テキストノード及び属性の値に対するインデクス（B-Tree、ハッシュ等）を構築し、内部の記憶手段に格納・管理する。少なくともパスとその値の制限を条件に、検索が可能な形とする。例えば、パスと値の両方をキーに全てを対象としたインデクスを１つ作成してもよいし、パス毎に別々のインデクス（値がキーとなる）を作成してもよい。 The value index unit 3 constructs an index (B-Tree, hash, etc.) for the text node and attribute value, and stores and manages it in the internal storage means. At least the path and its value are limited, and the search is possible. For example, one index that covers all paths and values as keys may be created, or a separate index (value is a key) may be created for each path.

検索部４は、検索クエリを受け取り、指定された検索パス及び検索キーワードにマッチするノードを、パスインデクス部１、ノードインデクス部２、値インデクス部３の各記憶手段から特定し、必要であればデータベース等のデータ管理部（図示せず）から返却結果を取得し、返却する。 The search unit 4 receives the search query, specifies a node that matches the specified search path and search keyword from the storage means of the path index unit 1, the node index unit 2, and the value index unit 3, and if necessary, A return result is acquired from a data management unit (not shown) such as a database and returned.

検索部４における結合検索の処理を以下に示す。 The combined search process in the search unit 4 is shown below.

図８は、本発明の一実施の形態における結合検索の処理のフローチャートである。 FIG. 8 is a flowchart of join search processing in one embodiment of the present invention.

ステップ１１０）検索部４は取得した検索クエリから、問い合わせ木を生成する。 Step 110) The search unit 4 generates a query tree from the acquired search query.

ステップ１２０）次に、ノード管理部２のノード情報を用いて問い合わせ木に対する実行木を作成する。 Step 120) Next, using the node information of the node manager 2, an execution tree for the query tree is created.

ステップ１３０）実行木に従って、ノード種別「ノード取得」「関係判定」「結合」「結果生成」等の実行ノードを選択する。 Step 130) According to the execution tree, an execution node such as node type “node acquisition”, “relationship determination”, “join”, “result generation”, etc. is selected.

ステップ１４０）実行ノード種別が「ノード取得」である場合は、ステップ１５０に移行し、「関係判定」である場合はステップ１６０に移行し、「結合」の場合はステップ１０７に移行し、「結果生成」の場合はステップ１８０に移行する。 Step 140) If the execution node type is “node acquisition”, the process proceeds to step 150; if it is “relationship determination”, the process proceeds to step 160; if it is “join”, the process proceeds to step 107; In the case of “generation”, the process proceeds to step 180.

ステップ１５０）実行ノード種別が「ノード取得」である場合は、パスインデクス部１、ノード管理部２からノード情報を取得し、ステップ１９０に移行する。 Step 150) When the execution node type is “node acquisition”, node information is acquired from the path index unit 1 and the node management unit 2, and the process proceeds to step 190.

ステップ１６０）実行ノード種別が「関係判定」である場合は、ノード管理部２からノード情報を取得し、範囲ラベルに従って親子／先祖子孫関係を判定し（文書ＩＤの突き合わせ、範囲ラベルによる構造ジョイン）、ステップ１９０に移行する。 Step 160) When the execution node type is “relationship determination”, the node information is acquired from the node management unit 2, and the parent-child / ancestor-descendant relationship is determined according to the range label (document ID matching, structure join by range label). The process proceeds to step 190.

ステップ１７０）実行ノード種別が「結合」である場合は、ノード管理部２からノード情報を取得し、値インデクスも必要に応じ利用し結合を行うと共に、結合ＩＤを付与する。また、同じ結合ＩＤ同士を結合し、ステップ１９０に移行する。 Step 170) When the execution node type is “join”, the node information is acquired from the node management unit 2, the value index is also used as necessary, and the join ID is given. Further, the same combination IDs are combined and the process proceeds to step 190.

ステップ１８０）実行ノード種別が「結果生成」である場合は、返却するための結果を生成し、ステップ１９０に移行する。 Step 180) If the execution node type is “result generation”, a result to be returned is generated, and the process proceeds to step 190.

ステップ１９０）全ての実行ノードについて、ステップ１３０以降の処理を繰り返す。 Step 190) The processing after Step 130 is repeated for all execution nodes.

次に、上記のステップ１５０について詳細に説明する
図９は、本発明の一実施の形態におけるノード取得処理のフローチャートである。 Next, step 150 will be described in detail. FIG. 9 is a flowchart of the node acquisition process according to the embodiment of the present invention.

ステップ１５１）実行ノード種別が「ノード取得」である場合、結合条件を持つかを判定し、持つ場合には、ステップ１５２に移行し、持たない場合はステップ１５６に移行する。 Step 151) When the execution node type is “node acquisition”, it is determined whether or not there is a join condition. If so, the process proceeds to step 152. If not, the process proceeds to step 156.

ステップ１５２）キー結合が実行済みである場合はステップ１５５に移行し、まだ実行していない場合はステップ１５３に移行する。 Step 152) If the key combination has been executed, the process goes to Step 155, and if not yet executed, the process goes to Step 153.

ステップ１５３）パスインデクス部１及びノード管理部２から該当する双方の結合キーノード情報を取得する。 Step 153) The corresponding combined key node information is acquired from the path index unit 1 and the node management unit 2.

ステップ１５４）ステップ１５３で取得した結合キーノード情報に基づいて値インデクス部３に問い合わせ、結合キーノード情報に対応するキーＩＤを取得し、キー結合処理を行う。 Step 154) The value index unit 3 is inquired based on the combined key node information acquired in step 153, a key ID corresponding to the combined key node information is acquired, and key combination processing is performed.

ステップ１５５）キー結合結果から結合ＩＤと共に、ノード情報を取得し、当該処理を終了する。 Step 155) The node information is acquired together with the combination ID from the key combination result, and the process is terminated.

ステップ１５６）結合条件を持たない場合は、パスインデクス部１、ノード管理部２から該当するノード情報（群）を取得する。 Step 156) If there is no join condition, the corresponding node information (group) is acquired from the path index unit 1 and the node management unit 2.

ステップ１５７）値インデクス部１に問い合わせ、値の制約を満たすものだけに絞り込み、当該処理を終了する。 Step 157) The value index unit 1 is inquired, only those satisfying the value constraints are narrowed down, and the process is terminated.

以下に、上記の図８、図９の処理の具体的な例として図１４のデータ群が図７のラベル付けがされ検索部４のメモリ（図示せず）に格納されている場合に、図５のクエリを実行する手順を示す。 In the following, as a specific example of the processing of FIGS. 8 and 9, when the data group of FIG. 14 is labeled in FIG. 7 and stored in the memory (not shown) of the search unit 4, FIG. 5 shows a procedure for executing the query 5.

検索部４は、問い合わせ木を生成する。ここで作成する問い合わせ木は、図１０のようになる。同図において二重線の部分（「course」「profile」）が処理対象となるパスを表しており、破線の矢印が指されているパスには値に制約が付いていることを示している。 The search unit 4 generates a query tree. The query tree created here is as shown in FIG. In the figure, the double line part ("course" "profile") represents the path to be processed, and the path indicated by the dashed arrow indicates that the value is constrained .

次に、検索部４は、図１１に示すような実行木を作成する。問い合わせ木の処理対象となるパスに対し、ノード群を取得する処理（ノード取得）を葉とし、それらの先祖子孫関係の判定を節にする。ノード取得には、値の制約が付与される場合があり、結合条件も制約の一種として双方の葉に共通的に付与する。実行木の根には、最終的な結合結果を生成する結果生成処理が配置され、その直下に、クエリ例では、"$a"、"$b"にセットされた２つの検索ノード群の結合処理とする。 Next, the search unit 4 creates an execution tree as shown in FIG. For a path to be processed of the query tree, a process for acquiring a node group (node acquisition) is set as a leaf, and the determination of the ancestor-descendant relationship is set as a clause. In node acquisition, a value constraint may be given, and a joint condition is also commonly given to both leaves as a kind of constraint. At the root of the execution tree, there is a result generation process that generates the final combined result. In the query example, there are two search node groups set to "$ a" and "$ b" in the query example. To do.

木の形は実行順序を表している。複数の選択肢があるため、何らかの実行計画の最適化処理を行うことが望ましいが、本発明では対象外であるのでその説明を省略する。実行ノードを選択し、実行木の葉から根に向かって処理を進めていく。 The tree shape represents the execution order. Since there are a plurality of choices, it is desirable to perform some kind of execution plan optimization process, but since it is out of scope in the present invention, its description is omitted. Select an execution node and proceed from the leaves of the execution tree toward the root.

ノードＩＤを［文書ＩＤ，(範囲ラベル)］の形式で表すと、以下の手順で処理される。 When the node ID is expressed in the format of [document ID, (range label)], it is processed in the following procedure.

（１）Ｐ５："/course/meta/grade/text()制約：＝'初級クラス'"のノード取得
結合条件は持っていないため、パスインデクス部１、ノード管理部２からパスに該当するノード情報を取得し、値インデクス部３により制約を満たすノードのみに絞り込む。"初級クラス"という値を該当パスに持つものは、文書１０１、及び１０２のノードとなるので、以下が得られる。 (1) P5: "/ course / meta / grade / text () Constraint: =" Beginner class "node acquisition Since there is no join condition, the node corresponding to the path from the path index part 1 and the node management part 2 Information is acquired, and the value index unit 3 narrows down to only nodes that satisfy the constraint. Those having the value of “beginning class” in the corresponding path become nodes of the documents 101 and 102, so that the following is obtained.

'初級クラス'[101,(6,7)]，'初級クラス'[10２,(6,7)]
（２）Ｐ６："/course"のノード取得
結合条件は持っていないため、パスインデクス部１、ノード管理部２から、パスに該当するノード情報を取得する（制約を持たないため、値インデクス部３により絞り込んでも同じ）全ての文書に該当パスを持つノードが存在するため、以下が得られる。 'Beginner class' [101, (6,7)], 'Beginner class' [102, (6,7)]
(2) Node acquisition of P6: “/ course” Since there is no join condition, node information corresponding to the path is acquired from the path index unit 1 and the node management unit 2 (because there is no constraint, the value index unit Since the same path exists in all documents), the following is obtained.

[101,(1,36)]，[102,(1,36)]，[103,(1,36)]*
ただし、上記の[103,(1,36)]*のノードは、（１）の結果から文書ＩＤを利用し、この段階で取り除くことも可能である。なお、以降も、事前の処理結果の文書ＩＤを利用し、同様に取り除くことが可能なノードについては、「＊」を付与する。 [101, (1,36)], [102, (1,36)], [103, (1,36)] *
However, the node [103, (1, 36)] * can be removed at this stage using the document ID from the result of (1). In the following, “*” is given to nodes that can be similarly removed using the document ID of the previous processing result.

（３）Ｐ４：Ｐ５とＰ６の先祖子孫関係判定
この例では、先祖側のＰ６のノードを残す形で、Ｐ６が先祖でＰ５が子孫の関係のノードを取得する。 (3) P4: Determination of ancestor-descendant relationship between P5 and P6 In this example, P6 is an ancestor and P5 is a descendant node in the form of leaving the ancestor P6 node.

[101,(1,36)](←子孫101,[6,7])，[102,(1,36)](←子孫102,[6,7])
（４）Ｐ７："/course/students/student_id/text()制約：キー結合"のノード取得
結合条件を持っており、まだキー結合が実行されていないため、まずキー結合を実行する。パスインデクス部１、ノード管理部２からＰ７とＰ９双方のノード情報を取得し、値インデクス部３も必要に応じ利用し、結合を行うと共に、結合ＩＤも付与する。キー結合結果として、以下が得られる。 [101, (1, 36)] (← offspring 101, [6, 7]), [102, (1, 36)] (← offspring 102, [6, 7])
(4) P7: “/ course / students / student_id / text () constraint: key join” node acquisition Since it has a join condition and the key join has not yet been executed, the key join is executed first. The node information of both P7 and P9 is acquired from the path index unit 1 and the node management unit 2, and the value index unit 3 is also used as necessary to perform the combination and also give the combination ID. As a result of key combination, the following is obtained.

本例では、結合ＩＤとしてＰ７の［文書ＩＤ，範囲ラベルの第１値］を利用することとしている（こうすることで、後述の（９）．Ｐ２の結合処理時に効率的な処理が期待できる）。

In this example, [document ID, first value of range label] of P7 is used as the combination ID (this makes it possible to expect an efficient process at the time of (9) .P2 combination process described later. ).

次に、得られたキー結合結果から、Ｐ７のノードＩＤと結合ＩＤを取り出し、ノード取得結果が得られる。 Next, the node ID and the combination ID of P7 are extracted from the obtained key combination result, and the node acquisition result is obtained.

（５）Ｐ３：Ｐ４とＰ７の先祖子孫関係判定
先祖側のＰ４のノードを残す形で、Ｐ４が先祖でＰ７が子孫の関係のノードを取得する。その際、結合ＩＤも検索部４の内部のメモリ（図示せず）に保持しておく。

(5) P3: Determination of ancestor-descendant relationship between P4 and P7 In the form of leaving the ancestor P4 node, P4 is an ancestor and P7 is a descendant. At that time, the combination ID is also stored in a memory (not shown) inside the search unit 4.

（６）Ｐ９："/profile/@id制約：キー結合"のノード取得
結合条件を持っており、キー結合結果も（１）で得られているため、キー結合結果から、Ｐ９のノードＩＤと結合ＩＤを取り出し、ノード取得結果を得る。

(6) Node acquisition of P9: “/ profile / @ id constraint: key join” Since it has a join condition and the key join result is also obtained in (1), the node ID of P9 is obtained from the key join result. The binding ID is taken out and the node acquisition result is obtained.

（７）Ｐ１０："/profile"のノード取得
結合条件は持っていないため、パスインデクス部１、ノード管理部２から、パスに該当するノード情報を取得する（制約を持たないため、値インデクス部３により絞り込んでも同じ）全ての文書に該当パスを持つノードが存在するため、以下が得られる。

(7) P10: Node acquisition of “/ profile” Since there is no join condition, node information corresponding to the path is acquired from the path index unit 1 and the node management unit 2 (there is no constraint, so the value index unit Since the same path exists in all documents), the following is obtained.

[201,(1,2)]，[202,(1,12)]，[203,(1,12)]，[204,(1,12)]*，[205,(1,12)]*，[206,(1,12)]*，
[207,(,12)]
（８）Ｐ８：Ｐ９とＰ１０の先祖子孫関係判定
先祖側のＰ１０のノードを残す形で、パスインデクス部１、ノード管理部２からＰ１０が先祖でＰ９が子孫の関係のノードを取得する。その際、結合ＩＤも保持しておく。 [201, (1,2)], [202, (1,12)], [203, (1,12)], [204, (1,12)] *, [205, (1,12)] *, [206, (1,12)] *,
[207, (, 12)]
(8) P8: Determination of ancestor-descendant relationship between P9 and P10 In the form of leaving the ancestor-side P10 node, the node of the path index unit 1 and node management unit 2 acquires a node in which P10 is an ancestor and P9 is a descendant. At that time, the binding ID is also held.

（９）Ｐ２：Ｐ３とＰ８の結合
同じ結合ＩＤを含むもの同士で結合を行うが、本例では、"$b"にあたるＰ８側の単位で結合する。

(9) P2: Binding of P3 and P8 Bindings that include the same binding ID are performed with each other, but in this example, they are combined in units of the P8 side corresponding to “$ b”.

参考までに、個々までの処理の中間結果を記したものを図１２に示す。

For reference, FIG. 12 shows intermediate results of individual processes.

（１０）Ｐ１：結果生成
最後のＰ１でreturn句に従って返却結果を作成すると、図４のようになる。返却対象のノードの取得も、上記手順と類似した方法で行うことができるが、本発明では対象外であるため、その説明を省略する。 (10) P1: Result generation When a return result is created according to the return phrase in the last P1, it is as shown in FIG. Acquisition of the node to be returned can also be performed by a method similar to the above procedure, but since it is out of scope in the present invention, description thereof is omitted.

なお、図３に示す検索部４の動作をプログラムとして構築し、情報検索装置として利用されるコンピュータにインストールして実行させる、または、ネットワークを介して流通させることが可能である。 It is possible to construct the operation of the search unit 4 shown in FIG. 3 as a program, install it on a computer used as an information search device and execute it, or distribute it via a network.

また、構築されたプログラムをハードディスクや、フレキシブルディスク・ＣＤ−ＲＯＭ等の可搬記憶媒体に格納し、コンピュータにインストールする、または、配布することが可能である。 Further, the constructed program can be stored in a portable storage medium such as a hard disk, a flexible disk, or a CD-ROM, and can be installed or distributed in a computer.

なお、本発明は、上記の実施の形態及び実施例に限定されることなく、特許請求の範囲内において種々変更・応用が可能である。 The present invention is not limited to the above-described embodiments and examples, and various modifications and applications can be made within the scope of the claims.

本発明は、構造化データである多数のＸＭＬデータの蓄積や検索技術に適用可能である。 The present invention can be applied to a technique for accumulating and retrieving a large number of XML data as structured data.

１パスインデクス記憶手段、パスインデクス部
２ノード記憶手段、ノード管理部
３値インデクス記憶手段、値インデクス部
４検索部
４１実行木生成手段
４２検索実行手段
４５メモリ 1 path index storage means, path index section 2 node storage means, node management section 3 value index storage means, value index section 4 search section 41 execution tree generation means 42 search execution means 45 memory

Claims

構造化データであるＸＭＬデータ群に対して、蓄積及び検索する装置における情報検索方法であって、
入力された検索クエリで指定された検索パス及び検索キーワードに基づいて、ＸＭＬの構造をパス単位で格納したパスインデクス記憶手段と、ＸＭＬの各ノード間の関係を保持するための、親ノードが子ノードを含むノードＩＤで表される範囲ラベルをノードＩＤとするノード情報を格納したノード記憶手段を参照して、問い合わせ木を生成し、該問い合わせ木の処理対象となるパスに対し、ノード群を取得する処理、それらの先祖子孫関係の判定処理、ノード群の結合処理、最終的な結合結果を生成する結果生成処理の各処理種別のノードを含む実行木を生成する実行木生成ステップと、
前記実行木の各実行ノードにおいて、子ノードがある場合は該子ノードの結果を取得し、当該ノードの処理種別に応じた処理を行い、最終的な結果を出力する検索ステップと、
を行うことを特徴とする情報検索方法。 An information retrieval method in an apparatus for storing and retrieving XML data that is structured data,
Based on the search path and the search keyword specified by the input search query, the path index storage means for storing the XML structure in units of paths and the parent node for maintaining the relationship between each node of the XML are children. A query storage is generated by referring to a node storage unit that stores node information whose node ID is a range label represented by a node ID including a node, and a node group is determined for a path to be processed by the query tree. An execution tree generation step for generating an execution tree including nodes of each processing type of a process to acquire, a determination process of their ancestor-descendant relationship, a node group combination process, and a result generation process for generating a final combination result;
In each execution node of the execution tree, if there is a child node, a result of the child node is obtained, processing according to the processing type of the node is performed, and a final result is output;
An information retrieval method characterized by:

前記検索ステップは、
前記実行木の実行ノードが結合条件を有しているとき、該結合条件で指定されているノード群（結合キーノード群）について結合を行い、結合したペア毎に結合ＩＤを付与し、検索記憶手段に格納し、
前記ノード記憶手段から、検索の単位となるノード群（検索ノード群）のノード情報を取得し、該検索ノード群に対し、前記結合キーノード群のノード情報の前記範囲ラベルに基づいて該当するものをマッチングし、マッチする検索ノード群を当該結合キーノード群の結合ＩＤと共に取得し、該検索ノード群と該結合キーノード群を結合し、検索記憶手段に格納し、
前記検索クエリの指定に基づいて前記検索記憶手段に格納されている情報から結果を生成する
請求項１記載の情報検索方法。 The search step includes
When the execution node of the execution tree has a join condition, the join group is joined with respect to the node group (join key node group) designated by the join condition, and a join ID is assigned to each joined pair, and the search storage means Store and
Node information of a node group (search node group) as a unit of search is acquired from the node storage means, and the corresponding one based on the range label of the node information of the combined key node group is obtained for the search node group A matching search node group is obtained together with a combination ID of the combination key node group, the search node group and the combination key node group are combined, and stored in a search storage unit;
The information search method according to claim 1, wherein a result is generated from information stored in the search storage unit based on designation of the search query.

構造化データであるＸＭＬデータ群に対して、蓄積及び検索する装置における情報検索装置であって、
ＸＭＬの構造をパス単位で格納したパスインデクス記憶手段と、
ＸＭＬの各ノード間の関係を保持するための、親ノードが子ノードを含むノードＩＤで表される範囲ラベルをノードＩＤとするノード情報を格納したノード記憶手段と、
入力された検索クエリで指定された検索パス及び検索キーワードに基づいて、前記パスインデクス記憶手段と前記ノード記憶手段を参照して、問い合わせ木を生成し、該問い合わせ木の処理対象となるパスに対し、ノード群を取得する処理、それらの先祖子孫関係の判定処理、ノード群の結合処理、最終的な結合結果を生成する結果生成処理の各処理種別のノードを含む実行木を生成する実行木生成手段と、
前記実行木の各実行ノードにおいて、子ノードがある場合は該子ノードの結果を取得し、当該ノードの処理種別に応じた処理を行い、最終的な結果を出力する検索実行手段と、
を有することを特徴とする情報検索装置。 An information search apparatus in an apparatus for storing and searching XML data group that is structured data,
A path index storage means for storing the XML structure in units of paths;
Node storage means for storing node information whose node ID is a range label represented by a node ID whose parent node includes a child node, in order to maintain a relationship between nodes of XML;
Based on the search path and the search keyword specified by the input search query, the query index is generated by referring to the path index storage means and the node storage means, and the query tree is processed. An execution tree generation that generates an execution tree including nodes of each processing type of a node group acquisition process, a determination process of their ancestor-descendant relationship, a node group combination process, and a result generation process that generates a final combination result Means,
In each execution node of the execution tree, if there is a child node, a result of the child node is obtained, processing according to the processing type of the node is performed, and a search execution unit that outputs a final result;
An information retrieval apparatus comprising:

前記検索実行手段は、
前記実行木の実行ノードが結合条件を有しているとき、該結合条件で指定されているノード群（結合キーノード群）について結合を行い、結合したペア毎に結合ＩＤを付与し、検索記憶手段に格納する手段と、
前記ノード記憶手段から、検索の単位となるノード群（検索ノード群）のノード情報を取得し、該検索ノード群に対し、前記結合キーノード群のノード情報の前記範囲ラベルに基づいて該当するものをマッチングし、マッチする検索ノード群を当該結合キーノード群の結合ＩＤと共に取得し、該検索ノード群と該結合キーノード群を結合し、検索記憶手段に格納する手段と、
前記検索クエリの指定に基づいて前記検索記憶手段に格納されている情報から結果を生成する手段と、
を含む請求項３記載の情報検索装置。 The search execution means includes
When the execution node of the execution tree has a join condition, the join group is joined with respect to the node group (join key node group) designated by the join condition, and a join ID is assigned to each joined pair, and the search storage means Means for storing;
Node information of a node group (search node group) as a unit of search is acquired from the node storage means, and the corresponding one based on the range label of the node information of the combined key node group is obtained for the search node group Means for matching, obtaining a matching search node group together with a combination ID of the combination key node group, combining the search node group and the combination key node group, and storing in the search storage unit;
Means for generating a result from information stored in the search storage means based on the designation of the search query;
The information search device according to claim 3, comprising:

請求項３または４記載の情報検索装置を構成する各手段としてコンピュータを機能させるための情報検索プログラム。 An information search program for causing a computer to function as each means constituting the information search device according to claim 3 or 4.