JP2697651B2

JP2697651B2 - Database search result evaluation method

Info

Publication number: JP2697651B2
Application number: JP6336641A
Authority: JP
Inventors: 秀雄島津
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1994-12-26
Filing date: 1994-12-26
Publication date: 1998-01-14
Anticipated expiration: 2013-01-14
Also published as: JPH08180071A

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、データベースの検索を
行うデータベース検索システムに関し、特にデータベー
スの検索における検索結果（検索されたレコード）の評
価（検索結果の間の類似度の検出や例外的な検索結果の
提示）を自動的に行うデータベース検索結果評価方式に
関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a database search system for searching a database, and more particularly to evaluation of search results (searched records) in database search (detection of similarity between search results and exceptional search). (Presentation of search results) automatically.

【０００２】[0002]

【従来の技術】データベース検索システムは、利用者
（検索者）に指定された条件（検索条件）を満たすレコ
ード（データベース内のレコード）を検索するシステム
である。利用者は、自分の入手したいデータを見つける
ために、自分の検索目的を示す検索条件をデータベース
検索式に変換して検索を行う。データベース検索システ
ムは、そのデータベース検索式に基づいてデータベース
の検索を行い、検索結果を利用者に提示する。2. Description of the Related Art A database search system is a system for searching for records (records in a database) that satisfy conditions (search conditions) specified by a user (searcher). A user performs a search by converting search conditions indicating his / her search purpose into a database search formula in order to find data he / she wants to obtain. The database search system searches the database based on the database search formula, and presents the search results to the user.

【０００３】ここで、従来のデータベース検索システム
では、検索結果が妥当なもの（検索条件を満たすレコー
ドの中の通常のもの）でも例外的なもの（検索条件を満
たし、かつ他の検索結果に比べて特異なもの）でも区別
なく、検索結果の提示が行われていた（参考文献として
は、例えば、「マイクロソフトアクセス１．０（Ｍｉ
ｃｒｏｓｏｆｔＡＣＣＥＳＳＶｅｒｓｉｏｎ１．
０）マニュアル，日本マイクロソフト社，１９９３
年」がある）。Here, in the conventional database search system, a search result that is valid (a normal one in a record that satisfies the search condition) is exceptional (a search result that satisfies the search condition and is compared with other search results). The search results were presented without distinction even if they were unique. (For example, as a reference, "Microsoft Access 1.0 (Mi
Microsoft ACCESS Version 1.
0) Manual, Microsoft Japan, 1993
Year ").

【０００４】すなわち、従来のデータベース検索システ
ムには、例外的な検索結果を自動的に提示したり、その
ような提示の前提として２つの検索結果の間の類似度を
検出したりする「データベース検索結果評価方式」が存
在しなかった。[0004] In other words, the conventional database search system automatically presents an exceptional search result, or detects a similarity between two search results as a premise of such presentation. There was no "result evaluation method".

【０００５】[0005]

【発明が解決しようとする課題】上述した従来のデータ
ベース検索システムでは、例外的な検索結果を自動的に
提示したり、そのような提示の前提として２つの検索結
果の間の類似度を検出したりする「データベース検索結
果評価方式」が存在せず、検索結果が妥当なものでも例
外的なものでも区別なく検索結果の提示が行われている
ので、利用者にとって「多くの検索結果の中から例外的
なレコード（検索結果）を見つけること」が困難とな
り、利用者がそのような例外的なレコードの存在に気づ
かなくて例外的で重要なレコードを見落とす可能性があ
るという問題点があった。In the above-described conventional database search system, an exceptional search result is automatically presented, or a similarity between two search results is detected as a premise of such presentation. There is no "database search result evaluation method", and the search results are presented without distinction whether the search results are valid or exceptional. It is difficult to find exceptional records (search results) ", and users may overlook exceptional and important records without noticing the existence of such exceptional records. .

【０００６】ここで、例えば、データベース検索システ
ムとして、架空の航空券予約システムを考えてみる。こ
の航空券予約システムは、利用者が日付や予定運賃や行
き先を検索条件として入力すると、それに対応したフラ
イトスケジュールを検索結果として提示するデータベー
ス検索システムである。Here, for example, consider a fictional airline ticket reservation system as a database search system. The airline ticket reservation system is a database search system that, when a user inputs a date, a scheduled fare, and a destination as search conditions, presents a flight schedule corresponding to the search condition as a search result.

【０００７】仮に、利用者により「１２／２２（１２月
２２日を意味する。以下同様）から１２／２５の間に運
賃３０万円以下でパリに行くフライト」という検索条件
が入力されると、その検索条件を満足するフライトのレ
コードが検索結果として提示される。[0007] If the user inputs a search condition of "flight to Paris with a fare of 300,000 yen or less between December 22 (which means December 22; the same applies hereinafter) and December 25", the user enters the search condition. The record of the flight that satisfies the search condition is presented as a search result.

【０００８】提示されたレコードのうち、１２／２３，
１２／２４および１２／２５に出発するフライトは３０
件あり、それらの運賃は全て３０万円以下ではあるが、
２０万円以上であったとする。一方、１２／２２に出発
するフライトは、１件しかなかったが、その運賃は１２
万円であったとする。[0008] Of the presented records, 12/23,
30 departures on 12/24 and 12/25
There are cases, all of these fares are less than 300,000 yen,
Suppose that it was 200,000 yen or more. On the other hand, there was only one flight departing on December 22, but the fare was 12
Suppose it was 10,000 yen.

【０００９】このような場合に、この航空券予約システ
ムが「他に比べて特に安い運賃のフライトのレコードを
例外的なレコードとして特別に利用者に提示してやる仕
組み」を持っていれば、利用者にとって有益な場合が多
い。In such a case, if the airline ticket reservation system has a mechanism for presenting specially to the user a record of a flight with a fare lower than others as an exceptional record, Often useful for

【００１０】もちろん、このような示唆（データベース
検索システムによる例外的な検索結果の提示）がいつも
利用者にとって役に立つとは限らない。例えば、上述の
例で、利用者はできるだけ出発日を遅らせて出発したく
てそのことの方が運賃が安いことよりも重要な場合もあ
り、そのような場合には上述の「特に安い運賃のフライ
トのレコードの示唆」は利用者にとってあまり役に立た
ない。しかし、そのような場合があるとしても、例外的
な検索結果（レコード）をデータベース検索システムが
自動的に利用者に提示することは、多くの場合に有益で
ある。Of course, such suggestions (exception of an exceptional search result by the database search system) are not always useful for the user. For example, in the above example, the user may want to depart as late as possible, which may be more important than the cheaper fare. The "flight record suggestion" is not very useful to users. However, even in such cases, it is often useful for the database search system to automatically present exceptional search results (records) to the user.

【００１１】従来のデータベース検索システムでは、こ
のように利用者にとって有益と考えられる「例外的な検
索結果を提示する仕組み」は存在しなかった。[0011] In the conventional database search system, there is no "mechanism for presenting an exceptional search result" which is considered useful for the user.

【００１２】本発明の目的は、上述の点に鑑み、例外的
な検索結果を自動的に提示することができ、そのような
提示の前提として２つの検索結果の間の類似度を検出す
ることを可能にするデータベース検索結果評価方式を提
供することにある。In view of the above, it is an object of the present invention to automatically present an exceptional search result, and to detect a similarity between two search results as a premise of such presentation. It is an object of the present invention to provide a database search result evaluation method which enables the following.

【００１３】[0013]

【課題を解決するための手段】本発明のデータベース検
索結果評価方式は、属性とその属性値との組の集合で１
つのレコードが構成され、複数のレコードの集合からな
るデータベースに関するデータベース検索結果評価方式
において、データベースから検索された１つのレコード
を保持する基本レコード保持部と、データベースから検
索された別のレコードを保持する比較レコード保持部
と、データベースの属性毎に定義されており当該各属性
における属性値間の類似度が定義されている類似度定義
と、データベースの属性毎に定義されており類似度計算
における当該各属性の重要度を示す重みが定義されてい
る重み定義と、前記基本レコード保持部内のレコードと
前記比較レコード保持部内のレコードとについて、各属
性の前記類似度定義を参照して各属性の属性値間の類似
度を計算し、各属性の前記重み定義を参照して各属性に
関する類似度を各属性の重みを加味して加重平均し、そ
の計算結果を総合類似度として出力する類似度計算部と
を有する。According to the present invention, there is provided a database search result evaluation method, wherein a set of an attribute and its attribute value is one set.
In a database search result evaluation method for a database in which one record is composed and a set of a plurality of records, a basic record holding unit that holds one record searched from the database and another record that is searched from the database A comparison record holding unit, a similarity definition defined for each attribute of the database and defining a similarity between attribute values in each attribute, and a similarity definition defined for each attribute of the database and For the weight definition in which the weight indicating the importance of the attribute is defined, and for the record in the basic record storage unit and the record in the comparison record storage unit, refer to the similarity definition of each attribute to determine the attribute value of each attribute. The similarity between each attribute is calculated, and the similarity regarding each attribute is calculated with reference to the weight definition of each attribute. Weighted average in consideration of the weight of, and a similarity calculation unit for outputting the calculation result as a general similarity.

【００１４】[0014]

【作用】本発明のデータベース検索結果評価方式では、
属性とその属性値との組の集合で１つのレコードが構成
され複数のレコードの集合からなるデータベースに関す
るデータベース検索結果評価方式において、データベー
スの属性毎に定義されており当該各属性における属性値
間の類似度が定義されている類似度定義が設けられ、デ
ータベースの属性毎に定義されており類似度計算におけ
る当該各属性の重要度を示す重みが定義されている重み
定義が設けられ、基本レコード保持部がデータベースか
ら検索された１つのレコードを保持し、比較レコード保
持部がデータベースから検索された別のレコードを保持
し、類似度計算部が基本レコード保持部内のレコードと
比較レコード保持部内のレコードとについて各属性の類
似度定義を参照して各属性の属性値間の類似度を計算し
各属性の重み定義を参照して各属性に関する類似度を各
属性の重みを加味して加重平均しその計算結果を総合類
似度として出力する。According to the database search result evaluation method of the present invention,
In a database search result evaluation method for a database composed of a set of a plurality of records in which one record is composed of a set of an attribute and its attribute value, each record is defined for each attribute of the database. A similarity definition that defines the similarity is provided, and a weight definition that is defined for each attribute of the database and defines a weight indicating the importance of the attribute in the similarity calculation is provided. Unit holds one record retrieved from the database, the comparison record retention unit retains another record retrieved from the database, and the similarity calculation unit retrieves a record in the basic record retention unit and a record in the comparison record retention unit. Calculate the similarity between the attribute values of each attribute by referring to the similarity definition of each attribute, and define the weight of each attribute Referring to the similarity for each attribute weighted average in consideration of the weights of each attribute and outputs the calculation result as a general similarity.

【００１５】[0015]

【実施例】次に、本発明について図面を参照して詳細に
説明する。なお、本発明のデータベース検索結果評価方
式が適用されるデータベース検索システムでは、属性と
その属性値との組の集合で１つのレコードが構成されて
複数のレコードの集合からなるデータベースに対する検
索が行われる。Next, the present invention will be described in detail with reference to the drawings. In the database search system to which the database search result evaluation method of the present invention is applied, one record is configured by a set of a set of an attribute and its attribute value, and a search is performed on a database including a set of a plurality of records. .

【００１６】図１は、本発明の第１の実施例（請求項１
記載の発明に対応する実施例）に係るデータベース検索
結果評価方式の構成を示すブロック図である。FIG. 1 shows a first embodiment of the present invention.
FIG. 11 is a block diagram showing a configuration of a database search result evaluation method according to an embodiment (corresponding to the described invention).

【００１７】本実施例のデータベース検索結果評価方式
は、データベースから検索された１つのレコードを保持
（格納）する基本レコード保持部１と、データベースか
ら検索された別のレコードを保持する比較レコード保持
部２と、データベースの属性（ここでは、属性ｉ，ｊお
よびｋ）毎に定義されており当該各属性における属性値
間の類似度が定義されている類似度定義３（ここでは、
属性ｉ，ｊおよびｋの類似度定義。後述する他の実施例
でも同様）と、データベースの属性毎に定義されており
類似度計算における当該各属性の重要度を示す重みが定
義されている重み定義４（ここでは、属性ｉ，ｊおよび
ｋの重み定義。後述する他の実施例でも同様）と、基本
レコード保持部１内のレコードと比較レコード保持部２
内のレコードとの間の総合類似度を計算してその値を出
力する類似度計算部５とを含んで構成されている。ここ
で、基本レコード保持部１と比較レコード保持部２と
は、通常のメモリで構成される。The database search result evaluation method of this embodiment includes a basic record holding unit 1 for holding (storing) one record searched from the database, and a comparison record holding unit for holding another record searched from the database. 2 and a similarity definition 3 (here, the definition is defined for each attribute of the database (attributes i, j, and k), and the similarity between the attribute values of each attribute is defined).
Definition of similarity between attributes i, j and k. Weight definition 4 (here, attributes i, j, and i) are defined for each attribute of the database and define the weight indicating the importance of each attribute in the similarity calculation. k, the same applies to other embodiments described later), the records in the basic record holding unit 1 and the comparison record holding unit 2
And a similarity calculation unit 5 that calculates the total similarity between the records within the group and outputs the calculated value. Here, the basic record holding unit 1 and the comparison record holding unit 2 are configured by ordinary memories.

【００１８】図２は、類似度定義３における類似度定義
表現の一例を示す図である（図２および図３の類似度定
義表現は、先に言及した航空券予約システムにおける類
似度定義表現を想定している）。図２に示す類似度定義
表現は、データベースの属性（データベース内のレコー
ドの属性）の１つである「行き先」に関するものであ
り、その類似度が概念階層木の形で定義されている。FIG. 2 is a diagram showing an example of a similarity definition expression in the similarity definition 3. (The similarity definition expressions in FIGS. 2 and 3 are similar to the similarity definition expressions in the airline ticket reservation system mentioned above. Assuming). The similarity definition expression shown in FIG. 2 relates to “destination”, which is one of the attributes of the database (the attribute of the record in the database), and the similarity is defined in the form of a conceptual hierarchical tree.

【００１９】例えば、グアムとサイパンとの間の類似度
は、両者が「日本から近いマリンリゾート」という点で
共通のグループに属しているので、０．７となる。ま
た、パリ，ミラノおよびフランクフルトの間の類似度
は、三者が「中欧部で日本からのフライトの多い都市」
という点で共通のグループに属しているので、０．５と
なる。For example, the similarity between Guam and Saipan is 0.7 because they belong to a common group in terms of “Marine resorts close to Japan”. In addition, the similarity between Paris, Milan and Frankfurt is based on the fact that "Central Europe is a city with many flights from Japan"
Because it belongs to a common group in this respect, it is 0.5.

【００２０】図３は、類似度定義３における類似度定義
表現の他の例を示す図である。図３に示す類似度定義表
現は、データベースの属性の１つである「運賃」に関す
るものであり、判断尺度と類似度との対応情報の形で定
義されている。FIG. 3 is a diagram showing another example of the similarity definition expression in the similarity definition 3. The similarity definition expression shown in FIG. 3 relates to “fare” which is one of the attributes of the database, and is defined in the form of correspondence information between the judgment scale and the similarity.

【００２１】ここでは、「金額の差が２万円以下」の場
合の類似度が０．８であり、「金額の差が２万円超過で
５万円以下」の場合の類似度が０．３であり、「金額の
差が５万円超過」の場合の類似度が０であるという定義
がなされている。Here, the similarity is 0.8 when the difference is less than 20,000 yen and the similarity is 0 when the difference is more than 50,000 yen and less than 50,000 yen. .3, and the similarity is defined to be 0 in the case of “the difference in the amount exceeds 50,000 yen”.

【００２２】なお、各属性の重み定義４には、それぞれ
の属性の重要度（レコード間の総合的な類似度を判断す
る上での重要度）に応じて重みの値が与えられている。
ここでは、重みの値の範囲は、０から１の間の値に正規
化されている（請求項５記載の発明参照）。ただし、本
発明を実現する上で、重みの値がそのような範囲の値に
限定されるものではない。The weight definition 4 for each attribute is assigned a weight value according to the importance of each attribute (importance in determining the overall similarity between records).
Here, the range of the weight value is normalized to a value between 0 and 1 (see the invention according to claim 5). However, in realizing the present invention, the weight value is not limited to a value in such a range.

【００２３】図４は、類似度計算部５の処理を示す流れ
図である。この処理は、分子用記憶領域および分母用記
憶領域確保・初期化ステップ４００と、基本レコード保
持部属性値取出しステップ４０１と、比較レコード保持
部属性値取出しステップ４０２と、類似度読取りステッ
プ４０３と、分子用記憶領域および分母用記憶領域更新
ステップ４０４と、全属性値取出し終了判定ステップ４
０５と、総合類似度出力ステップ４０６とからなる。FIG. 4 is a flowchart showing the processing of the similarity calculating section 5. This processing includes a numerator storage area and a denominator storage area securing / initializing step 400, a basic record holding unit attribute value extracting step 401, a comparison record holding unit attribute value extracting step 402, a similarity reading step 403, Numerator storage area and denominator storage area update step 404, and all attribute value extraction end determination step 4
05 and an overall similarity output step 406.

【００２４】次に、このように構成された本実施例のデ
ータベース検索結果評価方式の動作について説明する。Next, the operation of the database retrieval result evaluation method of the embodiment constructed as described above will be described.

【００２５】基本レコード保持部１は、データベースか
ら検索された１つのレコードを自己の内部に保持する。The basic record holding unit 1 holds one record retrieved from the database in itself.

【００２６】また、比較レコード保持部２は、データベ
ースから検索された別のレコードを自己の内部に保持す
る。The comparison record holding unit 2 holds another record retrieved from the database in itself.

【００２７】類似度計算部５は、以下に示すような処理
を行う（図４参照）。The similarity calculator 5 performs the following processing (see FIG. 4).

【００２８】総合類似度の値を計算するための分子用記
憶領域と分母用記憶領域とを確保し、両方に値０（初期
値）をセットする（ステップ４００）。この分子用記憶
領域は後述する計算式におけるＡに対応し、この分母用
記憶領域は同式におけるＢに対応する。A storage area for the numerator and a storage area for the denominator for calculating the value of the overall similarity are secured, and the value 0 (initial value) is set in both of them (step 400). The storage area for the numerator corresponds to A in a calculation formula described later, and the storage area for the denominator corresponds to B in the same equation.

【００２９】基本レコード保持部１内のレコードの任意
の１つの属性の属性値を取り出す（ステップ４０１）。An attribute value of any one attribute of a record in the basic record holding unit 1 is extracted (step 401).

【００３０】比較レコード保持部２内のレコードの同一
の属性（ステップ４０１で属性値を取り出した属性と同
一の属性）の属性値を取り出す（ステップ４０２）。An attribute value of the same attribute of the record in the comparison record holding unit 2 (the same attribute as the attribute whose attribute value is extracted in step 401) is extracted (step 402).

【００３１】上述のステップ４０１および４０２の処理
が全ての属性について繰り返されて、総合類似度の計算
に必要な各属性値が順次に取り出されていく。The processing of steps 401 and 402 described above is repeated for all the attributes, and the attribute values required for calculating the overall similarity are sequentially extracted.

【００３２】当該属性に対応する類似度定義３を参照し
て、ステップ４０１で取り出した属性値とステップ４０
２で取り出した属性値との間の類似度を読み取る（ステ
ップ４０３）。Referring to the similarity definition 3 corresponding to the attribute, the attribute value extracted in step 401 and the
The degree of similarity with the attribute value extracted in step 2 is read (step 403).

【００３３】当該属性に対応する重み定義４の値（重み
の値）をステップ４０３で読み取った値に積算した値を
ステップ４００で確保した分子用記憶領域の値に加えた
値を、分子用記憶領域にセットする。一方、その重み定
義４の値をステップ４００で確保した分母用記憶領域の
値に加えた値を、分母用記憶領域にセットする（ステッ
プ４０４）。このようにして、分子用記憶領域の値およ
び分母用記憶領域の値の更新を行う。The value obtained by adding the value of the weight definition 4 (the value of the weight) corresponding to the attribute to the value read in step 403 and the value in the molecule storage area secured in step 400 is added to the value of the molecule storage area. Set to the area. On the other hand, a value obtained by adding the value of the weight definition 4 to the value of the denominator storage area secured in step 400 is set in the denominator storage area (step 404). Thus, the value of the numerator storage area and the value of the denominator storage area are updated.

【００３４】基本レコード保持部１および比較レコード
保持部２内のレコードについて、全ての属性（図１で
は、属性ｉ，ｊおよびｋ）の属性値を取り出したか否か
を調べる（ステップ４０５）。It is checked whether the attribute values of all the attributes (attributes i, j and k in FIG. 1) have been extracted from the records in the basic record holding unit 1 and the comparison record holding unit 2 (step 405).

【００３５】ステップ４０５で「全ての属性の属性値を
取り出した」と判定した場合には、分子用記憶領域の値
を分母用記憶領域の値で割算（除算）した値を総合類似
度として出力する（ステップ４０６）。If it is determined in step 405 that "all attribute values have been extracted", the value obtained by dividing (dividing) the value of the numerator storage area by the value of the denominator storage area is used as the overall similarity. Output (Step 406).

【００３６】ステップ４０５で「全ての属性の属性値を
取り出してはいない」と判定した場合には、ステップ４
０１の処理に戻る。If it is determined in step 405 that "the attribute values of all attributes have not been extracted",
It returns to the process of 01.

【００３７】このように、類似度計算部５は、基本レコ
ード保持部１内のレコードと比較レコード保持部２内の
レコードとについて、各属性の類似度定義３を参照して
各属性の属性値間の類似度を計算し、各属性の重み定義
４を参照して各属性に関する類似度を各属性の重みを加
味して加重平均し、その計算結果を総合類似度として出
力する。As described above, the similarity calculation unit 5 refers to the similarity definition 3 of each attribute for the record in the basic record storage unit 1 and the record in the comparison record storage unit 2 to determine the attribute value of each attribute. The similarity between the attributes is calculated, the similarity of each attribute is weighted by taking the weight of each attribute into consideration with reference to the weight definition 4 of each attribute, and the calculation result is output as the overall similarity.

【００３８】ここで、総合類似度の値を計算式で表す
と、以下のようになる。この計算式においては、属性の
数がｎ（正整数）であるものとする。Here, the value of the total similarity is represented by a calculation formula as follows. In this formula, it is assumed that the number of attributes is n (positive integer).

【００３９】[0039]

【数１】 (Equation 1)

【００４０】図５は、本発明の第２の実施例（請求項２
記載の発明に対応する実施例）に係るデータベース検索
結果評価方式の構成を示すブロック図である。FIG. 5 shows a second embodiment of the present invention.
FIG. 11 is a block diagram showing a configuration of a database search result evaluation method according to an embodiment (corresponding to the described invention).

【００４１】本実施例のデータベース検索結果評価方式
は、データベースから検索された１つのレコードを保持
（格納）する基本レコード保持部１１と、データベース
から検索された別のレコードを保持する比較レコード保
持部１２と、データベースの属性（ここでは、属性ｉ，
ｊおよびｋ）毎に定義されており当該各属性における属
性値間の類似度が定義されている類似度定義１３と、デ
ータベースの属性毎に定義されており類似度計算におけ
る当該各属性の重要度を示す重みが定義されている重み
定義１４と、前もって類似度計算に使う属性の名前（こ
こでは、属性ｉおよびｋの名前とする）を保持する対象
属性保持部１５と、対象属性保持部１５に名前が保持さ
れている属性だけに限定して基本レコード保持部１１内
のレコードと比較レコード保持部１２内のレコードとの
間の総合類似度の計算および出力を行う属性限定型類似
度計算部１６とを含んで構成されている。ここで、基本
レコード保持部１１，比較レコード保持部１２，類似度
定義１３および重み定義１４は、図１中の基本レコード
保持部１，比較レコード保持部２，類似度定義３および
重み定義４と同一の構成要素である。また、基本レコー
ド保持部１１，比較レコード保持部１２および対象属性
保持部１５は、通常のメモリで実現される。なお、属性
限定型類似度計算部１６は請求項２における「類似度計
算部」に該当する。The database search result evaluation method of the present embodiment includes a basic record holding unit 11 for holding (storing) one record searched from the database, and a comparison record holding unit for holding another record searched from the database. 12 and the attributes of the database (here, attributes i,
j and k), a similarity definition 13 defining the similarity between attribute values in each attribute, and the importance of each attribute in the similarity calculation defined for each attribute in the database , A target attribute holding unit 15 for holding in advance the names of attributes (here, the names of attributes i and k) used for similarity calculation, and a target attribute holding unit 15 Attribute-limited similarity calculation unit that calculates and outputs the overall similarity between the record in the basic record storage unit 11 and the record in the comparison record storage unit 12 only for the attributes whose names are stored in 16 are included. Here, the basic record holding unit 11, the comparison record holding unit 12, the similarity definition 13 and the weight definition 14 are the same as the basic record holding unit 1, the comparison record holding unit 2, the similarity definition 3 and the weight definition 4 in FIG. The same components. Further, the basic record holding unit 11, the comparison record holding unit 12, and the target attribute holding unit 15 are realized by a normal memory. The attribute-limited similarity calculation unit 16 corresponds to the “similarity calculation unit” in claim 2.

【００４２】図６は、属性限定型類似度計算部１６の処
理を示す流れ図である。この処理は、分子用記憶領域お
よび分母用記憶領域確保・初期化ステップ６００と、基
本レコード保持部属性値取出しステップ６０１と、比較
レコード保持部属性値取出しステップ６０２と、類似度
読取りステップ６０３と、分子用記憶領域および分母用
記憶領域更新ステップ６０４と、対象属性全属性値取出
し終了判定ステップ６０５と、総合類似度出力ステップ
６０６とからなる。FIG. 6 is a flowchart showing the processing of the attribute-limited type similarity calculation unit 16. This processing includes a numerator storage area and a denominator storage area securing / initializing step 600, a basic record holding unit attribute value extracting step 601, a comparison record holding unit attribute value extracting step 602, a similarity reading step 603, It comprises a numerator storage area and denominator storage area update step 604, a target attribute all attribute value extraction end determination step 605, and a total similarity output step 606.

【００４３】次に、このように構成された本実施例のデ
ータベース検索結果評価方式の動作について説明する。Next, the operation of the database retrieval result evaluation method of the embodiment configured as described above will be described.

【００４４】基本レコード保持部１１および比較レコー
ド保持部１２の動作は、第１の実施例における基本レコ
ード保持部１および比較レコード保持部２の動作と同様
である。The operations of the basic record holding unit 11 and the comparison record holding unit 12 are the same as the operations of the basic record holding unit 1 and the comparison record holding unit 2 in the first embodiment.

【００４５】対象属性保持部１５は、利用者からの指示
により、データベースの検索において当該利用者が重要
と考える属性の名前、すなわち類似度計算に使う属性の
名前を自己の内部に保持する。The target attribute holding unit 15 holds the name of the attribute considered important by the user in the database search, that is, the name of the attribute used for calculating the similarity, in accordance with an instruction from the user.

【００４６】属性限定型類似度計算部１６は、以下に示
すような処理を行う（図６参照）。The attribute limited type similarity calculation section 16 performs the following processing (see FIG. 6).

【００４７】総合類似度の値を計算するための分子用記
憶領域と分母用記憶領域とを確保し、両方に値０（初期
値）をセットする（ステップ６００）。この分子用記憶
領域は後述する計算式におけるＣに対応し、この分母用
記憶領域は同式におけるＤに対応する。A storage area for the numerator and a storage area for the denominator for calculating the value of the overall similarity are secured, and a value 0 (initial value) is set in both (step 600). The storage area for the numerator corresponds to C in a calculation formula described later, and the storage area for the denominator corresponds to D in the same equation.

【００４８】基本レコード保持部１１内のレコードの属
性（対象属性保持部１５に名前が格納されている属性の
うちの任意の１つの属性）の属性値を取り出す（ステッ
プ６０１）。The attribute value of the attribute of the record in the basic record holding unit 11 (arbitrary one of the attributes whose names are stored in the target attribute holding unit 15) is extracted (step 601).

【００４９】比較レコード保持部１２内のレコードの同
一の属性（ステップ６０１で属性値を取り出した属性と
同一の属性）の属性値を取り出す（ステップ６０２）。An attribute value of the same attribute of the record in the comparison record holding unit 12 (the same attribute as the attribute whose attribute value is extracted in step 601) is extracted (step 602).

【００５０】上述のステップ６０１および６０２の処理
が対象属性保持部１５に名前が格納（保持）されている
全ての属性について繰り返されて、総合類似度の計算に
必要な各属性値が順次に取り出されていく。The processing of steps 601 and 602 described above is repeated for all the attributes whose names are stored (held) in the target attribute holding unit 15, and the attribute values required for calculating the overall similarity are sequentially extracted. I will go.

【００５１】当該属性に対応する類似度定義１３を参照
して、ステップ６０１で取り出した属性値とステップ６
０２で取り出した属性値との間の類似度を読み取る（ス
テップ６０３）。Referring to the similarity definition 13 corresponding to the attribute, the attribute value extracted in step 601 and the attribute value
The degree of similarity with the attribute value extracted in step 02 is read (step 603).

【００５２】当該属性に対応する重み定義１４の値（重
みの値）をステップ６０３で読み取った値に積算した値
をステップ６００で確保した分子用記憶領域の値に加え
た値を、分子用記憶領域にセットする。一方、その重み
定義１４の値をステップ６００で確保した分母用記憶領
域の値に加えた値を、分母用記憶領域にセットする（ス
テップ６０４）。このようにして、分子用記憶領域の値
および分母用記憶領域の値の更新を行う。The value obtained by adding the value of the weight definition 14 (weight value) corresponding to the attribute to the value read in step 603 and the value in the numerator storage area secured in step 600 is added to the numerator storage value. Set to the area. On the other hand, a value obtained by adding the value of the weight definition 14 to the value of the denominator storage area secured in step 600 is set in the denominator storage area (step 604). Thus, the value of the numerator storage area and the value of the denominator storage area are updated.

【００５３】基本レコード保持部１１および比較レコー
ド保持部１２内のレコードについて、対象属性保持部１
５に名前が格納されている全ての属性（図５では、属性
ｉおよびｋ）の属性値を取り出したか否かを調べる（ス
テップ６０５）。For the records in the basic record holding unit 11 and the comparison record holding unit 12, the target attribute holding unit 1
It is checked whether or not the attribute values of all the attributes (attributes i and k in FIG. 5) whose names are stored in 5 are extracted (step 605).

【００５４】ステップ６０５で「対象属性保持部１５に
名前が格納されている全ての属性の属性値を取り出し
た」と判定した場合には、分子用記憶領域の値を分母用
記憶領域の値で割算（除算）した値を総合類似度として
出力する（ステップ６０６）。If it is determined in step 605 that the attribute values of all the attributes whose names are stored in the target attribute holding unit 15 have been extracted, the value of the numerator storage area is replaced with the value of the denominator storage area. The divided (divided) value is output as the overall similarity (step 606).

【００５５】ステップ６０５で「対象属性保持部１５に
名前が格納されている全ての属性の属性値を取り出して
はいない」と判定した場合には、ステップ６０１の処理
に戻る。If it is determined in step 605 that the attribute values of all the attributes whose names are stored in the target attribute holding unit 15 have not been extracted, the process returns to step 601.

【００５６】このように、類似度計算部１６は、対象属
性保持部１５に名前が保持されている属性だけに限定し
て、基本レコード保持部１１内のレコードと比較レコー
ド保持部１２内のレコードとについて、各属性の類似度
定義１３を参照して各属性の属性値間の類似度を計算
し、各属性の重み定義１４を参照して各属性に関する類
似度を各属性の重みを加味して加重平均し、その計算結
果を総合類似度として出力する。As described above, the similarity calculation unit 16 limits the records in the basic record storage unit 11 and the records in the comparison record storage unit 12 to only the attributes whose names are stored in the target attribute storage unit 15. , The similarity between the attribute values of each attribute is calculated by referring to the similarity definition 13 of each attribute, and the similarity of each attribute is added to the weight of each attribute by referring to the weight definition 14 of each attribute. Weighted average, and outputs the calculation result as the overall similarity.

【００５７】ここで、総合類似度の値を計算式で表す
と、以下のようになる。この計算式においては、属性の
数がｎ（正整数）であるものとする。Here, the value of the total similarity is represented by a calculation formula as follows. In this formula, it is assumed that the number of attributes is n (positive integer).

【００５８】[0058]

【数２】 (Equation 2)

【００５９】この計算式において、「ｍ番目の属性の選
択フラグ値」とは、基本レコード保持部１１および比較
レコード保持部１２内のレコードの属性の数と同じ数の
要素数を持つ１次元のマスク行列の各要素の値であり、
対象属性保持部１５内に名前が格納されている属性（類
似度計算の対象として選択されている属性）に対応する
マスク行列の要素には１が格納されており、そうでない
属性に対応する要素には０が格納されている。この「選
択フラグ値」の存在によって、対象属性保持部１５内に
名前が格納されている属性の属性値だけが選択されて総
合類似度が計算される。なお、図６に示す処理において
は、ステップ６０１および６０２で対象属性（対象属性
保持部１５内に名前が格納されている属性）の属性値だ
けが取り出されることによって、上述の計算式における
「ｍ番目の属性の選択フラグ値の積算」が実現される。In this formula, the “selection flag value of the m-th attribute” is a one-dimensional element having the same number of elements as the number of attributes of the records in the basic record holding unit 11 and the comparison record holding unit 12. The value of each element of the mask matrix,
The element of the mask matrix corresponding to the attribute whose name is stored in the target attribute holding unit 15 (the attribute selected as the target of similarity calculation) stores 1; Stores 0. Due to the presence of the “selection flag value”, only the attribute value of the attribute whose name is stored in the target attribute holding unit 15 is selected, and the overall similarity is calculated. In the processing shown in FIG. 6, by extracting only the attribute value of the target attribute (the attribute whose name is stored in the target attribute holding unit 15) in steps 601 and 602, “m” in the above-described calculation formula is obtained. "Integration of the selection flag value of the second attribute" is realized.

【００６０】図７は、本発明の第３の実施例（請求項３
記載の発明に対応する実施例）に係るデータベース検索
結果評価方式の構成を示すブロック図である。FIG. 7 shows a third embodiment of the present invention.
FIG. 11 is a block diagram showing a configuration of a database search result evaluation method according to an embodiment (corresponding to the described invention).

【００６１】本実施例のデータベース検索結果評価方式
は、データベースから検索された１つのレコードを保持
（格納）する基本レコード保持部２１と、データベース
から検索された別のレコードを保持する比較レコード保
持部２２と、データベースの属性（ここでは、属性ｉ，
ｊおよびｋ）毎に定義されており当該各属性における属
性値間の類似度が定義されている類似度定義２３と、デ
ータベースの属性毎に定義されており類似度計算におけ
る当該各属性の重要度を示す重みが定義されている重み
定義２４と、基本レコード保持部２１内のレコードと比
較レコード保持部２２内のレコードとの間の総合類似度
を計算してその値を出力する類似度計算部２５と、前も
って定義された例外レコード判定用閾値を保持する閾値
保持部２６と、類似度計算部２５の出力結果が閾値保持
部２６に保持されている例外レコード判定用閾値よりも
小さい場合に比較レコード保持部２２内のレコードを例
外レコード（例外的なレコード）と判定して提示する例
外レコード判定部２７とを含んで構成されている。ここ
で、基本レコード保持部２１，比較レコード保持部２
２，類似度定義２３，重み定義２４および類似度計算部
２５は、図１中の基本レコード保持部１，比較レコード
保持部２，類似度定義３，重み定義４および類似度計算
部５と同一の構成要素である。また、基本レコード保持
部２１，比較レコード保持部２２および閾値保持部２６
は、通常のメモリで実現される。The database search result evaluation method of the present embodiment includes a basic record holding unit 21 for holding (storing) one record searched from the database, and a comparison record holding unit for holding another record searched from the database. 22 and attributes of the database (here, attributes i,
j and k), a similarity definition 23 defining the similarity between attribute values in each attribute, and the importance of each attribute in the similarity calculation defined for each attribute in the database And a similarity calculating unit that calculates a total similarity between a record in the basic record storage unit 21 and a record in the comparison record storage unit 22 and outputs the value. 25, a threshold holding unit 26 holding a predefined exception record determination threshold, and a comparison when the output result of the similarity calculation unit 25 is smaller than the exception record determination threshold held in the threshold holding unit 26. An exception record determination unit 27 that determines and presents a record in the record holding unit 22 as an exception record (exceptional record). Here, the basic record holding unit 21 and the comparison record holding unit 2
2, the similarity definition 23, the weight definition 24, and the similarity calculator 25 are the same as the basic record holder 1, the comparison record holder 2, the similarity definition 3, the weight definition 4, and the similarity calculator 5 in FIG. It is a component of. Further, the basic record holding unit 21, the comparison record holding unit 22, and the threshold value holding unit 26
Is realized by a normal memory.

【００６２】図８は、例外レコード判定部２７の処理を
示す流れ図である。この処理は、例外レコード判定用閾
値取出しステップ８０１と、類似度計算部出力受取りス
テップ８０２と、例外レコード判定用閾値・総合類似度
比較ステップ８０３と、例外レコード情報出力ステップ
８０４とからなる。FIG. 8 is a flowchart showing the processing of the exception record determination unit 27. This process includes an exception record determination threshold extracting step 801, a similarity calculation unit output receiving step 802, an exception record determination threshold / total similarity comparison step 803, and an exception record information output step 804.

【００６３】次に、このように構成された本実施例のデ
ータベース検索結果評価方式の動作について説明する。Next, the operation of the database retrieval result evaluation method of the present embodiment configured as described above will be described.

【００６４】基本レコード保持部２１，比較レコード保
持部２２および類似度計算部２５の動作は、第１の実施
例における基本レコード保持部１，比較レコード保持部
２および類似度計算部５の動作と同様である。なお、基
本レコード保持部２１に保持されているレコードは、デ
ータベースの検索結果として妥当なレコードと考えられ
るレコードである。The operations of the basic record storage unit 21, comparison record storage unit 22, and similarity calculation unit 25 are the same as those of the basic record storage unit 1, comparison record storage unit 2, and similarity calculation unit 5 in the first embodiment. The same is true. The record held in the basic record holding unit 21 is a record that is considered to be a valid record as a database search result.

【００６５】例外レコード判定部２７は、類似度計算部
２５により総合類似度が出力されていることを前提とし
て、以下に示すような処理を行う（図８参照）。The exception record determination section 27 performs the following processing on the assumption that the overall similarity has been output from the similarity calculation section 25 (see FIG. 8).

【００６６】閾値保持部２６に格納されている例外レコ
ード判定用閾値を取り出す（ステップ８０１）。The exception record determination threshold stored in the threshold holding unit 26 is extracted (step 801).

【００６７】類似度計算部２５の出力（総合類似度）を
受け取る（ステップ８０２）。The output (total similarity) of the similarity calculator 25 is received (step 802).

【００６８】ステップ８０１で取り出した値とステップ
８０２で受け取った値とを比較し、後者（類似度計算部
２５の出力結果である総合類似度）の方が小さいか否か
を判定する（ステップ８０３）。The value extracted in step 801 is compared with the value received in step 802, and it is determined whether or not the latter (the overall similarity output from the similarity calculator 25) is smaller (step 803). ).

【００６９】ステップ８０３で「類似度計算部２５の出
力結果の方が小さい」と判定した場合には、比較レコー
ド保持部２２内のレコードが「例外レコード」であるこ
と（基本レコード保持部２１内の妥当なレコードに比べ
て比較レコード保持部２２内のレコードが例外的である
こと）を示す情報を出力する（ステップ８０４）。If it is determined in step 803 that “the output result of the similarity calculation unit 25 is smaller”, the record in the comparison record holding unit 22 is an “exception record” (in the basic record holding unit 21). Is output (step 804). The information indicating that the record in the comparison record holding unit 22 is exceptional in comparison with the appropriate record (step 804).

【００７０】ステップ８０３で「類似度計算部２５の出
力結果の方が小さくない」と判定した場合には、ステッ
プ８０４の出力を行わずに処理を終了する。これによっ
て、比較レコード保持部２２内のレコードが例外レコー
ドでないことを提示する。If it is determined in step 803 that “the output result of the similarity calculation unit 25 is not smaller”, the process ends without outputting the data in step 804. This indicates that the record in the comparison record holding unit 22 is not an exception record.

【００７１】図９は、本発明の第４の実施例（請求項４
記載の発明に対応する実施例）に係るデータベース検索
結果評価方式の構成を示すブロック図である。FIG. 9 shows a fourth embodiment of the present invention.
FIG. 11 is a block diagram showing a configuration of a database search result evaluation method according to an embodiment (corresponding to the described invention).

【００７２】本実施例のデータベース検索結果評価方式
は、データベースから検索された１つのレコード（比較
制御部３９によって渡される検索レコード保持部３８内
のレコード）を保持する基本レコード保持部３１と、デ
ータベースから検索された別のレコード（比較制御部３
９によって渡される検索レコード保持部３８内のレコー
ド）を保持する比較レコード保持部３２と、データベー
スの属性（ここでは、属性ｉ，ｊおよびｋ）毎に定義さ
れており当該各属性における属性値間の類似度が定義さ
れている類似度定義３３と、データベースの属性毎に定
義されており類似度計算における当該各属性の重要度を
示す重みが定義されている重み定義３４と、基本レコー
ド保持部３１内のレコードと比較レコード保持部３２内
のレコードとの間の総合類似度を計算してその値を出力
する類似度計算部３５と、前もって定義された例外レコ
ード判定用閾値を保持する閾値保持部３６と、類似度計
算部３５の出力結果が閾値保持部３６に保持されている
例外レコード判定用閾値よりも小さい場合に比較レコー
ド保持部３２内のレコードを例外レコードと判定して提
示する例外レコード判定部３７と、データベースから検
索された１以上のレコードを保持する検索レコード保持
部３８と、検索レコード保持部３８内のレコード群の中
のレコードを基本レコード保持部３１および比較レコー
ド保持部３２に渡す比較制御部３９とを含んで構成され
ている。ここで、基本レコード保持部３１，比較レコー
ド保持部３２，類似度定義３３，重み定義３４，類似度
計算部３５，閾値保持部３６および例外レコード判定部
３７は、図７中の基本レコード保持部２１，比較レコー
ド保持部２２，類似度定義２３，重み定義２４，類似度
計算部２５，閾値保持部２６および例外レコード判定部
２７と同一の構成要素である。また、基本レコード保持
部３１，比較レコード保持部３２，閾値保持部３６およ
び検索レコード保持部３８は、通常のメモリで実現され
る。The database search result evaluation method according to the present embodiment includes a basic record holding unit 31 for holding one record (record in the search record holding unit 38 passed by the comparison control unit 39) searched from the database, Record (comparison control unit 3) retrieved from
9 and a comparison record holding unit 32 that holds the records in the search record holding unit 38 passed by the database 9 and the attribute values defined in each of the attributes (here, the attributes i, j, and k) of the database. A similarity definition 33, a weight definition 34 defined for each attribute of the database and defining a weight indicating the importance of each attribute in the similarity calculation, and a basic record holding unit. A similarity calculating unit 35 for calculating the total similarity between the record in the record 31 and the record in the comparison record holding unit 32 and outputting the calculated value; and a threshold hold for holding a previously defined exception record determination threshold. Unit 36 and the comparison record storage unit 32 when the output result of the similarity calculation unit 35 is smaller than the exception record determination threshold stored in the threshold storage unit 36. An exception record determination unit 37 that determines and presents the code as an exception record, a search record storage unit 38 that stores one or more records retrieved from the database, and a record in a record group in the search record storage unit 38 It includes a basic record holding unit 31 and a comparison control unit 39 to be passed to the comparison record holding unit 32. Here, the basic record storage unit 31, the comparison record storage unit 32, the similarity definition 33, the weight definition 34, the similarity calculation unit 35, the threshold storage unit 36, and the exception record determination unit 37 are the basic record storage unit in FIG. 21, the comparison record holding unit 22, the similarity definition 23, the weight definition 24, the similarity calculation unit 25, the threshold holding unit 26, and the exception record determination unit 27 are the same components. Further, the basic record holding unit 31, the comparison record holding unit 32, the threshold value holding unit 36, and the search record holding unit 38 are realized by a normal memory.

【００７３】図１０は、比較制御部３９の処理を示す流
れ図である。この処理は、基本レコード保持部レコード
引渡しステップ１０００と、比較レコード保持部レコー
ド引渡しステップ１００１と、例外レコード判定待機ス
テップ１００２と、全検索レコード取出し終了判定ステ
ップ１００３とからなる。FIG. 10 is a flowchart showing the processing of the comparison control unit 39. This processing includes a basic record storage unit record transfer step 1000, a comparison record storage unit record transfer step 1001, an exception record determination standby step 1002, and an all search record retrieval end determination step 1003.

【００７４】次に、このように構成された本実施例のデ
ータベース検索結果評価方式の動作について説明する。Next, the operation of the database retrieval result evaluation method of the present embodiment configured as described above will be described.

【００７５】本実施例のデータベース検索結果評価方式
は、第３の実施例と比べて、検索レコード保持部３８の
存在を前提とする比較制御部３９の処理が加わっている
点で異なる。The database search result evaluation method of the present embodiment is different from the third embodiment in that a processing of a comparison control unit 39 on the assumption that a search record holding unit 38 exists is added.

【００７６】比較制御部３９は、以下に示すような処理
を行う（図１０参照）。The comparison control section 39 performs the following processing (see FIG. 10).

【００７７】検索レコード保持部３８に保持されている
レコード群の中から、利用者により指定された基準によ
って選択される１つのレコード（例外レコードの判定の
ための基準となるレコードとして適切なレコード）を取
り出し、そのレコードを基本レコード保持部３１に渡す
（ステップ１０００）。基本レコード保持部３１は、以
降、そのレコードを保持する。One record selected from the group of records held in the search record holding unit 38 based on the criterion specified by the user (a record suitable as a reference record for judging an exceptional record) And passes the record to the basic record holding unit 31 (step 1000). The basic record holding unit 31 thereafter holds the record.

【００７８】検索レコード保持部３８に保持されている
レコード群（ステップ１００１で基本レコード保持部３
１に渡されたレコードは除外してもよい）のうちの任意
の１つのレコードを順次に取り出し、そのレコードを比
較レコード保持部３２に渡す（ステップ１００１）。比
較レコード保持部３２は、比較制御部３９によって次の
レコードが渡されるまで、そのレコードを保持する。The record group held in the search record holding unit 38 (the basic record holding unit 3
(The records passed to 1 may be excluded.) One record is sequentially extracted, and the record is passed to the comparison record holding unit 32 (step 1001). The comparison record holding unit 32 holds the record until the next record is passed by the comparison control unit 39.

【００７９】類似度計算部３５によって基本レコード保
持部３１内のレコードと比較レコード保持部３２内のレ
コードとの間の総合類似度が出力され、その総合類似度
に基づき例外レコード判定部３７によって比較レコード
保持部３２内のレコードが例外レコードであるかどうか
が判定されるのを待つ（ステップ１００２）。The similarity calculation unit 35 outputs the total similarity between the record in the basic record storage unit 31 and the record in the comparison record storage unit 32, and compares the total similarity with the exception record determination unit 37 based on the total similarity. The process waits until it is determined whether the record in the record holding unit 32 is an exception record (step 1002).

【００８０】例外レコード判定部３７による判定の出力
が行われると、検索レコード保持部３８から全てのレコ
ードを取り出したか否かを調べる（ステップ１００
３）。When the judgment record is output by the exception record judgment unit 37, it is checked whether or not all records have been extracted from the search record holding unit 38 (step 100).
3).

【００８１】ステップ１００３で「検索レコード保持部
３８から全てのレコードを取り出してはいない」と判定
した場合には、ステップ１００１の処理に戻る。すなわ
ち、検索レコード保持部３８から新たなレコードを１つ
取り出して比較レコード保持部３２に渡す。これによっ
て、検索レコード保持部３８内の未処理のレコードがな
くなるまで図１０に示す処理が繰り返される。If it is determined in step 1003 that "all records have not been retrieved from the search record holding unit 38", the process returns to step 1001. That is, one new record is taken out from the search record holding unit 38 and passed to the comparison record holding unit 32. As a result, the process shown in FIG. 10 is repeated until there is no unprocessed record in the search record holding unit 38.

【００８２】ステップ１００３で「検索レコード保持部
３８から全てのレコードを取り出した」と判定した場合
には、処理を終了する。If it is determined in step 1003 that "all records have been retrieved from the search record holding unit 38", the processing is terminated.

【００８３】以上のような比較制御部３９の処理によっ
て、例外レコード判定部３７は複数の検索レコード（検
索レコード保持部３８内のレコード群）に関する例外レ
コードかどうかの判定（利用者によって指定された基準
で選択された１つの検索レコードに対する各検索レコー
ドの例外性の判定）が可能となる。By the processing of the comparison control section 39 as described above, the exception record determination section 37 determines whether or not the record is an exception record relating to a plurality of search records (record group in the search record holding section 38) (designated by the user). (Exception of each search record with respect to one search record selected based on the standard) can be performed.

【００８４】[0084]

【発明の効果】以上説明したように本発明は、データベ
ース検索システムにおいて、あらかじめ各属性の類似度
および重みを定義して類似度計算を行うことにより、例
外的な検索結果を自動的に提示することができ（そのよ
うな提示の前提として２つの検索結果の間の類似度を検
出することを可能にし）、検索されたレコードの中から
「例外的なレコード」を自動的に検出してそれを利用者
に提示することができるという効果を有する。As described above, according to the present invention, an exceptional search result is automatically presented in a database search system by defining the similarity and weight of each attribute in advance and calculating the similarity. (Which makes it possible to detect the similarity between two search results as a premise of such a presentation) and automatically detects "exceptional records" from among the searched records Can be presented to the user.

【００８５】このような効果によって、データベース検
索システムの利用者が見落としがちでありながら重要な
レコードの存在を的確に提示でき、いわゆる「気がつ
く」・「気のきいた」データベース検索システムを構築
することができる。With the above effects, it is possible for the user of the database search system to accurately present the existence of important records that are apt to be overlooked, and to construct a so-called “noticeable” / “nice” database search system. Can be.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明の第１の実施例に係るデータベース検索
結果評価方式の構成を示すブロック図である。FIG. 1 is a block diagram showing a configuration of a database search result evaluation method according to a first embodiment of the present invention.

【図２】図１中の類似度定義における類似度定義表現の
一例を示す図である。FIG. 2 is a diagram showing an example of a similarity definition expression in the similarity definition in FIG. 1;

【図３】図１中の類似度定義における類似度定義表現の
他の例を示す図である。FIG. 3 is a diagram showing another example of a similarity definition expression in the similarity definition in FIG. 1;

【図４】図１中の類似度計算部の処理を示す流れ図であ
る。FIG. 4 is a flowchart showing a process of a similarity calculation unit in FIG. 1;

【図５】本発明の第２の実施例に係るデータベース検索
結果評価方式の構成を示すブロック図である。FIG. 5 is a block diagram illustrating a configuration of a database search result evaluation method according to a second embodiment of the present invention.

【図６】図５中の属性限定型類似度計算部の処理を示す
流れ図である。FIG. 6 is a flowchart showing processing of an attribute-limited similarity calculation unit in FIG. 5;

【図７】本発明の第３の実施例に係るデータベース検索
結果評価方式の構成を示すブロック図である。FIG. 7 is a block diagram showing a configuration of a database search result evaluation method according to a third embodiment of the present invention.

【図８】図７中の例外レコード判定部の処理を示す流れ
図である。FIG. 8 is a flowchart showing a process of an exception record determination unit in FIG. 7;

【図９】本発明の第４の実施例に係るデータベース検索
結果評価方式の構成を示すブロック図である。FIG. 9 is a block diagram showing a configuration of a database search result evaluation method according to a fourth embodiment of the present invention.

【図１０】図９中の比較制御部の処理を示す流れ図であ
る。FIG. 10 is a flowchart showing processing of a comparison control unit in FIG. 9;

【符号の説明】[Explanation of symbols]

１，１１，２１，３１基本レコード保持部２，１２，２２，３２比較レコード保持部３，１３，２３，３３類似度定義４，１４，２４，３４重み定義５，２５，３５類似度計算部１５対象属性保持部１６属性限定型類似度計算部２６，３６閾値保持部２７，３７例外レコード判定部３８検索レコード保持部３９比較制御部 1,11,21,31 Basic record holding unit 2,12,22,32 Comparison record holding unit 3,13,23,33 Similarity definition 4,14,24,34 Weight definition 5,25,35 Similarity calculation unit 15 Target attribute storage unit 16 Attribute limited similarity calculation unit 26, 36 Threshold storage unit 27, 37 Exception record determination unit 38 Search record storage unit 39 Comparison control unit

Claims

(57)【特許請求の範囲】(57) [Claims]

【請求項１】属性とその属性値との組の集合で１つの
レコードが構成され、複数のレコードの集合からなるデ
ータベースに関するデータベース検索結果評価方式にお
いて、データベースから検索された１つのレコードを保持する
基本レコード保持部と、データベースから検索された別のレコードを保持する比
較レコード保持部と、データベースの属性毎に定義されており当該各属性にお
ける属性値間の類似度が定義されている類似度定義と、データベースの属性毎に定義されており類似度計算にお
ける当該各属性の重要度を示す重みが定義されている重
み定義と、前記基本レコード保持部内のレコードと前記比較レコー
ド保持部内のレコードとについて、各属性の前記類似度
定義を参照して各属性の属性値間の類似度を計算し、各
属性の前記重み定義を参照して各属性に関する類似度を
各属性の重みを加味して加重平均し、その計算結果を総
合類似度として出力する類似度計算部とを有することを
特徴とするデータベース検索結果評価方式。1. A record is composed of a set of an attribute and its attribute value, and holds one record retrieved from a database in a database retrieval result evaluation method for a database composed of a plurality of records. A basic record storage unit, a comparison record storage unit that stores another record retrieved from the database, and a similarity definition that is defined for each attribute of the database and defines a similarity between attribute values in each attribute. A weight definition that is defined for each attribute of the database and defines a weight indicating the importance of each attribute in the similarity calculation; and a record in the basic record holding unit and a record in the comparison record holding unit. Calculating the similarity between the attribute values of each attribute by referring to the similarity definition of each attribute, And a similarity calculator for outputting a calculation result as a total similarity by performing a weighted average of similarities related to the respective attributes with reference to the weight definitions and adding weights of the respective attributes. Evaluation method.

【請求項２】前もって類似度計算に使う属性の名前を
保持する対象属性保持部と、類似度計算で、全ての属性を対象にするのではなく、前
記対象属性保持部に名前が保持されている属性だけを対
象とする類似度計算部とを有することを特徴とする請求
項１記載のデータベース検索結果評価方式。2. A target attribute holding unit that holds an attribute name used for similarity calculation in advance, and a name is held in the target attribute holding unit instead of targeting all attributes in the similarity calculation. 2. The database search result evaluation method according to claim 1, further comprising a similarity calculation unit for only the attribute that exists.

【請求項３】前もって定義された例外レコード判定用
閾値を保持する閾値保持部と、類似度計算部の出力結果が前記閾値保持部に保持されて
いる例外レコード判定用閾値よりも小さい場合に比較レ
コード保持部内のレコードを例外レコードと判定して提
示する例外レコード判定部とを有することを特徴とする
請求項１または請求項２記載のデータベース検索結果評
価方式。3. A threshold value holding unit for holding a previously defined exception record determination threshold value, and a comparison is made when the output result of the similarity calculation unit is smaller than the exception record determination threshold value held in the threshold value holding unit. 3. The database search result evaluation method according to claim 1, further comprising: an exception record determination unit that determines a record in the record holding unit as an exception record and presents the exception record.

【請求項４】データベースから検索された１以上のレ
コードを保持する検索レコード保持部と、この検索レコード保持部に保持されているレコード群の
中から利用者により指定された基準によって選択された
１つのレコードを取り出してそのレコードを基本レコー
ド保持部に渡し、前記検索レコード保持部に保持されて
いるレコード群を１つずつ比較レコード保持部に渡す比
較制御部とを有することを特徴とする請求項３記載のデ
ータベース検索結果評価方式。4. A search record holding unit for holding one or more records searched from a database, and a record selected by a user from a group of records held in the search record holding unit. A comparison control unit that extracts one record, passes the record to a basic record holding unit, and transfers a record group held in the search record holding unit to a comparison record holding unit one by one. 3. The database search result evaluation method described in 3.

【請求項５】重み定義における重みの値が０から１の
間の値に正規化されていることを特徴とする請求項１，
請求項２，請求項３または請求項４記載のデータベース
検索結果評価方式。5. The method according to claim 1, wherein the weight value in the weight definition is normalized to a value between 0 and 1.
5. The database search result evaluation method according to claim 2, claim 3 or claim 4.