JPH1153475A

JPH1153475A - Image recognition device, image recognition method and storage medium

Info

Publication number: JPH1153475A
Application number: JP9223054A
Authority: JP
Inventors: Hisataka Yamagishi; 久高山岸; Seikou Rou; 世紅労
Original assignee: Omron Corp; Omron Tateisi Electronics Co
Current assignee: Omron Corp
Priority date: 1997-08-04
Filing date: 1997-08-04
Publication date: 1999-02-26
Anticipated expiration: 2017-08-04
Also published as: JP3475422B2

Abstract

PROBLEM TO BE SOLVED: To provide an image recognition device, an image recognition method and a recording medium for establishing recognized results incapable of deciding whether it is the correct one or a wrong one to the several kinds of results, improving a reception correct answer rate and eliminating an erroneous recogni tion rate. SOLUTION: First, second and third preceding stage knowledge 112 is prepared by a first, second and third preceding knowledge preparation means 111 and first and second post stage knowledge 114 is prepared by a first and second post stage knowledge preparation means 113 respectively. In a first, second and third preceding stage judgement means /15, the recognized results are divided into the two of 'correct answer' and 'unknown' and established for the recognized results. Then, in a first and second post stage judgement means 116, data put in an OK cluster are decided as 'correct answer' data belonging to a category, they are defined as 'unknown' data in the case of not being put in any OK cluster and then, they are defined as 'incorrect answer' data in the case of being put in at least one of NG clusters further.

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、画像認識装置、画
像認識方法および該画像認識方法を実行させるためのプ
ログラムを記録した記録媒体に係り、特に、複数のカテ
ゴリを持ち、入力データに対する各カテゴリの一致度
（若しくはそれに類する情報）を算出する画像認識にお
いて、一致度が高いとされた結果が正しいものか、間違
ったものか決定できない認識結果を、「認識結果が正解
と確信できるため受け付ける」，「認識結果が正解と確
信できないためリジェクトする」，「認識結果が未確定
のためリジェクトする」或いは「認識結果が不正解のた
めリジェクトする」等といった数種類の結果に確定し、
受付け正解率を向上させ、かつ誤認識率を０［％］に近
づけた画像認識装置、画像認識方法および記録媒体に関
する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image recognition apparatus, an image recognition method, and a recording medium on which a program for executing the image recognition method is recorded. In image recognition that calculates the degree of coincidence (or similar information), a recognition result that cannot be determined whether the result determined to be high is correct or incorrect is "accepted because the recognition result is confident that the answer is correct." , "Rejected because the recognition result is not confident that it is correct", "Rejected because the recognition result is indeterminate" or "Rejected because the recognition result is incorrect"
The present invention relates to an image recognition device, an image recognition method, and a recording medium that improve the acceptance correct answer rate and reduce the false recognition rate to 0%.

【０００２】[0002]

【従来の技術】従来、複数のカテゴリを持ち、入力デー
タに対する各カテゴリの一致度（若しくはそれに類する
情報）を算出する手段を備えた画像認識システムにおい
ては、画像認識の認識結果は、該認識結果が正解である
か或いは誤認識であるかを人がチェックするまで確定で
きなかった。2. Description of the Related Art Conventionally, in an image recognition system having a plurality of categories and having means for calculating the degree of coincidence (or similar information) of each category with input data, the recognition result of image recognition is determined by the recognition result. Could not be determined until a person checked whether the answer was correct or incorrect.

【０００３】つまり、従来の画像認識システムにおける
運用方法は、以下のような手順で行われていた。すなわ
ち、まず第１に、画像認識システムで認識対象データを
認識させ、次に第２に、コンピュータ等の画面に認識結
果を表示させ、認識対象と認識結果を人が突き合わせて
確認をし、そして第３には、誤認識している結果を修正
し、さらに第４には、認識対象について最後まで突き合
わせ確認が終了した後、認識結果を確定するというもの
であった。That is, the operation method in the conventional image recognition system is performed in the following procedure. That is, first, the recognition target data is recognized by the image recognition system, and secondly, the recognition result is displayed on a screen of a computer or the like, and the recognition target and the recognition result are checked by a person, and Thirdly, the result of misrecognition is corrected, and fourthly, the recognition result is determined after the matching check has been completed for the recognition target.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、上記従
来の画像認識システムの運用方法にあっては、全文を人
手によって入力することに比べると省力化が図られてい
るものの、第２の人手による認識対象と認識結果の突き
合わせ確認処理に手間がかかるという問題点があった。
例えば、一度に数百、数千の帳票などを認識させる場
合、１枚１枚を認識させた後、人がチェックするのでは
自動化を図るとは言い難いため、画像認識システムの導
入が見送られることも多かった。However, in the operation method of the above-mentioned conventional image recognition system, although the labor saving is achieved as compared with the case where the whole text is input manually, the recognition by the second human is performed. There is a problem that it takes time to check the matching between the target and the recognition result.
For example, when hundreds or thousands of forms are recognized at a time, it is difficult to say that if a person checks each sheet and then checks it, it is hard to say that automation is intended. Therefore, introduction of an image recognition system is postponed. There were many things.

【０００５】また、認識性能にも問題があり、従来の画
像認識システムによって認識した結果をノーチェックで
全て受け付けたとすると、正解率は８０〜９５［％］程
度であるのが現状である。すなわち、５〜２０［％］は
誤認識しているにもかかわらず正解として受付けてしま
うのである。[0005] In addition, there is a problem in recognition performance, and if all the results recognized by the conventional image recognition system are accepted without checking, the correct answer rate is currently about 80 to 95 [%]. In other words, 5 to 20% is accepted as a correct answer even though it is erroneously recognized.

【０００６】本発明は、このような従来の問題点に着目
してなされたもので、その目的とするところは、複数の
カテゴリを持ち、入力データに対する各カテゴリの一致
度（若しくはそれに類する情報）を算出する画像認識に
おいて、一致度が高いとされた結果が正しいものか、間
違ったものか決定できない認識結果を、「認識結果が正
解と確信できるため受け付ける」，「認識結果が正解と
確信できないためリジェクトする」，「認識結果が未確
定のためリジェクトする」或いは「認識結果が不正解の
ためリジェクトする」等といった数種類の結果に確定
し、受付け正解率を向上させ、かつ誤認識率を０［％］
に近づけた画像認識装置、画像認識方法および記録媒体
を提供することにある。The present invention has been made in view of such conventional problems, and has as its object the purpose of having a plurality of categories and matching degree of each category with input data (or similar information). In the image recognition for calculating the recognition result, the recognition result that cannot be determined whether the result determined to be high in the correctness is correct or incorrect is "accepted because the recognition result is correct" and "the recognition result is not correct." Rejection "," rejection due to unrecognized recognition result "or" rejection due to incorrect recognition result ", etc., to improve the acceptance correct answer rate and reduce the false recognition rate to 0. [%]
To provide an image recognition device, an image recognition method, and a recording medium which are close to the above.

【０００７】[0007]

【課題を解決するための手段】上記課題を解決するため
に、本出願の請求項１に記載の発明は、複数カテゴリの
認識対象を持ち、登録画像パタンと入力画像パタンとの
パタンマッチングにおける適合度（距離）に基づき入力
画像の認識を行う画像認識装置において、画像認識の認
識結果およびその付随情報を含むデータから、認識結果
として同じ第１候補のカテゴリを持つデータを集め、該
集めたデータから特定の候補順位における類似カテゴリ
の出現率の高いものを抽出して、第１候補カテゴリに対
する類似カテゴリの関連を示す第１前段知識として保持
する第１前段知識作成手段と、入力画像の認識結果とし
て特定候補順位のカテゴリおよび類似カテゴリが得られ
るとき、該第１候補のカテゴリおよび特定候補順位の類
似カテゴリと前記第１前段知識における関連との比較に
より、該第１候補を”正解”または”不明”と判定する
第１前段判定手段とを具備することを特徴とする画像認
識装置にある。In order to solve the above-mentioned problems, the invention according to claim 1 of the present application has recognition targets of a plurality of categories, and is suitable for pattern matching between a registered image pattern and an input image pattern. In an image recognition apparatus for recognizing an input image based on a degree (distance), data having the same first candidate category as a recognition result is collected from data including a recognition result of image recognition and its accompanying information. A first pre-stage knowledge generating means for extracting a high similarity rate of the similar category in a specific candidate rank from the first candidate category and holding the extracted first pre-stage knowledge indicating the relation of the similar category to the first candidate category; and a recognition result of the input image. When the category of the specific candidate rank and the similar category are obtained as, the category of the first candidate and the similar category of the specific candidate rank are Comparison with related in 1 preceding knowledge, in the image recognition apparatus characterized by comprising a first pre-determination means determines that the first candidate "correct" or "unknown".

【０００８】また、本出願の請求項２に記載の発明は、
複数カテゴリの認識対象を持ち、登録画像パタンと入力
画像パタンとのパタンマッチングにおける適合度（距
離）に基づき入力画像の認識を行う画像認識装置におい
て、画像認識の認識結果およびその付随情報を含むデー
タから、認識結果として同じ第１候補のカテゴリを持つ
データを集め、該集めたデータから特定の候補順位にお
ける適合度（距離）から、該特定順位の候補を”正解”
と確定し得る適合度（距離）の大きさを算定し、第２前
段知識として保持する第２前段知識作成手段と、入力画
像の認識結果として特定候補順位のカテゴリおよび類似
カテゴリが得られるとき、該特定候補順位のカテゴリま
たは類似カテゴリにおける適合度（距離）の大きさと前
記第２前段知識における適合度（距離）の大きさとの比
較により、該特定順位の候補を”正解”または”不明”
と判定する第２前段判定手段とを具備することを特徴と
する画像認識装置にある。[0008] The invention described in claim 2 of the present application is:
In an image recognition apparatus having recognition targets of a plurality of categories and recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern, data including a recognition result of image recognition and its accompanying information , Data having the same category of the first candidate as the recognition result is collected, and from the collected data, the candidate of the specific rank is determined as “correct answer” based on the fitness (distance) in the specific candidate rank.
Calculating the magnitude of the degree of conformity (distance) that can be determined as the second preceding knowledge, and when the category of the specific candidate rank and the similar category are obtained as the recognition result of the input image, By comparing the magnitude of the fitness (distance) in the category of the specific candidate rank or the similar category with the magnitude of the fitness (distance) in the second preceding knowledge, the candidate of the specific rank is “correct answer” or “unknown”.
And a second pre-stage judging means for judging.

【０００９】また、本出願の請求項３に記載の発明は、
複数カテゴリの認識対象を持ち、登録画像パタンと入力
画像パタンとのパタンマッチングにおける適合度（距
離）に基づき入力画像の認識を行う画像認識装置におい
て、画像認識の認識結果およびその付随情報を含むデー
タから、認識結果として同じ第１候補のカテゴリを持つ
データを集め、該集めたデータから特定の候補順位のカ
テゴリまたは類似カテゴリ間における適合度（距離）差
から、該第１候補を”正解”と確定し得る適合度（距
離）差の大きさを算定し、第３前段知識として保持する
第３前段知識作成手段と、入力画像の認識結果として特
定候補順位のカテゴリおよび類似カテゴリが得られると
き、該特定の候補順位のカテゴリまたは類似カテゴリ間
における適合度（距離）差の大きさと前記第３前段知識
における適合度（距離）差の大きさとの比較により、該
第１候補を”正解”または”不明”と判定する第３前段
判定手段とを具備することを特徴とする画像認識装置に
ある。The invention described in claim 3 of the present application provides
In an image recognition apparatus having recognition targets of a plurality of categories and recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern, data including a recognition result of image recognition and its accompanying information , Data having the same first candidate category as a recognition result is collected, and from the collected data, the first candidate is determined as “correct” based on a difference in the degree of matching (distance) between a category of a specific candidate rank or a similar category. A third pre-stage knowledge creating means for calculating the degree of the adaptability (distance) difference that can be determined and holding the same as the third pre-stage knowledge; and when a category of a specific candidate rank and a similar category are obtained as a recognition result of the input image, The magnitude of the difference in the degree of fitness (distance) between the category of the particular candidate rank or the similar category and the degree of fitness (distance) in the third preceding knowledge By comparison with the magnitude of the difference, in the image recognition apparatus characterized by comprising a third front determination means determines that the first candidate "correct" or "unknown".

【００１０】また、本出願の請求項４に記載の発明は、
複数カテゴリの認識対象を持ち、登録画像パタンと入力
画像パタンとのパタンマッチングにおける適合度（距
離）に基づき入力画像の認識を行う画像認識装置におい
て、画像認識の認識結果およびその付随情報を含むデー
タをサンプル収集し、あるカテゴリで認識結果が”正
解”とされているデータから、前記データの集合をクラ
スタリングするときの軸を、上位候補に出現率の高いカ
テゴリを該軸として、下位候補に出現率の高いカテゴリ
を該軸として、上位候補に出現率の高いカテゴリおよび
下位候補に出現率の高いカテゴリをあわせて軸として、
中位候補に出現率の高いカテゴリを該軸として、相関係
数の高いカテゴリ同士を該軸として、或いは、相関係数
の低いカテゴリ同士を該軸として、抽出する第１軸抽出
手段と、前記第１軸抽出手段で抽出された軸に基づき前
記データの集合のクラスタリングを行い、該カテゴリの
ＯＫクラスタとして生成し、生成されたＯＫクラスタの
形状を第１後段知識として保持する第１後段知識作成手
段と、入力画像の認識結果として認識結果およびその付
随情報を含むデータが得られるとき、該データが前記第
１後段知識のあるカテゴリに所属するＯＫクラスタの中
に含まれるか否かをチェックして、該ＯＫクラスタに含
まれる場合には該ＯＫクラスタが所属するカテゴリを前
記入力画像の認識結果として”正解”と判定し、何れの
ＯＫクラスタにも含まれていない場合には”未確定”と
判定する第１後段判定手段とを具備することを特徴とす
る画像認識装置にある。Further, the invention described in claim 4 of the present application is
In an image recognition apparatus having recognition targets of a plurality of categories and recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern, data including a recognition result of image recognition and its accompanying information From the data for which the recognition result is “correct” in a certain category, the axis when clustering the set of data is set as the axis, and the category having a high appearance rate as the upper candidate is used as the axis to appear in the lower candidate. Using the category with a high rate as the axis, the category with a high appearance rate as the top candidate and the category with a high appearance rate as the lower candidate are used as axes,
A first axis extracting means for extracting a category having a high appearance rate as a middle candidate as the axis, categories having a high correlation coefficient as the axis, or categories having a low correlation coefficient as the axis, Clustering of the set of data based on the axis extracted by the first axis extracting means, generating an OK cluster of the category, and holding the shape of the generated OK cluster as first second-stage knowledge; Means, and when data including a recognition result and its accompanying information is obtained as a recognition result of the input image, check whether the data is included in an OK cluster belonging to the category having the first post-stage knowledge. If the category is included in the OK cluster, the category to which the OK cluster belongs is determined to be “correct” as the recognition result of the input image, and any of the OK clusters is determined. Rare in the case not is in the image recognition apparatus characterized by comprising a first second-stage determination means determines that the "undetermined".

【００１１】また、本出願の請求項５に記載の発明は、
前記第１後段判定手段は、前記データがあるカテゴリに
所属するＯＫクラスタに含まれる場合に、どの程度の確
信を持って”正解”といえるかを示す指標である確信度
を計算する確信度算出手段を具備し、前記データが複数
の別カテゴリに所属するＯＫクラスタに入った場合に
は、前記確信度に基づいてどのカテゴリを”正解”とす
るかを決定することを特徴とする請求項４に記載の画像
認識装置にある。The invention described in claim 5 of the present application is
The first post-stage determination means calculates a certainty factor that is an index indicating how much certainty can be said to be “correct” when the data is included in an OK cluster belonging to a certain category. 5. The method according to claim 4, further comprising: when the data is in an OK cluster belonging to a plurality of different categories, determine which category is “correct” based on the certainty factor. In the image recognition device described in (1).

【００１２】また、本出願の請求項６に記載の発明は、
複数カテゴリの認識対象を持ち、登録画像パタンと入力
画像パタンとのパタンマッチングにおける適合度（距
離）に基づき入力画像の認識を行う画像認識装置におい
て、画像認識の認識結果およびその付随情報を含むデー
タをサンプル収集し、あるカテゴリで認識結果が”誤
認”とされているデータから、前記データの集合をクラ
スタリングするときの軸を、上位候補に出現率の高いカ
テゴリを該軸として、下位候補に出現率の高いカテゴリ
を該軸として、上位候補に出現率の高いカテゴリおよび
下位候補に出現率の高いカテゴリをあわせて軸として、
中位候補に出現率の高いカテゴリを該軸として、相関係
数の高いカテゴリ同士を該軸として、或いは、相関係数
の低いカテゴリ同士を該軸として、抽出する第２軸抽出
手段と、前記第２軸抽出手段で抽出された軸に基づき前
記データの集合のクラスタリングを行い、該カテゴリの
ＮＧクラスタとして生成し、生成されたＮＧクラスタの
形状を第２後段知識として保持する第２後段知識作成手
段と、入力画像の認識結果として認識結果およびその付
随情報を含むデータが得られるとき、該データが前記第
２後段知識のあるカテゴリに所属するＮＧクラスタの中
に含まれるか否かをチェックして、該ＮＧクラスタに含
まれる場合には該ＮＧクラスタが所属するカテゴリを前
記入力画像の認識結果として”不正解”と判定し、何れ
のＮＧクラスタにも含まれていない場合には”正解”と
判定する第２後段判定手段とを具備することを特徴とす
る画像認識装置にある。Further, the invention described in claim 6 of the present application is
In an image recognition apparatus having recognition targets of a plurality of categories and recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern, data including a recognition result of image recognition and its accompanying information From the data for which the recognition result is "misidentified" in a certain category, the axis when clustering the set of data is used as the axis, and the category with a high appearance rate as the upper candidate is used as the axis to appear in the lower candidate. Using the category with a high rate as the axis, the category with a high appearance rate as the top candidate and the category with a high appearance rate as the lower candidate are used as axes,
A second axis extracting means for extracting categories having a high appearance rate as middle axes as the axis, categories having a high correlation coefficient as the axis, or categories having a low correlation coefficient as the axis, Clustering the set of data based on the axis extracted by the second axis extracting means, generating an NG cluster of the category, and holding the shape of the generated NG cluster as second knowledge of the second stage. Means, and when data including a recognition result and its accompanying information is obtained as a recognition result of the input image, check whether or not the data is included in an NG cluster belonging to the category having the second subsequent knowledge. If the NG cluster is included in the NG cluster, the category to which the NG cluster belongs is determined as “incorrect” as a recognition result of the input image, and Included when not is the image recognition apparatus characterized by comprising a second second-stage determination means determines that the "correct answer".

【００１３】また、本出願の請求項７に記載の発明は、
複数カテゴリの認識対象を持ち、登録画像パタンと入力
画像パタンとのパタンマッチングにおける適合度（距
離）に基づき入力画像の認識を行う画像認識方法におい
て、画像認識の認識結果およびその付随情報を含むデー
タから、認識結果として同じ第１候補のカテゴリを持つ
データを集め、該集めたデータから特定の候補順位にお
ける類似カテゴリの出現率の高いものを抽出して、第１
候補カテゴリに対する類似カテゴリの関連を示す第１前
段知識として保持する第１前段知識作成ステップと、入
力画像の認識結果として特定候補順位のカテゴリおよび
類似カテゴリが得られるとき、該第１候補のカテゴリお
よび特定候補順位の類似カテゴリと前記第１前段知識に
おける関連との比較により、該第１候補を”正解”また
は”不明”と判定する第１前段判定ステップとを具備す
ることを特徴とする画像認識方法にある。The invention described in claim 7 of the present application is
In an image recognition method having recognition targets of a plurality of categories and recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern, data including a recognition result of image recognition and accompanying information thereof , Data having the same first candidate category as the recognition result is collected, and the data having a high appearance rate of the similar category in the specific candidate rank is extracted from the collected data,
A first pre-knowledge creating step of retaining as a first pre-knowledge indicating the association of the similar category to the candidate category; and when the category of the specific candidate rank and the similar category are obtained as the recognition result of the input image, the category of the first candidate and Image recognition characterized by comprising a first pre-stage determination step of judging the first candidate as “correct” or “unknown” by comparing the similar category of the specific candidate rank with the association in the first pre-stage knowledge. In the way.

【００１４】また、本出願の請求項８に記載の発明は、
複数カテゴリの認識対象を持ち、登録画像パタンと入力
画像パタンとのパタンマッチングにおける適合度（距
離）に基づき入力画像の認識を行う画像認識方法におい
て、画像認識の認識結果およびその付随情報を含むデー
タから、認識結果として同じ第１候補のカテゴリを持つ
データを集め、該集めたデータから特定の候補順位にお
ける適合度（距離）から、該特定順位の候補を”正解”
と確定し得る適合度（距離）の大きさを算定し、第２前
段知識として保持する第２前段知識作成ステップと、入
力画像の認識結果として特定候補順位のカテゴリおよび
類似カテゴリが得られるとき、該特定候補順位のカテゴ
リまたは類似カテゴリにおける適合度（距離）の大きさ
と前記第２前段知識における適合度（距離）の大きさと
の比較により、該特定順位の候補を”正解”または”不
明”と判定する第２前段判定ステップとを具備すること
を特徴とする画像認識方法にある。[0014] Further, the invention described in claim 8 of the present application provides:
In an image recognition method having recognition targets of a plurality of categories and recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern, data including a recognition result of image recognition and accompanying information thereof , Data having the same category of the first candidate as the recognition result is collected, and from the collected data, the candidate of the specific rank is determined as “correct answer” based on the fitness (distance) in the specific candidate rank.
Calculating the magnitude of the degree of conformity (distance) that can be determined as the second preceding knowledge, and when the category of the specific candidate rank and the similar category are obtained as the recognition result of the input image, By comparing the magnitude of the fitness (distance) in the category of the specific candidate rank or the similar category with the magnitude of the fitness (distance) in the second preceding knowledge, the candidate of the specific rank is determined as “correct” or “unknown”. And a second precedent determination step.

【００１５】また、本出願の請求項９に記載の発明は、
複数カテゴリの認識対象を持ち、登録画像パタンと入力
画像パタンとのパタンマッチングにおける適合度（距
離）に基づき入力画像の認識を行う画像認識方法におい
て、画像認識の認識結果およびその付随情報を含むデー
タから、認識結果として同じ第１候補のカテゴリを持つ
データを集め、該集めたデータから特定の候補順位のカ
テゴリまたは類似カテゴリ間における適合度（距離）差
から、該第１候補を”正解”と確定し得る適合度（距
離）差の大きさを算定し、第３前段知識として保持する
第３前段知識作成ステップと、入力画像の認識結果とし
て特定候補順位のカテゴリおよび類似カテゴリが得られ
るとき、該特定の候補順位のカテゴリまたは類似カテゴ
リ間における適合度（距離）差の大きさと前記第３前段
知識における適合度（距離）差の大きさとの比較によ
り、該第１候補を”正解”または”不明”と判定する第
３前段判定ステップとを具備することを特徴とする画像
認識方法にある。Further, the invention described in claim 9 of the present application is
In an image recognition method having recognition targets of a plurality of categories and recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern, data including a recognition result of image recognition and accompanying information thereof , Data having the same first candidate category as a recognition result is collected, and from the collected data, the first candidate is determined as “correct” based on a difference in the degree of matching (distance) between a category of a specific candidate rank or a similar category. Calculating a magnitude of the degree of conformity (distance) difference that can be determined, and holding a third precedent knowledge as a third precedent knowledge; and obtaining a category of a specific candidate rank and a similar category as a recognition result of the input image, The magnitude of the fitness (distance) difference between the category of the specific candidate rank or the similar category, and the fitness ( By comparison with the size of the release) difference in image recognition method, characterized by comprising a third front determination step of determining that the first candidate "correct" or "unknown".

【００１６】また、本出願の請求項１０に記載の発明
は、複数カテゴリの認識対象を持ち、登録画像パタンと
入力画像パタンとのパタンマッチングにおける適合度
（距離）に基づき入力画像の認識を行う画像認識方法に
おいて、画像認識の認識結果およびその付随情報を含む
データから、認識結果として同じ第１候補のカテゴリを
持つデータを集め、該集めたデータから特定の候補順位
における類似カテゴリの出現率の高いものを抽出して、
第１候補カテゴリに対する類似カテゴリの関連を示す第
１前段知識として保持する第１前段知識作成ステップ
と、画像認識の認識結果およびその付随情報を含むデー
タから、認識結果として同じ第１候補のカテゴリを持つ
データを集め、該集めたデータから特定の候補順位にお
ける適合度（距離）から、該特定順位の候補を”正解”
と確定し得る適合度（距離）の大きさを算定し、第２前
段知識として保持する第２前段知識作成ステップと、画
像認識の認識結果およびその付随情報を含むデータか
ら、認識結果として同じ第１候補のカテゴリを持つデー
タを集め、該集めたデータから特定の候補順位のカテゴ
リまたは類似カテゴリ間における適合度（距離）差か
ら、該第１候補を”正解”と確定し得る適合度（距離）
差の大きさを算定し、第３前段知識として保持する第３
前段知識作成ステップと、入力画像の認識結果として特
定候補順位のカテゴリおよび類似カテゴリが得られると
き、該第１候補のカテゴリおよび特定候補順位の類似カ
テゴリと前記第１前段知識における関連との比較によ
り、該第１候補を”正解”または”不明”と判定する第
１前段判定ステップと、前記第１前段判定ステップで”
正解”と判定された入力画像について、該特定候補順位
のカテゴリまたは類似カテゴリにおける適合度（距離）
の大きさと前記第２前段知識における適合度（距離）の
大きさとの比較により、該第１候補を”正解”または”
不明”と判定する第２前段判定ステップと、前記第２前
段判定ステップで”正解”と判定された入力画像につい
て、該特定の候補順位のカテゴリまたは類似カテゴリ間
における適合度（距離）差の大きさと前記第３前段知識
における適合度（距離）差の大きさとの比較により、該
第１候補を”正解”または”不明”と判定する第３前段
判定ステップとを具備することを特徴とする画像認識方
法にある。The invention according to claim 10 of the present application has recognition targets of a plurality of categories, and recognizes an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. In the image recognition method, data having the same first candidate category as the recognition result is collected from the data including the recognition result of the image recognition and the accompanying information, and the appearance rate of the similar category in the specific candidate rank is determined from the collected data. Extract the tall ones,
From the first pre-knowledge creating step of retaining as the first pre-knowledge indicating the association of the similar category to the first candidate category, and from the data including the recognition result of the image recognition and its accompanying information, the same first candidate category as the recognition result is obtained. The data having the specified rank is collected from the collected data, and the candidate of the specific rank is determined as “correct” from the fitness (distance) in the specific candidate rank from the collected data.
From the data including the recognition result of image recognition and its accompanying information, a second pre-knowledge creating step of calculating the magnitude of the degree of conformity (distance) that can be determined as the second pre-knowledge and the same as the recognition result Data having one candidate category is collected, and from the collected data, a fitness (distance) that can determine the first candidate as “correct” from a fitness (distance) difference between a category of a specific candidate rank or a similar category. )
Calculate the magnitude of the difference and retain it as the third precedent knowledge.
A pre-knowledge generating step, and when a category of a specific candidate rank and a similar category are obtained as a recognition result of the input image, a comparison between the category of the first candidate and the similar category of the specific candidate rank and an association in the first pre-knowledge is performed. A first precedent determination step of determining the first candidate to be “correct” or “unknown”; and
For the input image determined to be "correct", the degree of fitness (distance) in the category of the specific candidate rank or the similar category
The first candidate is determined to be “correct” or “correct” by comparing the size of
The magnitude of the degree of conformity (distance) difference between the category of the specific candidate rank or the similar category for the input image determined to be “correct” in the second preceding determination step of determining “unknown” and the input image determined to be “correct” in the second preceding determination step And a third precedent determining step of determining the first candidate as “correct answer” or “unknown” by comparing the first candidate with the magnitude of the degree of matching (distance) difference in the third precedent knowledge. In the recognition method.

【００１７】また、本出願の請求項１１に記載の発明
は、複数カテゴリの認識対象を持ち、登録画像パタンと
入力画像パタンとのパタンマッチングにおける適合度
（距離）に基づき入力画像の認識を行う画像認識方法に
おいて、画像認識の認識結果およびその付随情報を含む
データをサンプル収集し、あるカテゴリで認識結果が”
正解”とされているデータから、前記データの集合をク
ラスタリングするときの軸を、上位候補に出現率の高い
カテゴリを該軸として、下位候補に出現率の高いカテゴ
リを該軸として、上位候補に出現率の高いカテゴリおよ
び下位候補に出現率の高いカテゴリをあわせて軸とし
て、中位候補に出現率の高いカテゴリを該軸として、相
関係数の高いカテゴリ同士を該軸として、或いは、相関
係数の低いカテゴリ同士を該軸として、抽出する第１軸
抽出ステップと、前記第１軸抽出ステップで抽出された
軸に基づき前記データの集合のクラスタリングを行い、
該カテゴリのＯＫクラスタとして生成し、生成されたＯ
Ｋクラスタの形状を第１後段知識として保持する第１後
段知識作成ステップと、入力画像の認識結果として認識
結果およびその付随情報を含むデータが得られるとき、
該データが前記第１後段知識のあるカテゴリに所属する
ＯＫクラスタの中に含まれるか否かをチェックして、該
ＯＫクラスタに含まれる場合には該ＯＫクラスタが所属
するカテゴリを前記入力画像の認識結果として”正解”
と判定し、何れのＯＫクラスタにも含まれていない場合
には”未確定”と判定する第１後段判定ステップとを具
備することを特徴とする画像認識方法にある。Further, the invention according to claim 11 of the present application has recognition targets of a plurality of categories, and recognizes an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. In the image recognition method, data including the recognition result of the image recognition and the accompanying information is sampled, and the recognition result in a certain category is "
From the data that is “correct”, the axis when clustering the set of data is set as an axis with the category with a high appearance rate as the top candidate as the axis, the category with a high appearance rate as the lower candidate as the axis, and Categories with a high appearance rate and categories with a high appearance rate as lower candidates are used as axes, categories with a high appearance rate as middle candidates are used as the axis, categories with high correlation coefficients are used as the axis, or a phase relationship is used. A first axis extraction step of extracting the low-numbered categories as the axes, and clustering the set of data based on the axes extracted in the first axis extraction step,
O generated as an OK cluster of the category
A first posterior knowledge generating step of retaining the shape of the K cluster as the first posterior knowledge, and when data including a recognition result and its accompanying information are obtained as a recognition result of the input image,
It is checked whether or not the data is included in an OK cluster belonging to the category having the first subsequent knowledge. If the data is included in the OK cluster, the category to which the OK cluster belongs is determined by the category of the input image. "Correct" as recognition result
And a first latter-stage determination step of determining that the data is not determined in any of the OK clusters.

【００１８】また、本出願の請求項１２に記載の発明
は、複数カテゴリの認識対象を持ち、登録画像パタンと
入力画像パタンとのパタンマッチングにおける適合度
（距離）に基づき入力画像の認識を行う画像認識方法に
おいて、画像認識の認識結果およびその付随情報を含む
データをサンプル収集し、あるカテゴリで認識結果が”
誤認”とされているデータから、前記データの集合をク
ラスタリングするときの軸を、上位候補に出現率の高い
カテゴリを該軸として、下位候補に出現率の高いカテゴ
リを該軸として、上位候補に出現率の高いカテゴリおよ
び下位候補に出現率の高いカテゴリをあわせて軸とし
て、中位候補に出現率の高いカテゴリを該軸として、相
関係数の高いカテゴリ同士を該軸として、或いは、相関
係数の低いカテゴリ同士を該軸として、抽出する第２軸
抽出ステップと、前記第２軸抽出ステップで抽出された
軸に基づき前記データの集合のクラスタリングを行い、
該カテゴリのＮＧクラスタとして生成し、生成されたＮ
Ｇクラスタの形状を第２後段知識として保持する第２後
段知識作成ステップと、入力画像の認識結果として認識
結果およびその付随情報を含むデータが得られるとき、
該データが前記第２後段知識のあるカテゴリに所属する
ＮＧクラスタの中に含まれるか否かをチェックして、該
ＮＧクラスタに含まれる場合には該ＮＧクラスタが所属
するカテゴリを前記入力画像の認識結果として”不正
解”と判定し、何れのＮＧクラスタにも含まれていない
場合には”正解”と判定する第２後段判定ステップとを
具備することを特徴とする画像認識方法にある。The invention according to claim 12 of the present application has recognition targets of a plurality of categories, and recognizes an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. In the image recognition method, data including the recognition result of the image recognition and the accompanying information is sampled, and the recognition result in a certain category is "
From the data that is regarded as “misidentification”, the axis when clustering the set of data is set as the axis with the category with a high appearance rate as the upper candidate as the axis, and the category with the higher appearance rate as the axis with the lower candidate as the axis. Categories with a high appearance rate and categories with a high appearance rate as lower candidates are used as axes, categories with a high appearance rate as middle candidates are used as the axis, categories with high correlation coefficients are used as the axis, or a phase relationship is used. A second axis extraction step of extracting the low number of categories as the axes, and clustering of the set of data based on the axes extracted in the second axis extraction step,
N is generated as an NG cluster of the category, and the generated N
A second posterior knowledge generating step of retaining the shape of the G cluster as the second posterior knowledge, and when data including a recognition result and its accompanying information is obtained as a recognition result of the input image,
It is checked whether or not the data is included in the NG cluster belonging to the category having the second subsequent knowledge. If the data is included in the NG cluster, the category to which the NG cluster belongs is determined by the input image. An image recognition method characterized by comprising a second post-stage determination step of determining "incorrect" as a recognition result and determining "correct" if not included in any NG cluster.

【００１９】また、本出願の請求項１３に記載の発明
は、前記第１後段判定ステップは、前記データがあるカ
テゴリに所属するＯＫクラスタに含まれる場合に、どの
程度の確信を持って”正解”といえるかを示す指標であ
る確信度を計算する確信度算出ステップを具備し、前記
データが複数の別カテゴリに所属するＯＫクラスタに入
った場合には、前記確信度に基づいてどのカテゴリを”
正解”とするかを決定することを特徴とする請求項１１
に記載の画像認識方法にある。Further, in the invention according to claim 13 of the present application, in the first latter-stage determination step, when the data is included in an OK cluster belonging to a certain category, with certainty, A confidence level calculation step of calculating a certainty level which is an index indicating whether the data belongs to an OK cluster belonging to a plurality of different categories. "
12. The method according to claim 11, wherein it is determined whether the answer is correct.
In the image recognition method described above.

【００２０】また、本出願の請求項１４に記載の発明
は、複数カテゴリの認識対象を持ち、登録画像パタンと
入力画像パタンとのパタンマッチングにおける適合度
（距離）に基づき入力画像の認識を行う画像認識方法
を、コンピュータに実行させるためのプログラムとして
記録した記録媒体において、画像認識の認識結果および
その付随情報を含むデータから、認識結果として同じ第
１候補のカテゴリを持つデータを集め、該集めたデータ
から特定の候補順位における類似カテゴリの出現率の高
いものを抽出して、第１候補カテゴリに対する類似カテ
ゴリの関連を示す第１前段知識として保持する第１前段
知識作成ステップと、入力画像の認識結果として特定候
補順位のカテゴリおよび類似カテゴリが得られるとき、
該第１候補のカテゴリおよび特定候補順位の類似カテゴ
リと前記第１前段知識における関連との比較により、該
第１候補を”正解”または”不明”と判定する第１前段
判定ステップとを具備することを特徴とする記録媒体に
ある。Further, the invention according to claim 14 of the present application has recognition targets of a plurality of categories, and recognizes an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. In a recording medium recorded as a program for causing a computer to execute the image recognition method, data having the same first candidate category as the recognition result is collected from the data including the recognition result of the image recognition and the accompanying information. A first pre-knowledge generating step of extracting a high appearance rate of the similar category in the specific candidate rank from the data obtained and storing the extracted data as first pre-knowledge indicating the association of the similar category with the first candidate category; When the category of the specific candidate rank and the similar category are obtained as the recognition result,
A first pre-stage determination step of judging the first candidate as “correct” or “unknown” by comparing the category of the first candidate and the similar category of the specific candidate rank with the association in the first pre-stage knowledge. A recording medium characterized by the above-mentioned.

【００２１】また、本出願の請求項１５に記載の発明
は、複数カテゴリの認識対象を持ち、登録画像パタンと
入力画像パタンとのパタンマッチングにおける適合度
（距離）に基づき入力画像の認識を行う画像認識方法
を、コンピュータに実行させるためのプログラムとして
記録した記録媒体において、画像認識の認識結果および
その付随情報を含むデータから、認識結果として同じ第
１候補のカテゴリを持つデータを集め、該集めたデータ
から特定の候補順位における適合度（距離）から、該特
定順位の候補を”正解”と確定し得る適合度（距離）の
大きさを算定し、第２前段知識として保持する第２前段
知識作成ステップと、入力画像の認識結果として特定候
補順位のカテゴリおよび類似カテゴリが得られるとき、
該特定候補順位のカテゴリまたは類似カテゴリにおける
適合度（距離）の大きさと前記第２前段知識における適
合度（距離）の大きさとの比較により、該特定順位の候
補を”正解”または”不明”と判定する第２前段判定ス
テップとを具備することを特徴とする記録媒体にある。Further, the invention according to claim 15 of the present application has recognition targets of a plurality of categories, and recognizes an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. In a recording medium recorded as a program for causing a computer to execute the image recognition method, data having the same first candidate category as the recognition result is collected from the data including the recognition result of the image recognition and the accompanying information. From the data obtained, the magnitude of the fitness (distance) that can determine the candidate of the specific rank as a “correct answer” is calculated from the fitness (distance) in the specific candidate rank, and stored as the second preceding knowledge. When the knowledge creation step and the category of the specific candidate rank and the similar category are obtained as the recognition result of the input image,
By comparing the magnitude of the fitness (distance) in the category of the specific candidate rank or the similar category with the magnitude of the fitness (distance) in the second preceding knowledge, the candidate of the specific rank is determined as “correct” or “unknown”. A second precedent determination step for the determination.

【００２２】また、本出願の請求項１６に記載の発明
は、複数カテゴリの認識対象を持ち、登録画像パタンと
入力画像パタンとのパタンマッチングにおける適合度
（距離）に基づき入力画像の認識を行う画像認識方法
を、コンピュータに実行させるためのプログラムとして
記録した記録媒体において、画像認識の認識結果および
その付随情報を含むデータから、認識結果として同じ第
１候補のカテゴリを持つデータを集め、該集めたデータ
から特定の候補順位のカテゴリまたは類似カテゴリ間に
おける適合度（距離）差から、該第１候補を”正解”と
確定し得る適合度（距離）差の大きさを算定し、第３前
段知識として保持する第３前段知識作成ステップと、入
力画像の認識結果として特定候補順位のカテゴリおよび
類似カテゴリが得られるとき、該特定の候補順位のカテ
ゴリまたは類似カテゴリ間における適合度（距離）差の
大きさと前記第３前段知識における適合度（距離）差の
大きさとの比較により、該第１候補を”正解”または”
不明”と判定する第３前段判定ステップとを具備するこ
とを特徴とする記録媒体にある。The invention according to claim 16 of the present application has recognition targets of a plurality of categories, and recognizes an input image based on the degree of matching (distance) in pattern matching between a registered image pattern and an input image pattern. In a recording medium recorded as a program for causing a computer to execute the image recognition method, data having the same first candidate category as the recognition result is collected from the data including the recognition result of the image recognition and the accompanying information. From the data obtained, a magnitude of a fitness (distance) difference that can determine the first candidate as a “correct answer” is calculated from the fitness (distance) difference between a category of a specific candidate rank or a similar category, and A third precedent knowledge creation step of retaining as knowledge, and a category of a specific candidate rank and a similar category are obtained as a recognition result of the input image Then, the first candidate is determined to be "correct" by comparing the magnitude of the fitness (distance) difference between the category of the specific candidate rank or the similar category and the magnitude of the fitness (distance) difference in the third preceding knowledge. Or "
A third precedent determination step of determining “unknown”.

【００２３】また、本出願の請求項１７に記載の発明
は、複数カテゴリの認識対象を持ち、登録画像パタンと
入力画像パタンとのパタンマッチングにおける適合度
（距離）に基づき入力画像の認識を行う画像認識方法
を、コンピュータに実行させるためのプログラムとして
記録した記録媒体において、画像認識の認識結果および
その付随情報を含むデータから、認識結果として同じ第
１候補のカテゴリを持つデータを集め、該集めたデータ
から特定の候補順位における類似カテゴリの出現率の高
いものを抽出して、第１候補カテゴリに対する類似カテ
ゴリの関連を示す第１前段知識として保持する第１前段
知識作成ステップと、画像認識の認識結果およびその付
随情報を含むデータから、認識結果として同じ第１候補
のカテゴリを持つデータを集め、該集めたデータから特
定の候補順位における適合度（距離）から、該特定順位
の候補を”正解”と確定し得る適合度（距離）の大きさ
を算定し、第２前段知識として保持する第２前段知識作
成ステップと、画像認識の認識結果およびその付随情報
を含むデータから、認識結果として同じ第１候補のカテ
ゴリを持つデータを集め、該集めたデータから特定の候
補順位のカテゴリまたは類似カテゴリ間における適合度
（距離）差から、該第１候補を”正解”と確定し得る適
合度（距離）差の大きさを算定し、第３前段知識として
保持する第３前段知識作成ステップと、入力画像の認識
結果として特定候補順位のカテゴリおよび類似カテゴリ
が得られるとき、該第１候補のカテゴリおよび特定候補
順位の類似カテゴリと前記第１前段知識における関連と
の比較により、該第１候補を”正解”または”不明”と
判定する第１前段判定ステップと、前記第１前段判定ス
テップで”正解”と判定された入力画像について、該特
定候補順位のカテゴリまたは類似カテゴリにおける適合
度（距離）の大きさと前記第２前段知識における適合度
（距離）の大きさとの比較により、該第１候補を”正
解”または”不明”と判定する第２前段判定ステップ
と、前記第２前段判定ステップで”正解”と判定された
入力画像について、該特定の候補順位のカテゴリまたは
類似カテゴリ間における適合度（距離）差の大きさと前
記第３前段知識における適合度（距離）差の大きさとの
比較により、該第１候補を”正解”または”不明”と判
定する第３前段判定ステップとを具備することを特徴と
する記録媒体にある。Further, the invention according to claim 17 of the present application has recognition targets of a plurality of categories, and recognizes an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. In a recording medium recorded as a program for causing a computer to execute the image recognition method, data having the same first candidate category as the recognition result is collected from the data including the recognition result of the image recognition and the accompanying information. A first pre-knowledge generating step of extracting a similar category having a high appearance rate in a specific candidate rank from the data and holding the same as first pre-knowledge indicating the association of the similar category with the first candidate category; Data having the same first candidate category as the recognition result from the data including the recognition result and its accompanying information From the collected data, the degree of fitness (distance) at which the candidate of the specific rank can be determined as "correct" is calculated from the fitness (distance) in the specific candidate rank from the collected data, and stored as the second preceding knowledge. From the data including the recognition result of image recognition and its accompanying information, collect data having the same first candidate category as the recognition result, and select a category or a specific candidate rank category from the collected data. Calculating a magnitude of a fitness (distance) difference that can determine the first candidate as a “correct answer” from a fitness (distance) difference between similar categories, and generating a third preceding knowledge storing step as a third preceding knowledge; When the category of the specific candidate rank and the similar category are obtained as the recognition result of the input image, the category of the first candidate and the similar category of the specific candidate rank and the first preceding knowledge The first candidate determining step determines that the first candidate is “correct” or “unknown” by comparing the first candidate with the related image. The second candidate for which the first candidate is determined as “correct” or “unknown” by comparing the magnitude of the fitness (distance) in the rank category or the similar category with the magnitude of the fitness (distance) in the second preceding knowledge. For the input image determined to be "correct" in the previous-stage determination step and the second previous-stage determination step, the magnitude of the degree of fitness (distance) difference between the category of the particular candidate rank or the similar category and the third-level knowledge A third pre-determination step of determining the first candidate to be “correct” or “unknown” by comparing with the degree of fitness (distance) difference. In the body.

【００２４】また、本出願の請求項１８に記載の発明
は、複数カテゴリの認識対象を持ち、登録画像パタンと
入力画像パタンとのパタンマッチングにおける適合度
（距離）に基づき入力画像の認識を行う画像認識方法
を、コンピュータに実行させるためのプログラムとして
記録した記録媒体において、画像認識の認識結果および
その付随情報を含むデータをサンプル収集し、あるカテ
ゴリで認識結果が”正解”とされているデータから、前
記データの集合をクラスタリングするときの軸を、上位
候補に出現率の高いカテゴリを該軸として、下位候補に
出現率の高いカテゴリを該軸として、上位候補に出現率
の高いカテゴリおよび下位候補に出現率の高いカテゴリ
をあわせて軸として、中位候補に出現率の高いカテゴリ
を該軸として、相関係数の高いカテゴリ同士を該軸とし
て、或いは、相関係数の低いカテゴリ同士を該軸とし
て、抽出する第１軸抽出ステップと、前記第１軸抽出ス
テップで抽出された軸に基づき前記データの集合のクラ
スタリングを行い、該カテゴリのＯＫクラスタとして生
成し、生成されたＯＫクラスタの形状を第１後段知識と
して保持する第１後段知識作成ステップと、入力画像の
認識結果として認識結果およびその付随情報を含むデー
タが得られるとき、該データが前記第１後段知識のある
カテゴリに所属するＯＫクラスタの中に含まれるか否か
をチェックして、該ＯＫクラスタに含まれる場合には該
ＯＫクラスタが所属するカテゴリを前記入力画像の認識
結果として”正解”と判定し、何れのＯＫクラスタにも
含まれていない場合には”未確定”と判定する第１後段
判定ステップとを具備することを特徴とする記録媒体に
ある。Further, the invention according to claim 18 of the present application has recognition targets of a plurality of categories, and recognizes an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. Samples of data including the recognition result of image recognition and its accompanying information are collected and sampled on a recording medium recorded as a program for causing a computer to execute the image recognition method, and data in which the recognition result is "correct" in a certain category From the above, the axis when clustering the set of data is used as the axis with the category having a high appearance rate as the top candidate as the axis, the category with the high appearance rate as the lower candidate as the axis, and the category with the high appearance rate and the lower Correlation coefficient, with categories with high appearance rates as candidates and axes with high appearance rates as middle axes A first axis extracting step of extracting high categories as the axes or low correlation coefficients of the categories as the axes, and clustering the set of data based on the axes extracted in the first axis extracting step , Generating as an OK cluster of the category, a first second-stage knowledge creating step of holding the shape of the generated OK cluster as first second-stage knowledge, and data including a recognition result and its accompanying information as a recognition result of the input image. Is obtained, it is checked whether or not the data is included in an OK cluster belonging to the category having the first subsequent knowledge. If the data is included in the OK cluster, the category to which the OK cluster belongs is checked. Is determined as “correct answer” as a recognition result of the input image, and is determined as “unconfirmed” when it is not included in any of the OK clusters. In recording medium characterized by comprising a first second-stage determination step.

【００２５】また、本出願の請求項１９に記載の発明
は、複数カテゴリの認識対象を持ち、登録画像パタンと
入力画像パタンとのパタンマッチングにおける適合度
（距離）に基づき入力画像の認識を行う画像認識方法
を、コンピュータに実行させるためのプログラムとして
記録した記録媒体において、画像認識の認識結果および
その付随情報を含むデータをサンプル収集し、あるカテ
ゴリで認識結果が”誤認”とされているデータから、前
記データの集合をクラスタリングするときの軸を、上位
候補に出現率の高いカテゴリを該軸として、下位候補に
出現率の高いカテゴリを該軸として、上位候補に出現率
の高いカテゴリおよび下位候補に出現率の高いカテゴリ
をあわせて軸として、中位候補に出現率の高いカテゴリ
を該軸として、相関係数の高いカテゴリ同士を該軸とし
て、或いは、相関係数の低いカテゴリ同士を該軸とし
て、抽出する第２軸抽出ステップと、前記第２軸抽出ス
テップで抽出された軸に基づき前記データの集合のクラ
スタリングを行い、該カテゴリのＮＧクラスタとして生
成し、生成されたＮＧクラスタの形状を第２後段知識と
して保持する第２後段知識作成ステップと、入力画像の
認識結果として認識結果およびその付随情報を含むデー
タが得られるとき、該データが前記第２後段知識のある
カテゴリに所属するＮＧクラスタの中に含まれるか否か
をチェックして、該ＮＧクラスタに含まれる場合には該
ＮＧクラスタが所属するカテゴリを前記入力画像の認識
結果として”不正解”と判定し、何れのＮＧクラスタに
も含まれていない場合には”正解”と判定する第２後段
判定ステップとを具備することを特徴とする記録媒体に
ある。Further, the invention according to claim 19 of the present application has recognition targets of a plurality of categories, and recognizes an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. Samples of data including the recognition result of image recognition and its accompanying information are collected and sampled on a recording medium recorded as a program for causing a computer to execute the image recognition method, and data in which the recognition result is "misrecognized" in a certain category From the above, the axis when clustering the set of data is used as the axis with the category having a high appearance rate as the top candidate as the axis, the category with the high appearance rate as the lower candidate as the axis, and the category with the high appearance rate and the lower Correlation coefficient, with categories with high appearance rates as candidates and axes with high appearance rates as middle axes A second axis extracting step of extracting high categories as the axes or low correlation coefficients of the categories as the axes, and clustering the set of data based on the axes extracted in the second axis extracting step And generating the NG cluster of the category, and holding the shape of the generated NG cluster as the second post-knowledge; and data including the recognition result and its accompanying information as the recognition result of the input image. Is obtained, it is checked whether or not the data is included in the NG cluster belonging to the category having the second subsequent knowledge. If the data is included in the NG cluster, the category to which the NG cluster belongs is checked. Is determined to be "incorrect" as a recognition result of the input image, and is determined to be "correct" if not included in any of the NG clusters. In recording medium characterized by comprising a second second-stage determination step.

【００２６】また、本出願の請求項２０に記載の発明
は、前記第１後段判定ステップは、前記データがあるカ
テゴリに所属するＯＫクラスタに含まれる場合に、どの
程度の確信を持って”正解”といえるかを示す指標であ
る確信度を計算する確信度算出ステップを具備し、前記
データが複数の別カテゴリに所属するＯＫクラスタに入
った場合には、前記確信度に基づいてどのカテゴリを”
正解”とするかを決定することを特徴とする請求項１８
に記載の記録媒体にある。According to a twentieth aspect of the present invention, in the above-mentioned first post-determination step, when the data is included in an OK cluster belonging to a certain category, with certainty, A confidence level calculation step of calculating a certainty level which is an index indicating whether the data belongs to an OK cluster belonging to a plurality of different categories. "
19. Determine whether the answer is "correct".
Above.

【００２７】そして、この請求項１、７または１４に記
載の発明によれば、第１前段知識作成手段（第１前段知
識作成ステップ）により、画像認識の認識結果およびそ
の付随情報を含むデータから、認識結果として同じ第１
候補のカテゴリを持つデータを集め、該集めたデータか
ら特定の候補順位における類似カテゴリの出現率の高い
ものを抽出して、第１候補カテゴリに対する類似カテゴ
リの関連を示す第１前段知識として保持し、第１前段判
定手段（第１前段判定ステップ）では、入力画像の認識
結果として特定候補順位のカテゴリおよび類似カテゴリ
が得られるとき、該第１候補のカテゴリおよび特定候補
順位の類似カテゴリと前記第１前段知識における関連と
の比較により、該第１候補を”正解”または”不明”と
判定するようにしている。According to the first, seventh or fourteenth aspect of the present invention, the first pre-knowledge generating means (first pre-knowledge generating step) converts the data including the recognition result of the image recognition and its accompanying information. , The same as the recognition result
Data having candidate categories is collected, and data having a high appearance rate of similar categories in a specific candidate rank is extracted from the collected data, and held as first precedent knowledge indicating the association of similar categories with the first candidate category. In the first precedent determination means (first precedent determination step), when the category of the specific candidate rank and the similar category are obtained as the recognition result of the input image, the similar category of the first candidate and the specific candidate rank and the similar category The first candidate is determined to be "correct" or "unknown" by comparing with the association in the preceding knowledge.

【００２８】ここで、「データ」とは、画像認識システ
ムにおける認識結果とそれに付随する情報を指す。また
「カテゴリ」とは、同じ文字の集まりを指す。また、本
発明が適用される画像認識システムは、例えば、類似し
たカテゴリのうち、登録画像パタンと入力画像パタンと
のパタンマッチング（適合演算）における適合度（距
離）が最も高いものを認識システムの認識結果（第１候
補）として出力し、該認識結果について”正解”、”正
解と確信できない”、”不正解”または”不明”を確定
するものである。尚、適合度の演算方法によっては、認
識結果として適合度（距離）が最も低いものを出力する
場合もある。また、本発明では、画像認識システムの認
識結果とそれに付随する情報を含むデータを得て、該デ
ータを基に認識結果の確定処理を行うが、「付随情報」
とは、類似カテゴリの候補とその候補で演算されたカテ
ゴリ辞書（登録画像パタン）への距離である。Here, "data" refers to a recognition result in the image recognition system and information accompanying the result. The “category” refers to a group of the same characters. In addition, the image recognition system to which the present invention is applied determines, for example, among similar categories, those having the highest degree of matching (distance) in pattern matching (matching calculation) between a registered image pattern and an input image pattern. This is output as a recognition result (first candidate), and for the recognition result, "correct answer", "cannot be sure of correct answer", "incorrect answer" or "unknown" is determined. It should be noted that depending on the method of calculating the degree of conformity, the recognition result with the lowest degree of conformity (distance) may be output. Further, in the present invention, data including a recognition result of the image recognition system and information associated therewith are obtained, and the recognition result is determined based on the data.
Is the distance to a similar category candidate and a category dictionary (registered image pattern) calculated with the candidate.

【００２９】つまり、本発明の構成は、画像認識システ
ムにおいて、画像認識手段の後段に配置されるものであ
る。That is, the configuration of the present invention is arranged in the image recognition system after the image recognition means.

【００３０】本発明によれば、正解として確定できたも
のについては人がチェックする必要がなくなり、正解と
確定できなかったものについてのみチェックするだけで
よくなることから、受付け誤認識率を低下させることが
でき、画像認識システムにおける省力化をさらに図るこ
とができる。According to the present invention, there is no need for a person to check a correct answer, and only a check for a correct answer cannot be made. Therefore, labor saving in the image recognition system can be further achieved.

【００３１】また、請求項２、８または１５に記載の発
明によれば、第２前段知識作成手段（第２前段知識作成
ステップ）により、画像認識の認識結果およびその付随
情報を含むデータから、認識結果として同じ第１候補の
カテゴリを持つデータを集め、該集めたデータから特定
の候補順位における適合度（距離）から、該特定順位の
候補を”正解”と確定し得る適合度（距離）の大きさを
算定し、第２前段知識として保持し、第２前段判定手段
（第２前段判定ステップ）では、入力画像の認識結果と
して特定候補順位のカテゴリおよび類似カテゴリが得ら
れるとき、該特定候補順位のカテゴリまたは類似カテゴ
リにおける適合度（距離）の大きさと前記第２前段知識
における適合度（距離）の大きさとの比較により、該特
定順位の候補を”正解”または”不明”と判定するよう
にしている。これにより、正解として確定できたものに
ついては人がチェックする必要がなくなり、正解と確定
できなかったものについてのみチェックするだけでよく
なることから、受付け誤認識率を低下させることがで
き、画像認識システムにおける省力化をさらに図ること
ができる。According to the second, eighth or fifteenth aspect of the present invention, the second pre-knowledge generating means (the second pre-knowledge generating step) converts the data including the recognition result of the image recognition and the accompanying information thereof. Data having the same first candidate category as a recognition result is collected, and from the collected data, the fitness (distance) at which the candidate of the specific rank can be determined as "correct" from the fitness (distance) in the specific candidate rank. Is calculated and held as second precedent knowledge. In the second precedent determination means (second precedent determination step), when the category of the specific candidate rank and the similar category are obtained as the recognition result of the input image, the identification is performed. By comparing the magnitude of the fitness (distance) in the category of the candidate rank or the similar category with the magnitude of the fitness (distance) in the second preceding knowledge, the candidate of the specific rank is identified as " So that it is determined that the solution "or" unknown ". This eliminates the need for a person to check what was determined as a correct answer, and only needs to check for those that could not be determined as a correct answer. Can further save labor.

【００３２】また、請求項３、９または１６に記載の発
明によれば、第３前段知識作成手段（第３前段知識作成
ステップ）により、画像認識の認識結果およびその付随
情報を含むデータから、認識結果として同じ第１候補の
カテゴリを持つデータを集め、該集めたデータから特定
の候補順位のカテゴリまたは類似カテゴリ間における適
合度（距離）差から、該第１候補を”正解”と確定し得
る適合度（距離）差の大きさを算定して、第３前段知識
として保持し、第３前段判定手段（第３前段判定ステッ
プ）では、入力画像の認識結果として特定候補順位のカ
テゴリおよび類似カテゴリが得られるとき、該特定の候
補順位のカテゴリまたは類似カテゴリ間における適度
（距離）差の大きさと前記第３前段知識における適合度
（距離）差の大きさとの比較により、該第１候補を”正
解”または”不明”と判定するようにしている。これに
より、正解として確定できたものについては人がチェッ
クする必要がなくなり、正解と確定できなかったものに
ついてのみチェックするだけでよくなることから、受付
け誤認識率を低下させることができ、画像認識システム
における省力化をさらに図ることができる。According to the third, ninth or sixteenth aspect of the present invention, the third pre-stage knowledge generation means (third pre-stage knowledge generation step) converts the data including the recognition result of the image recognition and its accompanying information into Data having the same first candidate category is collected as a recognition result, and the first candidate is determined as "correct" from the collected data based on a difference in the degree of fitness (distance) between a category of a specific candidate rank or a similar category. The magnitude of the difference in the degree of matching (distance) to be obtained is calculated and stored as third precedent knowledge, and the third precedent determination means (third precedent determination step) determines the category of the specific candidate rank and the similarity as the recognition result of the input image. When a category is obtained, the magnitude of the appropriateness (distance) difference between the category of the specific candidate rank or the similar category and the magnitude of the fitness (distance) difference in the third preceding knowledge By comparison, so as to determine the said first candidate "correct" or "unknown". This eliminates the need for a person to check what was determined as a correct answer, and only needs to check for those that could not be determined as a correct answer. Can further save labor.

【００３３】また、請求項１０または１７に記載の発明
によれば、予め、第１前段知識作成手段（第１前段知識
作成ステップ）により第１前段知識（第１候補カテゴリ
に対する類似カテゴリの関連）を、第２前段知識作成手
段（第２前段知識作成ステップ）により第２前段知識
（特定順位の候補を”正解”と確定し得る適合度（距
離）の大きさ）を、第３前段知識作成手段（第３前段知
識作成ステップ）により第３前段知識（第１候補を”正
解”と確定し得る適合度（距離）差の大きさ）を、それ
ぞれ作成しておき、第１前段判定手段（第１前段判定ス
テップ）、第２前段判定手段（第２前段判定ステップ）
および第３前段判定手段（第３前段判定ステップ）で
は、これら第１、第２および第３前段知識を用いて、入
力画像の認識結果について、”正解”または”不明”
（どのカテゴリにも属さないデータ）の２つに認識結果
を分別して確定するようにしている。これにより、受付
け誤認識率を低下させることができ、画像認識システム
における省力化をさらに図ることができる。また、帳票
読み込みの自動化を図ることができる。すなわち、例え
ば、１枚の帳票を認識対象エリアとするとき、確定結果
がすべて正解の場合はチェックを行うことなく結果を保
存することが可能となり、逆に、正解と確信できるもの
以外の結果が１つでも含まれる場合は、その帳票全体の
認識結果を保存せずに、後に人の入力をまとめて求める
といった運用が可能になる。According to the tenth or seventeenth aspect of the present invention, the first pre-stage knowledge (the relation of the similar category to the first candidate category) is determined in advance by the first pre-stage knowledge generation means (first pre-stage knowledge generation step). The second pre-knowledge generation means (second pre-knowledge generation step) converts the second pre-knowledge knowledge (the degree of fitness (distance) at which a candidate of a specific rank can be determined to be “correct”) to the third pre-knowledge generation Means (third first-stage knowledge creating step), the third first-stage knowledge (the magnitude of the degree of fitness (distance) difference that can determine the first candidate as “correct”) is created in advance, and the first first-stage determination means ( A first preceding-stage determining step), a second preceding-stage determining means (second preceding-stage determining step)
The third pre-stage determination means (third pre-stage determination step) uses the first, second, and third pre-stage knowledge to determine whether the recognition result of the input image is “correct” or “unknown”.
(Data that does not belong to any category) and decides the recognition result separately. As a result, the reception error recognition rate can be reduced, and labor saving in the image recognition system can be further achieved. Further, it is possible to automate the reading of the form. That is, for example, when one form is set as the recognition target area, if all the finalized results are correct, it is possible to save the results without performing a check, and conversely, results other than those that can be convinced are correct. When even one is included, it becomes possible to operate such that the input result of the person is collectively obtained later without storing the recognition result of the entire form.

【００３４】また、請求項４、１１または１８に記載の
発明によれば、第１軸抽出手段（第１軸抽出ステップ）
により、画像認識の認識結果およびその付随情報を含む
データをサンプル収集し、あるカテゴリで認識結果が”
正解”とされているデータから、前記データの集合をク
ラスタリングするときの軸を抽出し、第１後段知識作成
手段（第１後段知識作成ステップ）により、第１軸抽出
手段（第１軸抽出ステップ）で抽出された軸に基づき前
記データの集合のクラスタリングを行い、該カテゴリの
ＯＫクラスタとして生成し、生成されたＯＫクラスタの
形状を第１後段知識として保持し、第１後段判定手段
（第１後段判定ステップ）では、入力画像の認識結果と
して認識結果およびその付随情報を含むデータが得られ
るとき、該データが第１後段知識のあるカテゴリに所属
するＯＫクラスタの中に含まれるか否かをチェックし
て、該ＯＫクラスタに含まれる場合には該ＯＫクラスタ
が所属するカテゴリを入力画像の認識結果として”正
解”と判定し、何れのＯＫクラスタにも含まれていない
場合には”未確定”と判定する。尚、第１軸抽出手段
（第１軸抽出ステップ）による軸抽出には、例えば、上
位候補に出現率の高いカテゴリを該軸として、下位候補
に出現率の高いカテゴリを該軸として、上位候補に出現
率の高いカテゴリおよび下位候補に出現率の高いカテゴ
リをあわせて軸として、中位候補に出現率の高いカテゴ
リを該軸として、相関係数の高いカテゴリ同士を該軸と
して、或いは、相関係数の低いカテゴリ同士を該軸とし
て抽出する方法等が考えられる。これにより、正解とし
て確定できたものについては人がチェックする必要がな
くなり、正解と確定できなかったものについてのみチェ
ックするだけでよくなることから、受付け誤認識率を低
下させることができ、画像認識システムにおける省力化
をさらに図ることができる。According to the invention described in claim 4, 11 or 18, the first axis extracting means (first axis extracting step).
Sampled data including the recognition result of image recognition and its accompanying information, and the recognition result was "
From the data determined to be "correct", an axis for clustering the set of data is extracted, and the first second-stage knowledge creating means (first second-stage knowledge creating step) uses first axis extracting means (first axis extracting step). ), The data set is clustered based on the axes extracted in step (c), the data is generated as an OK cluster of the category, and the shape of the generated OK cluster is held as first post-stage knowledge. In the latter-stage determination step), when data including a recognition result and its accompanying information is obtained as a recognition result of the input image, it is determined whether or not the data is included in an OK cluster belonging to a category having the first second-stage knowledge. Checking, if the OK cluster is included, the category to which the OK cluster belongs is determined as “correct” as the recognition result of the input image, and any O If it is not included in the cluster, it is determined as “undecided.” In the axis extraction by the first axis extraction means (first axis extraction step), for example, a category having a high appearance rate as a top candidate is determined. As an axis, a category having a high appearance rate is set as the lower candidate, and a category having a high appearance rate is set as the upper candidate, and a category having a high appearance rate is set as the axis. As the axis, a method of extracting categories having a high correlation coefficient as the axis or extracting categories having a low correlation coefficient as the axis can be considered. It is no longer necessary to check the correct answer and only check those that could not be determined as correct. It can be further achieved that labor saving.

【００３５】また、請求項６、１２または１９に記載の
発明によれば、第２軸抽出手段（第２軸抽出ステップ）
により、画像認識の認識結果およびその付随情報を含む
データをサンプル収集し、あるカテゴリで認識結果が”
誤認”とされているデータから、前記データの集合をク
ラスタリングするときの軸を抽出し、第２後段知識作成
手段（第２後段知識作成ステップ）により、第２軸抽出
手段（第２軸抽出ステップ）で抽出された軸に基づき前
記データの集合のクラスタリングを行い、該カテゴリの
ＮＧクラスタとして生成し、生成されたＮＧクラスタの
形状を第２後段知識として保持し、第２後段判定手段
（第２後段判定ステップ）では、入力画像の認識結果と
して認識結果およびその付随情報を含むデータが得られ
るとき、該データが第２後段知識のあるカテゴリに所属
するＮＧクラスタの中に含まれるか否かをチェックし
て、該ＮＧクラスタに含まれる場合には該ＮＧクラスタ
が所属するカテゴリを前記入力画像の認識結果として”
不正解”と判定し、何れのＮＧクラスタにも含まれてい
ない場合には”正解”と判定するようにしている。尚、
第２軸抽出手段（第２軸抽出ステップ）における軸抽出
には、上位候補に出現率の高いカテゴリを該軸として、
下位候補に出現率の高いカテゴリを該軸として、上位候
補に出現率の高いカテゴリおよび下位候補に出現率の高
いカテゴリをあわせて軸として、中位候補に出現率の高
いカテゴリを該軸として、相関係数の高いカテゴリ同士
を該軸として、或いは、相関係数の低いカテゴリ同士を
該軸として抽出する方法等が考えられる。According to the invention of claim 6, 12 or 19, the second axis extracting means (second axis extracting step).
Sampled data including the recognition result of image recognition and its accompanying information, and the recognition result was "
An axis for clustering the set of data is extracted from the data which is determined to be "misidentified", and the second axis extraction means (second axis extraction step) ), The set of data is clustered based on the axes extracted in step (c), the cluster is generated as an NG cluster of the category, and the shape of the generated NG cluster is stored as second post-stage knowledge. In the latter-stage determination step), when data including a recognition result and its accompanying information is obtained as a recognition result of the input image, it is determined whether or not the data is included in an NG cluster belonging to a category having the second latter-stage knowledge. Check that if the NG cluster is included, the category to which the NG cluster belongs is used as the recognition result of the input image.
It is determined to be "incorrect", and if it is not included in any of the NG clusters, it is determined to be "correct."
In the axis extraction in the second axis extraction means (second axis extraction step), a category having a high appearance rate as a top candidate is set as the axis,
Using the category with a high appearance rate in the lower candidate as the axis, the category with the high appearance rate in the upper candidate and the category with the higher appearance rate in the lower candidate as the axis, and the category with the higher appearance rate in the middle candidate as the axis, A method of extracting categories with high correlation coefficients as the axis, or categories with low correlation coefficient as the axis, or the like can be considered.

【００３６】これにより、正解として確定できたものに
ついては人がチェックする必要がなくなり、正解と確定
できなかったものについてのみチェックするだけでよく
なることから、受付け誤認識率を低下させることがで
き、画像認識システムにおける省力化をさらに図ること
ができる。As a result, it is not necessary for a person to check a correct answer, and it is only necessary to check only a correct answer. Therefore, the false recognition rate can be reduced. Labor saving in the image recognition system can be further achieved.

【００３７】また、請求項５、１３または２０に記載の
発明によれば、第１後段判定手段（第１後段判定ステッ
プ）において、前記データがあるカテゴリに所属するＯ
Ｋクラスタに含まれる場合に、どの程度の確信を持っ
て”正解”といえるかを示す指標である確信度を確信度
算出手段（確信度算出ステップ）により計算するように
し、前記データが複数の別カテゴリに所属するＯＫクラ
スタに入った場合には、確信度に基づいてどのカテゴリ
を”正解”とするかを決定し、例えば、確信度の大きさ
が所定値以下の場合には”正解と確信できない”と判定
するようにしている。これにより、あるカテゴリのＯＫ
クラスタに入ったデータをそのカテゴリに属するデータ
と決定付け、逆にどのＯＫクラスタにも入らない場合
は”不明”（どのカテゴリにも属さない）データとされ
るが、あるカテゴリに属するデータと決定付けられた場
合に、認識結果の確信度合いを算出することにより、”
不明”、”正解”、”正解と確信できない”の３つに認
識結果を分別して確定されることとなる。また、さら
に、ＮＧクラスタの少なくとも１つに入った場合は”不
正解”データと決定され、”不明”、”正解”、”不正
解”、”正解と確信できない”の４つに認識結果を分別
して確定されることとなる。According to the fifth, thirteenth, or twentieth aspect of the present invention, in the first second-stage determining means (first second-stage determining step), the data belonging to a category belonging to a certain category is selected.
When included in the K cluster, a certainty factor, which is an index indicating how much certainty can be said to be “correct”, is calculated by a certainty factor calculation means (certainty factor calculation step), and the data is stored in a plurality of clusters. When entering the OK cluster belonging to another category, it is determined which category is “correct” based on the certainty. For example, when the magnitude of the certainty is equal to or less than a predetermined value, “correct” is determined. I'm not sure. " With this, OK of a certain category
Data entered into a cluster is determined as data belonging to that category. Conversely, if the data does not belong to any OK cluster, it is determined to be "unknown" (not belonging to any category), but data belonging to a certain category is determined. By calculating the degree of certainty of the recognition result,
Recognition results are classified into three types: "unknown", "correct", and "unsure of correctness", and if at least one of the NG clusters is included, "incorrect" data It is determined, and the recognition result is classified into four, namely, "unknown", "correct answer", "incorrect answer", and "cannot be confident that the answer is correct".

【００３８】[0038]

【発明の実施の形態】以下、本発明の画像認識装置、画
像認識方法および記録媒体の実施の形態について、図面
を参照して詳細に説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, embodiments of an image recognition apparatus, an image recognition method, and a recording medium according to the present invention will be described in detail with reference to the drawings.

【００３９】図１は本発明の一実施形態に係る画像認識
装置の構成図である。FIG. 1 is a configuration diagram of an image recognition device according to an embodiment of the present invention.

【００４０】図１において、本実施形態の画像認識装置
は、おおまかにＯＣＲ認識処理部１０１および認識結果
確定部１０２を備えて構成されており、認識結果確定部
１０２は、知識生成部１０３、前段判定部（第１前段判
定手段、第２前段判定手段および第３前段判定手段）１
１５、後段判定部（第１後段判定手段および第２後段判
定手段）１１６、ルール判定部１１７、認識確定部１１
８およびリジェクト部１１９を備えて構成されている。
また、知識生成部１０３は、前段知識の作成部（第１前
段知識作成手段、第２前段知識作成手段および第３前段
知識作成手段）１１１、前段知識ベース（第１前段知
識、第２前段知識および第３前段知識）１１２、後段知
識の作成部（第１後段知識作成手段および第２後段知識
作成手段）１１３および後段知識ベース（第１後段知識
および第２後段知識）１１４を備えた構成である。Referring to FIG. 1, the image recognition apparatus of the present embodiment roughly includes an OCR recognition processing unit 101 and a recognition result determination unit 102. The recognition result determination unit 102 includes a knowledge generation unit 103, Judgment unit (first precedent judgment means, second precedent judgment means and third precedent judgment means) 1
15, second-stage determination unit (first second-stage determination unit and second second-stage determination unit) 116, rule determination unit 117, recognition determination unit 11
8 and a reject unit 119.
The knowledge generation unit 103 includes a pre-stage knowledge generation unit (first pre-stage knowledge generation unit, second pre-stage knowledge generation unit and third pre-stage knowledge generation unit) 111, and a pre-stage knowledge base (first pre-stage knowledge, second pre-stage knowledge). And a third pre-stage knowledge) 112, a post-knowledge generation unit (first post-stage knowledge generation unit and second post-stage knowledge generation unit) 113, and a post-stage knowledge base (first post-stage knowledge and second post-stage knowledge) 114 is there.

【００４１】また、図２は、本実施形態の画像認識装置
の全体的な処理の概略を説明する説明図である。同図に
示すように、本実施形態の画像認識装置は、複数カテゴ
リの認識対象を持ち、登録画像パタンと入力画像パタン
とのパタンマッチングにおける適合度（距離）に基づき
入力画像の認識を行うものであるが、認識結果確定部１
０２に特徴を持つものであり、ＯＣＲ認識処理部１０１
からの認識結果と付随情報１５１ｂについて、前段知識
ベース１１２および後段知識ベース１１４に基づく前段
判定部１１５および後段判定部１１６の判定を行うこと
によって、認識確定部１１８により、該認識結果が”未
確定（不正解）”、”正解”或いは”正解と確信が持て
ない”の何れであるかを確定するものである。FIG. 2 is an explanatory diagram for explaining the outline of the overall processing of the image recognition apparatus according to the present embodiment. As shown in the figure, the image recognition device of the present embodiment has recognition targets of a plurality of categories and recognizes an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. , But the recognition result determination unit 1
OCR recognition processing unit 101
Of the recognition result and the accompanying information 151b by the first-stage determination unit 115 and the second-stage determination unit 116 based on the first-stage knowledge base 112 and the second-stage knowledge base 114, the recognition determination unit 118 determines that the recognition result is “undetermined”. (Incorrect answer), "correct answer" or "cannot be sure of the correct answer".

【００４２】本実施形態の画像認識装置を構成する各構
成要素および各構成要素間の引き渡しデータについて簡
単に説明する。尚、本実施形態の画像認識装置では、説
明を簡単にするため、認識対象の文字画像として、数字
（０〜９）並びに３種の記号（”（”，”）”，”
−”）、合計１３種の文字を扱うものとする。The components constituting the image recognition apparatus of this embodiment and the data transferred between the components will be briefly described. In the image recognition device of the present embodiment, for the sake of simplicity, numerals (0 to 9) and three types of symbols ("(", ")", ") are used as character images to be recognized.
-"), For a total of 13 types of characters.

【００４３】まず、ＯＣＲ認識処理部１０１では、類似
した文字のうち、各文字辞書に対して適合度演算を行
う。ここで、適合度演算は、登録画像パタンと入力画像
パタンとのパタンマッチングにおいて行われるもので、
適合度（以下、距離という）の高いもの順に類似文字を
並べ、距離が最も高いものをＯＣＲの認識結果として出
力する。First, the OCR recognition processing unit 101 performs a fitness calculation for each character dictionary among similar characters. Here, the adaptability calculation is performed in pattern matching between the registered image pattern and the input image pattern.
Similar characters are arranged in descending order of the degree of matching (hereinafter referred to as distance), and the character having the highest distance is output as the OCR recognition result.

【００４４】また、認識結果１５１ａ，１５１ｂは、画
像認識の認識結果およびその付随情報を含むデータであ
り、該データは、１文字あたり１３個の候補と各々の候
補辞書に対する距離により構成されている（図６参
照）。特に、知識生成用認識結果１５１ａは、知識生成
のために利用するサンプル画像について実際の運用前に
ＯＣＲ認識処理部１０１によって取得し、知識生成部１
０３に渡される。また特に、認識結果１５１ｂは、実際
のシステム運用時に、ＯＣＲ認識処理部１０１から１文
字単位で前段判定部１１５に供給される。The recognition results 151a and 151b are data including the recognition result of the image recognition and its accompanying information, and are composed of 13 candidates per character and the distance to each candidate dictionary. (See FIG. 6). In particular, the knowledge generation recognition result 151a is obtained by the OCR recognition processing unit 101 before the actual operation of the sample image used for the knowledge generation, and the knowledge generation unit 1
03 is passed. In particular, the recognition result 151b is supplied from the OCR recognition processing unit 101 to the preceding-stage determination unit 115 on a character-by-character basis during actual system operation.

【００４５】前段知識の作成部１１１は、上記のよう
に、第１前段知識作成手段、第２前段知識作成手段およ
び第３前段知識作成手段を備えて構成され、第１前段知
識作成手段では、画像認識の認識結果およびその付随情
報を含むデータから、認識結果として同じ第１候補のカ
テゴリを持つデータを集め、該集めたデータから特定の
候補順位における類似カテゴリの出現率の高いものを抽
出して、これを第１候補カテゴリに対する類似カテゴリ
の関連を示す第１前段知識として前段知識ベース１１２
に登録する。As described above, the pre-knowledge generating section 111 is provided with the first pre-knowledge generating means, the second pre-knowledge generating means and the third pre-knowledge generating means. Data having the same first candidate category as the recognition result is collected from the data including the recognition result of the image recognition and the accompanying information, and the data having a high appearance rate of the similar category in the specific candidate rank is extracted from the collected data. This is used as the first precedent knowledge indicating the relation of the similar category to the first candidate category,
Register with.

【００４６】また、第２前段知識作成手段では、画像認
識の認識結果およびその付随情報を含むデータから、認
識結果として同じ第１候補のカテゴリを持つデータを集
め、該集めたデータから特定の候補順位における適合度
（距離）から、該特定順位の候補を”正解”と確定し得
る適合度（距離）の大きさを算定し、第２前段知識とし
て前段知識ベース１１２に登録する。The second pre-stage knowledge generating means collects data having the same first candidate category as the recognition result from the data including the recognition result of the image recognition and the accompanying information, and designates a specific candidate from the collected data. Based on the degree of matching (distance) in the rank, the magnitude of the degree of matching (distance) that can determine the candidate of the specific rank as a “correct answer” is calculated and registered in the preceding knowledge base 112 as the second preceding knowledge.

【００４７】さらに、第３前段知識作成手段では、画像
認識の認識結果およびその付随情報を含むデータから、
認識結果として同じ第１候補のカテゴリを持つデータを
集め、該集めたデータから特定の候補順位のカテゴリま
たは類似カテゴリ間における適合度（距離）差から、該
第１候補を”正解”と確定し得る適合度（距離）差の大
きさを算定して、第３前段知識として前段知識ベース１
１２に登録する。Further, in the third pre-stage knowledge generating means, the data including the recognition result of the image recognition and the accompanying information thereof is obtained.
Data having the same first candidate category is collected as a recognition result, and the first candidate is determined as "correct" from the collected data based on a difference in the degree of fitness (distance) between a category of a specific candidate rank or a similar category. The magnitude of the obtained fitness (distance) difference is calculated, and the preceding knowledge base 1 is used as the third preceding knowledge.
Register in 12.

【００４８】本実施形態では、認識対象の文字はカテゴ
リが１３個あり、ＯＣＲ認識処理部１０１から出力され
るデータは、その１３個の全てが必ず候補文字の中に含
まれている。そのうち第２、第３候補文字データで出現
率の高い文字（第１前段知識）を取得すると共に、第１
候補の距離データで正解と判別できる最低値（第２前段
知識）、並びに、第１候補距離と第２候補距離の差のデ
ータから正解と判別できる最低値（第３前段知識）を算
出する。ここで作成された知識は、前段判定部１１５で
使用されることとなる。尚、１３種の数字および記号以
外の文字の場合については、数千種のカテゴリがある
が、この場合、ＯＣＲ認識処理部１０１から出力される
データは、距離の大きいカテゴリの上位数十種類とし
て、これらから同様にして、第Ｘ（Ｘは任意に選択可
能）候補に出現の多いものを選択していくことになろ
う。In the present embodiment, there are 13 categories of characters to be recognized, and all of the 13 data output from the OCR recognition processing unit 101 are always included in the candidate characters. Among them, a character having a high appearance rate (first precedent knowledge) in the second and third candidate character data is acquired, and the first and second candidate character data are acquired.
The lowest value that can be determined as a correct answer from the candidate distance data (second preceding knowledge) and the lowest value that can be determined as a correct answer from the data of the difference between the first candidate distance and the second candidate distance (third preceding knowledge) are calculated. The knowledge created here is used by the preceding-stage determination unit 115. Note that there are thousands of categories for characters other than the 13 types of numbers and symbols, but in this case, the data output from the OCR recognition processing unit 101 is classified into the top tens of types of the category having a large distance. In the same manner, a candidate having a large number of occurrences in the Xth (X can be arbitrarily selectable) candidate will be selected.

【００４９】次に、後段知識の作成部１１３は、上記の
ように、第１後段知識作成手段および第２後段知識作成
手段を備えて構成され、第１後段知識作成手段に含まれ
る第１軸抽出手段により、画像認識の認識結果およびそ
の付随情報を含むデータをサンプル収集し、あるカテゴ
リで認識結果が”正解”とされているデータから、デー
タの集合をクラスタリングするときの軸を抽出し、第１
後段知識作成手段により、抽出した軸に基づきデータの
集合のクラスタリングを行い、該カテゴリのＯＫクラス
タとして生成し、生成されたＯＫクラスタの形状を第１
後段知識として後段知識ベース１１４に登録する。Next, the second-stage knowledge creating unit 113 is provided with the first second-stage knowledge creating unit and the second second-stage knowledge creating unit as described above, and includes the first axis included in the first second-stage knowledge creating unit. The extraction means collects a sample of the data including the recognition result of the image recognition and the accompanying information, and extracts an axis for clustering a set of data from data for which the recognition result is “correct” in a certain category, First
The subsequent-knowledge creating means clusters the set of data based on the extracted axes, generates an OK cluster of the category, and sets the shape of the generated OK cluster to the first.
It is registered in the second-stage knowledge base 114 as second-stage knowledge.

【００５０】また、第２後段知識作成手段に含まれる第
２軸抽出手段により、画像認識の認識結果およびその付
随情報を含むデータをサンプル収集し、あるカテゴリで
認識結果が”誤認”とされているデータから、データの
集合をクラスタリングするときの軸を抽出し、第２後段
知識作成手段により、抽出した軸に基づきデータの集合
のクラスタリングを行い、該カテゴリのＮＧクラスタと
して生成し、生成されたＮＧクラスタの形状を第２後段
知識として後段知識ベース１１４に登録する。The second axis extracting means included in the second post-stage knowledge creating means collects a sample of the data including the recognition result of the image recognition and its accompanying information, and determines that the recognition result is "misrecognized" in a certain category. From the existing data, an axis for clustering the set of data is extracted, and the second set of knowledge creating means performs clustering of the set of data based on the extracted axis to generate an NG cluster of the category. The shape of the NG cluster is registered in the second-stage knowledge base 114 as the second second-stage knowledge.

【００５１】尚、第１および第２軸抽出手段における軸
抽出には、上位候補に出現率の高いカテゴリを該軸とし
て（誤認識し易いカテゴリを選択）、下位候補に出現率
の高いカテゴリを該軸として（誤認識し難いカテゴリを
選択）、上位候補に出現率の高いカテゴリおよび下位候
補に出現率の高いカテゴリをあわせて軸として（誤認識
し易いカテゴリとし難いカテゴリを選択）、中位候補に
出現率の高いカテゴリを該軸として（誤認識し易くもし
難くも無いカテゴリを選択）、相関係数の高いカテゴリ
同士を該軸として、或いは、相関係数の低いカテゴリ同
士を該軸として抽出する方法等が考えられる。In the axis extraction by the first and second axis extracting means, a category having a high appearance rate is selected as the upper candidate (select a category which is easily misrecognized), and a category having a high appearance rate is selected as the lower candidate. As the axis (select a category that is difficult to misrecognize), a category with a high appearance rate as a top candidate and a category with a high appearance rate as a lower candidate are combined (select a category that is difficult to misrecognize a category) and A category with a high appearance rate as a candidate is set as the axis (select a category that is not easily or easily misrecognized), and categories with a high correlation coefficient are set as the axis, or categories with a low correlation coefficient are set as the axis. An extraction method or the like can be considered.

【００５２】つまり、クラスタを生成するために軸を選
択し、その軸データを利用してクラスタを生成する。数
字であれば、カテゴリが１３個あり、候補にはその１３
個のカテゴリへの距離が大きいもの順に出力される。そ
の中から、認識確定に利用出来値特徴のある軸を選択す
ることになる。本実施形態では、上位候補に出現率が高
い文字（すなわち、誤認識し易い文字である）上位３つ
をクラスタ使用軸として選択した。そして、認識結果１
５１ａを用いてクラスタリングを実施した。作成される
クラスタ形状は楕円体となる。また１カテゴリ当たり、
複数のＯＫクラスタと複数のＮＧクラスタが存在するこ
ととなる。That is, an axis is selected to generate a cluster, and a cluster is generated using the axis data. If it is a number, there are 13 categories, and 13
The categories are output in descending order of the distance to the category. From among them, an axis having an available value feature for recognition confirmation is selected. In the present embodiment, the top three characters having a high appearance rate in the top candidates (that is, characters that are easily misrecognized) are selected as cluster use axes. And recognition result 1
Clustering was performed using 51a. The created cluster shape is an ellipsoid. Also, per category,
There are a plurality of OK clusters and a plurality of NG clusters.

【００５３】次に、前段判定部１１５は、上記のよう
に、第１前段判定手段、第２前段判定手段および第３前
段判定手段を備えて構成され、第１前段判定手段では、
入力画像の認識結果として特定候補順位のカテゴリおよ
び類似カテゴリが得られるとき、該第１候補のカテゴリ
および特定候補順位の類似カテゴリと第１前段知識にお
ける関連との比較により、該第１候補を”正解”また
は”不明”と判定する。Next, as described above, the first-stage determining unit 115 includes the first first-stage determining unit, the second first-stage determining unit, and the third first-stage determining unit.
When the category of the specific candidate rank and the similar category are obtained as the recognition result of the input image, the first candidate is determined by comparing the category of the first candidate and the similar category of the specific candidate rank with the association in the first preceding knowledge. Judge as "correct" or "unknown".

【００５４】また、第２前段判定手段では、該特定候補
順位のカテゴリまたは類似カテゴリにおける適合度（距
離）の大きさと第２前段知識における適合度（距離）の
大きさとの比較により、該特定順位の候補を”正解”ま
たは”不明”と判定する。さらに、第３前段判定手段で
は、該特定の候補順位のカテゴリまたは類似カテゴリ間
における適合度（距離）差の大きさと第３前段知識にお
ける適合度（距離）差の大きさとの比較により、該第１
候補を”正解”または”不明”と判定する。Further, the second preceding stage determining means compares the magnitude of the degree of matching (distance) in the category of the specific candidate rank or the similar category with the degree of matching (distance) in the second preceding stage knowledge to determine the specific ranking. Are determined to be “correct” or “unknown”. Further, the third preceding-stage determining means compares the magnitude of the fitness (distance) difference between the category of the specific candidate rank or the similar category with the magnitude of the fitness (distance) difference in the third preceding-stage knowledge, and compares the magnitude of the difference. 1
The candidate is determined as “correct” or “unknown”.

【００５５】本実施形態では、認識結果１５１ｂにおけ
る第２候補および第３候補の文字と、前段知識ベース１
１２の知識とを比較する。例えば、第１候補が”１”で
あった場合には、第２候補および第３候補にそれぞれ”
６”および”）”という文字が出現する率が高い。つま
り、第１前段知識には、第２候補および第３候補に出現
する文字が、”６”および”）”のとき、第１候補が”
１”であるものを正解とする知識を所有しており、まず
認識結果１５１ｂの第２候補および第３候補の文字種類
をチェックするのである。In the present embodiment, the characters of the second and third candidates in the recognition result 151b are
Compare with 12 knowledge. For example, if the first candidate is “1”, “2” and “3”
The probability that the characters “6” and “)” appear is high. In other words, the first preamble contains the first candidate when the characters appearing in the second and third candidates are “6” and “)”. But"
The user has the knowledge that the answer is “1”, and first checks the character types of the second and third candidates of the recognition result 151b.

【００５６】次に、前段知識ベース１１２で保持する第
２前段知識（第１候補の距離データで正解と判別できる
最低値）を参照して、該最低値以上の値が第１候補の距
離について算出されているかをチェックし、最低値以上
なら”正解”とする。さらに、第１候補および第２候補
の距離が非常に近い値であると、誤認識している可能性
があるため、前段知識ベース１１２で保持する第３前段
知識（第１候補距離と第２候補距離の差のデータで正解
と判別できる最低値）を参照して、該最低値以上の値で
あるかをチェックし、最低値以上なら”正解”とする。
以上の３つのチェック全てで”正解”と判断されたデー
タについてのみ、認識確定部１１８に結果（確定データ
１５５）が渡されることとなる。Next, with reference to the second precedent knowledge (the lowest value that can be determined as a correct answer based on the distance data of the first candidate) held in the precedent knowledge base 112, a value equal to or more than the minimum value is determined for the distance of the first candidate. It is checked whether it has been calculated, and if it is more than the minimum value, it is determined as "correct answer". Further, if the distance between the first candidate and the second candidate is a very short value, there is a possibility that the distance may be erroneously recognized. Therefore, the third preceding knowledge stored in the preceding knowledge base 112 (the first candidate distance and the second With reference to the candidate distance difference data (the lowest value that can be determined to be a correct answer), it is checked whether the value is equal to or greater than the minimum value.
Only for data determined to be “correct” in all three checks, the result (determined data 155) is passed to the recognition determining unit 118.

【００５７】したがって、確定データ１５５は前段判定
部１１５で正解と判断されたデータであり、不明データ
１５４は前段判定部１１５で正解と判断されなかったデ
ータである。Therefore, the fixed data 155 is data determined to be correct by the preceding-stage determining unit 115, and the unknown data 154 is data not determined to be correct by the preceding-stage determining unit 115.

【００５８】次に、後段判定部１１６は、上記のよう
に、第１後段判定手段および第２後段判定手段を備えて
構成され、第１後段判定手段では、入力画像の認識結果
として認識結果およびその付随情報を含むデータが得ら
れるとき、該データが第１後段知識のあるカテゴリに所
属するＯＫクラスタの中に含まれるか否かをチェックし
て、該ＯＫクラスタに含まれる場合には該ＯＫクラスタ
が所属するカテゴリを入力画像の認識結果として”正
解”と判定し、何れのＯＫクラスタにも含まれていない
場合には”未確定”と判定する。また、第２後段判定
手段では、該データが第２後段知識のあるカテゴリに所
属するＮＧクラスタの中に含まれるか否かをチェックし
て、該ＮＧクラスタに含まれる場合には該ＮＧクラスタ
が所属するカテゴリを前記入力画像の認識結果として”
不正解”と判定し、何れのＮＧクラスタにも含まれてい
ない場合には”正解”と判定する。Next, the second-stage determination unit 116 is provided with the first second-stage determination unit and the second second-stage determination unit as described above, and the first second-stage determination unit recognizes the recognition result and the input image as the recognition result of the input image. When data including the accompanying information is obtained, it is checked whether or not the data is included in an OK cluster belonging to a category having the first second-stage knowledge. The category to which the cluster belongs is determined as “correct” as the recognition result of the input image, and if not included in any of the OK clusters, it is determined as “undetermined”. The second subsequent-stage determination unit checks whether the data is included in an NG cluster belonging to a category having second second-stage knowledge, and if the data is included in the NG cluster, the NG cluster is determined. The category to which the user belongs belongs as the recognition result of the input image.
It is determined to be “incorrect”, and if it is not included in any of the NG clusters, it is determined to be “correct”.

【００５９】つまり、前段判定部１１５で未確定とされ
たデータ１５４を対象に、各カテゴリで生成されたクラ
スタを用いて判定を行う。この後段判定部１１６には、
ＯＫクラスタおよびＮＧクラスタの知識が後段知識ベー
ス１１４から供給され、ＯＫクラスタに入るかどうかの
チェック、並びに、ＮＧクラスタに入るかどうかのチェ
ックという２段階の判定を行う。That is, for the data 154 which has not been determined by the preceding-stage determination unit 115, a determination is made using clusters generated for each category. In the latter-stage determination unit 116,
The knowledge of the OK cluster and the NG cluster is supplied from the subsequent knowledge base 114, and a two-stage determination of whether to enter the OK cluster and whether to enter the NG cluster is performed.

【００６０】第１後段判定手段において、データがある
カテゴリに所属するＯＫクラスタに含まれる場合には、
どの程度の確信を持って”正解”といえるかを示す指標
である確信度を当該後段判定部１１６に含まれる確信度
算出手段により計算し、データが複数の別カテゴリに所
属するＯＫクラスタに入った場合には、該確信度に基づ
いてどのカテゴリを”正解”とするかを決定する。例え
ば、確信度の大きさが所定値以下の場合には”正解と確
信できない”と判定する。If the data is included in an OK cluster belonging to a certain category,
A certainty factor, which is an index indicating how much certainty can be said to be a “correct answer”, is calculated by the certainty factor calculation means included in the latter-stage determination unit 116, and the data enters an OK cluster belonging to a plurality of different categories. In such a case, it is determined which category is the “correct answer” based on the certainty factor. For example, when the degree of certainty is equal to or smaller than a predetermined value, it is determined that “the user cannot be sure of the correct answer”.

【００６１】尚、確信度算出手段における確信度算出方
法には、データが、該ＯＫクラスタの重心からどの位の
距離に位置しているかを演算し、重心から近ければ高い
確信度を、重心から遠ければ低い確信度をそれぞれ算出
する第１の方法と、また、該ＯＫクラスタを構成するデ
ータの分布に基づいて、データが密度の高いエリアに存
在する場合は高い確信度を、密度の低いエリアに存在す
る場合は低い確信度をそれぞれ算出する第２の方法とが
ある。The certainty calculating method in the certainty calculating means calculates how far the data is located from the center of gravity of the OK cluster. If the data is closer to the center of gravity, the higher certainty is calculated from the center of gravity. A first method of calculating a low certainty factor if the data is far away, and a high certainty factor if the data exists in a high-density area based on the distribution of data forming the OK cluster; , There is a second method of calculating each of the low certainty factors.

【００６２】また、第２後段判定手段において、データ
があるカテゴリに所属するＮＧクラスタに含まれる場合
には、当該後段判定部１１６に含まれる確信度更新手段
により、ＯＫクラスタに含まれることで計算された確信
度を低くする処理が行われる。When the data is included in an NG cluster belonging to a certain category in the second subsequent-stage determining means, the data is calculated by being included in the OK cluster by the certainty factor updating means included in the second-stage determining unit 116. A process for lowering the confidence level is performed.

【００６３】したがって、確定データ１５６は、少なく
とも１つのＯＫクラスタに含まれ、確信度が計算された
データであり、未確定データ１５７は、後段判定部１１
６で、どのＯＫクラスタにも含まれなかった場合のデー
タであり、認識結果をどのカテゴリにも確定することが
できなかったデータである。Therefore, the confirmed data 156 is data that is included in at least one OK cluster and for which the degree of certainty has been calculated.
6 is data that is not included in any of the OK clusters, and is data for which the recognition result could not be determined for any category.

【００６４】次に、リジェクト部１１９は、後段判定部
１１６で未確定とされたデータ１５７、或いは、ルール
判定部１１７で正確と確信できないと判定されたデータ
１５９を受ける部分である。Next, the reject section 119 is a section for receiving the data 157 which has not been determined by the subsequent-stage determination section 116 or the data 159 which has been determined by the rule determination section 117 to be uncertain.

【００６５】また、ルール判定部１１７は、後段判定部
１１６で算出された確信度に基づいて、正解と確信でき
ないデータとしてリジェクトするか、確定データとして
受け付けるか判定する部分である。実際には、ユーザの
設定した値との大小比較で”確定”、”正解と確信でき
ない”を分けている。The rule determining section 117 is a section for determining whether to reject as data for which the user cannot be sure of a correct answer or to accept it as finalized data, based on the degree of certainty calculated by the subsequent-stage determining section 116. Actually, “determined” and “cannot be sure of the correct answer” are classified based on the magnitude comparison with the value set by the user.

【００６６】したがって、確定データ１５８は、ルール
判定部１１７でユーザの設定した値より確信度が大きい
データであり、正解と確信できないデータ１５９は、ル
ール判定部１１７でユーザの設定した値より確信度が小
さいデータである。Therefore, the fixed data 158 is data having a certainty greater than the value set by the user in the rule determining unit 117, and the data 159 for which the user cannot be sure of the correct answer is a data having a certainty greater than the value set by the user in the rule determining unit 117. Is small data.

【００６７】さらに、認識確定部１１８は、前段判定部
１１５またはルール判定部１１７で認識結果を確定でき
たデータ１５５，１５８を受ける部分である。Further, the recognition determining section 118 is a section that receives the data 155 and 158 whose recognition result has been determined by the preceding-stage determining section 115 or the rule determining section 117.

【００６８】以上説明した各構成要素により、本実施形
態の画像認識装置では、概略的に以下のような動作が行
われる。図３、図４および図５には、本実施形態の画像
認識装置の概略的動作を説明する概略フローチャートを
示す。The following operations are generally performed in the image recognition device of the present embodiment by the above-described components. FIGS. 3, 4 and 5 show schematic flowcharts for explaining the schematic operation of the image recognition device of the present embodiment.

【００６９】まず、図３は知識作成処理の概略フローチ
ャートである。知識作成処理では、まずステップ３０１
で、サンプルデータについて、ＯＣＲ認識処理部１０１
により知識生成用の認識結果１５１ａの収集が行われ
る。First, FIG. 3 is a schematic flowchart of the knowledge creation processing. In the knowledge creation process, first, at step 301
Then, for the sample data, the OCR recognition processing unit 101
, The recognition result 151a for knowledge generation is collected.

【００７０】次に、ステップ３０２では、知識生成部１
０３の前段知識の作成部１１１により前段知識ベース１
１２が構築される。すなわち、第１前段知識作成手段に
より第１前段知識（第１候補カテゴリに対する類似カテ
ゴリの関連）が、第２前段知識作成手段により第２前段
知識（特定順位の候補を”正解”と確定し得る適合度
（距離）の大きさ）が、第３前段知識作成手段により第
３前段知識（第１候補を”正解”と確定し得る適合度
（距離）差の大きさ）が、それぞれ作成され登録され
る。Next, at step 302, the knowledge generation unit 1
03, the pre-knowledge base 1 by the pre-knowledge creation unit 111
12 is constructed. In other words, the first pre-knowledge generating means can determine the first pre-knowledge (association of a similar category with the first candidate category) and the second pre-knowledge generating means can determine the second pre-knowledge (a candidate of a specific rank is “correct”). The degree of fitness (distance)) is created and registered by the third-level knowledge creating means, and the degree of fitness (distance) difference that can determine the first candidate as “correct” is created and registered. Is done.

【００７１】さらに、ステップ３０３では、知識生成部
１０３の後段知識の作成部１１３により後段知識ベース
１１４が構築される。すなわち、第１後段知識作成手段
により第１後段知識（ＯＫクラスタの形状）が、第２後
段知識作成手段により第２後段知識（ＮＧクラスタの形
状）がそれぞれ作成され登録される。Further, in step 303, the post-stage knowledge base 114 is constructed by the post-stage knowledge creation unit 113 of the knowledge generation unit 103. That is, first second-stage knowledge (shape of OK cluster) is created and registered by the first second-stage knowledge creation means, and second second-stage knowledge (shape of NG cluster) is created and registered by the second second-stage knowledge creation means.

【００７２】次に、図４および図５は、システム運用時
の処理の概略フローチャートである。まず、ステップ４
０１で、実際の認識対象画像について、ＯＣＲ認識処理
部１０１により認識処理されて、認識結果１５１ｂが認
識結果確定部１０２の前段判定部１１５に供給される。Next, FIGS. 4 and 5 are schematic flowcharts of the processing at the time of operating the system. First, step 4
At 01, the actual recognition target image is subjected to recognition processing by the OCR recognition processing unit 101, and the recognition result 151 b is supplied to the preceding stage determination unit 115 of the recognition result determination unit 102.

【００７３】次に、前段判定部１１５による処理が行わ
れる。まず、ステップ４０２では、前段判定部１１５の
第１前段判定手段により、第１候補のカテゴリおよび特
定候補順位の類似カテゴリと第１前段知識における関連
とを比較して、知識と同じであれば該第１候補を”正
解”と判定してステップ４０３に進み、また知識と違っ
ていれば”不明”と判定して図５のステップ５０１に進
む。Next, the process by the preceding stage determination unit 115 is performed. First, in step 402, the first-stage determination unit of the first-stage determination unit 115 compares the category of the first candidate and the similar category of the specific candidate rank with the association in the first first-stage knowledge. The first candidate is determined to be "correct" and the process proceeds to step 403. If the first candidate is different from the knowledge, it is determined to be "unknown" and the process proceeds to step 501 in FIG.

【００７４】次に、ステップ４０３では、第２前段判定
手段により、該特定候補順位のカテゴリまたは類似カテ
ゴリにおける適合度（距離）の大きさと第２前段知識に
おける適合度（距離）の大きさとを比較して、知識の距
離の大きさより大きい値であれば該第１候補を”正解”
と判定してステップ４０４に進み、また知識の距離の大
きさ以下であれば”不明”と判定して図５のステップ５
０１に進む。Next, in step 403, the magnitude of the fitness (distance) in the category of the specific candidate rank or the similar category is compared with the magnitude of the fitness (distance) in the knowledge of the second preceding stage by the second preceding stage determining means. Then, if the value is larger than the magnitude of the distance of knowledge, the first candidate is determined to be “correct”.
And proceeds to step 404. If the distance is equal to or less than the distance of knowledge, it is determined to be "unknown" and step 5 in FIG.
Go to 01.

【００７５】さらに、ステップ４０４では、第３前段判
定手段により、該特定の候補順位のカテゴリまたは類似
カテゴリ間における適合度（距離）差の大きさと第３前
段知識における適合度（距離）差の大きさとを比較し
て、知識の距離差の大きさより大きい値であれば該第１
候補を”正解”と判定して図５のステップ５０８に進
み、また知識の距離差の大きさ以下であれば”不明”と
判定して図５のステップ５０１に進む。Further, in step 404, the magnitude of the degree of fitness (distance) difference between the category of the particular candidate rank or the similar category and the magnitude of the degree of fitness (distance) difference in the knowledge of the third level are determined by the third preceding stage determining means. And if the value is greater than the magnitude of the distance difference in knowledge, the first
The candidate is determined to be “correct” and the process proceeds to step 508 in FIG. 5. If the candidate is smaller than the magnitude of the distance difference between the knowledges, it is determined to be “unknown” and the process proceeds to step 501 in FIG.

【００７６】次に、図５の後段判定部１１６における処
理に進む。まず、ステップ５０１では、後段判定部１１
６の第１後段判定手段により、データが第１後段知識の
あるカテゴリに所属するＯＫクラスタの中に含まれるか
否かをチェックする。ここで、ＯＫクラスタの中に含ま
れていなければ、ステップ５０９でリジェクトされる。Next, the procedure proceeds to the processing in the subsequent-stage determination section 116 of FIG. First, in step 501, the subsequent-stage determination unit 11
The first second-stage determination means 6 checks whether the data is included in an OK cluster belonging to a category having the first second-stage knowledge. If it is not included in the OK cluster, it is rejected in step 509.

【００７７】また、ステップ５０１において、データが
あるカテゴリに所属するＯＫクラスタに含まれる場合に
は、ステップ５０２またはステップ５０３に進んで、ど
の程度の確信を持って”正解”といえるかを示す指標で
ある確信度を確信度算出手段により計算する。If it is determined in step 501 that the data is included in an OK cluster belonging to a certain category, the process proceeds to step 502 or step 503, and an index indicating the degree of certainty that the data can be said to be “correct” Is calculated by the certainty calculating means.

【００７８】確信度算出手段による確信度算出は、ステ
ップ５０２のように、データが、該ＯＫクラスタの重心
からどの位の距離に位置しているかを演算し、重心から
近ければ高い確信度を、重心から遠ければ低い確信度を
それぞれ算出するか、或いは、ステップ５０３のよう
に、該ＯＫクラスタを構成するデータの分布に基づい
て、データが密度の高いエリアに存在する場合は高い確
信度を、密度の低いエリアに存在する場合は低い確信度
をそれぞれ算出するか、の方法で行われる。The confidence calculation by the confidence calculation means calculates the distance of the data from the center of gravity of the OK cluster as in step 502, and if the data is closer to the center of gravity, the higher confidence is calculated. If the data is located in a high-density area based on the distribution of data constituting the OK cluster, a high confidence is calculated based on the distribution of the data constituting the OK cluster, as in step 503. If it exists in an area with a low density, a low certainty factor is calculated, respectively.

【００７９】本実施形態では、図５に示すように、ステ
ップ５０２でクラスタ重心からの距離による確信度を求
め、ステップ５０３でクラスタ内座標値による確信度を
求め、ステップ５０４において、これら２つの算出され
た確信度を加算して最終的な確信度としている。In this embodiment, as shown in FIG. 5, certainty based on the distance from the cluster center of gravity is determined in step 502, certainty based on the intra-cluster coordinate value is determined in step 503, and these two calculated values are determined in step 504. The obtained confidences are added to make the final confidence.

【００８０】次に、ステップ５０５では、第２後段判定
手段により、該データが第２後段知識のあるカテゴリに
所属するＮＧクラスタの中に含まれるか否かをチェック
する。ここで、ＮＧクラスタの中に含まれていなければ
ステップ５０７に進む。Next, in step 505, the second post-stage determining means checks whether or not the data is included in the NG cluster belonging to the category having the second post-stage knowledge. Here, if it is not included in the NG cluster, the process proceeds to step 507.

【００８１】また、ステップ５０５において、データが
あるカテゴリに所属するＮＧクラスタに含まれる場合に
は、ステップ５０６に進んで、確信度更新手段により、
ＯＫクラスタに含まれることで計算された確信度を低く
するように、確信度を調整する。If it is determined in step 505 that the data is included in the NG cluster belonging to a certain category, the process proceeds to step 506, where the certainty factor updating unit executes
The certainty is adjusted so as to lower the certainty calculated by being included in the OK cluster.

【００８２】さらに、ステップ５０７では、ルール判定
部１１７により、後段判定部１１６で算出された確信度
とユーザ設定された値とを比較して、ユーザ設定値以下
の確信度の場合には、ステップ５０９で”正解と確信で
きない”データとしてリジェクトし、また、ユーザ設定
値より大きい確信度の場合には、ステップ５０８で”正
解”の確定データとして受け付ける。Further, in step 507, the rule determining section 117 compares the certainty calculated by the second-stage determining section 116 with the value set by the user. In step 509, the data is rejected as "cannot be convinced that the answer is correct". If the credibility is larger than the user set value, it is accepted in step 508 as the "correct answer" final data.

【００８３】次に、以下では、本実施形態の画像認識装
置を構成する各構成要素について、その機能および動作
並びに用いる手法等を各説明図を参照して詳細に説明し
ていく。Next, the function and operation of each component constituting the image recognition apparatus of the present embodiment, the method used, and the like will be described in detail with reference to the accompanying drawings.

【００８４】まず、ＯＣＲ認識処理部１０１から出力さ
れる認識結果について図６を参照して説明する。認識対
象１文字当たりに出力されるデータは、図６の各行に示
すように、各行上段の候補文字と、各行下段の各候補に
ついての距離である。First, the recognition result output from the OCR recognition processing unit 101 will be described with reference to FIG. As shown in each row of FIG. 6, the data output for each character to be recognized is the distance between the candidate character at the top of each row and each candidate at the bottom of each row.

【００８５】第１行目のデータを例にとり説明を加える
と、次のようになる。すなわち、第１候補は、”０”
で、”０”の距離は２３４である。第２候補は、”９”
で、”９”の距離は２１４である。…。第１３候補
は、”）”で、”）”の距離は１３５である。ここで、
距離が大きいほど、認識した文字への信頼度が高いこと
を意味する。つまり、第１行目のデータについて、ＯＣ
Ｒ認識処理部１０１は認識対象文字を”０”として認識
したことになる。The following is an explanation given by taking the data in the first row as an example. That is, the first candidate is “0”
The distance of “0” is 234. The second candidate is “9”
Thus, the distance of “9” is 214. …. The thirteenth candidate is “)”, and the distance of “)” is 135. here,
The greater the distance, the higher the reliability of the recognized character. That is, for the data in the first row, the OC
This means that the R recognition processing unit 101 has recognized the recognition target character as “0”.

【００８６】次に、前段知識の作成部１１１による前段
知識ベース１１２の構築について説明する。まず、第１
前段知識作成手段では、認識結果１５１ａ（図６参照）
のデータのうち、第２候補および第３候補に出現する候
補文字をカウントする。そのうち出現数の上位２つの候
補文字を第１前段知識として保存する。Next, the construction of the preceding knowledge base 112 by the preceding knowledge creating unit 111 will be described. First, the first
In the preceding knowledge creating means, the recognition result 151a (see FIG. 6)
, The candidate characters appearing in the second and third candidates are counted. Among them, the top two candidate characters having the number of appearances are stored as first precedent knowledge.

【００８７】次に、第２および第３前段知識作成手段で
は、認識結果１５１ａ（図６参照）のデータから、第１
候補距離の大きさ並びに（第１候補距離−第２候補距
離）の大きさの２つの特徴を用いて、認識結果の”確
定”、”未確定”を判断する第２および第３前段知識
（しきい値パラメータ）を作成する。Next, the second and third pre-stage knowledge generating means converts the data of the recognition result 151a (see FIG. 6) into the first
The second and third pre-stage knowledge ("K") for determining whether the recognition result is "determined" or "undetermined" using the two features of the magnitude of the candidate distance and the magnitude of (first candidate distance-second candidate distance). Threshold parameter).

【００８８】まず第１に、知識作成用データからパラメ
ータ決定に使用するデータの選択を行う（カテゴリ毎に
実施する）。具体的には、以下のイ）からニ）の手順で
行われる。First, data to be used for parameter determination is selected from the data for creating knowledge (performed for each category). Specifically, it is performed in the following steps a) to d).

【００８９】イ）（第１候補距離−第２候補距離）が、
０、１、２…の場合の正解サンプル数を調べる。B) (first candidate distance−second candidate distance) is
The number of correct answer samples in the case of 0, 1, 2, ... is examined.

【００９０】ロ）正解サンプル数と誤判別サンプル数の
比を調べる。B) Check the ratio of the number of correct answer samples to the number of incorrectly judged samples.

【００９１】ハ）誤判別サンプル数と正解サンプル数の
比が一定以上（例えば、４０％以上）になる距離差αを
見つける。C) Find a distance difference α at which the ratio of the number of incorrectly determined samples to the number of correct samples is equal to or more than a certain value (for example, 40% or more).

【００９２】ニ）（第１候補距離−第２候補距離）がα
以下のデータは、全てリジェクトの対象として（２）の
パラメータ決定の対象から除く。D) (first candidate distance−second candidate distance) is α
The following data are all rejected and excluded from the parameter determination in (2).

【００９３】次に（第２に）、文字（カテゴリ）ごとに
誤判別をリジェクトするのに必要な第１候補距離並びに
第１候補および第２候補の距離差のしきい値を見つけ
る。具体的には、以下のイ）からハ）の手順で行われ
る。Next (secondly), a first candidate distance necessary for rejecting erroneous determination for each character (category) and a threshold value of a distance difference between the first candidate and the second candidate are found. Specifically, it is performed in the following steps a) to c).

【００９４】イ）誤判別サンプルの第１候補距離と（第
１候補距離−第２候補距離）を調べる。A) Check the first candidate distance and (first candidate distance−second candidate distance) of the erroneously determined sample.

【００９５】ロ）正解のサンプルの第１候補距離と（第
１候補距離−第２候補距離）を調べる。B) Check the first candidate distance and (first candidate distance−second candidate distance) of the correct answer sample.

【００９６】ハ）誤判別をすべてリジェクトし、かつ正
解のリジェクトを最小限に抑えることのできる第１候補
と（第１候補距離−第２候補距離）のしきい値を決め
る。C) A threshold value of a first candidate and a (first candidate distance−second candidate distance) capable of rejecting all erroneous determinations and minimizing rejection of a correct answer is determined.

【００９７】以上のようにして得られたパラメータを図
７に示す。図中、「第１候補距離」が第２前段知識であ
り、「第１−第２候補距離」が第３前段知識であり、
「第２、第３候補に出現率の高い文字」が第１前段知識
である。FIG. 7 shows the parameters obtained as described above. In the figure, “first candidate distance” is second precedent knowledge, “first-second candidate distance” is third precedent knowledge,
"Characters having a high appearance rate in the second and third candidates" is the first preceding knowledge.

【００９８】次に、前段判定部１１５による判定方法に
ついて説明する。前段判定部１１５は、知識生成部１０
３で作成した候補文字の距離データによる前段知識ベー
ス１１２を使用して、ＯＣＲ認識処理部１０１の認識結
果１５１ｂを”確定”、”未確定”に分けるものであ
る。ここで、”確定”、”未確定”に分けるためのルー
ルは、以下に述べる通りである。Next, a description will be given of a determination method by the preceding-stage determination unit 115. The pre-stage determination unit 115 determines whether the knowledge generation unit 10
The recognition result 151b of the OCR recognition processing unit 101 is divided into "determined" and "undetermined" using the preceding knowledge base 112 based on the distance data of the candidate characters created in step 3. Here, the rules for dividing into “determined” and “undetermined” are as described below.

【００９９】まず第１に（手順１）、第１候補を参照
し、第１前段知識（図７参照）からチェックに使用する
パラメータを選択する。次に（手順２）、第２候補およ
び第３候補文字が第１前段知識（出現率が高いもの）に
一致しているかチェックする。一致しないデータが入っ
ている場合は”不明”データとする。次に（手順３）、
手順２で”不明”データとされなかったデータについ
て、第１候補距離が第２前段知識（図７の第１候補距離
パラメータ）より小さければ”不明”データとする。さ
らに（手順４）、手順３で”不明”データとされなかっ
たデータについて、（第１候補距離−第２候補距離）の
チェックを行う。すなわち、（第１候補距離−第２候補
距離）が第３前段知識（図７の（第１候補距離−第２候
補距離）パラメータ）より小さければ”不明”データと
する。First (procedure 1), referring to the first candidate, a parameter to be used for the check is selected from the first preceding knowledge (see FIG. 7). Next (procedure 2), it is checked whether the second candidate character and the third candidate character match the first preceding knowledge (the one having a high appearance rate). If data that does not match is included, it is regarded as "unknown" data. Next (step 3)
If the first candidate distance is smaller than the second preceding knowledge (the first candidate distance parameter in FIG. 7), the data that is not determined as “unknown” data in step 2 is determined as “unknown” data. Further (step 4), a check of (first candidate distance−second candidate distance) is performed on the data not determined as “unknown” data in step 3. That is, if (the first candidate distance−the second candidate distance) is smaller than the third preceding knowledge (the (first candidate distance−the second candidate distance) parameter in FIG. 7), the data is determined to be “unknown”.

【０１００】図８には、以上の手順をフローチャートと
して示す。すなわち、まずステップ８０１では、ＯＣＲ
認識処理部１０１の認識結果１５１ｂから第１候補を取
得する。次に、ステップ８０２では、第１候補を参照
し、第１前段知識から「第２、第３候補に出現率の高い
候補文字」を抽出する。そして、ステップ８０３では、
認識結果１５１ｂの第２、第３候補と第１前段知識の第
２、第３候補とが一致するかを判断する。ここで、一致
しない場合には、ステップ８０８に進んで、認識結果１
５１ｂの第１候補を”不明”とする。FIG. 8 is a flowchart showing the above procedure. That is, first, in step 801, the OCR
The first candidate is obtained from the recognition result 151b of the recognition processing unit 101. Next, in step 802, “a candidate character having a high appearance rate in the second and third candidates” is extracted from the first preceding knowledge with reference to the first candidate. Then, in step 803,
It is determined whether the second and third candidates of the recognition result 151b match the second and third candidates of the first preceding knowledge. If they do not match, the process proceeds to step 808, where the recognition result 1
The first candidate of 51b is set to “unknown”.

【０１０１】次に、ステップ８０３において一致した場
合には、ステップ８０４に進んで、第２および第３前段
知識から「第１候補距離」パラメータαおよび「第１−
第２候補距離差」パラメータβを取得する。そして、ス
テップ８０５では、認識結果１５１ｂの第１候補距離と
「第１候補距離」パラメータαとを比較して、第１候補
距離が「第１候補距離」パラメータαより小さければス
テップ８０８に進んで、認識結果１５１ｂの第１候補
を”不明”とする。Next, when they match in step 803, the process proceeds to step 804, where the "first candidate distance" parameter α and the "1-th
2nd candidate distance difference "parameter β is acquired. In step 805, the first candidate distance of the recognition result 151b is compared with the “first candidate distance” parameter α, and if the first candidate distance is smaller than the “first candidate distance” parameter α, the process proceeds to step 808. The first candidate of the recognition result 151b is set to “unknown”.

【０１０２】また、ステップ８０５において、第１候補
距離が「第１候補距離」パラメータαより大きければ、
ステップ８０６に進んで、認識結果１５１ｂの（第１候
補距離−第２候補距離）と「第１−第２候補距離差」パ
ラメータβとを比較して、（第１候補距離−第２候補距
離）が「第１−第２候補距離差」パラメータβより小さ
ければステップ８０８に進んで、認識結果１５１ｂの第
１候補を”不明”とする。In step 805, if the first candidate distance is larger than the "first candidate distance" parameter α,
Proceeding to step 806, the (first candidate distance−second candidate distance) of the recognition result 151b is compared with the “first-second candidate distance difference” parameter β to obtain (first candidate distance−second candidate distance). ) Is smaller than the “first-second candidate distance difference” parameter β, the process proceeds to step 808, and the first candidate of the recognition result 151b is set to “unknown”.

【０１０３】また、ステップ８０６において、（第１候
補距離−第２候補距離）が「第１−第２候補距離差」パ
ラメータβより大きければステップ８０７に進んで、認
識結果１５１ｂの第１候補を”正解”と確定する。If it is determined in step 806 that (first candidate distance−second candidate distance) is greater than the “first-second candidate distance difference” parameter β, the process proceeds to step 807, where the first candidate of the recognition result 151b is determined. Confirm "Correct".

【０１０４】尚、図８中で用いている「第１候補距離」
パラメータαおよび「第１−第２候補距離差」パラメー
タβは、外部から変動させることができる。つまり、図
７に示したパラメータは、各文字（カテゴリ）毎に用い
る標準的なパラメータであり、これを変動させることで
確定、未確定データ数の割合を調節できる。Note that the "first candidate distance" used in FIG.
The parameter α and the “first-second candidate distance difference” parameter β can be changed externally. That is, the parameters shown in FIG. 7 are standard parameters used for each character (category), and by varying these parameters, the ratio of the number of confirmed and undetermined data can be adjusted.

【０１０５】すなわち、「第１候補距離」パラメータα
および「第１−第２候補距離差」パラメータβは、次式
において、Ｋｄ、第１固定値および第２固定値を可変設
定することで調整可能である。That is, the “first candidate distance” parameter α
And the “first-second candidate distance difference” parameter β can be adjusted by variably setting Kd, the first fixed value, and the second fixed value in the following equation.

【０１０６】 α＝第１候補距離＋第１固定値×Ｋｄ（−１．０≦Ｋｄ≦１．０） β＝（第１候補距離−第２候補距離）＋第２固定値×Ｋ
ｄ（−１．０≦Ｋｄ≦１．０）ここで、第１固定値および第２固定値の決定法は、図７
のパラメータを基準に±２の変動幅を持たせる場合に
は、第１固定値＝第２固定値＝２とする。また、図７の
パラメータを基準値とすると、パラメータαおよびβが
共に大きいほど、図８における条件（ステップ８０５，
８０６）を通過できるデータ数は少なくなるから、Ｋｄ
が１．０に近くなればなるほど厳しい設定ということに
なる。逆に、Ｋｄが−１．０に近くなればなるほど緩い
設定ということになる。Α = first candidate distance + first fixed value × Kd (−1.0 ≦ Kd ≦ 1.0) β = (first candidate distance−second candidate distance) + second fixed value × K
d (−1.0 ≦ Kd ≦ 1.0) Here, the method of determining the first fixed value and the second fixed value is shown in FIG.
When a variation range of ± 2 is provided based on the above parameter, the first fixed value = the second fixed value = 2. Further, assuming that the parameters in FIG. 7 are reference values, the larger the parameters α and β are, the greater the condition in FIG.
806), the number of data that can pass
Is closer to 1.0, the stricter the setting. Conversely, the closer the Kd is to -1.0, the looser the setting.

【０１０７】次に、後段知識の作成部１１３による後段
知識ベース１１４の構築方法について説明する。本実施
形態では、上位候補に出現率が高い文字（すなわち、誤
認識し易い文字）の上位３つを、クラスタ使用軸として
選択し、３次元空間にクラスタを形成することとする。
つまり、図７における出現率の高い候補文字を軸として
選択したことになる。これらの軸を使用して、クラスタ
リングを実施するまでの過程を以下に簡単に述べる。Next, a method of constructing the second-stage knowledge base 114 by the second-stage knowledge creating unit 113 will be described. In the present embodiment, the top three characters having a high appearance rate as top candidates (that is, characters that are easily misrecognized) are selected as cluster use axes, and clusters are formed in a three-dimensional space.
That is, the candidate character having a high appearance rate in FIG. 7 is selected as an axis. The process of performing clustering using these axes will be briefly described below.

【０１０８】まず（手順１）、ＯＣＲ認識処理部１０１
から認識結果（知識生成用）１５１ａを受け取り、その
データをカテゴリ別（１３個）に分類する。次に（手順
２）、カテゴリ別に分類されたデータを、ＯＣＲ認識結
果正解データ（以下、甲群という）と誤認識結果（以
下、乙群という）に分類する。次に（手順３）、図７に
示した各カテゴリの「第２、第３候補に出現率の高い候
補文字」のデータ群を、甲群のデータから抽出して、こ
れらを出現率の高い順にＡ、ＢおよびＣとする。カテゴ
リ”０”を例に取ると、０（Ａ）、６（Ｂ）、９（Ｃ）
のカテゴリの甲群データを抽出する。First (procedure 1), the OCR recognition processing unit 101
Receives the recognition result (for knowledge generation) 151a, and classifies the data into 13 categories. Next (procedure 2), the data classified by category is classified into OCR recognition result correct answer data (hereinafter, group A) and misrecognition result (hereinafter, group B). Next (procedure 3), a data group of “candidate characters having a high appearance rate in the second and third candidates” of each category shown in FIG. 7 is extracted from the data of the instep group, and these are extracted with a high appearance rate. A, B, and C in this order. Taking category "0" as an example, 0 (A), 6 (B), 9 (C)
Extract the group A data of the category.

【０１０９】次に（手順４）、Ａのデータ群を対象に、
軸Ａ、軸Ｂおよび軸Ｃのデータを抽出してクラスタリン
グを実施する。ここで作成されたクラスタをａとする。
次に（手順５）、Ｂのデータ群を対象に、軸Ａ、軸Ｂお
よび軸Ｃのデータを抽出してクラスタリングを実施す
る。ここで作成されたクラスタをｂとする。さらに（手
順６）、Ｃのデータ群を対象に、軸Ａ、軸Ｂおよび軸Ｃ
のデータを抽出してクラスタリングを実施する。ここで
作成されたクラスタをｃとする。次に（手順７）、乙群
のデータに対して、軸Ａ、軸Ｂおよび軸Ｃのデータを抽
出してクラスタリングを実施する。Next (step 4), for the data group of A,
The data of the axes A, B and C are extracted and clustering is performed. The cluster created here is assumed to be a.
Next (procedure 5), data of the axis A, axis B, and axis C are extracted from the data group B, and clustering is performed. The cluster created here is assumed to be b. Further (procedure 6), for the data group of C, axis A, axis B and axis C
And perform clustering. The cluster created here is assumed to be c. Next (procedure 7), clustering is performed by extracting data of the axis A, axis B, and axis C from the data of the group B.

【０１１０】ここで、カテゴリは１３個あるので手順１
〜手順６および手順７はそれぞれ１３回実施することに
なる。また、甲群および乙群それぞれについて、複数の
クラスタが生成されることになる。尚、手順３〜手順６
で作成されたクラスタを「ＯＫクラスタ」（ａ、ｂ、ｃ
が含まれる）と呼び、手順７で作成されたクラスタを
「ＮＧクラスタ」と呼ぶ。Since there are 13 categories, the procedure 1
Steps 6 and 7 are each performed 13 times. Also, a plurality of clusters will be generated for each of the instep group and the second group. Steps 3 to 6
"OK cluster" (a, b, c)
Is included), and the cluster created in the procedure 7 is called an “NG cluster”.

【０１１１】次に、本実施形態で採用するクラスタリン
グ手法（超楕円体法）について説明する。クラスタ間距
離が近いものの同士を合併しクラスタを生成していく通
常のクラスタリング方法とは異なり、超楕円体法による
クラスタリングは、線型性（線型モデルをいかにして形
成できるかということを意味する）と、連続性（クラス
タを構成するデータの分布が連続的かどうかを意味す
る）とを考慮してクラスタを生成することができる。Next, the clustering method (hyperellipsoid method) used in this embodiment will be described. Unlike the usual clustering method in which clusters with short intercluster distances are merged to generate clusters, clustering by the hyperellipsoidal method is linear (means how a linear model can be formed). And the continuity (meaning whether or not the distribution of data constituting the cluster is continuous) can be generated.

【０１１２】通常のクラスタリングでは、距離という規
範のみで一度クラスタが合併されると、形状は変更でき
ない。つまり、合併後、クラスタを形成している一部分
を分離するための規範が無いのである。しかし、超楕円
体クラスタリングでは、線型性と連続性の考慮により、
合併対象クラスタの変更や、合併後のクラスタ構成デー
タの移動により、定義した規範に即したクラスタ群を構
築することができる。In ordinary clustering, once clusters are merged only based on the distance rule, the shape cannot be changed. That is, after the merger, there is no norm for separating the part forming the cluster. However, in superellipsoidal clustering, due to the consideration of linearity and continuity,
By changing the cluster to be merged or by moving the cluster configuration data after the merge, a cluster group conforming to the defined norm can be constructed.

【０１１３】図９に示すクラスタ生成を例に採ると、連
続度を重視した場合には、距離の近いクラスタＡとＢが
合併され、構成データ密度の高いクラスタが生成され
る。また、線型度を重視した場合には、クラスタＢとＣ
が合併され、さらに必要があれば一点移動をおこない、
線型モデルが構築し易いデータ分布のクラスタが生成さ
れることになる。尚、超楕円体クラスタリングの規範
は、各クラスタの体積（超積）の和を最小にすることで
ある。Taking the cluster generation shown in FIG. 9 as an example, when importance is placed on continuity, clusters A and B having short distances are merged, and a cluster having a high configuration data density is generated. When importance is placed on the linearity, clusters B and C
Are merged, and if necessary, move one point,
A cluster having a data distribution that is easy to construct a linear model is generated. The norm of the hyperellipsoidal clustering is to minimize the sum of the volume (super product) of each cluster.

【０１１４】次に、超楕円体クラスタリングの処理の流
れについて、以下、簡単に説明する。超楕円体法クラス
タリングの処理の概要は、図１０に示すフローチャート
の通りである。前述した通り、この処理を文字（カテゴ
リ）毎に行い、複数のクラスタ／カテゴリを作成する。Next, the flow of the processing of the hyperellipsoidal clustering will be briefly described below. The outline of the processing of the hyperellipsoid clustering is as shown in the flowchart of FIG. As described above, this process is performed for each character (category) to create a plurality of clusters / categories.

【０１１５】以下に、処理のステップを簡単に述べる。
まず、ステップ１００１では各種パラメータを設定す
る。次に、初期のクラスタは、１［データ／クラスタ］
で形成されており、そのクラスタを、クラスタ間距離が
近いもの同士で合併し、処理クラスタ群を形成する。こ
こで用いられるのがウォード法と呼ばれる従来のクラス
タリング手法（ステップ１００２）である。Hereinafter, the steps of the process will be briefly described.
First, in step 1001, various parameters are set. Next, the initial cluster is 1 [data / cluster]
, And the clusters are merged with each other with a short inter-cluster distance to form a processing cluster group. Here, a conventional clustering method called the Ward method (step 1002) is used.

【０１１６】次に、ステップ１００３では、複数［デー
タ／クラスタ］となった後（中には１［データ／クラス
タ］のものも存在する）、超楕円体の評価規範であるク
ラスタ体積和の増加を最小にするクラスタ同士を合併す
る（ステップ１００５）。ただし、合併対象のクラスタ
重心間距離は、δｕ以下でなければならない（ステップ
１００４）。Next, in step 1003, after a plurality of [data / clusters] (including one [data / cluster]), the cluster volume sum, which is the evaluation criterion of the hyperellipsoid, is increased. Are merged with each other (step 1005). However, the distance between the cluster centroids to be merged must be δu or less (step 1004).

【０１１７】次に、合併後、クラスタ数を変更せずに、
構成データユニットのクラスタ間移動を行う。ここで移
動対象となるのは、超楕円体の評価規範であるクラスタ
体積和の減少量が最大となるデータである（ステップ１
００７〜１０１０）。ただし、データの移動をおこなう
クラスタは、クラスタ重心間処理がδｍ以下でなければ
ならない（ステップ１００８）。Next, after merging, without changing the number of clusters,
The configuration data unit is moved between clusters. Here, the data to be moved is data in which the reduction amount of the cluster volume sum, which is the evaluation criterion of the hyperellipsoid, is the largest (step 1).
007-1010). However, for the cluster in which the data is to be moved, the processing between cluster centroids must be δm or less (step 1008).

【０１１８】最後に、終了判定を行う。クラスタの個数
が指定した最終クラスタ数であるか（ステップ１００
６）、或いは、規範増加量が指定値以上である（ステッ
プ１０１１）とき終了する。そうでなければ、ステップ
１００５〜１０１０を繰り返す。Finally, an end determination is made. Whether the number of clusters is the specified final cluster number (step 100
6) Alternatively, the process ends when the reference increase amount is equal to or more than the specified value (step 1011). Otherwise, steps 1005 to 1010 are repeated.

【０１１９】尚、上記説明の中で用いたパラメータδｕ
およびδｍには、次のような意味がある。すなわち、δ
ｕは合併範囲制限パラメータであり、δｕを大きくする
と線型度重視となり、δｕ小さくすると連続度重視とな
る。またδｍは移動範囲制限パラメータであり、δｍを
大きくすると線型度重視であり、δｍを小さくすると連
続度重視である。これらのパラメータδｕおよびδｍ
は、一意に決定できるものではないため、試行錯誤して
設定することが必要である。The parameter δu used in the above description
And δm have the following meanings. That is, δ
u is a merge range limit parameter. When δu is increased, importance is placed on linearity, and when δu is decreased, importance is placed on continuity. Further, δm is a movement range restriction parameter. When δm is increased, importance is placed on linearity, and when δm is reduced, importance is placed on continuity. These parameters δu and δm
Cannot be uniquely determined, and must be set by trial and error.

【０１２０】以上のようなクラスタリングを実行した結
果、後段知識としてファイル出力される場合のデータフ
ォーマットを図１１に示す。FIG. 11 shows a data format in the case where a file is output as subsequent knowledge as a result of performing the above clustering.

【０１２１】次に、楕円体と各パラメータの関係につい
て説明する。クラスタ構成データで生成される共分散行
列の固有ベクトルが楕円体の広がり方向を決め、その方
向の楕円体半径は共分散行列の固有値の平方根に相当す
る。例えば、１データが１３の成分からなり、全ての成
分を使用して超楕円体クラスタリングを行うと、１３×
１３の共分散行列が形成される。そして、その固有ベク
トル、固有値はそれぞれ１３個求めることができる。た
だし、図１１のクラスタデータフォーマットに示すよう
に、共分散行列は後段知識として保存しない。Next, the relationship between the ellipsoid and each parameter will be described. The eigenvectors of the covariance matrix generated by the cluster configuration data determine the direction in which the ellipsoid spreads, and the ellipsoid radius in that direction corresponds to the square root of the eigenvalue of the covariance matrix. For example, if one data is composed of 13 components and all the components are used to perform hyperellipsoid clustering, 13 ×
Thirteen covariance matrices are formed. Then, 13 eigenvectors and 13 eigenvalues can be obtained for each. However, as shown in the cluster data format of FIG. 11, the covariance matrix is not stored as subsequent knowledge.

【０１２２】図１２には、楕円体と固有値および固有ベ
クトルの関係を説明する説明図を示す。また、楕円体
式、楕円体体積は次式であらわすことができる。FIG. 12 is an explanatory diagram for explaining the relationship between an ellipsoid, eigenvalues, and eigenvectors. The ellipsoidal expression and the ellipsoidal volume can be expressed by the following expressions.

【０１２３】[0123]

【数１】 (Equation 1)

【数２】次に、後段判定部１１６による判定方法について説明す
る。後段判定部１１６は、前述のように、前段判定部１
１５で未確定とされた不明データ１５４について、確
定、未確定を再度チェックするものである。チェック
は、後段知識の作成部１１３で作成した後段知識ベース
１１４（ＯＫクラスタ）に対象データが含まれるか否か
で行う。もし、あるカテゴリのクラスタに含まれる場合
は、その確信度（信頼度）を計算する確信度算出手段を
持つ。(Equation 2) Next, a determination method by the latter-stage determination unit 116 will be described. The second-stage determination unit 116, as described above, uses the first-stage determination unit 1
The unknown and undetermined unknown data 154, which has been determined to be undetermined at 15, are checked again. The check is performed based on whether the target data is included in the second-stage knowledge base 114 (OK cluster) created by the second-stage knowledge creating unit 113. If it is included in a cluster of a certain category, it has a certainty calculating means for calculating the certainty (reliability).

【０１２４】後段判定部１１６における処理の流れを説
明するフローチャートを図１３に示す。まず、ステップ
１３０１では、確定対象データ１５４の第１候補を取得
し、その第１候補で利用する軸データ（３次元）を抽出
する。例えば、第１候補が”０”ならば、図７より、
０、６、９の軸を選択する。FIG. 13 is a flowchart for explaining the flow of the processing in the subsequent-stage determination section 116. First, in step 1301, a first candidate of the determination target data 154 is obtained, and axis data (three-dimensional) used for the first candidate is extracted. For example, if the first candidate is “0”, from FIG.
The axes 0, 6, and 9 are selected.

【０１２５】次に、ステップ１３０３では、後段知識と
して所有しているクラスタのうち、「ＯＫクラスタ」の
全てに対して、内部に含まれるかチェックを行う。Next, in step 1303, it is checked whether all the “OK clusters” among the clusters possessed as subsequent knowledge are included inside.

【０１２６】ステップ１３０３において、あるクラスタ
内部に含まれる場合には、ステップ１３０４に進んで、
クラスタ重心からの距離による確信度を算出し、続いて
ステップ１３０５で、クラスタ内座標値による確信度を
算出する。そして、ステップ１３０６では、これら２つ
の確信度を加算して最終的な確信度とする。以上の処理
を全クラスタ似付いて行う（ステップ１３０２〜１３０
７）。If it is determined in step 1303 that the data is included in a certain cluster, the flow advances to step 1304, and
A certainty factor based on the distance from the cluster center of gravity is calculated, and then in step 1305, a certainty factor based on the intra-cluster coordinate value is calculated. Then, in step 1306, these two certainty factors are added to obtain a final certainty factor. The above processing is performed for all clusters (steps 1302 to 130
7).

【０１２７】ここで、算出した確信度を、そのクラスタ
を作成する際に使用したデータ群（カテゴリ）に対する
所属度合いとする。つまり、後段知識に作成されている
ａというクラスタに入った場合はＡに対する所属度合
い、また、ｂというクラスタに入った場合はＢに対する
所属度合い、さらに、ｃというクラスタに入った場合は
Ｃに対する所属度合いとなる。Here, the calculated degree of certainty is defined as the degree of belonging to the data group (category) used in creating the cluster. In other words, if the user enters the cluster a created in the subsequent knowledge, the degree of affiliation to A, if the user enters the cluster b, the affiliation to B, and if the user enters the cluster c, the affiliation to C Degree.

【０１２８】”０”というカテゴリについて、更に具体
例を述べる。”０”というカテゴリのＯＫクラスタは、
０、６、９カテゴリのデータ群それぞれについて、０
軸、６軸、９軸を用いてクラスタが作成されている。こ
のうち、０のデータ群で作成されたクラスタに入った場
合は０に対する所属度合いとなる。また６のデータ群で
作成されたクラスタに入った場合は６に対する所属度合
い、９のデータ群で作成されたクラスタに入った場合は
９に対する所属度合いとなる。A more specific example will be described for the category "0". The OK cluster of the category “0” is
For each of the data groups of 0, 6, and 9 categories, 0
A cluster is created using axes, 6 axes, and 9 axes. Of these, when the cluster is created with a data group of 0, the degree of affiliation with 0 is obtained. In addition, when entering a cluster created with 6 data groups, the affiliation degree to 6 is given, and when entering a cluster created with 9 data groups, the affiliation degree to 9 is given.

【０１２９】次に、ステップ１３０８では、内部に含ま
れるクラスタの有無を判断する。複数のクラスタに含ま
れる場合は、ステップ１３０９で、最大確信度を得たク
ラスタのカテゴリを抽出し、そのクラスタを生成する際
に用いたデータ群を認識結果とする。Next, in step 1308, it is determined whether there is a cluster included therein. If it is included in a plurality of clusters, in step 1309, the category of the cluster that has obtained the maximum certainty is extracted, and the data group used when generating the cluster is used as the recognition result.

【０１３０】次に、ステップ１３１２では、知識として
所有しているクラスタのうち、ステップ１３０９で得ら
れたカテゴリ（認識結果）の所有するＮＧクラスタ全て
に対して、内部に含まれるかのチェックを行う。もしＮ
Ｇクラスタ内部に含まれていた場合には、ステップ１３
１３において、ステップ１３０９で得られた確信度を、
ある規則に従って小さくなるような方向に微調整する。
内部に含まれない場合は、ステップ１３０９の結果を確
定する。Next, in step 1312, it is checked whether all the NG clusters owned by the category (recognition result) obtained in step 1309 among the clusters owned as knowledge are included inside. . If N
If it is included in the G cluster, step 13
In 13, the confidence obtained in step 1309 is
Fine-tune in a direction to reduce according to a certain rule.
If not included, the result of step 1309 is determined.

【０１３１】ここで注目すべき点は、ステップ１３０９
の処理によって、ＯＣＲ認識処理部１０１による認識結
果が覆され、認識結果を置き直して正解とすることがで
きる点である。The point to be noted here is that step 1309
Is that the recognition result by the OCR recognition processing unit 101 is reversed, and the recognition result can be replaced to obtain a correct answer.

【０１３２】次に、「ＯＫクラスタ」のチェック（ステ
ップ１３０３）について詳しく説明する。確定対象デー
タがクラスタ内部に含まれるかどうかは、クラスタリン
グにおいて使用する３軸をクラスタの持つ固有ベクトル
で新座標に変換した後、楕円体クラスタを表す次の楕円
体式に代入して、算出された距離が楕円体クラスタのカ
バリング距離以下であるかどうかで判断する。尚、次式
において、γは３軸座標を固有ベクトルの座標に変換し
たものである。Next, the check of “OK cluster” (step 1303) will be described in detail. Whether the data to be determined is included in the cluster is determined by converting the three axes used in the clustering into new coordinates using the eigenvectors of the cluster, and then substituting it into the following ellipsoidal expression representing the ellipsoidal cluster, and calculating the calculated distance Is smaller than or equal to the covering distance of the ellipsoidal cluster. In the following equation, γ is obtained by converting three-axis coordinates into coordinates of eigenvectors.

【０１３３】[0133]

【数３】ここで、もしカバリング距離以下であれば、そのクラス
タに含まれることになり、確信度計算処理（ステップ１
３０４〜１３０６）へ進む。逆に、カバリング距離以上
であれば、他のＯＫクラスタのチェックへ進む。(Equation 3) Here, if the distance is equal to or less than the covering distance, it is included in the cluster, and the certainty factor calculation processing (step 1)
304 to 1306). On the other hand, if the distance is equal to or longer than the covering distance, the process proceeds to another OK cluster check.

【０１３４】次に、確信度の計算処理（ステップ１３０
４〜１３０６）について詳しく説明する。まず、認識結
果があるクラスタ内部に含まれると判断された時、算出
するクラスタ重心からの距離の確信度（ステップ１３０
４）についての説明を行う。Next, the calculation processing of the certainty factor (step 130)
4-1306) will be described in detail. First, when it is determined that the recognition result is included in a certain cluster, the certainty factor of the distance from the calculated cluster centroid (step 130).
4) will be described.

【０１３５】概要を簡単に述べると、クラスタ知識を生
成した際に確保されたファイル（図１１参照）の中で、
dis_max、dis_median、dis_min（以下、距離ポイントデ
ータという）を用いてメンバシップ関数を生成し、その
関数への適合度を計算するのである。ここで用いる距離
ポイントデータとは、クラスタを構成するデータ全てに
ついて中心からの距離を算出し、その中の最大、中央
値、最小値を保存したものである。平均値ではなく中央
値を採用したのは、あるかけ離れた値によって値の変動
が少ないデータという理由からである。ただし、現時点
ではdis_maxの値は保存しているだけで使用していな
い。Briefly speaking, in the file (see FIG. 11) secured when the cluster knowledge is generated,
A membership function is generated using dis_max, dis_median, and dis_min (hereinafter referred to as distance point data), and the fitness to the function is calculated. The distance point data used here is data in which the distance from the center is calculated for all the data constituting the cluster, and the maximum, median, and minimum values are stored. The reason why the median value is used instead of the average value is that data whose values vary little due to values that are far apart from each other is small. However, at present, the value of dis_max is only saved and not used.

【０１３６】採用するメンバシップ関数の数式は、次の
通りである。式のdisに認識対象文字の距離を入れるこ
とで距離が算出される。尚、ｔ＞０であり、dis＞dis_m
inならば適合度＝１である。The formula of the membership function to be adopted is as follows. The distance is calculated by putting the distance of the recognition target character into the dis of the expression. Note that t> 0 and dis> dis_m
If in, the degree of conformity = 1.

【０１３７】[0137]

【数４】また、中心からの距離の確信度算出で用いるメンバシッ
プ関数を図示すると図１４のようになる。ここで、図１
４の概略を述べておく。認識対象データのクラスタ中心
からの距離がdis_min以下の時、適合度＝１を与える。
メンバシップ関数の変極点は、dis_medianである。ｔの
値を変化させることで、算出される適合度の大きさを変
更できる。すなわち、ｔが大きくなるほど算出適合度は
大きくなる。(Equation 4) FIG. 14 illustrates a membership function used in calculating the certainty factor of the distance from the center. Here, FIG.
The outline of No. 4 will be described. When the distance of the recognition target data from the center of the cluster is equal to or less than dis_min, the fitness level = 1 is given.
The inflection point of the membership function is dis_median. By changing the value of t, the magnitude of the calculated degree of conformity can be changed. That is, the calculation suitability increases as t increases.

【０１３８】尚、後段判定部１１６で用いている「距
離」とは、楕円体の固有ベクトル方向の軸長さを考慮し
たものであり、メンバシップ関数と楕円体の関係は、図
１５の適合度と楕円の関係に示す如くなる。説明に用い
るのは２次元の楕円で、ｘ、ｙ軸方向への広がりを持
つ。ここで算出された適合度を、「クラスタ重心からの
確信度」という。The “distance” used in the subsequent-stage determination unit 116 takes into account the axial length of the ellipsoid in the direction of the eigenvector, and the relationship between the membership function and the ellipsoid is determined by the fitness shown in FIG. And an ellipse. A two-dimensional ellipse is used for the description and has a spread in the x and y axis directions. The degree of matching calculated here is referred to as “certainty from the center of gravity of the cluster”.

【０１３９】次に、クラスタ内座標からの確信度の算出
（ステップ１３０５）について説明する。前述のよう
に、後段知識の作成部１１３による後段知識ベース１１
４の構築において、クラスタを構成するデータの座標値
の頻度を計算している。その頻度によって確信度を割り
当てたものが、図１１に示した保存知識おけるfreqienc
yである。２次元の楕円体についての場合に説明を加え
る。Next, the calculation of the certainty factor from the intra-cluster coordinates (step 1305) will be described. As described above, the second-stage knowledge base 11 by the second-stage knowledge creating unit 113 is used.
In the construction of No. 4, the frequency of the coordinate values of the data constituting the cluster is calculated. The assignment of certainty according to the frequency is the freqienc in the stored knowledge shown in FIG.
y. A description will be given in the case of a two-dimensional ellipsoid.

【０１４０】図１６に示されるようなデータの分布でク
ラスタが構成される場合、確信度を求める関数は、頻度
の高い座標値付近で高い値を取るよう設定されている。
これは、このクラスタに属するデータであれば、どの座
標値に偏って集まるかを示していることになる。図１６
では、第１象限に偏って集まることが示されている。つ
まり、第１象限に確定対象データが含まれた場合には、
高い確信度を算出する。逆に、第３、第４象限では低い
確信度が算出されることとなる。When a cluster is composed of the data distribution as shown in FIG. 16, the function for obtaining the certainty factor is set to take a high value near the frequently-used coordinate value.
This indicates to which coordinate value the data belonging to this cluster are concentrated. FIG.
Shows that they gather in the first quadrant. That is, if the data to be determined is included in the first quadrant,
Calculate high confidence. Conversely, low confidence is calculated in the third and fourth quadrants.

【０１４１】尚、確信度関数のピーク値は、１を使用す
る軸数（クラスタ構成軸数）で割ったものを使用する。
つまり、２軸でクラスタを構成しているならば、各軸の
確信度関数のピーク値は０．５であり、３軸の場合は
０．３３である。よって、各軸で算出された確信度を足
し合わせたときに、最大で１を取るように設定する。The peak value of the certainty factor function is obtained by dividing the peak value by the number of axes using 1 (the number of axes constituting the cluster).
That is, if a cluster is formed by two axes, the peak value of the certainty function of each axis is 0.5, and in the case of three axes, it is 0.33. Therefore, when the certainty factors calculated for the respective axes are added, the maximum value is set to 1.

【０１４２】次に、最大確信度の抽出処理（ステップ１
３０９）について説明する。複数クラスタに確定対象デ
ータが含まれる場合が存在する。その場合、全ての算出
された確信度の中で、最大の確信度を与えるクラスタを
生成する際に用いたデータ群（カテゴリ）をこの時点で
有効とする。前述した通り、この時点で、ＯＣＲ認識処
理部１０１による認識結果の第１候補が有効でなくなる
場合があり得る。Next, the extraction process of the maximum certainty factor (step 1)
309) will be described. There are cases where data to be fixed is included in a plurality of clusters. In this case, the data group (category) used when generating the cluster that gives the maximum certainty among all the calculated certainties is valid at this time. As described above, at this time, the first candidate of the recognition result by the OCR recognition processing unit 101 may not be valid.

【０１４３】次に、ＮＧクラスタチェック（ステップ１
３０８）について説明する。最大確信度の抽出処理（ス
テップ１３０９）において、有効なカテゴリが算出され
た場合には、ＮＧクラスタチェックを行う。ＮＧクラス
タとは、前述した通り、ＯＣＲ認識処理部１０１で誤認
識しているデータのみを集めてクラスタを生成したもの
である。クラスタの生成法は、ＯＫクラスタと同様であ
る。Next, an NG cluster check (step 1)
308) will be described. If an effective category is calculated in the maximum certainty extraction process (step 1309), an NG cluster check is performed. As described above, the NG cluster is a cluster generated by collecting only data that has been erroneously recognized by the OCR recognition processing unit 101. The method of generating the cluster is the same as that of the OK cluster.

【０１４４】チェック対象のＮＧクラスタは、有効なカ
テゴリとして算出されたカテゴリに属するＮＧクラスタ
全てである。つまり、２が有効クラスタと判断された場
合は、本当は２であるのに他のカテゴリに誤って認識さ
れたしまったデータを集めて作成されたＮＧクラスタす
べてでチェックを行うのである。The NG clusters to be checked are all NG clusters belonging to the category calculated as valid categories. In other words, if it is determined that 2 is a valid cluster, the check is performed on all NG clusters created by collecting data that is actually 2 but was incorrectly recognized in another category.

【０１４５】次に、確信度を低くする処理（ステップ１
３１３）について説明する。ＮＧクラスタチェック（ス
テップ１３０８）で、あるＮＧクラスタに入った場合、
最大確信度の抽出処理（ステップ１３０９）において算
出された確信度を低くする処理を行う。ＮＧクラスタ
も、ＯＫクラスタと同様に、第２、第３候補に出現し易
い候補を軸として作成されている。つまり、ＯＫクラス
タ、ＮＧクラスタは、同じ空間（軸も同じ）内に存在す
ることになる。Next, the process of lowering the certainty factor (step 1)
313) will be described. If a certain NG cluster is entered in the NG cluster check (step 1308),
A process is performed to lower the certainty factor calculated in the maximum certainty factor extraction process (step 1309). Similarly to the OK cluster, the NG cluster is created around a candidate that easily appears as the second and third candidates as an axis. That is, the OK cluster and the NG cluster exist in the same space (the same axis).

【０１４６】図１７には、あるカテゴリのＯＫクラスタ
とＮＧクラスタの位置関係を示す。後段知識ベース１１
４の構築方法でも述べたように、Ａというカテゴリの第
２、第３候補に多く出現する文字（Ａ、Ｂ、Ｃ）を軸と
してとり、Ａという文字の正解データを集めたデータ群
の軸Ａ、軸Ｂ、軸Ｃから構成される「ＯＫクラスタａ」
と、Ｂという文字の正解データを集めたデータ群の軸
Ａ、軸Ｂ、軸Ｃから構成される「ＯＫクラスタｂ」と、
Ｃという文字の正解データを集めたデータ群の軸Ａ、軸
Ｂ、軸Ｃから構成される「ＯＫクラスタｃ」と、正解が
ＡなのにＢに間違えているデータ群の軸Ａ、軸Ｂ、軸Ｃ
から構成される「ＮＧクラスタ」とを、２次元に射影し
図示している。ただし、「ＯＫクラスタｃ」は、説明を
簡単にするために図示していない。FIG. 17 shows the positional relationship between an OK cluster and an NG cluster of a certain category. Post Knowledge Base 11
As described in the construction method 4 above, the axis of a data group in which the correct data of the letter A is collected with the letters (A, B, C) appearing in the second and third candidates of the category A as axes. "OK cluster a" composed of A, axis B and axis C
And "OK cluster b" composed of axes A, B, and C of a data group in which correct answer data of the character B is collected;
"OK cluster c" composed of the axes A, B, and C of the data group in which the correct answer data of the character "C" is collected, and the axes A, B, and A of the data group in which the correct answer is A but mistaken for B C
Are projected and shown two-dimensionally. However, “OK cluster c” is not shown for simplicity.

【０１４７】図１７に示すように、ＯＫクラスタａとＯ
Ｋクラスタｂの重なるエリアに、確定対象のデータが存
在し、ＯＫクラスタチェック（ステップ１３０３）にお
いて、ＯＫクラスタｂの確信度が最大を取る場合で、Ａ
をＢと誤認識してしまったデータがあると仮定する。こ
のときそのデータは、ＮＧクラスタにも含まれるため、
図中のＮＧクラスタ内部の軸上に示す関数により、算出
された結果をＢの確信度に掛け合わせて調節する。すな
わち、ＯＫクラスタａ、ｂの重なりが大きい領域Ｒ４
ではＢの確信度はそれほど低くならないが、他の領域Ｒ
１，Ｒ２，Ｒ３ではかなり低くなる。As shown in FIG. 17, OK clusters a and O
If there is data to be determined in the area where K cluster b overlaps, and the OK cluster check takes the maximum confidence in the OK cluster check (step 1303), A
Suppose that there is data that has been mistakenly recognized as B. At this time, since the data is also included in the NG cluster,
The calculated result is multiplied by the certainty factor of B and adjusted by the function shown on the axis inside the NG cluster in the figure. That is, the region where the overlap between the OK clusters a and b is large R4
Then, the confidence of B is not so low, but other regions R
For 1, R2 and R3, it is considerably lower.

【０１４８】次に、ルール判定部１１７による処理につ
いて詳しく説明する。ルール判定部１１７は、後段判定
部１１６の確信度算出手段によって算出された確信度を
元に、確定データとするか、或いは、未確定データとし
てリジェクト判定データとするか決定する処理を行うも
のである。図１８に、ルール判定部１１７による処理の
フローチャートを示す。Next, the processing by the rule determining unit 117 will be described in detail. The rule determination unit 117 performs a process of determining whether to use the certainty data calculated by the certainty calculation means of the subsequent determination unit 116 as fixed data or rejected data as unconfirmed data. is there. FIG. 18 shows a flowchart of a process performed by the rule determination unit 117.

【０１４９】処理の内容は簡単で、ステップ１８０１に
おいて、確信度算出手段によって算出された確信度が所
定値γを越えるときは、ステップ１８０２に進んで確定
データとし、それ以下の値のときは、ステップ１８０３
に進んで正解と確信できない判定データとし、リジェク
ト対象データにする。ここで用いた所定値は、ユーザが
設定でき任意に設定できる。所定値を大きくすればする
ほど確定データは減少するようなしくみである。The contents of the processing are simple. If the certainty calculated by the certainty calculating means in step 1801 exceeds a predetermined value γ, the flow advances to step 1802 to be determined data. Step 1803
Then, the judgment data is determined to be unreliable and the rejection data is set. The predetermined value used here can be set by the user and can be set arbitrarily. The larger the predetermined value is, the smaller the determined data is.

【０１５０】以上説明したように、本実施形態の画像認
識装置によれば、予め、前段知識の作成部１１１におい
て、第１前段知識作成手段により第１前段知識（第１候
補カテゴリに対する類似カテゴリの関連）を、第２前段
知識作成手段により第２前段知識（特定順位の候補を”
正解”と確定し得る適合度（距離）の大きさ）を、第３
前段知識作成手段により第３前段知識（第１候補を”正
解”と確定し得る適合度（距離）差の大きさ）をそれぞ
れ前段知識ベース１１２に作成しておき、また、後段知
識の作成部１１３において、第１後段知識作成手段によ
り第１後段知識（ＯＫクラスタの形状）を、第２後段知
識作成手段により第２後段知識（ＮＧクラスタの形状）
をそれぞれ後段知識ベース１１４に作成しておき、前段
判定部（第１前段判定手段、第２前段判定手段および第
３前段判定手段）１１５では、これら前段知識ベース１
１２の第１、第２および第３前段知識を用いて、入力画
像の認識結果について、”正解”または”不明”（どの
カテゴリにも属さないデータ）の２つに認識結果を分別
して確定し、また、ここで”不明”とされた入力画像の
認識結果については、後段判定部（第１後段判定手段お
よび第２後段判定手段）１１６により、後段知識ベース
１１４の第１および第２後段知識を用いて、あるカテゴ
リのＯＫクラスタに入ったデータを該カテゴリに属す
る”正解”データと決定付け、逆にどのＯＫクラスタに
も入らない場合は”不明”（どのカテゴリにも属さな
い）データとした後、さらに、ＮＧクラスタの少なくと
も１つに入った場合は”不正解”データと決定し、３つ
に認識結果を分別して確定するようにして、再チェック
を行うようにしている。これにより、受付け誤認識率を
低下させることができ、画像認識システムにおける省力
化をさらに図ることができ、また、帳票読み込みの自動
化を図ることができる。さらに、画像認識システムの認
識結果を否定して”正解”の認識結果に置き換えること
ができるので、認識性能を向上させることができる。As described above, according to the image recognition apparatus of the present embodiment, in the pre-stage knowledge creating unit 111, the first pre-stage knowledge creating means (the similar category of the first category Related), the second pre-stage knowledge generation means (the candidates of the specific
The degree of goodness of fit (distance) that can be determined as "correct answer"
The first-stage knowledge creating means creates third-stage knowledge (the magnitude of the degree of fitness (distance) difference that can determine the first candidate as "correct") in the first-stage knowledge base 112, respectively. At 113, the first second-stage knowledge creating unit converts the first second-stage knowledge (OK cluster shape) into the second second-stage knowledge creating unit (NG cluster shape).
Are created in the succeeding knowledge base 114, respectively, and the preceding-stage determining unit (first preceding-stage determining means, second preceding-stage determining means, and third preceding-stage determining means) 115 generates the preceding knowledge base 1
Twelve first, second, and third pre-knowledges are used to classify the input image recognition results into two types of "correct" or "unknown" (data that does not belong to any category) and determine them. In addition, the recognition result of the input image that is determined to be “unknown” is determined by a second-stage determination unit (first second-stage determination unit and second second-stage determination unit) 116 in the first and second second-stage knowledge bases of the second-stage knowledge base 114. Is determined as "correct" data belonging to a category by using "OK" data belonging to a certain category, and "unknown" (not belonging to any category) data if the data does not belong to any OK cluster. After that, if at least one of the NG clusters is entered, it is determined that the data is "incorrect", and the recognition result is classified into three and confirmed, and rechecking is performed. As a result, the reception error recognition rate can be reduced, the labor of the image recognition system can be further reduced, and the reading of the form can be automated. Furthermore, since the recognition result of the image recognition system can be negated and replaced with a "correct answer" recognition result, the recognition performance can be improved.

【０１５１】[0151]

【発明の効果】以上の説明から明らかなように、本発明
によれば、第１前段知識作成手段（第１前段知識作成ス
テップ）により、画像認識の認識結果およびその付随情
報を含むデータから、認識結果として同じ第１候補のカ
テゴリを持つデータを集め、該集めたデータから特定の
候補順位における類似カテゴリの出現率の高いものを抽
出して、第１候補カテゴリに対する類似カテゴリの関連
を示す第１前段知識として保持し、第１前段判定手段
（第１前段判定ステップ）では、入力画像の認識結果と
して特定候補順位のカテゴリおよび類似カテゴリが得ら
れるとき、該第１候補のカテゴリおよび特定候補順位の
類似カテゴリと前記第１前段知識における関連との比較
により、該第１候補を”正解”または”不明”と判定す
ることとしたので、正解として確定できたものについて
は人がチェックする必要がなくなり、正解と確定できな
かったものについてのみチェックするだけでよくなるこ
とから、受付け誤認識率を低下させることができ、画像
認識システムにおける省力化をさらに図り得る画像認識
装置、画像認識方法および記録媒体を提供することがで
きる。As is apparent from the above description, according to the present invention, the first pre-knowledge generating means (first pre-knowledge generating step) uses the data including the recognition result of the image recognition and the accompanying information thereof. Data having the same first candidate category is collected as a recognition result, and data having a high appearance rate of similar categories in a specific candidate rank is extracted from the collected data, and the second data indicating the association of the similar category with the first candidate category is extracted. In the first precedent determination means (first precedent determination step), when a category of a specific candidate rank and a similar category are obtained as a recognition result of the input image, the category of the first candidate and the specific candidate rank Since the first candidate is determined to be “correct” or “unknown” by comparing the similar category of with the association in the first precedent knowledge, There is no need for a person to check for a solution that can be determined as a solution, but only for those that cannot be determined as a correct answer.This can reduce the false recognition rate and reduce labor in image recognition systems. It is possible to provide an image recognition device, an image recognition method, and a recording medium that can further achieve the above.

【０１５２】また、本発明によれば、第２前段知識作成
手段（第２前段知識作成ステップ）により、画像認識の
認識結果およびその付随情報を含むデータから、認識結
果として同じ第１候補のカテゴリを持つデータを集め、
該集めたデータから特定の候補順位における適合度（距
離）から、該特定順位の候補を”正解”と確定し得る適
合度（距離）の大きさを算定し、第２前段知識として保
持し、第２前段判定手段（第２前段判定ステップ）で
は、入力画像の認識結果として特定候補順位のカテゴリ
および類似カテゴリが得られるとき、該特定候補順位の
カテゴリまたは類似カテゴリにおける適合度（距離）の
大きさと前記第２前段知識における適合度（距離）の大
きさとの比較により、該特定順位の候補を”正解”また
は”不明”と判定することとし、また、第３前段知識作
成手段（第３前段知識作成ステップ）により、画像認識
の認識結果およびその付随情報を含むデータから、認識
結果として同じ第１候補のカテゴリを持つデータを集
め、該集めたデータから特定の候補順位のカテゴリまた
は類似カテゴリ間における適合度（距離）差から、該第
１候補を”正解”と確定し得る適合度（距離）差の大き
さを算定して、第３前段知識として保持し、第３前段判
定手段（第３前段判定ステップ）では、入力画像の認識
結果として特定候補順位のカテゴリおよび類似カテゴリ
が得られるとき、該特定の候補順位のカテゴリまたは類
似カテゴリ間における適合度（距離）差の大きさと前記
第３前段知識における適合度（距離）差の大きさとの比
較により、該第１候補を”正解”または”不明”と判定
することとしたので、正解として確定できたものについ
ては人がチェックする必要がなくなり、正解と確定でき
なかったものについてのみチェックするだけでよくなる
ことから、受付け誤認識率を低下させることができ、画
像認識システムにおける省力化をさらに図り得る画像認
識装置、画像認識方法および記録媒体を提供することが
できる。Further, according to the present invention, the second first-stage knowledge creating means (second first-stage knowledge creating step) converts the data including the recognition result of image recognition and its accompanying information into the category of the same first candidate as the recognition result. Collect data with
From the collected data, the magnitude of the fitness (distance) that can determine the candidate of the specific rank as a “correct answer” is calculated from the fitness (distance) in the specific candidate rank, and held as the second preceding knowledge, In the second preceding-stage determining means (second preceding-stage determining step), when a category of a specific candidate rank and a similar category are obtained as a recognition result of the input image, the degree of fitness (distance) in the category of the specific candidate rank or the similar category is increased. By comparing this with the magnitude of the degree of matching (distance) in the second precedent knowledge, the particular rank candidate is determined to be “correct” or “unknown”. Knowledge creating step), data having the same first candidate category as the recognition result is collected from the data including the recognition result of the image recognition and the accompanying information, and the collected data The magnitude of the fitness (distance) difference that can determine the first candidate as a “correct answer” is calculated from the fitness (distance) difference between the categories of the specific candidate rank or similar categories, and is calculated as the third preceding knowledge. The third precedent determination means (third precedent determination step) includes, when a category of a specific candidate rank and a similar category are obtained as a recognition result of the input image, the degree of matching between the category of the specific candidate rank or the similar category The first candidate is determined to be “correct” or “unknown” by comparing the magnitude of the (distance) difference with the magnitude of the fitness (distance) difference in the third preceding knowledge, so that the first candidate can be determined as a correct answer. For people who do not need to check for the correct answer, it is only necessary to check for those that could not be determined as the correct answer, lowering the false recognition rate Bets can be further aims to obtain an image recognition apparatus laborsaving in the image recognition system, it is possible to provide an image recognition method and a recording medium.

【０１５３】また、本発明によれば、予め、第１前段知
識作成手段（第１前段知識作成ステップ）により第１前
段知識（第１候補カテゴリに対する類似カテゴリの関
連）を、第２前段知識作成手段（第２前段知識作成ステ
ップ）により第２前段知識（特定順位の候補を”正解”
と確定し得る適合度（距離）の大きさ）を、第３前段知
識作成手段（第３前段知識作成ステップ）により第３前
段知識（第１候補を”正解”と確定し得る適合度（距
離）差の大きさ）を、それぞれ作成しておき、第１前段
判定手段（第１前段判定ステップ）、第２前段判定手段
（第２前段判定ステップ）および第３前段判定手段（第
３前段判定ステップ）では、これら第１、第２および第
３前段知識を用いて、入力画像の認識結果について、”
正解”または”不明”（どのカテゴリにも属さないデー
タ）の２つに認識結果を分別して確定することとしたの
で、受付け誤認識率を低下させることができ、画像認識
システムにおける省力化をさらに図ることができ、ま
た、帳票読み込みの自動化を図り得る画像認識装置、画
像認識方法および記録媒体を提供することができる。Further, according to the present invention, the first pre-stage knowledge (the relation of the similar category to the first candidate category) and the second pre-stage knowledge generation are prepared in advance by the first pre-stage knowledge generation means (first pre-stage knowledge generation step). Means (second pre-knowledge creating step), the second pre-knowledge (specified rank candidates are "correct")
Is determined by the third pre-stage knowledge generating means (third pre-stage knowledge generation step) to the third pre-stage knowledge (the degree of conformity at which the first candidate can be determined to be "correct" (distance) ), The magnitude of the difference) is prepared in advance, and the first preceding-stage determining means (first preceding-stage determining step), the second preceding-stage determining means (second preceding-stage determining step), and the third preceding-stage determining means (third preceding-stage determining step) Step) uses the first, second, and third pre-knowledge to determine the recognition result of the input image.
Recognition results are classified into two types, "correct" or "unknown" (data that does not belong to any category), so that the rate of incorrect recognition can be reduced, and the labor of the image recognition system can be further reduced. It is possible to provide an image recognition device, an image recognition method, and a recording medium that can achieve the automatic reading of a form.

【０１５４】また、本発明によれば、第１軸抽出手段
（第１軸抽出ステップ）により、画像認識の認識結果お
よびその付随情報を含むデータをサンプル収集し、ある
カテゴリで認識結果が”正解”とされているデータか
ら、前記データの集合をクラスタリングするときの軸を
抽出し、第１後段知識作成手段（第１後段知識作成ステ
ップ）により、第１軸抽出手段（第１軸抽出ステップ）
で抽出された軸に基づき前記データの集合のクラスタリ
ングを行い、該カテゴリのＯＫクラスタとして生成し、
生成されたＯＫクラスタの形状を第１後段知識として保
持し、第１後段判定手段（第１後段判定ステップ）で
は、入力画像の認識結果として認識結果およびその付随
情報を含むデータが得られるとき、該データが第１後段
知識のあるカテゴリに所属するＯＫクラスタの中に含ま
れるか否かをチェックして、該ＯＫクラスタに含まれる
場合には該ＯＫクラスタが所属するカテゴリを入力画像
の認識結果として”正解”と判定し、何れのＯＫクラス
タにも含まれていない場合には”未確定”と判定するこ
ととしたので、受付け誤認識率を低下させることがで
き、画像認識システムにおける省力化をさらに図り得る
画像認識装置、画像認識方法および記録媒体を提供する
ことができる。Further, according to the present invention, the first axis extracting means (first axis extracting step) collects a sample of the data including the recognition result of the image recognition and the accompanying information, and determines that the recognition result is "correct" in a certain category. From the data indicated as "", an axis for clustering the set of data is extracted, and the first second-stage knowledge creating means (first second-stage knowledge creating step) uses first axis extracting means (first axis extracting step).
Clustering of the set of data based on the axis extracted in the above step, and generating an OK cluster of the category;
The shape of the generated OK cluster is stored as first second-stage knowledge, and the first second-stage determination means (first second-stage determination step) obtains a data including the recognition result and its accompanying information as the recognition result of the input image. It is checked whether or not the data is included in an OK cluster belonging to a category having the first subsequent knowledge, and if the data is included in the OK cluster, the category to which the OK cluster belongs is determined as a recognition result of the input image. Is determined to be “correct”, and if it is not included in any of the OK clusters, it is determined to be “undetermined”. Therefore, the false recognition rate can be reduced, and labor saving in the image recognition system can be achieved. It is possible to provide an image recognition device, an image recognition method, and a recording medium that can further achieve the above.

【０１５５】また、本発明によれば、第２軸抽出手段
（第２軸抽出ステップ）により、画像認識の認識結果お
よびその付随情報を含むデータをサンプル収集し、ある
カテゴリで認識結果が”誤認”とされているデータか
ら、前記データの集合をクラスタリングするときの軸を
抽出し、第２後段知識作成手段（第２後段知識作成ステ
ップ）により、第２軸抽出手段（第２軸抽出ステップ）
で抽出された軸に基づき前記データの集合のクラスタリ
ングを行い、該カテゴリのＮＧクラスタとして生成し、
生成されたＮＧクラスタの形状を第２後段知識として保
持し、第２後段判定手段（第２後段判定ステップ）で
は、入力画像の認識結果として認識結果およびその付随
情報を含むデータが得られるとき、該データが第２後段
知識のあるカテゴリに所属するＮＧクラスタの中に含ま
れるか否かをチェックして、該ＮＧクラスタに含まれる
場合には該ＮＧクラスタが所属するカテゴリを前記入力
画像の認識結果として”不正解”と判定し、何れのＮＧ
クラスタにも含まれていない場合には”正解”と判定す
ることとしたので、受付け誤認識率を低下させることが
でき、画像認識システムにおける省力化をさらに図り得
る画像認識装置、画像認識方法および記録媒体を提供す
ることができる。Further, according to the present invention, the second axis extracting means (second axis extracting step) collects a sample of the data including the recognition result of the image recognition and the accompanying information, and determines that the recognition result is “misrecognized” in a certain category. From the data indicated as "", an axis for clustering the set of data is extracted, and the second second-stage knowledge creating means (second second-stage knowledge creating step) uses the second axis extracting means (second axis extracting step).
Clustering of the set of data based on the axis extracted in the above step to generate an NG cluster of the category,
The shape of the generated NG cluster is retained as second post-stage knowledge, and the second post-stage determination means (second post-stage determination step) obtains data including the recognition result and its accompanying information as the recognition result of the input image. It is checked whether or not the data is included in an NG cluster belonging to a category having the second subsequent knowledge. If the data is included in the NG cluster, the category to which the NG cluster belongs is recognized by the input image. As a result, it was judged as "incorrect" and any NG
If it is not included in the cluster, it is determined to be “correct”, so that the false recognition rate can be reduced, and the image recognition apparatus, the image recognition method and the image recognition method can further save labor in the image recognition system. A recording medium can be provided.

【０１５６】また、本発明によれば、予め、第１後段知
識作成手段（第１後段知識作成ステップ）により第１後
段知識（ＯＫクラスタの形状）を、第２後段知識作成手
段（第２後段知識作成ステップ）により第２後段知識
（ＮＧクラスタの形状）を、それぞれ作成しておき、第
１後段判定手段（第１後段判定ステップ）および第２後
段判定手段（第２後段判定ステップ）では、これら第１
および第２後段知識を用いて、入力画像の認識結果につ
いて、あるカテゴリのＯＫクラスタに入ったデータを該
カテゴリに属する”正解”データと決定付け、逆にどの
ＯＫクラスタにも入らない場合は”不明”（どのカテゴ
リにも属さない）データとした後、さらに、ＮＧクラス
タの少なくとも１つに入った場合は”不正解”データと
決定され、３つに認識結果を分別して確定することとし
たので、受付け誤認識率を低下させることができ、画像
認識システムにおける省力化をさらに図ることができ、
また、帳票読み込みの自動化を図ることができ、さら
に、画像認識システムの認識結果を否定して”正解”の
認識結果に置き換えることができるので、認識性能を向
上させ得る画像認識装置、画像認識方法および記録媒体
を提供することができる。Further, according to the present invention, the first post-stage knowledge (the shape of the OK cluster) is first converted into the second post-stage knowledge generation unit (the second post-stage) by the first post-stage knowledge generation unit (first post-stage knowledge generation step). The second post-stage knowledge (the shape of the NG cluster) is created in advance by the knowledge generation step, and the first post-stage determination means (first post-stage determination step) and the second post-stage determination means (second post-stage determination step) These first
Using the second-stage knowledge and the recognition result of the input image, data included in an OK cluster of a certain category is determined as “correct answer” data belonging to the category. After the data is "unknown" (which does not belong to any category), if it falls into at least one of the NG clusters, the data is determined to be "incorrect", and the recognition results are classified into three and determined. Therefore, it is possible to reduce the false recognition rate and further reduce labor in the image recognition system,
In addition, it is possible to automate the reading of a form, and further, since the recognition result of the image recognition system can be negated and replaced with a “correct answer” recognition result, an image recognition apparatus and an image recognition method capable of improving recognition performance And a recording medium can be provided.

【０１５７】また、本発明によれば、予め、第１前段知
識作成手段（第１前段知識作成ステップ）により第１前
段知識（第１候補カテゴリに対する類似カテゴリの関
連）を、第２前段知識作成手段（第２前段知識作成ステ
ップ）により第２前段知識（特定順位の候補を”正解”
と確定し得る適合度（距離）の大きさ）を、第３前段知
識作成手段（第３前段知識作成ステップ）により第３前
段知識（第１候補を”正解”と確定し得る適合度（距
離）差の大きさ）を、第１後段知識作成手段（第１後段
知識作成ステップ）により第１後段知識（ＯＫクラスタ
の形状）を、第２後段知識作成手段（第２後段知識作成
ステップ）により第２後段知識（ＮＧクラスタの形状）
を、それぞれ作成しておき、第１前段判定手段（第１前
段判定ステップ）、第２前段判定手段（第２前段判定ス
テップ）および第３前段判定手段（第３前段判定ステッ
プ）では、これら第１、第２および第３前段知識を用い
て、入力画像の認識結果について、”正解”または”不
明”（どのカテゴリにも属さないデータ）の２つに認識
結果を分別して確定し、また、ここで”不明”とされた
入力画像の認識結果については、第１後段判定手段（第
１後段判定ステップ）および第２後段判定手段（第２後
段判定ステップ）により、第１および第２後段知識を用
いて、あるカテゴリのＯＫクラスタに入ったデータを該
カテゴリに属する”正解”データと決定付け、逆にどの
ＯＫクラスタにも入らない場合は”不明”（どのカテゴ
リにも属さない）データとした後、さらに、ＮＧクラス
タの少なくとも１つに入った場合は”不正解”データと
決定され、３つに認識結果を分別して確定するようにし
て、再チェックを行うこととしたので、受付け誤認識率
を低下させることができ、画像認識システムにおける省
力化をさらに図ることができ、また、帳票読み込みの自
動化を図ることができ、さらに、画像認識システムの認
識結果を否定して”正解”の認識結果に置き換えること
ができるので、認識性能を向上させ得る画像認識装置、
画像認識方法および記録媒体を提供することができる。Further, according to the present invention, the first pre-stage knowledge (association of similar categories with respect to the first candidate category) and the second pre-stage knowledge generation are prepared in advance by the first pre-stage knowledge generation means (first pre-stage knowledge generation step). Means (second pre-knowledge creating step), the second pre-knowledge (specified rank candidates are "correct")
Is determined by the third pre-stage knowledge generating means (third pre-stage knowledge generation step) to the third pre-stage knowledge (the degree of conformity at which the first candidate can be determined to be "correct" (distance) ) The magnitude of the difference), the first second-stage knowledge (the shape of the OK cluster) by the first second-stage knowledge creation means (first second-stage knowledge creation step), and the second second-stage knowledge creation means (second second-stage knowledge creation step). 2nd second stage knowledge (NG cluster shape)
Are respectively created, and the first pre-stage determining means (first pre-stage judging step), the second pre-stage judging means (second pre-stage judging step) and the third pre-stage judging means (third pre-stage judging step) Using the first, second, and third pre-knowledge, the recognition result of the input image is classified and fixed into two of “correct answer” or “unknown” (data that does not belong to any category). Here, the recognition result of the input image that is determined to be “unknown” is obtained by the first second-stage determining means (first second-stage determining step) and the second second-stage determining means (second second-stage determining step). Is used to determine data included in an OK cluster of a certain category as “correct answer” data belonging to the category. Conversely, if the data does not belong to any OK cluster, “unknown” (does not belong to any category) Then, if at least one of the NG clusters was entered, the data was determined to be "incorrect", and the recognition results were sorted into three and confirmed, and a recheck was performed. In addition, it is possible to reduce the false recognition rate, further save labor in the image recognition system, automate the reading of the form, and deny the recognition result of the image recognition system. An image recognition device that can improve the recognition performance because it can be replaced with the recognition result of "correct answer",
An image recognition method and a recording medium can be provided.

【０１５８】また、本発明によれば、第１後段判定手段
（第１後段判定ステップ）において、前記データがある
カテゴリに所属するＯＫクラスタに含まれる場合に、ど
の程度の確信を持って”正解”といえるかを示す指標で
ある確信度を確信度算出手段（確信度算出ステップ）に
より計算するようにし、前記データが複数の別カテゴリ
に所属するＯＫクラスタに入った場合には、確信度に基
づいてどのカテゴリを”正解”とするかを決定すること
としたので、あるカテゴリのＯＫクラスタに入ったデー
タをそのカテゴリに属するデータと決定付け、逆にどの
ＯＫクラスタにも入らない場合は”不明”（どのカテゴ
リにも属さない）データとされるが、あるカテゴリに属
するデータと決定付けられた場合に、認識結果の確信度
合いを算出することにより、”不明”、”正解”、”正
解と確信できない”の３つに認識結果を分別して確定で
き、また、さらに、ＮＧクラスタの少なくとも１つに入
った場合は”不正解”データと決定され、”不明”、”
正解”、”不正解”、”正解と確信できない”の４つに
認識結果を分別して確定し得る画像認識装置、画像認識
方法および記録媒体を提供することができる。Further, according to the present invention, when the data is included in an OK cluster belonging to a certain category, the first second-stage determination means (first second-stage determination step) determines the correct answer Is calculated by the certainty calculating means (the certainty calculating step). When the data enters an OK cluster belonging to a plurality of different categories, the certainty is calculated. Since it is decided which category is the “correct answer” on the basis of the data, the data included in the OK cluster of a certain category is determined as the data belonging to the category. If the data is “unknown” (it does not belong to any category), but the data is determined to belong to a certain category, calculate the degree of certainty of the recognition result. As a result, the recognition result can be classified and determined into three types of “unknown”, “correct answer”, and “cannot be sure of the correct answer”, and when at least one of the NG clusters is included, it is determined as “incorrect” data. And "unknown", "
It is possible to provide an image recognition apparatus, an image recognition method, and a recording medium which can determine a recognition result by classifying the result into four, namely, "correct answer", "incorrect answer", and "cannot be sure of correct answer".

【０１５９】さらに、本発明によれば、第２後段判定手
段（第２後段判定ステップ）において、前記データがあ
るカテゴリに所属するＮＧクラスタに含まれる場合に
は、確信度更新手段（確信度更新ステップ）により、ど
の程度の確信を持って”正解”といえるかを示す指標で
ある確信度を低くすることとしたので、あるカテゴリの
ＯＫクラスタに入ったデータをそのカテゴリに属するデ
ータと決定付け、逆にどのＯＫクラスタにも入らない場
合は”不明”（どのカテゴリにも属さない）データとさ
れるが、あるカテゴリに属するデータと決定付けられた
場合に、認識結果の確信度合いを算出することによ
り、”不明”、”正解”、”正解と確信できない”の３
つに認識結果を分別して確定でき、さらに、ＮＧクラス
タの少なくとも１つに入った場合は”不正解”データと
決定されるか、算出された確信度を低くすることができ
る。Further, according to the present invention, when the data is included in an NG cluster belonging to a certain category in the second post-stage determination means (second post-stage determination step), Step) reduces the certainty factor, which is an index indicating the degree of certainty that the answer can be said to be “correct”, so that the data included in the OK cluster of a certain category is determined as data belonging to that category. Conversely, if the data does not belong to any of the OK clusters, the data is determined to be "unknown" (not belonging to any category). If it is determined that the data belongs to a certain category, the degree of certainty of the recognition result is calculated. As a result, "unknown", "correct", "can't be sure" 3
Finally, the recognition result can be sorted and determined, and if it falls in at least one of the NG clusters, it can be determined as "incorrect" data or the calculated certainty can be lowered.

【０１６０】以上を総括して、本発明によれば、複数の
カテゴリを持ち、入力データに対する各カテゴリの一致
度（若しくはそれに類する情報）を算出する画像認識に
おいて、一致度が高いとされた結果が正しいものか、間
違ったものか決定できない認識結果を、「認識結果が正
解と確信できるため受け付ける」，「認識結果が正解と
確信できないためリジェクトする」，「認識結果が未確
定のためリジェクトする」，並びに「認識結果が不正解
のためリジェクトする」という数種類の結果に確定し、
受付け正解率を向上させ、かつ誤認識率を０［％］に近
づけた画像認識装置、画像認識方法および記録媒体を提
供することができる。Summarizing the above, according to the present invention, in image recognition for calculating the degree of coincidence (or similar information) of each category with input data according to the present invention, it is determined that the degree of coincidence is high. If the recognition result cannot be determined whether it is correct or incorrect, accept the recognition result because the recognition result can be confident that it is correct, reject it because the recognition result cannot be convinced that the recognition result is correct, or reject the recognition result that is uncertain , And "Recognition results are rejected due to incorrect answers."
It is possible to provide an image recognition device, an image recognition method, and a recording medium in which the accepting correct answer rate is improved and the false recognition rate is close to 0 [%].

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明の一実施形態に係る画像認識装置の構成
図である。FIG. 1 is a configuration diagram of an image recognition device according to an embodiment of the present invention.

【図２】実施形態の画像認識装置の全体的な処理の概略
を説明する説明図である。FIG. 2 is an explanatory diagram illustrating an outline of overall processing of the image recognition device of the embodiment.

【図３】実施形態の画像認識装置における知識作成処理
の概略フローチャートである。FIG. 3 is a schematic flowchart of a knowledge creation process in the image recognition device of the embodiment.

【図４】実施形態の画像認識装置におけるシステム運用
時の処理の概略フローチャート（その１）である。FIG. 4 is a schematic flowchart (part 1) of a process when the system is operated in the image recognition device of the embodiment.

【図５】実施形態の画像認識装置におけるシステム運用
時の処理の概略フローチャート（その２）である。FIG. 5 is a schematic flowchart (part 2) of a process at the time of system operation in the image recognition device of the embodiment.

【図６】ＯＣＲ認識処理部から出力される認識結果を例
示する説明図である。FIG. 6 is an explanatory diagram illustrating a recognition result output from an OCR recognition processing unit;

【図７】第１、第２および第３前段知識のパラメータを
例示する説明図である。FIG. 7 is an explanatory diagram illustrating parameters of first, second, and third preceding knowledge.

【図８】前段判定部による判定処理を説明するフローチ
ャートである。FIG. 8 is a flowchart illustrating a determination process performed by a preceding-stage determination unit.

【図９】クラスタ生成を例示する説明図である。FIG. 9 is an explanatory diagram illustrating cluster generation.

【図１０】超楕円体法クラスタリングの処理の概要を説
明するフローチャートである。FIG. 10 is a flowchart illustrating an outline of a process of the hyperellipsoid clustering.

【図１１】後段知識としてファイル出力される場合のデ
ータフォーマットを例示する説明図である。FIG. 11 is an explanatory diagram illustrating a data format when a file is output as subsequent knowledge.

【図１２】楕円体と固有値および固有ベクトルの関係を
説明する説明図である。FIG. 12 is an explanatory diagram illustrating a relationship between an ellipsoid, an eigenvalue, and an eigenvector.

【図１３】後段判定部における処理の流れを説明するフ
ローチャートである。FIG. 13 is a flowchart illustrating a flow of processing in a subsequent-stage determination unit.

【図１４】中心からの距離の確信度算出で用いるメンバ
シップ関数を例示する説明図である。FIG. 14 is an explanatory diagram illustrating a membership function used in calculating the certainty factor of the distance from the center.

【図１５】適合度と楕円の関係を示す説明図である。FIG. 15 is an explanatory diagram showing a relationship between a degree of conformity and an ellipse.

【図１６】楕円と確信度関数の関係を示す説明図であ
る。FIG. 16 is an explanatory diagram showing a relationship between an ellipse and a certainty factor function.

【図１７】あるカテゴリのＯＫクラスタとＮＧクラスタ
の位置関係を例示する説明図である。FIG. 17 is an explanatory diagram illustrating a positional relationship between an OK cluster and an NG cluster of a certain category;

【図１８】ルール判定部による処理のフローチャートで
ある。FIG. 18 is a flowchart of a process performed by a rule determination unit.

【符号の説明】[Explanation of symbols]

１０１ＯＣＲ認識処理部１０２認識結果確定部１０３知識生成部１１１前段知識の作成部（第１前段知識作成手段、
第２前段知識作成手段および第３前段知識作成手段）１１２前段知識ベース（第１前段知識、第２前段知
識および第３前段知識）１１３後段知識の作成部（第１後段知識作成手段お
よび第２後段知識作成手段）１１４後段知識ベース（第１後段知識および第２後
段知識）１１５前段判定部（第１前段判定手段、第２前段判
定手段および第３前段判定手段）１１６後段判定部（第１後段判定手段および第２後
段判定手段）１１７ルール判定部１１８認識確定部１１９リジェクト部１５１ａ，１５１ｂ認識結果１５２，１５３知識１５４不明データ１５５，１５６，１５８確定データ１５７未確定データ１５９正解と確信できないデータReference Signs List 101 OCR recognition processing unit 102 Recognition result determination unit 103 Knowledge generation unit 111 Pre-knowledge generation unit (first pre-knowledge generation unit,
Second pre-stage knowledge creation unit and third pre-stage knowledge creation unit) 112 Pre-stage knowledge base (first pre-stage knowledge, second pre-stage knowledge and third pre-stage knowledge) 113 Post-stage knowledge creation unit (first post-stage knowledge creation unit and second pre-stage knowledge creation unit) Second-stage knowledge creating unit) 114 Second-stage knowledge base (first second-stage knowledge and second second-stage knowledge) 115 First-stage determining unit (first first-stage determining unit, second previous-stage determining unit, and third first-stage determining unit) 116 Second-stage determining unit (first) (Second-stage determination unit and second second-stage determination unit) 117 Rule determination unit 118 Recognition determination unit 119 Rejection unit 151a, 151b Recognition result 152, 153 Knowledge 154 Unknown data 155, 156, 158 Fixed data 157 Undetermined data 159 Data that cannot be convinced to be correct

Claims

【特許請求の範囲】[Claims]

【請求項１】複数カテゴリの認識対象を持ち、登録画
像パタンと入力画像パタンとのパタンマッチングにおけ
る適合度（距離）に基づき入力画像の認識を行う画像認
識装置において、画像認識の認識結果およびその付随情報を含むデータか
ら、認識結果として同じ第１候補のカテゴリを持つデー
タを集め、該集めたデータから特定の候補順位における
類似カテゴリの出現率の高いものを抽出して、第１候補
カテゴリに対する類似カテゴリの関連を示す第１前段知
識として保持する第１前段知識作成手段と、入力画像の認識結果として特定候補順位のカテゴリおよ
び類似カテゴリが得られるとき、該第１候補のカテゴリ
および特定候補順位の類似カテゴリと前記第１前段知識
における関連との比較により、該第１候補を”正解”ま
たは”不明”と判定する第１前段判定手段と、を具備すること、を特徴とする画像認識装置。An image recognition apparatus that has recognition targets of a plurality of categories and performs recognition of an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. Data having the same first candidate category as the recognition result is collected from the data including the accompanying information, and a data having a high appearance rate of the similar category in the specific candidate rank is extracted from the collected data, and the data corresponding to the first candidate category is extracted. A first pre-knowledge generating means for holding as a first pre-knowledge indicating the association of similar categories, and when a category of a specific candidate rank and a similar category are obtained as a recognition result of the input image, the category of the first candidate and the specific candidate rank The first candidate is determined to be “correct” or “unknown” by comparing the similar category of Be provided with a, a first front determination means determines that the image recognition apparatus according to claim.

【請求項２】複数カテゴリの認識対象を持ち、登録画
像パタンと入力画像パタンとのパタンマッチングにおけ
る適合度（距離）に基づき入力画像の認識を行う画像認
識装置において、画像認識の認識結果およびその付随情報を含むデータか
ら、認識結果として同じ第１候補のカテゴリを持つデー
タを集め、該集めたデータから特定の候補順位における
適合度（距離）から、該特定順位の候補を”正解”と確
定し得る適合度（距離）の大きさを算定し、第２前段知
識として保持する第２前段知識作成手段と、入力画像の認識結果として特定候補順位のカテゴリおよ
び類似カテゴリが得られるとき、該特定候補順位のカテ
ゴリまたは類似カテゴリにおける適合度（距離）の大き
さと前記第２前段知識における適合度（距離）の大きさ
との比較により、該特定順位の候補を”正解”または”
不明”と判定する第２前段判定手段と、を具備すること、を特徴とする画像認識装置。2. An image recognition apparatus having recognition targets of a plurality of categories and recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. Data having the same category of the first candidate as the recognition result is collected from the data including the accompanying information, and from the collected data, the candidate of the specific rank is determined as “correct” from the fitness (distance) in the specific candidate rank. A second pre-stage knowledge generating means for calculating a degree of adaptability (distance) that can be performed and holding the same as second pre-stage knowledge; and when a category of a specific candidate rank and a similar category are obtained as a recognition result of an input image, the identification is performed. Comparison between the magnitude of the fitness (distance) in the category of the candidate rank or the similar category and the magnitude of the fitness (distance) in the second preceding knowledge Ri, the candidate of the specific order "correct" or "
An image recognition apparatus, comprising: a second former-stage determination unit that determines “unknown”.

【請求項３】複数カテゴリの認識対象を持ち、登録画
像パタンと入力画像パタンとのパタンマッチングにおけ
る適合度（距離）に基づき入力画像の認識を行う画像認
識装置において、画像認識の認識結果およびその付随情報を含むデータか
ら、認識結果として同じ第１候補のカテゴリを持つデー
タを集め、該集めたデータから特定の候補順位のカテゴ
リまたは類似カテゴリ間における適合度（距離）差か
ら、該第１候補を”正解”と確定し得る適合度（距離）
差の大きさを算定し、第３前段知識として保持する第３
前段知識作成手段と、入力画像の認識結果として特定候補順位のカテゴリおよ
び類似カテゴリが得られるとき、該特定の候補順位のカ
テゴリまたは類似カテゴリ間における適合度（距離）差
の大きさと前記第３前段知識における適合度（距離）差
の大きさとの比較により、該第１候補を”正解”また
は”不明”と判定する第３前段判定手段と、を具備すること、を特徴とする画像認識装置。3. An image recognition apparatus having recognition targets of a plurality of categories and recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. Data having the same first candidate category as a recognition result is collected from the data including the accompanying information, and the first candidate is determined from the fitness (distance) difference between a category of a specific candidate rank or a similar category from the collected data. (Distance) that can be determined as "correct"
Calculate the magnitude of the difference and retain it as the third precedent knowledge.
A pre-knowledge generating means, and when a category of a specific candidate rank and a similar category are obtained as a result of recognition of the input image, the magnitude of a difference in the degree of fitness (distance) between the category of the specific candidate rank or the similar category and the third pre-stage An image recognition apparatus, comprising: a third pre-stage determination unit that determines the first candidate as “correct” or “unknown” by comparing with the magnitude of the degree of matching (distance) difference in knowledge.

【請求項４】複数カテゴリの認識対象を持ち、登録画
像パタンと入力画像パタンとのパタンマッチングにおけ
る適合度（距離）に基づき入力画像の認識を行う画像認
識装置において、画像認識の認識結果およびその付随情報を含むデータを
サンプル収集し、あるカテゴリで認識結果が”正解”と
されているデータから、前記データの集合をクラスタリ
ングするときの軸を、上位候補に出現率の高いカテゴリを該軸として、下位候補に出現率の高いカテゴリを該軸として、上位候補に出現率の高いカテゴリおよび下位候補に出現
率の高いカテゴリをあわせて軸として、中位候補に出現率の高いカテゴリを該軸として、相関係数の高いカテゴリ同士を該軸として、或いは、相関係数の低いカテゴリ同士を該軸として、抽
出する第１軸抽出手段と、前記第１軸抽出手段で抽出された軸に基づき前記データ
の集合のクラスタリングを行い、該カテゴリのＯＫクラ
スタとして生成し、生成されたＯＫクラスタの形状を第
１後段知識として保持する第１後段知識作成手段と、入力画像の認識結果として認識結果およびその付随情報
を含むデータが得られるとき、該データが前記第１後段
知識のあるカテゴリに所属するＯＫクラスタの中に含ま
れるか否かをチェックして、該ＯＫクラスタに含まれる
場合には該ＯＫクラスタが所属するカテゴリを前記入力
画像の認識結果として”正解”と判定し、何れのＯＫク
ラスタにも含まれていない場合には”未確定”と判定す
る第１後段判定手段と、を具備すること、を特徴とする画像認識装置。4. An image recognition apparatus having recognition targets of a plurality of categories and recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. Data including the accompanying information is collected as a sample, and an axis when clustering the set of data from data for which the recognition result is “correct” in a certain category is defined as a category having a high appearance rate as a top candidate as the axis. A category with a high appearance rate as the lower candidate is used as the axis, a category with a higher appearance rate as the upper candidate and a category with a higher appearance rate as the lower candidate are used as the axis, and a category with a higher appearance rate as the middle candidate is used as the axis. A first axis extraction method for extracting categories with high correlation coefficients as the axis or categories with low correlation coefficient as the axis Clustering the set of data based on the axis extracted by the first axis extracting means, generating an OK cluster of the category, and holding the shape of the generated OK cluster as first post-stage knowledge. Second-stage knowledge creating means; when data including a recognition result and its accompanying information is obtained as a recognition result of the input image, whether or not the data is included in an OK cluster belonging to a certain category of the first second-stage knowledge; Is checked, and if it is included in the OK cluster, the category to which the OK cluster belongs is determined to be “correct” as the recognition result of the input image. If it is not included in any of the OK clusters, “ An image recognition apparatus comprising: a first subsequent-stage determination unit that determines “undetermined”.

【請求項５】前記第１後段判定手段は、前記データがあるカテゴリに所属するＯＫクラスタに含
まれる場合に、どの程度の確信を持って”正解”といえ
るかを示す指標である確信度を計算する確信度算出手段
を具備し、前記データが複数の別カテゴリに所属するＯＫクラスタ
に入った場合には、前記確信度に基づいてどのカテゴリ
を”正解”とするかを決定することを特徴とする請求項
４に記載の画像認識装置。5. The first post-stage determining means calculates a certainty factor, which is an index indicating how much certainty can be said to be “correct” when the data is included in an OK cluster belonging to a certain category. A certainty factor calculating means for calculating, and when the data falls into an OK cluster belonging to a plurality of different categories, which category is determined as "correct" based on the certainty factor is determined. The image recognition device according to claim 4, wherein

【請求項６】複数カテゴリの認識対象を持ち、登録画
像パタンと入力画像パタンとのパタンマッチングにおけ
る適合度（距離）に基づき入力画像の認識を行う画像認
識装置において、画像認識の認識結果およびその付随情報を含むデータを
サンプル収集し、あるカテゴリで認識結果が”誤認”と
されているデータから、前記データの集合をクラスタリ
ングするときの軸を、上位候補に出現率の高いカテゴリを該軸として、下位候補に出現率の高いカテゴリを該軸として、上位候補に出現率の高いカテゴリおよび下位候補に出現
率の高いカテゴリをあわせて軸として、中位候補に出現率の高いカテゴリを該軸として、相関係数の高いカテゴリ同士を該軸として、或いは、相関係数の低いカテゴリ同士を該軸として、抽
出する第２軸抽出手段と、前記第２軸抽出手段で抽出された軸に基づき前記データ
の集合のクラスタリングを行い、該カテゴリのＮＧクラ
スタとして生成し、生成されたＮＧクラスタの形状を第
２後段知識として保持する第２後段知識作成手段と、入力画像の認識結果として認識結果およびその付随情報
を含むデータが得られるとき、該データが前記第２後段
知識のあるカテゴリに所属するＮＧクラスタの中に含ま
れるか否かをチェックして、該ＮＧクラスタに含まれる
場合には該ＮＧクラスタが所属するカテゴリを前記入力
画像の認識結果として”不正解”と判定し、何れのＮＧ
クラスタにも含まれていない場合には”正解”と判定す
る第２後段判定手段と、を具備すること、を特徴とする画像認識装置。6. An image recognition apparatus having recognition targets of a plurality of categories and recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. Samples of data containing accompanying information are collected, and the axis for clustering the set of data from the data for which the recognition result is "misidentified" in a certain category is defined as a category with a high appearance rate as a top candidate. A category with a high appearance rate as the lower candidate is used as the axis, a category with a higher appearance rate as the upper candidate and a category with a higher appearance rate as the lower candidate are used as the axis, and a category with a higher appearance rate as the middle candidate is used as the axis. A second axis extraction method for extracting categories with high correlation coefficients as the axes or categories with low correlation coefficients as the axes Clustering the set of data based on the axis extracted by the second axis extracting means, generating an NG cluster of the category, and storing the shape of the generated NG cluster as second post-stage knowledge. Second-stage knowledge creating means, when data including a recognition result and its accompanying information is obtained as a recognition result of the input image, whether or not the data is included in an NG cluster belonging to a category having the second second-stage knowledge; Is checked, and if it is included in the NG cluster, the category to which the NG cluster belongs is determined as “incorrect” as the recognition result of the input image.
An image recognition apparatus, comprising: a second subsequent-stage determination unit that determines “correct answer” when not included in a cluster.

【請求項７】複数カテゴリの認識対象を持ち、登録画
像パタンと入力画像パタンとのパタンマッチングにおけ
る適合度（距離）に基づき入力画像の認識を行う画像認
識方法において、画像認識の認識結果およびその付随情報を含むデータか
ら、認識結果として同じ第１候補のカテゴリを持つデー
タを集め、該集めたデータから特定の候補順位における
類似カテゴリの出現率の高いものを抽出して、第１候補
カテゴリに対する類似カテゴリの関連を示す第１前段知
識として保持する第１前段知識作成ステップと、入力画像の認識結果として特定候補順位のカテゴリおよ
び類似カテゴリが得られるとき、該第１候補のカテゴリ
および特定候補順位の類似カテゴリと前記第１前段知識
における関連との比較により、該第１候補を”正解”ま
たは”不明”と判定する第１前段判定ステップと、を具備すること、を特徴とする画像認識方法。7. An image recognition method which has recognition targets of a plurality of categories, and performs recognition of an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. Data having the same first candidate category as the recognition result is collected from the data including the accompanying information, and a data having a high appearance rate of the similar category in the specific candidate rank is extracted from the collected data, and the data corresponding to the first candidate category is extracted. A first pre-knowledge generating step of retaining as a first pre-knowledge indicating the association of similar categories, and when a category of a specific candidate rank and a similar category are obtained as a recognition result of the input image, the category of the first candidate and the specific candidate rank By comparing the similar category of the first candidate with the association in the first preceding knowledge, the first candidate is determined to be “correct” or “correct”. By comprising a first pre-determination step of determining that bright ", the image recognition method comprising.

【請求項８】複数カテゴリの認識対象を持ち、登録画
像パタンと入力画像パタンとのパタンマッチングにおけ
る適合度（距離）に基づき入力画像の認識を行う画像認
識方法において、画像認識の認識結果およびその付随情報を含むデータか
ら、認識結果として同じ第１候補のカテゴリを持つデー
タを集め、該集めたデータから特定の候補順位における
適合度（距離）から、該特定順位の候補を”正解”と確
定し得る適合度（距離）の大きさを算定し、第２前段知
識として保持する第２前段知識作成ステップと、入力画像の認識結果として特定候補順位のカテゴリおよ
び類似カテゴリが得られるとき、該特定候補順位のカテ
ゴリまたは類似カテゴリにおける適合度（距離）の大き
さと前記第２前段知識における適合度（距離）の大きさ
との比較により、該特定順位の候補を”正解”または”
不明”と判定する第２前段判定ステップと、を具備すること、を特徴とする画像認識方法。8. An image recognition method having recognition targets of a plurality of categories and recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. Data having the same category of the first candidate as the recognition result is collected from the data including the accompanying information, and from the collected data, the candidate of the specific rank is determined as “correct” from the fitness (distance) in the specific candidate rank. A second pre-knowledge generating step of calculating a degree of adaptability (distance) that can be performed and holding the same as second pre-knowledge; and when a category of a specific candidate rank and a similar category are obtained as a recognition result of the input image, the identification is performed. Ratio of the degree of fitness (distance) in the category of the candidate rank or the similar category to the degree of fitness (distance) in the second preceding knowledge Accordingly, the candidate of the specific rank "correct" or "
A second preceding-stage determination step of determining “unknown”.

【請求項９】複数カテゴリの認識対象を持ち、登録画
像パタンと入力画像パタンとのパタンマッチングにおけ
る適合度（距離）に基づき入力画像の認識を行う画像認
識方法において、画像認識の認識結果およびその付随情報を含むデータか
ら、認識結果として同じ第１候補のカテゴリを持つデー
タを集め、該集めたデータから特定の候補順位のカテゴ
リまたは類似カテゴリ間における適合度（距離）差か
ら、該第１候補を”正解”と確定し得る適合度（距離）
差の大きさを算定し、第３前段知識として保持する第３
前段知識作成ステップと、入力画像の認識結果として特定候補順位のカテゴリおよ
び類似カテゴリが得られるとき、該特定の候補順位のカ
テゴリまたは類似カテゴリ間における適合度（距離）差
の大きさと前記第３前段知識における適合度（距離）差
の大きさとの比較により、該第１候補を”正解”また
は”不明”と判定する第３前段判定ステップと、を具備すること、を特徴とする画像認識方法。9. An image recognition method having recognition targets of a plurality of categories and recognizing an input image based on a degree of conformity (distance) in pattern matching between a registered image pattern and an input image pattern, comprising the steps of: Data having the same first candidate category as a recognition result is collected from the data including the accompanying information, and the first candidate is determined from the fitness (distance) difference between a category of a specific candidate rank or a similar category from the collected data. (Distance) that can be determined as "correct"
Calculate the magnitude of the difference and retain it as the third precedent knowledge.
A precedent knowledge creating step, and when a category of a specific candidate rank and a similar category are obtained as a recognition result of the input image, the magnitude of the difference in the degree of fitness (distance) between the category of the specific candidate rank or the similar category and the third precedent A third pre-determination step of determining the first candidate as “correct” or “unknown” by comparing with the magnitude of the degree of fitness (distance) difference in knowledge.

【請求項１０】複数カテゴリの認識対象を持ち、登録
画像パタンと入力画像パタンとのパタンマッチングにお
ける適合度（距離）に基づき入力画像の認識を行う画像
認識方法において、画像認識の認識結果およびその付随情報を含むデータか
ら、認識結果として同じ第１候補のカテゴリを持つデー
タを集め、該集めたデータから特定の候補順位における
類似カテゴリの出現率の高いものを抽出して、第１候補
カテゴリに対する類似カテゴリの関連を示す第１前段知
識として保持する第１前段知識作成ステップと、画像認識の認識結果およびその付随情報を含むデータか
ら、認識結果として同じ第１候補のカテゴリを持つデー
タを集め、該集めたデータから特定の候補順位における
適合度（距離）から、該特定順位の候補を”正解”と確
定し得る適合度（距離）の大きさを算定し、第２前段知
識として保持する第２前段知識作成ステップと、画像認識の認識結果およびその付随情報を含むデータか
ら、認識結果として同じ第１候補のカテゴリを持つデー
タを集め、該集めたデータから特定の候補順位のカテゴ
リまたは類似カテゴリ間における適合度（距離）差か
ら、該第１候補を”正解”と確定し得る適合度（距離）
差の大きさを算定し、第３前段知識として保持する第３
前段知識作成ステップと、入力画像の認識結果として特定候補順位のカテゴリおよ
び類似カテゴリが得られるとき、該第１候補のカテゴリ
および特定候補順位の類似カテゴリと前記第１前段知識
における関連との比較により、該第１候補を”正解”ま
たは”不明”と判定する第１前段判定ステップと、前記第１前段判定ステップで”正解”と判定された入力
画像について、該特定候補順位のカテゴリまたは類似カ
テゴリにおける適合度（距離）の大きさと前記第２前段
知識における適合度（距離）の大きさとの比較により、
該第１候補を”正解”または”不明”と判定する第２前
段判定ステップと、前記第２前段判定ステップで”正解”と判定された入力
画像について、該特定の候補順位のカテゴリまたは類似
カテゴリ間における適合度（距離）差の大きさと前記第
３前段知識における適合度（距離）差の大きさとの比較
により、該第１候補を”正解”または”不明”と判定す
る第３前段判定ステップと、を具備すること、を特徴とする画像認識方法。10. An image recognition method having recognition targets of a plurality of categories and recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. Data having the same first candidate category as the recognition result is collected from the data including the accompanying information, and a data having a high appearance rate of the similar category in the specific candidate rank is extracted from the collected data, and the data corresponding to the first candidate category is extracted. First pre-knowledge creating step of retaining as first pre-knowledge indicating similar category association, and collecting data having the same first candidate category as the recognition result from the data including the recognition result of the image recognition and the accompanying information, From the collected data, the specific rank candidate is determined as “correct” from the fitness (distance) in the specific candidate rank. A second pre-knowledge creation step of calculating the magnitude of the degree of conformity (distance) to be stored as the second pre-knowledge; and a data including the recognition result of the image recognition and its accompanying information, and identifying the same first candidate as the recognition result. Data having categories are collected, and from the collected data, a fitness (distance) that can determine the first candidate as a “correct answer” from a fitness (distance) difference between a category having a specific candidate rank or a similar category.
Calculate the magnitude of the difference and retain it as the third precedent knowledge.
A precedent knowledge creating step, and when a category of a specific candidate rank and a similar category are obtained as a recognition result of the input image, a comparison between the category of the first candidate and the similar category of the specific candidate rank and the association in the first precedent knowledge is performed. A first precedent determination step of determining the first candidate as “correct” or “unknown”; and a category or a similar category of the specific candidate rank with respect to the input image determined as “correct” in the first precedent determination step. Is compared with the magnitude of the fitness (distance) in the second preceding knowledge,
A second precedent determination step of determining the first candidate as “correct” or “unknown”; and a category or a similar category of the specific candidate rank with respect to the input image determined as “correct” in the second precedent determination step. A third pre-stage determining step of judging the first candidate as “correct” or “unknown” by comparing the magnitude of the degree of conformity (distance) difference between the two and the magnitude of the degree of conformity (distance) in the third pre-stage knowledge An image recognition method, comprising:

【請求項１１】複数カテゴリの認識対象を持ち、登録
画像パタンと入力画像パタンとのパタンマッチングにお
ける適合度（距離）に基づき入力画像の認識を行う画像
認識方法において、画像認識の認識結果およびその付随情報を含むデータを
サンプル収集し、あるカテゴリで認識結果が”正解”と
されているデータから、前記データの集合をクラスタリ
ングするときの軸を、上位候補に出現率の高いカテゴリを該軸として、下位候補に出現率の高いカテゴリを該軸として、上位候補に出現率の高いカテゴリおよび下位候補に出現
率の高いカテゴリをあわせて軸として、中位候補に出現率の高いカテゴリを該軸として、相関係数の高いカテゴリ同士を該軸として、或いは、相関係数の低いカテゴリ同士を該軸として、抽
出する第１軸抽出ステップと、前記第１軸抽出ステップで抽出された軸に基づき前記デ
ータの集合のクラスタリングを行い、該カテゴリのＯＫ
クラスタとして生成し、生成されたＯＫクラスタの形状
を第１後段知識として保持する第１後段知識作成ステッ
プと、入力画像の認識結果として認識結果およびその付随情報
を含むデータが得られるとき、該データが前記第１後段
知識のあるカテゴリに所属するＯＫクラスタの中に含ま
れるか否かをチェックして、該ＯＫクラスタに含まれる
場合には該ＯＫクラスタが所属するカテゴリを前記入力
画像の認識結果として”正解”と判定し、何れのＯＫク
ラスタにも含まれていない場合には”未確定”と判定す
る第１後段判定ステップと、を具備すること、を特徴とする画像認識方法。11. An image recognition method having recognition targets of a plurality of categories and recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. Data including the accompanying information is collected as a sample, and an axis when clustering the set of data from data for which the recognition result is “correct” in a certain category is defined as a category having a high appearance rate as a top candidate as the axis. A category with a high appearance rate as the lower candidate is used as the axis, a category with a higher appearance rate as the upper candidate and a category with a higher appearance rate as the lower candidate are used as the axis, and a category with a higher appearance rate as the middle candidate is used as the axis. First axis extraction for extracting categories with high correlation coefficients as the axis or categories with low correlation coefficient as the axis And step performs clustering of the set of data based on the axis extracted by said first shaft extraction step, OK of the categories
A first post-knowledge generating step of generating as a cluster and holding the shape of the generated OK cluster as the first post-knowledge; and when data including a recognition result and its accompanying information is obtained as a recognition result of the input image, Is checked if it is included in the OK cluster belonging to the category having the first subsequent knowledge. If the OK cluster is included in the OK cluster, the category to which the OK cluster belongs is recognized as the recognition result of the input image. A first post-determination step of determining that the answer is “correct”, and determining that the answer is “unconfirmed” when the answer is not included in any of the OK clusters.

【請求項１２】複数カテゴリの認識対象を持ち、登録
画像パタンと入力画像パタンとのパタンマッチングにお
ける適合度（距離）に基づき入力画像の認識を行う画像
認識方法において、画像認識の認識結果およびその付随情報を含むデータを
サンプル収集し、あるカテゴリで認識結果が”誤認”と
されているデータから、前記データの集合をクラスタリ
ングするときの軸を、上位候補に出現率の高いカテゴリを該軸として、下位候補に出現率の高いカテゴリを該軸として、上位候補に出現率の高いカテゴリおよび下位候補に出現
率の高いカテゴリをあわせて軸として、中位候補に出現率の高いカテゴリを該軸として、相関係数の高いカテゴリ同士を該軸として、或いは、相関係数の低いカテゴリ同士を該軸として、抽
出する第２軸抽出ステップと、前記第２軸抽出ステップで抽出された軸に基づき前記デ
ータの集合のクラスタリングを行い、該カテゴリのＮＧ
クラスタとして生成し、生成されたＮＧクラスタの形状
を第２後段知識として保持する第２後段知識作成ステッ
プと、入力画像の認識結果として認識結果およびその付随情報
を含むデータが得られるとき、該データが前記第２後段
知識のあるカテゴリに所属するＮＧクラスタの中に含ま
れるか否かをチェックして、該ＮＧクラスタに含まれる
場合には該ＮＧクラスタが所属するカテゴリを前記入力
画像の認識結果として”不正解”と判定し、何れのＮＧ
クラスタにも含まれていない場合には”正解”と判定す
る第２後段判定ステップと、を具備すること、を特徴とする画像認識方法。12. An image recognition method having recognition targets of a plurality of categories and recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. Samples of data containing accompanying information are collected, and the axis for clustering the set of data from the data for which the recognition result is "misidentified" in a certain category is defined as a category with a high appearance rate as a top candidate. A category with a high appearance rate as the lower candidate is used as the axis, a category with a higher appearance rate as the upper candidate and a category with a higher appearance rate as the lower candidate are used as the axis, and a category with a higher appearance rate as the middle candidate is used as the axis. Second axis extraction for extracting categories with high correlation coefficients as the axis or categories with low correlation coefficient as the axis Step and, based on the axis extracted by said second shaft extraction step performs clustering of said set of data, NG of the categories
A second post-knowledge generating step of generating as a cluster and retaining the shape of the generated NG cluster as the second post-knowledge; and when data including a recognition result and its accompanying information are obtained as a recognition result of the input image, Is included in the NG cluster belonging to the category having the second subsequent knowledge. If the NG cluster is included in the NG cluster, the category to which the NG cluster belongs is determined as the recognition result of the input image. Is determined to be "incorrect" and any NG
A second post-determination step of determining a “correct answer” when not included in the cluster.

【請求項１３】前記第１後段判定ステップは、前記データがあるカテゴリに所属するＯＫクラスタに含
まれる場合に、どの程度の確信を持って”正解”といえ
るかを示す指標である確信度を計算する確信度算出ステ
ップを具備し、前記データが複数の別カテゴリに所属するＯＫクラスタ
に入った場合には、前記確信度に基づいてどのカテゴリ
を”正解”とするかを決定することを特徴とする請求項
１１に記載の画像認識方法。13. The first post-determination step includes determining, when the data is included in an OK cluster belonging to a certain category, a certainty factor which is an index indicating how confident the user can be said to be a “correct answer”. A step of calculating a certainty factor for calculating, wherein when the data enters an OK cluster belonging to a plurality of different categories, it is determined which category is to be a "correct answer" based on the certainty factor. The image recognition method according to claim 11, wherein

【請求項１４】複数カテゴリの認識対象を持ち、登録
画像パタンと入力画像パタンとのパタンマッチングにお
ける適合度（距離）に基づき入力画像の認識を行う画像
認識方法を、コンピュータに実行させるためのプログラ
ムとして記録した記録媒体において、画像認識の認識結果およびその付随情報を含むデータか
ら、認識結果として同じ第１候補のカテゴリを持つデー
タを集め、該集めたデータから特定の候補順位における
類似カテゴリの出現率の高いものを抽出して、第１候補
カテゴリに対する類似カテゴリの関連を示す第１前段知
識として保持する第１前段知識作成ステップと、入力画像の認識結果として特定候補順位のカテゴリおよ
び類似カテゴリが得られるとき、該第１候補のカテゴリ
および特定候補順位の類似カテゴリと前記第１前段知識
における関連との比較により、該第１候補を”正解”ま
たは”不明”と判定する第１前段判定ステップと、を具備すること、を特徴とする記録媒体。14. A program for causing a computer to execute an image recognition method having recognition targets of a plurality of categories and performing input image recognition based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. Data having the same first candidate category as the recognition result from the data including the recognition result of the image recognition and the accompanying information, and the appearance of a similar category in a specific candidate order from the collected data. A first pre-knowledge creating step of extracting a high rate and retaining it as first pre-knowledge indicating the association of the similar category with the first candidate category; and a specific candidate rank category and similar category as input image recognition results. When obtained, the category of the first candidate and the similar category of the specific candidate rank are Comparison with related in 1 preceding knowledge, by comprising a first pre-determination step determines that the first candidate "correct" or "unknown", the recording medium characterized.

【請求項１５】複数カテゴリの認識対象を持ち、登録
画像パタンと入力画像パタンとのパタンマッチングにお
ける適合度（距離）に基づき入力画像の認識を行う画像
認識方法を、コンピュータに実行させるためのプログラ
ムとして記録した記録媒体において、画像認識の認識結果およびその付随情報を含むデータか
ら、認識結果として同じ第１候補のカテゴリを持つデー
タを集め、該集めたデータから特定の候補順位における
適合度（距離）から、該特定順位の候補を”正解”と確
定し得る適合度（距離）の大きさを算定し、第２前段知
識として保持する第２前段知識作成ステップと、入力画像の認識結果として特定候補順位のカテゴリおよ
び類似カテゴリが得られるとき、該特定候補順位のカテ
ゴリまたは類似カテゴリにおける適合度（距離）の大き
さと前記第２前段知識における適合度（距離）の大きさ
との比較により、該特定順位の候補を”正解”または”
不明”と判定する第２前段判定ステップと、を具備すること、を特徴とする記録媒体。15. A program for causing a computer to execute an image recognition method having recognition targets of a plurality of categories and performing input image recognition based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. In the recording medium recorded as the data, data having the same first candidate category as the recognition result is collected from the data including the recognition result of the image recognition and the accompanying information, and the degree of conformity (distance) in the specific candidate rank is determined from the collected data. ), The magnitude of the degree of fitness (distance) that can determine the candidate of the specific rank as the “correct answer” is calculated, and a second precedent knowledge generating step of retaining the same as the second precedent knowledge is specified. When the category of the candidate rank and the similar category are obtained, the matching degree in the category of the specific candidate rank or the similar category ( By comparison with the size of the goodness of fit (distance) in size and the second pre knowledge of the release), the candidate of the specific rank "correct" or "
A second preceding-stage determination step of determining “unknown”.

【請求項１６】複数カテゴリの認識対象を持ち、登録
画像パタンと入力画像パタンとのパタンマッチングにお
ける適合度（距離）に基づき入力画像の認識を行う画像
認識方法を、コンピュータに実行させるためのプログラ
ムとして記録した記録媒体において、画像認識の認識結果およびその付随情報を含むデータか
ら、認識結果として同じ第１候補のカテゴリを持つデー
タを集め、該集めたデータから特定の候補順位のカテゴ
リまたは類似カテゴリ間における適合度（距離）差か
ら、該第１候補を”正解”と確定し得る適合度（距離）
差の大きさを算定し、第３前段知識として保持する第３
前段知識作成ステップと、入力画像の認識結果として特定候補順位のカテゴリおよ
び類似カテゴリが得られるとき、該特定の候補順位のカ
テゴリまたは類似カテゴリ間における適合度（距離）差
の大きさと前記第３前段知識における適合度（距離）差
の大きさとの比較により、該第１候補を”正解”また
は”不明”と判定する第３前段判定ステップと、を具備すること、を特徴とする記録媒体。16. A program for causing a computer to execute an image recognition method having recognition targets of a plurality of categories and performing input image recognition based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. A data having the same first candidate category as the recognition result from the data including the recognition result of the image recognition and the accompanying information, and a category of a specific candidate rank or a similar category from the collected data. The fitness (distance) at which the first candidate can be determined as “correct” from the fitness (distance) difference between
Calculate the magnitude of the difference and retain it as the third precedent knowledge.
A precedent knowledge creating step, and when a category of a specific candidate rank and a similar category are obtained as a recognition result of the input image, the magnitude of the difference in the degree of fitness (distance) between the category of the specific candidate rank or the similar category and the third precedent A third pre-determination step of determining the first candidate as "correct" or "unknown" by comparing the degree of matching (distance) difference in knowledge with the magnitude of the difference.

【請求項１７】複数カテゴリの認識対象を持ち、登録
画像パタンと入力画像パタンとのパタンマッチングにお
ける適合度（距離）に基づき入力画像の認識を行う画像
認識方法を、コンピュータに実行させるためのプログラ
ムとして記録した記録媒体において、画像認識の認識結果およびその付随情報を含むデータか
ら、認識結果として同じ第１候補のカテゴリを持つデー
タを集め、該集めたデータから特定の候補順位における
類似カテゴリの出現率の高いものを抽出して、第１候補
カテゴリに対する類似カテゴリの関連を示す第１前段知
識として保持する第１前段知識作成ステップと、画像認識の認識結果およびその付随情報を含むデータか
ら、認識結果として同じ第１候補のカテゴリを持つデー
タを集め、該集めたデータから特定の候補順位における
適合度（距離）から、該特定順位の候補を”正解”と確
定し得る適合度（距離）の大きさを算定し、第２前段知
識として保持する第２前段知識作成ステップと、画像認識の認識結果およびその付随情報を含むデータか
ら、認識結果として同じ第１候補のカテゴリを持つデー
タを集め、該集めたデータから特定の候補順位のカテゴ
リまたは類似カテゴリ間における適合度（距離）差か
ら、該第１候補を”正解”と確定し得る適合度（距離）
差の大きさを算定し、第３前段知識として保持する第３
前段知識作成ステップと、入力画像の認識結果として特定候補順位のカテゴリおよ
び類似カテゴリが得られるとき、該第１候補のカテゴリ
および特定候補順位の類似カテゴリと前記第１前段知識
における関連との比較により、該第１候補を”正解”ま
たは”不明”と判定する第１前段判定ステップと、前記第１前段判定ステップで”正解”と判定された入力
画像について、該特定候補順位のカテゴリまたは類似カ
テゴリにおける適合度（距離）の大きさと前記第２前段
知識における適合度（距離）の大きさとの比較により、
該第１候補を”正解”または”不明”と判定する第２前
段判定ステップと、前記第２前段判定ステップで”正解”と判定された入力
画像について、該特定の候補順位のカテゴリまたは類似
カテゴリ間における適合度（距離）差の大きさと前記第
３前段知識における適合度（距離）差の大きさとの比較
により、該第１候補を”正解”または”不明”と判定す
る第３前段判定ステップと、を具備すること、を特徴とする記録媒体。17. A program for causing a computer to execute an image recognition method having recognition targets of a plurality of categories and performing recognition of an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. Data having the same first candidate category as the recognition result from the data including the recognition result of the image recognition and the accompanying information, and the appearance of a similar category in a specific candidate order from the collected data. A first pre-knowledge generating step of extracting a high rate and retaining it as first pre-knowledge indicating the association of the similar category with the first candidate category; and recognizing from the data including the recognition result of the image recognition and its accompanying information. As a result, data having the same category of the first candidate is collected, and a specific candidate is selected from the collected data. A second pre-knowledge generating step of calculating, from the relevance (distance) in the rank, the magnitude of the relevance (distance) that can determine the candidate of the specific rank as a “correct answer”, and holding the same as the second pre-knowledge; Data having the same first candidate category as the recognition result is collected from the data including the recognition result of the recognition and the accompanying information, and a fitness (distance) difference between a category of a specific candidate rank or a similar category from the collected data. From the fitness (distance) at which the first candidate can be determined to be “correct”
Calculate the magnitude of the difference and retain it as the third precedent knowledge.
A precedent knowledge creating step, and when a category of a specific candidate rank and a similar category are obtained as a recognition result of the input image, a comparison between the category of the first candidate and the similar category of the specific candidate rank and the association in the first precedent knowledge is performed. A first precedent determination step of determining the first candidate as “correct” or “unknown”; and a category or a similar category of the specific candidate rank with respect to the input image determined as “correct” in the first precedent determination step. Is compared with the magnitude of the fitness (distance) in the second preceding knowledge,
A second precedent determination step of determining the first candidate as “correct” or “unknown”; and a category or a similar category of the specific candidate rank with respect to the input image determined as “correct” in the second precedent determination step. A third pre-stage determining step of judging the first candidate as “correct” or “unknown” by comparing the magnitude of the degree of conformity (distance) difference between the two and the magnitude of the degree of conformity (distance) in the third pre-stage knowledge And a recording medium comprising:

【請求項１８】複数カテゴリの認識対象を持ち、登録
画像パタンと入力画像パタンとのパタンマッチングにお
ける適合度（距離）に基づき入力画像の認識を行う画像
認識方法を、コンピュータに実行させるためのプログラ
ムとして記録した記録媒体において、画像認識の認識結果およびその付随情報を含むデータを
サンプル収集し、あるカテゴリで認識結果が”正解”と
されているデータから、前記データの集合をクラスタリ
ングするときの軸を、上位候補に出現率の高いカテゴリを該軸として、下位候補に出現率の高いカテゴリを該軸として、上位候補に出現率の高いカテゴリおよび下位候補に出現
率の高いカテゴリをあわせて軸として、中位候補に出現率の高いカテゴリを該軸として、相関係数の高いカテゴリ同士を該軸として、或いは、相関係数の低いカテゴリ同士を該軸として、抽
出する第１軸抽出ステップと、前記第１軸抽出ステップで抽出された軸に基づき前記デ
ータの集合のクラスタリングを行い、該カテゴリのＯＫ
クラスタとして生成し、生成されたＯＫクラスタの形状
を第１後段知識として保持する第１後段知識作成ステッ
プと、入力画像の認識結果として認識結果およびその付随情報
を含むデータが得られるとき、該データが前記第１後段
知識のあるカテゴリに所属するＯＫクラスタの中に含ま
れるか否かをチェックして、該ＯＫクラスタに含まれる
場合には該ＯＫクラスタが所属するカテゴリを前記入力
画像の認識結果として”正解”と判定し、何れのＯＫク
ラスタにも含まれていない場合には”未確定”と判定す
る第１後段判定ステップと、を具備すること、を特徴とする記録媒体。18. A program for causing a computer to execute an image recognition method having recognition targets of a plurality of categories and performing input image recognition based on the degree of matching (distance) in pattern matching between a registered image pattern and an input image pattern. In a recording medium recorded as a sample, data including a recognition result of image recognition and its accompanying information is sampled, and an axis for clustering the set of data from data for which the recognition result is “correct” in a certain category is collected. With the category with the higher appearance rate as the top candidate as the axis, the category with the higher appearance rate as the lower candidate as the axis, and the category with the higher appearance rate as the upper candidate and the category with the higher appearance rate as the lower candidate as the axis , Categories with high appearance rates as middle candidates are set as the axes, and categories with high correlation coefficients are set as the axes, There performs the shaft lower category between correlation coefficients, a first shaft extracting, clustering of said set of data based on the axis extracted by said first shaft extraction step, OK of the categories
A first post-knowledge generating step of generating as a cluster and holding the shape of the generated OK cluster as the first post-knowledge; and when data including a recognition result and its accompanying information is obtained as a recognition result of the input image, Is checked if it is included in the OK cluster belonging to the category having the first subsequent knowledge. If the OK cluster is included in the OK cluster, the category to which the OK cluster belongs is recognized as the recognition result of the input image. And a first subsequent-stage determination step of determining “correct” and determining “unconfirmed” when not included in any of the OK clusters.

【請求項１９】複数カテゴリの認識対象を持ち、登録
画像パタンと入力画像パタンとのパタンマッチングにお
ける適合度（距離）に基づき入力画像の認識を行う画像
認識方法を、コンピュータに実行させるためのプログラ
ムとして記録した記録媒体において、画像認識の認識結果およびその付随情報を含むデータを
サンプル収集し、あるカテゴリで認識結果が”誤認”と
されているデータから、前記データの集合をクラスタリ
ングするときの軸を、上位候補に出現率の高いカテゴリを該軸として、下位候補に出現率の高いカテゴリを該軸として、上位候補に出現率の高いカテゴリおよび下位候補に出現
率の高いカテゴリをあわせて軸として、中位候補に出現率の高いカテゴリを該軸として、相関係数の高いカテゴリ同士を該軸として、或いは、相関係数の低いカテゴリ同士を該軸として、抽
出する第２軸抽出ステップと、前記第２軸抽出ステップで抽出された軸に基づき前記デ
ータの集合のクラスタリングを行い、該カテゴリのＮＧ
クラスタとして生成し、生成されたＮＧクラスタの形状
を第２後段知識として保持する第２後段知識作成ステッ
プと、入力画像の認識結果として認識結果およびその付随情報
を含むデータが得られるとき、該データが前記第２後段
知識のあるカテゴリに所属するＮＧクラスタの中に含ま
れるか否かをチェックして、該ＮＧクラスタに含まれる
場合には該ＮＧクラスタが所属するカテゴリを前記入力
画像の認識結果として”不正解”と判定し、何れのＮＧ
クラスタにも含まれていない場合には”正解”と判定す
る第２後段判定ステップと、を具備すること、を特徴とする記録媒体。19. A program for causing a computer to execute an image recognition method having recognition targets of a plurality of categories and recognizing an input image based on a matching degree (distance) in pattern matching between a registered image pattern and an input image pattern. In the recording medium recorded as, the data including the recognition result of the image recognition and the accompanying information is sampled, and the axis for clustering the set of data from the data in which the recognition result is “misrecognized” in a certain category is collected. With the category with the higher appearance rate as the top candidate as the axis, the category with the higher appearance rate as the lower candidate as the axis, and the category with the higher appearance rate as the upper candidate and the category with the higher appearance rate as the lower candidate as the axis , Categories with high appearance rates as middle candidates are set as the axes, and categories with high correlation coefficients are set as the axes, There is, as a shaft of the lower category between correlation coefficients, and the second shaft extracting performs clustering of the set of data based on the axis extracted by said second shaft extraction step, NG of the categories
A second post-knowledge generating step of generating as a cluster and retaining the shape of the generated NG cluster as the second post-knowledge; and when data including a recognition result and its accompanying information are obtained as a recognition result of the input image, Is included in the NG cluster belonging to the category having the second subsequent knowledge. If the NG cluster is included in the NG cluster, the category to which the NG cluster belongs is determined as the recognition result of the input image. Is determined to be "incorrect" and any NG
A second post-determination step of determining “correct” if not included in the cluster.

【請求項２０】前記第１後段判定ステップは、前記データがあるカテゴリに所属するＯＫクラスタに含
まれる場合に、どの程度の確信を持って”正解”といえ
るかを示す指標である確信度を計算する確信度算出ステ
ップを具備し、前記データが複数の別カテゴリに所属するＯＫクラスタ
に入った場合には、前記確信度に基づいてどのカテゴリ
を”正解”とするかを決定することを特徴とする請求項
１８に記載の記録媒体。20. The first subsequent-stage determining step is to determine a certainty factor which is an index indicating how much certainty can be said to be “correct” when the data is included in an OK cluster belonging to a certain category. A step of calculating a certainty factor for calculating, wherein when the data enters an OK cluster belonging to a plurality of different categories, it is determined which category is to be a "correct answer" based on the certainty factor. The recording medium according to claim 18, wherein: