JP2853169B2

JP2853169B2 - Pattern recognition device

Info

Publication number: JP2853169B2
Application number: JP1138750A
Authority: JP
Inventors: 啓介後藤; 哲也安田; 仁古郡
Original assignee: Meidensha Corp
Current assignee: Meidensha Corp
Priority date: 1989-05-31
Filing date: 1989-05-31
Publication date: 1999-02-03
Anticipated expiration: 2014-02-03
Also published as: JPH033090A

Description

【発明の詳細な説明】 A.産業上の利用分野本発明は文字や図形等のパターンを認識する装置に関
するものである。The present invention relates to an apparatus for recognizing a pattern such as a character or a figure.

B.発明の概要本発明は、文字や図形等のパターンを、照合対象とな
る辞書を用いて認識する装置において、パターンの特徴点の位置的特徴及び方向的特徴をあい
まい集合（ファジイ集合）で表した辞書を用い、認識対
象のパターンの特徴点の位置及び方向に対応するメンバ
ーシップ値について演算を行い、演算結果にもとづいて
認識結果を得ることによって、パターンの認識を高い精度でかつ高速に実行できるよ
うにしたものである。B. Summary of the Invention The present invention relates to an apparatus for recognizing a pattern such as a character or a figure using a dictionary to be collated, wherein a positional feature and a directional feature of a feature point of the pattern are defined by an ambiguous set (fuzzy set). Using the expressed dictionary, calculate the membership value corresponding to the position and direction of the feature point of the pattern to be recognized, and obtain the recognition result based on the calculation result. It is something that can be executed.

C.従来の技術文字や図形等のパターン認識する場合、例えば、書類
や図面等をイメージスキャナ等の入力装置で操作して得
られる黒と白の２値画像データ（入力パターン）に基づ
いて、予め記憶されている文字や図形の標準パターン
（辞書パターン）と位置等を照合し、辞書パターンの中
から最も重なり合うものを最終的に認識結果として求め
ている。C. Prior Art When patterns such as characters and figures are recognized, for example, based on black and white binary image data (input pattern) obtained by operating a document or drawing with an input device such as an image scanner, The position and the like are compared with a standard pattern (dictionary pattern) of characters and figures stored in advance, and the most overlapping one of the dictionary patterns is finally obtained as a recognition result.

認識方法の具体例として、例えば、入力パターンと辞
書パターンをそれぞれ小さな正方格子状（メッシュ）に
区切り、格子点が文字や図形である場合には黒（１）、
そうでないときは（０）と表す場合において、入力と辞
書の２つのパターンの重なりの程度をハミング距離を用
いて求めることができる。これは、入力と辞書が黒と黒
または白と白の場合には０、黒と白または白と黒の場合
には１とおき、その総和を求めるものである。つまり、
黒と黒、白と白のように重なり合っていればいるほど、
その総和（ハミング距離）が小さくなる。この性質によ
り、多数の辞書パターンの中から入力パターンと最も重
なり合っているもの、すなわち、ハミング距離が最も小
さいものを認識結果とするものである。As a specific example of the recognition method, for example, the input pattern and the dictionary pattern are each divided into small square lattices (mesh), and black (1) when the lattice points are characters or figures,
Otherwise, in the case of expressing (0), the degree of overlap between the two patterns of the input and the dictionary can be obtained using the Hamming distance. In this case, 0 is set when the input and the dictionary are black and black or white and white, and 1 is set when the input and dictionary are black and white or white and black. That is,
The more they overlap like black and black, white and white,
The sum (Hamming distance) becomes smaller. Due to this property, a pattern that most overlaps with the input pattern among a large number of dictionary patterns, that is, a pattern with the smallest Hamming distance is used as a recognition result.

D.発明が解決しようとする課題この方法だと入力パターンの位置が辞書パターンの位
置とずれていたり、大きさが異なっていたり、傾きが異
なっていたりすると、重なりの程度が変わってしまい、
認識の誤りが生じやすい欠点がある。D. Problems to be solved by the invention With this method, if the position of the input pattern is shifted from the position of the dictionary pattern, the size is different, or the inclination is different, the degree of overlap will change,
There is a disadvantage that recognition errors easily occur.

また記憶容量の点でも問題がある。１文字あるいは１
図形パターンの必要空間は表示のための文字フォントの
ように８×８画素（英数字）や24×24画素（漢字）の空
間では済まず、線の切れやつぶれを極力避けるため少な
くともその３〜５倍の空間を必要とする。特に、漢字や
複雑な図形の場合128×128画素程度以上確保しないと安
定な処理が望めないことになる。There is also a problem in terms of storage capacity. 1 character or 1
The space required for graphic patterns is not limited to 8 x 8 pixels (alphanumeric characters) or 24 x 24 pixels (kanji) as in character fonts for display. Requires 5 times the space. In particular, in the case of kanji or complicated figures, stable processing cannot be expected unless about 128 × 128 pixels or more are secured.

文字や図形パターンの辞書種別は漢字の場合第二水準
を含めると6000字種以上に及び、図形の場合でも多いと
きには1000字種以上になる。The dictionary type of characters and graphic patterns is more than 6000 characters including the second level in the case of Chinese characters, and more than 1000 characters when there are many figures.

このような膨大な辞書種別に対して、１辞書毎に128
×128画素を確保するとすれば、メモリとして2Kバイト
が必要となる。つまり、先の漢字では12Mバイト以上、
図形で2Mバイト以上もの記憶領域が必要となる。For such a huge dictionary type, 128
Assuming that × 128 pixels are secured, 2K bytes are required as memory. In other words, the previous kanji is over 12MB,
A figure requires a storage area of 2 Mbytes or more.

ハードウェア構成上この容量は無視できないほど大き
く、また、辞書の探索や整合処理にも処理時間などで影
響を与えることになる。Due to the hardware configuration, this capacity is so large that it cannot be ignored, and also affects the search and matching processing of the dictionary due to processing time and the like.

更に、辞書種別の点からいえば、上述の膨大な種別に
ついてひとつひとつ辞書をマニュアルで登録する作業が
必要とされており、そのための作業時間も膨大となる問
題点がある。Further, in terms of dictionary types, it is necessary to manually register the dictionaries one by one for each of the enormous types described above, and there is a problem that the time required for the work is enormous.

以上まとめると、従来技術には以下の大きな３つの課
題が残されている。In summary, the prior art has the following three major problems.

認識精度の高い認識手法の確立記憶容量低減のための辞書構成方法の確立辞書の自動生成方法の確立本発明は、このような課題を解決することを目的とす
る。Establishment of a recognition method with high recognition accuracy Establishment of a dictionary construction method for reducing storage capacity Establishment of an automatic dictionary generation method The present invention aims to solve such a problem.

E.課題を解決するための手段及び作用文字や図形パターンは人間の歴史的または経験的規則
に従って記述された位置関係にあるが、パターンを特徴
づける点（特徴点）そのものはあいまい性を有してい
る。E. Means and Actions for Solving the Problem Characters and graphic patterns have a positional relationship described in accordance with human historical or empirical rules, but the points that characterize the patterns (feature points) themselves have ambiguity. ing.

例えば、第３図ａの○印で示す文字“A"の代表的な特
徴点の場合（これを頂点と呼ぶものとする）、頂点の位
置そのものは他の特徴点からみて左に振れたり（同図
ｂ）、右に振れたり（同図ｃ）する。また、頂点のとこ
ろで切れたり（同図ｄ）もする。For example, in the case of a representative feature point of the character “A” indicated by a circle in FIG. 3A (this is referred to as a vertex), the position of the vertex itself is shifted left as viewed from other feature points ( FIG. B), and swings to the right (FIG. C). In addition, it is cut off at the vertex (d in the figure).

しかし、同図ｅのように、頂点の部分だけが他の特徴
点より下の方に位置することはありえない。However, it is unlikely that only the vertex is located below other feature points as shown in FIG.

結局、文字“A"の頂点というものは、他の特徴点より
位置関係で言うと上の方でかつ真ん中付近にある、とい
う極めてあいまいな尺度で規則性を持っているというこ
とになる。As a result, the vertex of the letter “A” has regularity on a very ambiguous scale that it is located above and near the middle of other feature points in terms of positional relationship.

本発明は、このように文字や図形パターンの特徴点の
位置関係にあいまい性があることを考慮して、その位置
関係をあいまい集合（ファジイ集合）の概念を取り入れ
て表現することによりパターン毎にファジイ辞書を登録
し、認識対象のパターンと各パターンのファジイ辞書と
を比較してその比較結果にもとづいて認識結果を得よう
とするものである。The present invention considers that the positional relationship between the characteristic points of a character or a graphic pattern is ambiguous in this way, and expresses the positional relationship by introducing the concept of an ambiguous set (fuzzy set). A fuzzy dictionary is registered, a pattern to be recognized is compared with a fuzzy dictionary of each pattern, and a recognition result is obtained based on the comparison result.

第１図に本発明装置のうちファジイ辞書作成のフロー
を示すと、先ずあるパターンに対して多数のサンプルを
作成し、各サンプル毎に光学的走査によりパターンを入
力する。FIG. 1 shows the flow of fuzzy dictionary creation in the apparatus of the present invention. First, a large number of samples are created for a certain pattern, and the pattern is input by optical scanning for each sample.

次いでこの入力したパターン即ち画像情報に対してノ
イズ処理等の前処理を行う。続いてこの画像情報にもと
づいて例えば白画素に隣接する黒画素群を輪郭画素群と
して抜き出し、これらについて直線近似処理等の情報圧
縮化を行った後、各輪郭画素群を例えば右回りに結合し
て隣接画素同士P_n，P_n+1を結ぶベクトルを抽出し、この
ベクトルを画素P_nの位置とベクトルの方向とにより特定
して記憶する。ここでP_nの位置についてはX,Y座標によ
り表され、方向については、次のように量子化した方向
コードθとして表される。一般に座標点ａ（X_s，Y_s）を
始点とし、座標点ｂ（X_e，Y_e）を終点とする線分の方向
αは次式で表される。Next, preprocessing such as noise processing is performed on the input pattern, that is, image information. Subsequently, based on this image information, for example, a black pixel group adjacent to a white pixel is extracted as a contour pixel group, and after performing information compression such as a straight line approximation process on each of them, each contour pixel group is coupled clockwise, for example. Then, a vector connecting adjacent pixels P _n and P _{n + 1} is extracted, and this vector is specified and stored based on the position of the pixel P _n and the direction of the vector. Here, the position of _Pn is represented by X and Y coordinates, and the direction is represented by a direction code θ quantized as follows. Generally coordinate point a (X _{_s,} Y _s) and starting from the coordinate point b (X _{_e,} Y _e) the direction α of a line and ending a is expressed by the following equation.

α＝tan^-1｛（Y_e，Y_s）／（X_e，X_s）ここに方向コードθは例えば方向αを45度単位に量子
化したものであり、第２図（ａ）に示すようにθ＝0,1,
2……７としたとき、次の条件を満たすときのθを求め
ることにより得られる。α = tan ^-1 ｛(Y _e , Y _s ) / (X _e , X _s ) Here, the direction code θ is obtained by quantizing the direction α in units of 45 degrees, and is shown in FIG. 2A. Θ = 0,1,
2... 7, it is obtained by obtaining θ when the following condition is satisfied.

（２θ−１）π/8≦α≦（２θ＋１）π/8 なお向きを考慮しない場合には第２図（ｂ）に示すよ
うにθ＝0,1,2,3の４つのコードとして取り扱うことも
できる。(2θ-1) π / 8 ≦ α ≦ (2θ + 1) π / 8 If the direction is not considered, it is handled as four codes of θ = 0, 1, 2, 3 as shown in FIG. 2 (b). You can also.

一方、パターンが描かれる領域をａ×ｂ（a,bは整
数）のメッシュに区分して、このメッシュ群よりなる領
域（メッシュ領域）に対応する領域を方向コードの数だ
けメモリ内に確保し、確保したメッシュ領域に夫々方向
コードを割り当てる。例えば方向コードθが第２図
（ａ）に示すように０〜７までの８通りある場合、メモ
リ内にａ×ｂのメッシュ領域を８個確保し、そのうちの
１つをθ＝０に対応したメッシュ領域、他の１つをθ＝
１に対応したメッシュ領域、………といった具合に割り
当てる。On the other hand, the area in which the pattern is drawn is divided into a × b (a and b are integers) meshes, and areas corresponding to the areas (mesh areas) formed of this mesh group are secured in the memory by the number of direction codes. Then, a direction code is assigned to each of the secured mesh areas. For example, if there are eight direction codes θ from 0 to 7 as shown in FIG. 2 (a), eight a × b mesh areas are secured in the memory, and one of them corresponds to θ = 0. Mesh region, and the other one is θ =
The mesh area corresponding to 1,... Is assigned.

次に各サンプルにおけるパターンの特徴点例えば先述
した輪郭画素群を抽出し、メモリ内の８つのメッシュ領
域の中でその特徴点の位置及び方向コードに対応するメ
ッシュ位置（メッシュ領域内のメッシュの位置）に出現
頻度つまり出現回数を書き込む。例えばサンプルの数が
100個あり、そのうちの１個についてのみメッシュ位置
（2,3）にθ＝０の特徴点が現れた場合、θ＝０の方向
コードにおけるメッシュ領域のメッシュ位置に出現回数
（出現頻度）１が書き込まれる。出現頻度の計算は、特
徴点の位置を特定するＸ座標及びＹ座標と方向コードと
により規定される３次元空間つまりａ×ｂ×ｋ（ｋは方
向コードの数）のメッシュ空間において、あるサンプル
についてメッシュ位置（X₁，Y₁，θ₁）に特徴点が現れ
たとすると、当該メッシュ位置における今までの出現頻
度に１を加えることによって実行される。Next, the characteristic points of the pattern in each sample, for example, the above-described outline pixel group are extracted, and the positions of the characteristic points and the mesh positions corresponding to the direction codes (the positions of the meshes in the mesh areas) in the eight mesh areas in the memory. ) Is written with the appearance frequency, that is, the number of appearances. For example, if the number of samples
When there are 100 feature points and only one of them has a feature point of θ = 0 at the mesh position (2,3), the number of appearances (appearance frequency) 1 at the mesh position of the mesh region in the direction code of θ = 0 is Written. The calculation of the appearance frequency is performed in a three-dimensional space defined by the X and Y coordinates specifying the position of the feature point and the direction code, that is, in a mesh space of a × b × k (k is the number of direction codes). Assuming that a feature point appears at the mesh position (X ₁ , Y ₁ , θ ₁ ) for, this is executed by adding 1 to the frequency of occurrence so far at the mesh position.

例えばパターンが描かれる領域５×５のメッシュ群に
分割し、θを第２図（ａ）のように８つに規定した場
合、５×５×８のメッシュ空間をとる。そしてある特徴
点の位置（メッシュ領域のメッシュに対応する位置）の
Ｘ座標,Y座標が（5,3）、その方向コードがθ＝６であ
る場合には、メッシュ空間上の位置（5,3,6）に対応す
る頻度に１を加えたものとなる。特徴点の位置のＸ座
標,Y座標を（X,Y）、方向コードをθとすれば、メッシ
ュ空間上のメッシュ位置（X,Y,θ）における出現頻度Ｐ
（X,Y,θ）は次式のようにカウントアップされる。For example, when the area is divided into 5 × 5 mesh groups where patterns are drawn and θ is defined as eight as shown in FIG. 2A, a 5 × 5 × 8 mesh space is taken. When the X coordinate and the Y coordinate of the position of a certain feature point (the position corresponding to the mesh in the mesh area) are (5, 3) and the direction code is θ = 6, the position (5, This is obtained by adding 1 to the frequency corresponding to 3,6). If the X and Y coordinates of the position of the feature point are (X, Y) and the direction code is θ, the appearance frequency P at the mesh position (X, Y, θ) in the mesh space
(X, Y, θ) is counted up as in the following equation.

Ｐ（X,Y,θ）＝Ｐ（X,Y,θ）＋１ただしＰ（X,Y,θ）の初期値は０である。P (X, Y, θ) = P (X, Y, θ) +1 However, the initial value of P (X, Y, θ) is 0.

こうして前記メッシュ空間の各メッシュ位置毎に特徴
点の出現頻度を求め、出現頻度分布を作成する。第４図
は、作成された出現頻度分布の一例を示す分布図であ
る。In this way, the appearance frequency of the feature point is obtained for each mesh position in the mesh space, and an appearance frequency distribution is created. FIG. 4 is a distribution diagram showing an example of the created appearance frequency distribution.

こうして求めた特徴点出現頻度分布は、特徴点がどの
様な位置に出現し、どの様な方向性を持っているかとい
う傾向を示している。つまり、同一の文字、または、図
形に対して、複数のサンプルによる頻度分布を取ると、
似通った位置と似通った方向性を持って特徴点が出現し
やすく、その付近にピークが存在することが多いことを
示している。The feature point appearance frequency distribution obtained in this manner indicates a tendency of what position the feature point appears and what direction the feature point has. In other words, if you take the frequency distribution of multiple samples for the same character or figure,
This indicates that feature points are likely to appear with similar directionality to similar positions, and that peaks are often present in the vicinity thereof.

本発明は、この考え方をファジイ集合におけるメンバ
ーシップ値の設定に応用したものである。The present invention applies this idea to the setting of membership values in fuzzy sets.

ファジイ集合におけるメンバーシップ値は、あいまい
な尺度のものを［0,1］区間の実数領域における主観的
な量として表現したものである。The membership value in the fuzzy set expresses a vague scale as a subjective quantity in the real domain in the [0,1] interval.

メンバーシップ値の定義は、［0,1］区間の実数領域
をヒントに、また、特徴点が頻度分布のピーク値周辺に
現れやすいことをヒントにしている。The definition of the membership value is based on the hint that the real number region in the [0, 1] section is a hint, and that the feature points are likely to appear around the peak value of the frequency distribution.

つまり、頻度分布は出現しやすい位置と方向性を、特
徴点の評価に反映した指標となりうるため、０〜１の実
数区間で正規化すればそのままメンバーシップ値として
用いることができることになる。本発明ではこのように
して正規化した特徴点頻度分布をファジイ辞書としてパ
ターン毎に辞書記憶部に登録しておく。またこの辞書記
憶部には、パターン毎に特徴点の基準個数ｊを登録して
おく。この基準個数ｊとは、例えば各サンプルにおける
パターンの特徴点の平均個数である。In other words, since the frequency distribution can be an index that reflects the position and directionality that are likely to appear in the evaluation of the feature points, the frequency distribution can be used as it is as a membership value by normalizing it in a real section of 0 to 1. In the present invention, the feature point frequency distribution thus normalized is registered in the dictionary storage unit for each pattern as a fuzzy dictionary. In this dictionary storage unit, a reference number j of feature points is registered for each pattern. The reference number j is, for example, the average number of feature points of the pattern in each sample.

そして認識対象であるパターンについて特徴点を抽出
し、その特徴点の総数ｍと各特徴点のメッシュ領域にお
けるメッシュ位置（X_i，Y_i，θ_i）（１≦ｉ≦ｍ）とを
求め、辞書記憶部内にあるパターンPT_kを照合対象として、
これのファジイ辞書の前記メッシュ位置（X_i，Y_i，
θ_i）に対応するメンバーシップ値f_k（X_i，Y_i，θ_i）を
求め、各メンバーシップ値f_k（X_i，Y_i，θ_i）を確信度1.0か
ら差し引いた値の合計値と前記特徴点の基準個数ｊと前記特徴点の総数ｍとにも
とづいて、認識対象のパターンと照合対象のパターンPT
_kとの離れている度合いに相当する距離D_kを演算し、この距離D_kを辞書記憶部内の総てのパターンについて
計算し、そのうちの小さい値から順にパターンを適数選
んで認識結果とする。The feature points extracted for the pattern to be recognized, the mesh positions in total m and the mesh area of each feature point of the feature point seeking and _{_{(X i, Y i, θ}} i) (1 ≦ i ≦ m), With the pattern PT _k in the dictionary storage unit as the matching target,
The mesh positions (X _i , Y _i ,
membership value corresponding to _{_{_{θ i) f k (X i}}} , Y i, determine the theta _i), the sum of the values obtained by subtracting the membership value _{_{_{f k (X i, Y i}}} , the theta _i) from the certainty factor 1.0 value And the reference number j of the feature points and the total number m of the feature points, the pattern to be recognized and the pattern
It calculates the distance D _k corresponding to and degree to which apart of _k, the distance D _k calculated for all the patterns in the dictionary in the storage unit, to the smaller value ones As the recognition result by selecting the number of suitable patterns sequentially .

ここで前記距離D_kの意味について説明する。今認識対
象であるパターン（入力パターン）の特徴点の総数が
ｍ、各特徴点のメッシュ位置が（X₁，Y₁，θ₁），
（X₂，Y₂，θ₂）……（X_m，Y_m，θ_m）であったとする。
先ず辞書記憶部内に登録されているあるパターンPT_kを
照合対象として、その２次元ファジイ辞書について前記
メッシュ位置（X_i，Y_i，θ_i）（ただし１≦ｉ≦ｍ）に
対応するメンバーシップ値f_k（X_i，Y_i，θ_i）を拾い出
す。このf_k（X_i，Y_i，θ_i）は、（X_i，Y_i，θ_i）に位置
する入力パターンの特徴点がパターンPT_kのひとつの特
徴点であることの確信度を示すものである。そこで本発
明では、例えば確信度1.0からf_k（X_i，Y_i，θ_i）を差し
引いた値を入力パターンの全特徴点数ｍあるいはパター
ンPT_kに係る特徴点の基準個数ｊで割った値を、（X_i，Y
_i，θ_i）に位置する特徴点とパターンPT_kとの距離とい
う概念で捉え、そして入力パターンの全特徴点について
の前記距離の合計値を当該パターンと照合対象であるパ
ターンPT_kとの距離D_kとして捉える。このD_kの演算式の
一例を示すと、ｊとｍとが等しいときにはとなる。またｊとｍとが異なるときはとなる。これらの式をまとめると次式となる。Here, the meaning of the distance _Dk will be described. The total number of feature points of the pattern (input pattern) to be recognized now is m, the mesh position of each feature point is (X ₁ , Y ₁ , θ ₁ ),
(X ₂ , Y ₂ , θ ₂ )... (X _m , Y _m , θ _m ).
First, with respect to a certain pattern PT _k registered in the dictionary storage unit, a membership corresponding to the mesh position (X _i , Y _i , θ _i ) (1 ≦ i ≦ m) for the two-dimensional fuzzy dictionary. Pick up the value f _k (X _i , Y _i , θ _i ). This f _k (X _i , Y _i , θ _i ) indicates the degree of certainty that the feature point of the input pattern located at (X _i , Y _i , θ _i ) is one feature point of the pattern PT _k Things. Therefore, in the present invention, for example, a value obtained by subtracting f _k (X _i , Y _i , θ _i ) from the certainty factor 1.0 is divided by the total number m of feature points of the input pattern or the reference number j of feature points related to the pattern PT _k To (X _i , Y
_i, captured by the concept of distance between the feature points and the pattern PT _k located theta _i), and the distance between the pattern PT _k the sum of the distances is the pattern and the comparison target for all feature points in the input pattern _Caught as _Dk . As an example of the operation formula of _Dk , when j and m are equal, Becomes When j and m are different, Becomes These equations are summarized as follows.

ただしmax（m.j）はm.jの大きい方の値である。 However, max (mj) is the larger value of mj.

F.実施例第５図は辞書記憶部に登録すべきデータを作成するた
めの回路のブロック図である。この例における特徴点出
現頻度分の正規化の方法についてはじめに述べておく
と、全サンプルの特徴点数をＮとしたとき、次式に示す
ようにある一定の割合C_nを乗じたものを基準点までのピ
ーク抽出数ｎとして求める。F. Embodiment FIG. 5 is a block diagram of a circuit for creating data to be registered in the dictionary storage unit. First, the method of normalizing the appearance frequency of feature points in this example is as follows. When the number of feature points in all samples is N, a value obtained by multiplying a certain percentage C _n as shown in the following equation is used as a reference point. Is determined as the number of peaks extracted up to n.

ｎ＝C_n・Ｎ次に、特徴点出現頻度分布において、頻度の大きいも
のから順に捜していき、ｎ番目の頻度の値を取るものを
求め、これを基準点の頻度P_nとする。n = C _n · N Next, in the feature point appearance frequency distribution, search is performed in descending order of frequency, and the one that takes the value of the n-th frequency is obtained, and this is set as the frequency P _{n of the} reference point.

P_nをもとに、特徴点頻度分布を次式により正規化す
る。分布上の頻度をＰ、正規化後の頻度をＭとすれば、Ｍ＝1.0 ……Ｐ≧P_nのとき P/P_n ……Ｐ＜P_nのとき分布上の全ての頻度について行うことで、正規化した結
果が得られる。Based on P _n , the feature point frequency distribution is normalized by the following equation. If the frequency on the distribution is P and the frequency after the normalization is M, then M = 1.0 …… P ≧ P _n P / P _n …… P <P _n Perform all frequencies on the distribution Yields a normalized result.

第５図中１は文字や図形等のサンプルパターンであ
り、このサンプルパターン１に対して走査回路２により
走査が行われ、次いでその走査結果に対してノイズ処理
や大きさの正規化といった前処理が前処理回路３により
施される。なお走査回路２はコントローラ６の制御の下
に多数のサンプルについて走査を行う。前処理回路３か
ら出力されたデータは特徴点抽出回路４に入力され、特
徴点抽出回路４は、文字や図形の黒白境界に沿って輪郭
画素群を抽出し、これらを直線近似処理等により情報圧
縮化し、こうして得られた特徴点群を、文字，図形を常
に右側（または左側）に見るようにして輪郭ベクトルと
して結合する。第６図は文字「Ａ」について得られた輪
郭ベクトルを示す図であり、ベクトル群より構成される
閉じた線分を１つの輪郭ベクトル系列と呼ぶならば、こ
の例では２つの輪郭ベクトル系列A₁，A₂を有している。
更に特徴点抽出回路４は、特徴点の位置を正規化すると
共に当該特徴点を始点とする輪郭ベクトルの方向コード
を求め、その結果を特徴点出現頻度分布作成器５に出力
すると共に、特徴点数を特徴点数カウンタ回路７に渡
す。特徴点出現頻度分布作成器５では、特徴点抽出回路
４によって得られる特徴点出現位置及びその方向コード
にもとづき、一時記憶メモリ5₁に記憶された頻度をカウ
ントアップして第７図に示すような特徴点出現頻度分布
を作成する。コントローラ６は、辞書のサンプル数によ
って走査回路２への入力を繰り返し、特徴点出現頻度分
布の作成処理をコントロールすると共に、作成処理終了
後基準頻度検出回路８に処理を進める。基準頻度検出回
路８では、特徴点数カウンタ回路７から得られる全特徴
点数（出現頻度の合計値）とコントローラ６から与えら
れる設定値により、基準頻度を求めるためのピーク数を
決定し、そのピーク数をもとに一時記憶メモリ5₁に記憶
されている特徴点出現頻度分布からピーク頻度を順次大
きいものから読み出すとともに、与えられたピーク数に
達した頻度を基準頻度として検出する。基準頻度検出回
路８における処理が終了したのち、特徴点出現頻度分布
正規化回路は、一時記憶メモリ5₁に記憶されている特徴
点出現頻度分布を基準頻度検出回路８で検出された基準
頻度を用いて第８図に示すように正規化された頻度分布
をメンバーシップ値としてファジイ辞書10に記憶する。In FIG. 5, reference numeral 1 denotes a sample pattern such as a character or a figure, which is scanned by the scanning circuit 2 and then subjected to preprocessing such as noise processing or size normalization. Is performed by the preprocessing circuit 3. The scanning circuit 2 scans a large number of samples under the control of the controller 6. The data output from the pre-processing circuit 3 is input to a feature point extraction circuit 4, which extracts a group of contour pixels along the black and white boundary of a character or a figure, and outputs these information by a linear approximation process or the like. The feature points obtained by compression are combined as contour vectors so that characters and figures are always viewed on the right (or left). FIG. 6 is a diagram showing contour vectors obtained for the character "A". If a closed line segment composed of a vector group is called one contour vector sequence, in this example, two contour vector sequences A are used. and a _1, a _2.
Further, the feature point extraction circuit 4 normalizes the position of the feature point, obtains the direction code of the contour vector starting from the feature point, outputs the result to the feature point appearance frequency distribution creator 5, and Is passed to the feature point counter circuit 7. In the feature point frequency distribution generator 5, based on the feature point occurrence position and direction code obtained by the feature point extraction circuit 4, seventh as shown in FIG counts up the frequency stored in the temporary storage memory 5 ₁ A characteristic point appearance frequency distribution. The controller 6 repeats the input to the scanning circuit 2 according to the number of samples in the dictionary, controls the generation processing of the feature point appearance frequency distribution, and advances the processing to the reference frequency detection circuit 8 after the completion of the generation processing. The reference frequency detection circuit 8 determines the number of peaks for obtaining the reference frequency based on the total number of feature points (total value of appearance frequency) obtained from the feature point number counter circuit 7 and the set value given from the controller 6. reads from those sequentially larger peak frequency from the original to the temporary storage memory 5 ₁ to the stored feature point occurrence frequency distribution, detects the frequency reaches a given number of peaks as the standard frequency. After the process of the reference frequency detecting circuit 8 is completed, feature points appearance frequency distribution normalization circuit, a reference frequency detected feature point frequency distribution stored in the temporary memory 5 ₁ at the standard frequency detecting circuit 8 As shown in FIG. 8, the normalized frequency distribution is stored in the fuzzy dictionary 10 as a membership value.

以上において特徴点数カウンタ回路７にて求める全特
徴点数としては、方向コードθ＝０〜８毎に特徴点数を
合計した値であってもよいし、あるいは各合計値を合計
した値であってもよく、前者の場合全特徴点数が方向コ
ード毎に独立した値となる。In the above description, the total number of feature points obtained by the feature point number counter circuit 7 may be a value obtained by summing the number of feature points for each of the direction codes θ = 0 to 8, or a value obtained by summing the respective sum values. Often, in the former case, the total number of feature points is an independent value for each direction code.

次に認識対象であるパターンを入力して、辞書記憶部
内のパターン群と照合する手段の一例について述べる。
第９図中辞書記憶部11内にはパターン毎にファジイ辞書
と特徴点の基準個数ｊとが登録されている。特徴点の基
準個数ｊは、例えばファジイ辞書を作成するときに用い
たパターンのサンプルの特徴点の平均個数である。Next, an example of means for inputting a pattern to be recognized and collating with a pattern group in the dictionary storage unit will be described.
9, a fuzzy dictionary and a reference number j of feature points are registered in the dictionary storage unit 11 for each pattern. The reference number j of feature points is, for example, the average number of feature points of pattern samples used when creating a fuzzy dictionary.

先ずパターン入力部12にてパターンを入力し、特徴点
個数加算部13により当該パターンの特徴点の総数（全特
徴点数）ｍを求める。続いてメンバーシップ値加算部14
により辞書記憶部１内のあるパターンPT_kについて、パ
ターン入力部12に入力したパターン（認識対象のパター
ン）の特徴点のメッシュ位置（X_i，Y_i，θ_i）（１≦ｉ
≦ｍ）に対応するメンバーシップ値f_k（X_i，Y_i，θ_i）
を求め、これらf_k（X_i，Y_i，θ_i）の総てを加算し、そ
の加算値を距離演算部15に出力する。そして最大値検出部16によ
りj,mのうちの大きい方max（j.m）を出力し、距離演算
部15にて先述した（１）式の演算を実行して距離D_kを求
める。こうした試行を登録されているすべてのパターン
について行い、近距離判定部17にてD_kの小さい順にｌ個
のパターンを取り出し、これを認識結果として認識結果
出力部18より出力する。First, a pattern is input by the pattern input unit 12, and the total number of feature points (the total number of feature points) m of the pattern is obtained by the feature point number adding unit 13. Next, the membership value adder 14
For a certain pattern PT _k in the dictionary storage unit 1, the mesh position (X _i , Y _i , θ _i ) of the feature point of the pattern (pattern to be recognized) input to the pattern input unit 12 (1 ≦ i)
≦ m) corresponding membership value f _k (X _i , Y _i , θ _i )
, And all of these f _k (X _i , Y _i , θ _i ) are added, and the added value Is output to the distance calculation unit 15. Then, the maximum value j (m) of the maximum value j (m) is output by the maximum value detection unit 16 and the distance calculation unit 15 executes the calculation of the above-described equation (1) to obtain the distance _Dk . Such trials are performed for all registered patterns, and the short distance determination unit 17 extracts l patterns in ascending order of _Dk , and outputs these patterns from the recognition result output unit 18 as recognition results.

G.発明の効果本発明によれば、文字や図形等のパターンの特徴点の
出現頻度分布を、多数のサンプルを用意することにより
作成し、この出現頻度分布の各頻度を正規化して、その
値をメンバーシップ値として登録するようにしているた
め、辞書を自動的に作成することができると共に辞書登
録のためのマニュアル作業が大幅に低減し、また入力パ
ターンの位置や傾きにばらつきがあっても認識の誤りが
生じにくい。G. Effects of the Invention According to the present invention, an appearance frequency distribution of feature points of a pattern such as a character or a graphic is created by preparing a large number of samples, and each frequency of the appearance frequency distribution is normalized. Because values are registered as membership values, dictionaries can be created automatically, manual work for dictionary registration is greatly reduced, and there are variations in the position and inclination of input patterns. In addition, recognition errors are less likely to occur.

しかも特徴点の位置的特徴に加えて方向特徴の考え方
を導入したものであるため、ａ×ｂ×ｋのメッシュ空間
で辞書を構成することからメッシュのとり方によっては
大きな辞書構成となるが、実際には方向的特徴を加えた
ことで、a,bのメッシュのとり方は可成り粗くとっても
問題なく、英数字で５×５×８メッシュ，漢字でも15×
15×８メッシュ程度で十分である。この場合漢字１文字
当たり15×15×８×４ビット＝900バイトとなり、従来
の技術における2Kバイトと比較して1/2程度以下であ
る。従って辞書の記憶容量の低減化を図ることができ
る。Moreover, since the concept of directional features is introduced in addition to the positional features of feature points, a dictionary is constructed in an a × b × k mesh space. With the addition of directional features, the meshing of a and b is fairly coarse, and there is no problem. 5x5x8 mesh for alphanumeric characters and 15x for Chinese characters
About 15 × 8 mesh is enough. In this case, 15 × 15 × 8 × 4 bits per kanji character = 900 bytes, which is about 1/2 or less as compared with 2 K bytes in the conventional technology. Therefore, the storage capacity of the dictionary can be reduced.

また入力パターンを認識するにあたっては、入力パタ
ーンの特徴点のメッシュ位置に対応するメンバーシップ
値を辞書より求めるだけで確信度が得られ、その確信度
を確信度1.0から差し引いた値を特徴点個々の照合対象
であるパターンとの距離として捉え、この距離を用いて
例えば（１）式により２つのパターンの距離D_kを演算
し、その値が小さい程パターンのマッチングの程度が高
いとして認識しているため、演算が簡単であって、高速
なパターン認識を実現できる。そして辞書そのものにあ
いまい性があるので、位置ずれに極めて強い特性があ
る。In recognizing an input pattern, a certainty factor can be obtained simply by obtaining a membership value corresponding to a mesh position of a feature point of the input pattern from a dictionary, and a value obtained by subtracting the certainty factor from the certainty factor 1.0 is calculated for each feature point. The distance _Dk of the two patterns is calculated using, for example, equation (1) using this distance, and the smaller the value is, the higher the degree of pattern matching is. Therefore, the calculation is simple and high-speed pattern recognition can be realized. Since the dictionary itself has ambiguity, it has a characteristic that is extremely resistant to positional deviation.

【図面の簡単な説明】[Brief description of the drawings]

第１図は本発明装置のフローを示すフローチャート図、
第２図は方向コードを示す説明図、第３図は特徴点のあ
いまい性を示す説明図、第４図は特徴点出現頻度分布
図、第５図は本発明の実施例の一部に係る回路のブロッ
ク図、第６図は輪郭ベクトル系列を示す説明図、第７
図，第８図は夫々正規化前後の特徴点出現頻度分布図、
第９図は本発明の実施例の一部に係る回路のブロック図
である。１…パターン、２…走査回路、４…特徴点抽出回路、５
…特徴点出現頻度分布作成器、８…基準頻度検出回路、
９…特徴点出現頻度分布正規化回路、11…辞書記憶部、
13…特徴点個数加算部、14…メンバーシップ値加算部、
15…距離演算部。FIG. 1 is a flowchart showing the flow of the apparatus of the present invention,
FIG. 2 is an explanatory diagram showing the direction code, FIG. 3 is an explanatory diagram showing the ambiguity of the feature points, FIG. 4 is a feature point appearance frequency distribution diagram, and FIG. 5 relates to a part of the embodiment of the present invention. FIG. 6 is a block diagram of a circuit, FIG.
Fig. 8 and Fig. 8 are the characteristic point appearance frequency distribution diagrams before and after normalization, respectively.
FIG. 9 is a block diagram of a circuit according to a part of the embodiment of the present invention. DESCRIPTION OF SYMBOLS 1 ... pattern, 2 ... scanning circuit, 4 ... feature point extraction circuit, 5
... Feature point appearance frequency distribution generator, 8 ... Reference frequency detection circuit,
9: feature point appearance frequency distribution normalization circuit, 11: dictionary storage unit,
13 ... Feature point number adder, 14 ... Membership value adder,
15 Distance calculation unit.

フロントページの続き (56)参考文献特開平３−3091（ＪＰ，Ａ) 特開平３−3089（ＪＰ，Ａ) 特開平１−290090（ＪＰ，Ａ) 特開昭63−108480（ＪＰ，Ａ) 特開昭62−192886（ＪＰ，Ａ) 特開昭64−15885（ＪＰ，Ａ) 特許2643293（ＪＰ，Ｂ２) 特許2643294（ＪＰ，Ｂ２) 特公平７−104938（ＪＰ，Ｂ２) (58)調査した分野(Int.Cl.⁶，ＤＢ名) G06K 9/68 G06K 9/62 特許ファイル（ＰＡＴＯＬＩＳ) ＪＩＣＳＴファイル（ＪＯＩＳ)Continuation of the front page (56) References JP-A-3-3091 (JP, A) JP-A-3-3089 (JP, A) JP-A-1-290090 (JP, A) JP-A-63-108480 (JP) JP-A-62-192886 (JP, A) JP-A-64-15885 (JP, A) Patent 2643293 (JP, B2) Patent 2643294 (JP, B2) JP-B-7-104938 (JP, B2) (58) Fields investigated (Int. Cl. ⁶ , DB name) G06K 9/68 G06K 9/62 Patent file (PATOLIS) JICST file (JOIS)

Claims

(57)【特許請求の範囲】(57) [Claims]

【請求項１】文字や図形等のパターンを照合の対象とな
る辞書を用いて認識する装置において、あるパターンに対する多数のサンプルを作成し、各サン
プルにおけるパターンについて特徴点群P₁，P₂…P_N（Ｎ
は整数）を抽出するとともに、隣接する特徴点P_i，P_i+i
（ｉ≧１）を結ぶ線分の方向を量子化してその値を特徴
点P_iの方向コードとして求める特徴点抽出回路と、パターンが描かれる平面領域に横縦ａ×ｂ（a,bは整
数）個のメッシュ領域を割り当てるとともに、メッシュ
領域の横方向、縦方向の座標をメッシュに対応させて
（Ｘ（１≦Ｘ≦ａ）,Y（１≦Ｙ≦ｂ））とし、また方向
コードの数及び値をそれぞれk,θとし、メモリにａ×ｂ×ｋのメッシュ空間に対応する領域を設
定するとともに、当該メッシュ空間におけるメッシュ位
置を（X,Y,θ）とし、前記特徴点抽出回路から出力され
た特徴点をX,Y,θにより特定してその特徴点の出現をメ
ッシュ位置（X,Y,θ）における出現とし、各メッシュ位置毎に特徴点の出現回数である出現頻度
を、前記特徴点抽出回路の出力から求めて出現頻度分布
を作成するとともに、作成された出現頻度分布を前記メ
モリに記憶させる特徴点出現頻度分布作成器と、前記メモリに記憶されている出現頻度分布を、前記特徴
点抽出回路から得られた特徴点と設定値とにより求めた
基準頻度から正規化する特徴点出現頻度分布正規化回路
と、この特徴点出現頻度分布正規化回路により正規化された
出現頻度分布をメンバーシップ値として自動的に登録す
るファジイ辞書と、このファジイ辞書として特徴点の基準個数ｊと共に予め
パターン毎に登録され、パターンが描かれる平面領域に
メッシュ領域を割り当てるとともに、これに対応してメ
ッシュ領域が設定された辞書記憶部と、パターン入力部に入力された認識対象パターンの特徴点
の個数を加算して総数ｍを求める特徴点個数加算部と、前記辞書記憶部内の、あるパターンについて、前記パタ
ーン入力部に入力されたパターンの特徴点のメッシュ位
置に対応するメンバーシップ値f_Kを求め、これらf_Kを総
てを加算した加算値を求めるメンバーシップ値加算部
と、メンバーシップ値加算部で求められた加算値と、前記基
準個数ｊと総数ｍのうち最大値検出部で得た大きい方の
出力とが供給され、両者から下記式の演算を実行して認
識対象パターンと照合対象パターンとの離れている度合
いに相当する距離D_Kを求める距離演算部と、この距離演算部で演算された距離D_Kを前記辞書記憶部内
に登録されている総てのパターンについて演算し、その
うちの小さい値から順にパターンを適数選んで認識結果
として取り出す近距離判定部とを備えたことを特徴とす
るパターン認識装置。ただしmax（m.j）はm,jの大きい方の値An apparatus for recognizing a pattern such as a character or a figure using a dictionary to be compared is provided. A large number of samples for a certain pattern are created, and a feature point group P ₁ , P ₂ . P _N (N
Are integers) and adjacent feature points P _i , P _{i + i}
(I ≧ 1) by quantizing the direction of a line connecting the feature point extraction circuit for obtaining the value as the direction code of the feature point P _i, horizontal and vertical a × b (a to the plane region where the pattern is drawn, b is Integer) mesh areas are allocated, and the horizontal and vertical coordinates of the mesh area are set to (X (1 ≦ X ≦ a), Y (1 ≦ Y ≦ b)) corresponding to the mesh, and the direction code The number and the value of are respectively set to k and θ, an area corresponding to the a × b × k mesh space is set in the memory, and the mesh position in the mesh space is set to (X, Y, θ). The feature point output from the circuit is specified by X, Y, θ, and the appearance of the feature point is defined as the appearance at the mesh position (X, Y, θ), and the appearance frequency is the number of appearances of the feature point for each mesh position Is calculated from the output of the feature point extraction circuit to create an appearance frequency distribution. In addition, a feature point appearance frequency distribution creator that stores the created appearance frequency distribution in the memory, a feature point obtained from the feature point extraction circuit, and a set value obtained by the appearance frequency distribution stored in the memory And a fuzzy dictionary for automatically registering the appearance frequency distribution normalized by the feature point appearance frequency distribution normalization circuit as a membership value. A dictionary storage unit which is registered in advance for each pattern together with the reference number j of feature points as this fuzzy dictionary, allocates a mesh area to a plane area on which the pattern is drawn, and sets a mesh area corresponding thereto; A feature point number adding unit for adding the number of feature points of the recognition target pattern input to the unit to obtain a total number m; For pattern, determine the membership value f _K corresponding to the mesh position of the characteristic point of the pattern input to the pattern input unit, and a membership value adding unit for obtaining an added value obtained by adding all these f _K, member The addition value obtained by the ship value addition unit and the larger output of the reference number j and the total number m obtained by the maximum value detection unit are supplied. and a distance calculation unit for determining the distance D _K corresponding to the degree that separation of the collation object pattern, for all patterns the computed distance D _K in the distance calculator are registered in the dictionary storage portion operation And a short distance determining unit for selecting an appropriate number of patterns in order from the smallest value and extracting the selected pattern as a recognition result. Where max (mj) is the larger of m and j