JPH01233587A

JPH01233587A - Character recognition method

Info

Publication number: JPH01233587A
Application number: JP63060060A
Authority: JP
Inventors: Sueji Miyahara; 末治宮原
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1988-03-14
Filing date: 1988-03-14
Publication date: 1989-09-19

Abstract

PURPOSE:To detect a category to which an input pattern belongs as a candidate category with high accuracy even when a noise is superimposed on the input pattern and to separate and identify similar characters from each other with high accuracy by providing a coincidence degree counter part, etc., at an identification part. CONSTITUTION:A content formed on the pattern memory 210 of a pattern memory part 200 is divided into blocks by the control of a control part 600, and they are transferred at every block to the identification part 500 sequentially. At the identification part 500, firstly, it is counted whether or not a code which coincides with the reference feature of a reference feature storage part 510 exists at every block at the coincidence degree counter part 250, and the candidate category is extracted from the one with high coincidence degree, and the candidate category and the coincidence degree are recorded on an internal memory. Next, at an effective block detecting part 530, it is checked whether or not the coincidence degree detected at every block is valid, and at a candidate category decision part 540, the category having the high coincidence degree with the reference feature is decided as the category to which the input pattern belongs by using only an effective block. In such a way, it is possible to decide the category even when the noise exists, and to identify the similar characters from each other.

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明は文字認識方法に係り、詳しくは、位相幾何的に
みて複雑な構造を持つ漢字等の文字パタンから文字線の
「つぶれ」や「かすれ」などに起因する非所望の雑音部
分を検出し、その部分を無視することによって、識別に
有効な部分の特徴のみを用いて入力パタンの属するカテ
ゴリを精度よく検出したり、類似文字対において差分の
大きい部分の特徴のみを採用して類似度文字を区別する
のに好適な文字読取方法に関する。[Detailed Description of the Invention] [Industrial Application Field] The present invention relates to a character recognition method, and more specifically, it is possible to identify character lines such as ``collapsed'' or By detecting undesired noise parts caused by "fading" and ignoring these parts, it is possible to accurately detect the category to which the input pattern belongs using only the features of the part that are effective for identification, and to detect the category to which the input pattern belongs with high accuracy. The present invention relates to a character reading method suitable for distinguishing similarity characters by employing only features of portions with large differences.

（従来の技術〕従来、黒、白２値パタン１個を含むパタン領域の各点か
ら予め定められた方向に走査線を出して検出される位相
幾何的特徴を各点に集積し、この結果から得られる特徴
を用いて識別する文字認識方式として特願昭５１−８５
７０８号があり、この方式を拡張した方式で、黒点に関
する符号化手段を示すものとして特願昭５４−１６４２
ａ９号。(Prior Art) Conventionally, a scanning line is emitted in a predetermined direction from each point of a pattern area including one black and one white binary pattern, and the detected topological features are accumulated at each point. Patent application filed in 1985-85 as a character recognition method for identifying characters using features obtained from
No. 708, which is an expanded method of this method, was published in Japanese Patent Application No. 1642-1983 as a means of encoding sunspots.
No. a9.

白点に関する符号化手段を示すものとして特願昭５５−
１２２０８号などがある。Japanese Patent Application No. 1983-1985 as a means of encoding white spots
12208 etc.

第３図乃至第５図により、従来方式の原理を説明する。The principle of the conventional system will be explained with reference to FIGS. 3 to 5.

第３図の文字部分１０を含むパタン領域１１において、
任意の各点から第４図で示す８方向に走査線を出し、黒
点の連続数（ランレングス）を調べる。各点に対して９
ビツトの特徴ベクトル１２が用意され、走査の方向に応
じた所定のビット位置に、黒点の連続数が予め定められ
た値より大きければ「１」、小さければ「０」をセット
する。すなわち、黒点に対しては、第３図（ａ）の特徴
ベクトル１２が示すように、黒点であることを示すビッ
ト位［２０に「１」をセットするとＮもに、第４図に示
す方向の定義に従って５１から５８までの方向成分と、
第５図に示す傾きの定義に従って６１から６４までの傾
き成分との両者の特徴が文字部分１０上に存在するか否
かを確認し、それぞれの方向成分に対応するビット位置
に、存在すれば「１」、存在しなければ「０」をセット
する。それぞれの方向成分に対応するビット位置とは、
方向５１と傾き６１の成分が存在すれば（方向５１の連
続数が方向検出閾値αより大きく、傾き６１が傾き検出
閾値βより大きければ）ビット位置２１に、方向５５と
傾き６１の成分が存在すればビット位置２２に、方向５
７と傾き６３の成分が存在すればビット位置２３に、方
向５３と傾き６３の成分が存在すればビット位置２４に
、方向５８と傾き６４の成分が存在すればビット位置２
５に、方向５４と傾き６４の成分が存在すればビット位
置２６に、方向５６と傾き６２の成分が存在すればビッ
ト位置２７に、方向５２と傾き６２の成分が存在すれば
ビット位置２８に、それぞれ「１」をセットして、その
他のビット位置は「０」とする。In the pattern area 11 including the character portion 10 in FIG.
A scanning line is drawn in eight directions shown in FIG. 4 from each arbitrary point, and the number of consecutive black dots (run length) is checked. 9 for each point
A bit feature vector 12 is prepared, and if the number of consecutive black dots is larger than a predetermined value, "1" is set at a predetermined bit position depending on the scanning direction, and "0" is set if it is smaller. In other words, for a sunspot, as shown by the feature vector 12 in FIG. directional components from 51 to 58 according to the definition of
According to the definition of the slope shown in FIG. Set to "1", and set to "0" if it does not exist. The bit position corresponding to each direction component is
If components of direction 51 and slope 61 exist (if the number of consecutive directions 51 is greater than direction detection threshold α and slope 61 is greater than slope detection threshold β), components of direction 55 and slope 61 exist at bit position 21. then bit position 22, direction 5
If a component with direction 53 and slope 63 exists, it will be placed in bit position 23, if a component with direction 53 and slope 63 exists, it will be placed in bit position 24, and if a component with direction 58 and slope 64 exists, it will be placed in bit position 2.
5, if a component of direction 54 and slope 64 exists, it is placed in bit position 26, if a component of direction 56 and slope 62 exists, it is placed in bit position 27, and if a component of direction 52 and slope 62 exists, it is placed in bit position 28. , are set to "1", and the other bit positions are set to "0".

以上の処理によって、パタン領域１１の文字部分１０に
は位相幾何的情報と方向情報を保存した特徴ベクトルが
形成される。第３図（ａ）における特徴ベクトル１２の
内容は、左傾斜線分と右傾斜線分とが交差した点１の内
容を示しており、パタン領域１１の文字部分１０上に１
文字線の傾きの状態によって、矢印の記号で示す種々の
特徴ベクトルが抽出される。Through the above processing, a feature vector storing topological information and direction information is formed in the character portion 10 of the pattern area 11. The content of the feature vector 12 in FIG.
Various feature vectors indicated by arrow symbols are extracted depending on the state of the inclination of the character line.

一方、白点に対しては、走査方向に存在する文字部分１
０の特徴を方向情報を保存して予め定められた一定の規
則に従い、特徴ベクトル１２の所定の位置にセットする
。たとえば第３図（ｂ）において、白点２が図中の位置
にあるとき、この点の特徴ベクトル１２は白点であるこ
とを示すビット位置２０が「０」、右方向走査線５１に
対応するビット２１．２２が右傾斜の文字線が存在する
ことを示すｒｏ、ＩＪ　、上方向走査線５３に対応する
ビット２３．２４が水平の文字線が存在することを示す
ｒｌ、ＩＪ　、左方向走査線５５に対応するビット２５
．２６が左傾斜の文字線が存在することを示すｒｌ、０
」、下方向走査線５７に対応するビット２７．２８が文
字線の傾きが一意に定まらないことを示すｒｏ、ＯＪで
表わされる。On the other hand, for the white point, the character part 1 existing in the scanning direction
The feature 0 is set at a predetermined position of the feature vector 12 according to a certain predetermined rule while preserving direction information. For example, in FIG. 3(b), when the white point 2 is located at the position in the figure, the feature vector 12 of this point indicates that the bit position 20 indicating that it is a white point is "0", which corresponds to the rightward scanning line 51. Bits 21 and 22 indicate the presence of a right-slanted character line, ro, IJ, and bits 23 and 24 corresponding to the upward scanning line 53 indicate the presence of a horizontal character line, rl, IJ, and the left direction. Bit 25 corresponding to scan line 55
．． 26 indicates the existence of a left-slanted character line rl, 0
'', bits 27 and 28 corresponding to the lower scanning line 57 are represented by ro and OJ indicating that the slope of the character line is not uniquely determined.

このような処理によってパタンの白点領域は、各白点が
如何なる特徴ベクトルを持つかによっていくつかに分割
される。第３図（ｂ）のパタン領域１１の白点部分１３
上には文字線の存在と傾きの状態によって折れ線で示す
特徴ベクトルが抽出される。Through such processing, the white point area of the pattern is divided into several parts depending on what kind of feature vector each white point has. White dot portion 13 of pattern area 11 in FIG. 3(b)
At the top, feature vectors shown as polygonal lines are extracted depending on the presence of character lines and the state of the inclination.

入力パタンの識別は、上記特徴の存在状況とカテゴリ毎
に用意されている標準特徴のリストを比較して行うか、
あるいは入力パタン領域１１をブロック分割し、ブロッ
ク毎に上記特徴の存在状況とカテゴリのブロック毎に用
意されている標準特徴のリストを比較し、ブロック毎に
設定した条件を全て満足するカテゴリを候補カテゴリと
して選出したり、パタン全体にわたって設定した条件を
満足するカテゴリを候補カテゴリとして選出することで
行う。Input patterns are identified by comparing the presence status of the above features with a list of standard features prepared for each category, or
Alternatively, divide the input pattern area 11 into blocks, compare the existence status of the above features for each block with a list of standard features prepared for each category block, and select a category that satisfies all the conditions set for each block as a candidate category. This is done by selecting a category that satisfies conditions set over the entire pattern as a candidate category.

〔発明が解決しようとするｉｉｌｌＭ）上記従来の文字
認識方式では、入力パタンの識別をパタン全体あるいは
比較しようとするブロック全ての条件を満足しなければ
ならない構成となっているため、パタン領域の一部に雑
音が重畳したり、文字線につぶれやかすれが生じた場合
、その部分の特徴が変化し、入力パタンの属する標準特
徴の条件を満足できなくなり、入力パタンの属するカテ
ゴリを候補カテゴリとして検出できない問題があった。[IillM to be Solved by the Invention] In the conventional character recognition method described above, in order to identify an input pattern, the conditions for the entire pattern or all the blocks to be compared must be satisfied. If noise is superimposed on a part, or if a character line is distorted or blurred, the characteristics of that part will change, and the conditions of the standard feature to which the input pattern belongs will no longer be satisfied, and the category to which the input pattern belongs will be detected as a candidate category. There was a problem that I couldn't do it.

また、類似文字に対しては文字パタン全体で照合を行っ
ていた＼め、その差分が明確に呪われば、その区別が国
難であるという問題があった。In addition, because similar characters were compared using the entire character pattern, there was a problem that if the difference was clearly cursed, the distinction would be a national disaster.

本発明の目的は、入力パタンに雑音が重畳した場合でも
精度よく入力パタンの属するカテゴリを候補カテゴリと
して検出することができ、また、類似文字を精度よく区
別して識別することができる文字認識方法を提供するこ
とにある。An object of the present invention is to provide a character recognition method that can accurately detect the category to which an input pattern belongs as a candidate category even when noise is superimposed on the input pattern, and can also accurately distinguish and identify similar characters. It is about providing.

〔課題を解決するための手段〕[Means to solve the problem]

本発明の文字認識方法においては、まず、入力パタン領
域を複数のブロックに分割し、ブロック毎に入力パタン
の特徴と標準パタンの特徴との一致の度合を比較して複
数の候補カテゴリを選択し、次に、前記複数の候補カテ
ゴリの標準パタン間でブロック単位の一致の度合いを比
較し、一致の度合いの少ないブロックを用いて該ブロッ
ク毎に入力パタンの特徴と標準パタンの特徴とを比較し
て入力パタンの属するカテゴリを決定する。In the character recognition method of the present invention, first, an input pattern area is divided into a plurality of blocks, and a plurality of candidate categories are selected by comparing the degree of coincidence between the features of the input pattern and the features of a standard pattern for each block. Next, the degree of agreement in each block is compared between the standard patterns of the plurality of candidate categories, and the characteristics of the input pattern and the characteristics of the standard pattern are compared for each block using blocks with a low degree of agreement. to determine the category to which the input pattern belongs.

〔作　用〕[For production]

本発明は、入力パタンのパタン領域をブロック分割し、
ブロック毎に標準特徴との一致を採り、一致度の高いも
のを候補カテゴリとし、さらに一致度の高いブロックの
みを用いて入力パタンの属する最終の候補カテゴリを決
定するものである。The present invention divides a pattern area of an input pattern into blocks,
Matching with standard features is determined for each block, those with a high matching degree are set as candidate categories, and only the blocks with a high matching degree are used to determine the final candidate category to which the input pattern belongs.

これにより、雑音等が混入した領域を無視して入力パタ
ンの属するカテゴリを決定できる。Thereby, it is possible to determine the category to which the input pattern belongs while ignoring areas where noise or the like is mixed.

〔実施例〕〔Example〕

以下、本発明の一実施例について図面により説明する。 An embodiment of the present invention will be described below with reference to the drawings.

第１図は本発明の一実施例の構成を示す。図中。FIG. 1 shows the configuration of an embodiment of the present invention. In the figure.

１００は走査部、２００はパタンメモリ部、３００は特
徴抽出部、４００は特徴集積部、５００は識別部、６０
０は制御部、７００は出力端子であり、識別部５００は
標準特徴格納部５１０、一致度計数部５２０、有効ブロ
ック検出部５８０．入力パタンの候補カテゴリ決定部５
４０よりなる。100 is a scanning section, 200 is a pattern memory section, 300 is a feature extraction section, 400 is a feature accumulation section, 500 is an identification section, 60
0 is a control unit, 700 is an output terminal, and the identification unit 500 includes a standard feature storage unit 510, a matching degree counting unit 520, a valid block detection unit 580. Input pattern candidate category determining unit 5
Consists of 40.

制御部６００の指令により、走査部１００が紙面を走査
することにより１紙面上の１文字を含む領域（第３図の
パタン領域１１）の黒および白の２値信号が、パタンメ
モリ部２００のパタンメモリ２１０の所定のアドレスに
１点１語として順次格納される。パタン領域１１内の信
号がパタンメモリ２１０に格納され終ると、パタンメモ
リ部２００は制御信号６００の指令を受け、パタンメモ
リ２１０の内容を第４図の定義に従ってラスク走査し、
順次、黒白の情報を特徴抽出部３００に転送する。特徴
抽出部３００は、パタンメモリ２１０から転送されてく
る各点の信号が黒を表わす信号が白を表わす信号かを判
断して、黒点が連続する場合にその数を計数し、黒点の
特徴抽出用の情報とする。この情報を８方向について求
め、それぞれの計数値を予め定められた定数値と比較す
ることによって文字部分１０の特徴を抽出し、パタンメ
モリ２１０のアドレスの所定の位置に格納する。In response to a command from the control unit 600, the scanning unit 100 scans the paper, and the black and white binary signals of the area (pattern area 11 in FIG. 3) containing one character on one paper are stored in the pattern memory unit 200. Each point is sequentially stored at a predetermined address in the pattern memory 210, one word at a time. When the signals in the pattern area 11 have been stored in the pattern memory 210, the pattern memory section 200 receives a command from the control signal 600 and scans the contents of the pattern memory 210 according to the definition in FIG.
The black and white information is sequentially transferred to the feature extraction unit 300. The feature extraction unit 300 determines whether the signal of each point transferred from the pattern memory 210 is a signal representing black or a signal representing white, counts the number of consecutive black points, and extracts the characteristics of the black points. This information is for use only. This information is obtained in eight directions, and the characteristics of the character portion 10 are extracted by comparing each count value with a predetermined constant value, and stored in a predetermined address position of the pattern memory 210.

文字部分１０の特徴が抽出され終ると、パタンメモリ部
２００は制御部６００の指令を受け、パタンメモリ２１
０の内容を方向５１、方向５５、方向５７の４方向にラ
スク走査し、先に生成された黒点の特徴を順次特徴集積
部４００に転送する。When the features of the character portion 10 have been extracted, the pattern memory section 200 receives a command from the control section 600 and transfers the pattern memory 21
The content of 0 is rask-scanned in four directions: direction 51, direction 55, and direction 57, and the features of the previously generated black dots are sequentially transferred to the feature accumulating unit 400.

特徴集積部４００は、パタンメモリ２１０から転送され
てくる各点の信号が黒を表わす信号か、白を表わす信号
かを判断して、白点である場合にはラスク走査ごとに以
前に出現した黒点の特徴を集積し、白点部分に特徴を形
成し、パタンメモリ２１０のアドレスの所定の位置に格
納する。The feature accumulation unit 400 determines whether the signal of each point transferred from the pattern memory 210 is a signal representing black or a signal representing white, and if it is a white point, the feature accumulation unit 400 determines whether the signal of each point transferred from the pattern memory 210 is a signal representing black or a signal representing white. The features of the black dots are accumulated, the features are formed in the white dots, and the features are stored at a predetermined address position in the pattern memory 210.

このようにしてパタン領域１１の特徴抽出が終了すると
、パタンメモリ部２００のパタンメモリ２１０に形成さ
れた内容は、制御部６００の制御の基にパタンメモリ２
１０をブロック分割し、ブロック毎に識別部５００に順
次転送される。識別部５００の動作目的は、パタンメモ
リ２１０上の特徴から入力パタンの属するカテゴリが何
であるかを決定することにある。When the feature extraction of the pattern area 11 is completed in this way, the contents formed in the pattern memory 210 of the pattern memory section 200 are transferred to the pattern memory 210 under the control of the control section 600.
10 is divided into blocks, and each block is sequentially transferred to the identification unit 500. The purpose of the operation of the identification unit 500 is to determine the category to which the input pattern belongs based on the characteristics on the pattern memory 210.

識別部５００では、まず一致度計数部５１０でパタンメ
モリ２１０中の客語についてブロック毎に標準特徴格納
部５１０の標準特徴と一致するコードが存在するか否か
を計数し、一致度の高いものから候補カテゴリを抽出し
、候補カテゴリと一致度とを後述のブロック毎の一致度
算出部５２６内の格納メモリに記録する。次に有効ブロ
ック検出部５３０では、ブロック毎に検出された一致度
が有効であるか否かのチエツクを行ない、候補カテゴリ
決定部５４０では有効なブロックのみを用いて標準特徴
と一致度の高いカテゴリを入力パタンの属するカテゴリ
と決定し、その結果を出力端子７００に出力する。In the identification unit 500, first, a matching degree counting unit 510 counts whether or not there is a code that matches the standard feature in the standard feature storage unit 510 for each block of the guest words in the pattern memory 210, and selects a code with a high matching degree. A candidate category is extracted from the block, and the candidate category and the matching degree are recorded in a storage memory in a matching degree calculation unit 526 for each block, which will be described later. Next, the valid block detection unit 530 checks whether the degree of coincidence detected for each block is valid or not, and the candidate category determination unit 540 uses only valid blocks to determine categories that have a high degree of coincidence with the standard features. is determined as the category to which the input pattern belongs, and the result is output to the output terminal 700.

第２図は識別部５００の詳細構成である。パタンメモリ
部２００から転送される入力パタンの１語９ビツトから
成る特徴は、４１準特徴格納部５１０の内容と一致がと
れるように変換された後、ブロック毎の特徴格納メモリ
５２１に記録される。FIG. 2 shows the detailed configuration of the identification section 500. The feature consisting of 9 bits per word of the input pattern transferred from the pattern memory section 200 is converted to match the contents of the 41 quasi-feature storage section 510, and then recorded in the feature storage memory 521 for each block. .

標準特徴の構成は第６に示すように、カテゴリＣｋのブ
ロックＢ１毎に存在しなければならない特徴Ｐ０とその
基準値Ｔａｎ（Ｐ特徴）と、存在してはいけない特徴Ｎ
ａｐ（Ｎ特徴）とが記述されたものであり、計数回路５
２５では特徴格納メモリ５２１の内容と、標準特徴とを
ＡＮＤ回路５２２゜ＮＡＮＤ回路５２３を通した後、両
信号をＯＲ回路５２４を通し、これから上って来る信号
を計数回路５２５によって計数する。計数結果は標準特
徴格納部５１０内に予め用意されたカテグリＣｋのブロ
ックＢ、の標準値や標準閾値と照合して、ブロック毎の
一致度算出部５２６にその値を送出する。一致度算出部
５２６では、第７図に示すようなブロックごとの一致度
（ここではＰ特徴の一致度とＮ特徴との一致度とを加算
した標準特徴からの距離値Ａｉｍ）と、その合計値Ａｋ
とを算出する。The configuration of the standard features, as shown in the sixth section, is a feature P0 that must exist for each block B1 of category Ck, its reference value Tan (P feature), and a feature N that must not exist.
ap (N features) is described, and the counting circuit 5
25, the contents of the feature storage memory 521 and the standard feature are passed through an AND circuit 522 and a NAND circuit 523, and then both signals are passed through an OR circuit 524, and the signals coming up from this are counted by a counting circuit 525. The counting result is compared with the standard value and standard threshold value of block B of category Ck prepared in advance in the standard feature storage unit 510, and the value is sent to the matching degree calculating unit 526 for each block. The matching degree calculation unit 526 calculates the matching degree for each block as shown in FIG. value Ak
Calculate.

次に有効ブロック検出部５３０では、まず、係数算出回
路５３１において全ブロックの候補カテゴリの一致から
有効ブロックを検出するための閾値設定用の係数ａ求め
、閾値設定部５３３へ送出する。すなわち閾値設定用の
係数αは注目カテゴリＣｋのブロックの一致度の平均値
あるいは一致度の分布の一定の割合を求めた値、あるい
はカテゴリの決定に使用するブロック数を満足する値と
することができる。Next, in the valid block detecting section 530, first, a coefficient a for setting a threshold for detecting a valid block is calculated from the matching of the candidate categories of all blocks in a coefficient calculating circuit 531, and the coefficient a is sent to the threshold setting section 533. In other words, the coefficient α for setting the threshold value may be a value obtained by calculating the average value of the degree of coincidence of the blocks of the category of interest Ck or a certain percentage of the distribution of the degree of coincidence, or a value that satisfies the number of blocks used to determine the category. can.

閾値設定部５３３では送られてきた係数を基に。The threshold value setting unit 533 uses the sent coefficients.

一致度に対する重ねをみつけたり、カテゴリ間の閾値設
定用係数のバラツキを補正して、カテゴリ毎のブロック
の採否を決める閾値ＴＨを設定する。A threshold TH is set for determining whether a block is accepted or rejected for each category by finding an overlap in the degree of coincidence or by correcting variations in threshold setting coefficients between categories.

次に、この閾値ＴＨと各ブロック毎に抽出された候補カ
テゴリの一致度とを比較回路５３２によって比較し、一
致度が閾値より高い値を示すブロックを有効ブロックと
みなして、候補カテゴリごとに整理し、その一致度を候
補カテゴリ決定部５４０へ送出する。候補カテゴリ決定
部５４０では候補カテゴリ決定回路５４２において、カ
テゴリ毎に一致度を集計し、他のカテゴリの一致度と比
較することによって入力文字の属するカテゴリを決定す
る。Next, the comparison circuit 532 compares this threshold TH with the degree of matching of the candidate category extracted for each block, and blocks whose degree of matching is higher than the threshold are regarded as valid blocks and organized into candidate categories. Then, the matching degree is sent to candidate category determining section 540. In the candidate category determining unit 540, the candidate category determining circuit 542 totals the degree of matching for each category, and determines the category to which the input character belongs by comparing the degree of matching with the degree of matching of other categories.

本文字認識方法を用いれば、従来識別が困難であった入
カバターンを識別することができる。たとえば、第８図
（ａ）に示す漢字については、破線の丸で囲った部分５
５１が文字線切れによって消滅した場合でも、第８図（
ｂ）に示すように。By using this character recognition method, it is possible to identify inset cover patterns, which have been difficult to identify in the past. For example, for the kanji shown in Figure 8(a), the part 5 surrounded by the dashed line
Even if 51 disappears due to character line breakage, the figure 8 (
As shown in b).

ブロック５５２を無視することによって、残った部分の
ブロックで入力文字を精度よく識別することができる。By ignoring block 552, input characters can be accurately identified using the remaining blocks.

なお１本方式を差分の大きい部分に注目すれば類似文字
を区別することができる。すなわち、入力パタンに対し
一致度の高いカテゴリが複数個存在した場合、そのカテ
ゴリ対において一方の一致度の低いブロックのみに注目
して互いの一致度を検出し、それらの一致度の高いカテ
ゴリを求めればよい。Note that similar characters can be distinguished in the one-line method by focusing on parts with large differences. In other words, when there are multiple categories with a high degree of agreement with the input pattern, the degree of agreement with each other is detected by focusing only on one of the blocks with a low degree of agreement in the category pair, and the categories with a high degree of agreement are selected. All you have to do is ask.

〔発明の効果〕〔Effect of the invention〕

以上説明したように１本発明によれば、入力パタンのパ
タン領域をブロック分割して、ブロック毎に計数した特
徴と、予めカテゴリのブロック毎に用意された特徴とを
照合し、カテゴリごとにどのブロックが有効なのか、無
効なのかを判定し、有効なブロックのみを用いて再度判
定することにより、雑音が混入した領域を無視して、入
力パタンの属するカテゴリを決定することができる。そ
のため、文字線切れや、インクのにじみなどが混入した
文字でもＬＩＰ識することができる利点がある。As explained above, according to the present invention, the pattern area of the input pattern is divided into blocks, and the features counted for each block are compared with the features prepared in advance for each category block. By determining whether a block is valid or invalid and performing the determination again using only valid blocks, the category to which the input pattern belongs can be determined while ignoring areas where noise is mixed. Therefore, there is an advantage that even characters with broken lines or ink smears can be recognized by LIP.

また、文字品質のよい文字パターンに対しては差分の大
きいブロックに注目して、類似文字を区別することがで
きる。Further, for character patterns with good character quality, similar characters can be distinguished by focusing on blocks with large differences.

【図面の簡単な説明】[Brief explanation of the drawing]

第１図は本発明の一実施例の構成図、第２図は入力パタ
ンの属するカテゴリを決定するための識別部の一実施例
の詳細構成図、第３図は従来の文字読取方法を概念的に
説明する図、第４図は方向の定義の説明図、第５図は傾
斜の定義の説明図。巣６図はパタン照合用の標準特徴の構成図、第７図は各
カテゴリの一致度とブロック毎の一致とを示す構成図、
第８図は本発明によって識別が容易になった文字パタン
例の説明図である。１００・・・走査部、　２００・・・パタンメモリ部。２１０・・・パタンメモリ、　３００・・・特徴抽出部
。４００・・・特徴集積部、　５００・・・識別部。５１０・・・標準特徴格納部、５２０・・・一致度計数部、５３０・・・有効ブロック検出部、５４０・・・候補カテゴリ決定部、６００・・・制御部、　　７００・・・出力端子。Ｗｈｓ因（α）（ｂ）第４図第６図第７図Fig. 1 is a block diagram of an embodiment of the present invention, Fig. 2 is a detailed block diagram of an embodiment of an identification section for determining the category to which an input pattern belongs, and Fig. 3 is a conceptual diagram of a conventional character reading method. FIG. 4 is an explanatory diagram of the definition of direction, and FIG. 5 is an explanatory diagram of the definition of inclination. Figure 6 is a configuration diagram of standard features for pattern matching, and Figure 7 is a configuration diagram showing the degree of matching for each category and the matching for each block.
FIG. 8 is an explanatory diagram of an example of a character pattern that is easy to identify according to the present invention. 100... Scanning section, 200... Pattern memory section. 210...Pattern memory, 300...Feature extraction unit. 400...Feature accumulation unit, 500...Identification unit. 510... Standard feature storage section, 520... Matching degree counting section, 530... Valid block detection section, 540... Candidate category determining section, 600... Control section, 700... Output terminal. Whs factor (α) (b) Figure 4 Figure 6 Figure 7

Claims

【特許請求の範囲】[Claims]

（１）黒、白の２値パタンを含む入力パタンの各点に、
文字の位相幾何的特徴を集積して入力パタンの特徴の抽
出を行い、該抽出した入力パタンの特徴と標準パタンと
の一致をとって入力パタンの属するカテゴリを決定する
文字認識方法において、まず、入力パタン領域を複数のブロックに分割し、ブロ
ック毎に入力パタンの特徴と標準パタンの特徴との一致
の度合を比較して複数の候補カテゴリを選択し、次に、前記複数の候補カテゴリの標準パタン間でブロッ
ク単位の一致の度合いを比較し、一致の度合いの少ない
ブロックを用いて該ブロック毎に入力パタンの特徴と標
準パタンの特徴とを比較して入力パタンの属するカテゴ
リを決定することを特徴とする文字認識方法。(1) At each point of the input pattern including the binary pattern of black and white,
In a character recognition method that extracts features of an input pattern by accumulating topological features of characters, and matches the extracted features of the input pattern with a standard pattern to determine the category to which the input pattern belongs, first, Divide the input pattern area into multiple blocks, select multiple candidate categories by comparing the degree of matching between the input pattern features and the standard pattern features for each block, and then select the standard of the multiple candidate categories. The degree of agreement in each block between patterns is compared, and the category to which the input pattern belongs is determined by comparing the characteristics of the input pattern and the characteristics of the standard pattern for each block using blocks with a low degree of agreement. Characteristic character recognition method.