JP2856409B2

JP2856409B2 - Character recognition apparatus and method

Info

Publication number: JP2856409B2
Application number: JP1042944A
Authority: JP
Inventors: 由明黒沢
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1989-02-27
Filing date: 1989-02-27
Publication date: 1999-02-10
Anticipated expiration: 2014-02-10
Also published as: JPH02224085A

Description

【発明の詳細な説明】〔発明の目的〕（産業上の利用分野）本発明は文字認識結果に対して誤読している部分を修
正し正しい認識結果を得る文字認識装置に関する。DETAILED DESCRIPTION OF THE INVENTION [Object of the Invention] (Industrial application field) The present invention relates to a character recognition device that corrects a misread portion of a character recognition result and obtains a correct recognition result.

（従来の技術）文字認識装置は画像入力部，文字検出部，文字認識
部，後処理部より構成され、文字認識結果のあいまい性
を文脈や単語の知識に基づいて修正することにより最終
的な認識率の向上をはかっている。(Prior Art) A character recognition device is composed of an image input unit, a character detection unit, a character recognition unit, and a post-processing unit, and corrects the ambiguity of the character recognition result based on context and knowledge of words to obtain a final result. The aim is to improve the recognition rate.

しかしながら、文脈や単語の知識にない文脈や単語が
入力された時や、低品質文字が入力された時には最終的
な誤読文字を救済する手段はこれまで開発されていなか
った。However, no means has been developed to rescue the final misread characters when a context or word that is unknown to the context or word is input, or when a low-quality character is input.

（発明が解決しようとする課題）このように従来の文字認識装置では、未知語や低品質
文字に対し救済手段が無いという欠点が有った。(Problems to be Solved by the Invention) As described above, the conventional character recognition device has a disadvantage that there is no remedy for unknown words or low-quality characters.

本発明は文脈や単語の知識にない文脈や単語が入力さ
れた時や、低品質文字が入力された時でも誤読すること
なく、全体として高い認識率を得ることのできる文字認
識装置を提供することを目的としたものである。The present invention provides a character recognition device that can obtain a high recognition rate as a whole without misreading even when a context or word that is not known to the context or word is input, or when a low-quality character is input. It is intended for that purpose.

〔発明の構成〕[Configuration of the invention]

（課題を解決するための手段）本発明は、文字パターンを含む入力画像から文字を検
出し切り出す文字検出手段と、この文字検出手段によっ
て切り出された文字を第１の文字認識用辞書を用いて認
識する第１の文字認識手段と、この第１の文字認識手段
の認識結果に対し文脈に関する知識を用いて誤読部分を
修正する後処理手段と、この後処理手段による後処理結
果と前記第１の文字認識手段の認識結果とを比較するこ
とにより誤読文字を推定する推定手段と、この推定手段
による推定結果に基づいて前記入力画像の文字パターン
から第２の文字認識用辞書を作成する辞書作成手段と、
この辞書作成手段によって作成された前記第２の文字認
識用辞書を用いて、後処理が不成功に終わった個所の前
記入力画像の文字パターンを再認識する第２の文字認識
手段を有することを特徴とする。(Means for Solving the Problems) The present invention uses a character detecting means for detecting and cutting out characters from an input image including a character pattern, and a character cut out by the character detecting means using a first character recognition dictionary. First character recognizing means for recognizing, post-processing means for correcting an erroneously read portion using knowledge about context with respect to the recognition result of the first character recognizing means; Estimating means for estimating a misread character by comparing the recognition result of the character recognizing means with the character recognition means, and dictionary creation for creating a second character recognition dictionary from the character pattern of the input image based on the estimation result by the estimating means Means,
A second character recognizing means for re-recognizing a character pattern of the input image at a place where the post-processing is unsuccessfully performed, using the second character recognition dictionary created by the dictionary creating means. Features.

（作用）本発明では、画像入力・文字検出・第１の文字認識・
後処理の実行後、その結果の信頼性が低いと判断される
部分について、入力帳票上の文字パターンを用いて再
度、文字の検出・切り出し、及び文字認識を行なうこと
が可能となる。(Operation) In the present invention, image input, character detection, first character recognition,
After the execution of the post-processing, it is possible to perform character detection / cutout and character recognition again using the character pattern on the input form for the portion of the result determined to be unreliable.

（実施例）第１図は本発明を実施例するための概念ブロック図で
ある。文字が印刷または記入された帳票の画像データは
電気信号に変えられて１の画像入力部より入力される。
次に、２の文字検出部において、この画像データから文
字行が抽出され、さらにその中から文字が取り出され
る。この取り出された文字パターンは３の第１の文字認
識部で各々認識され、その認識結果として、たとえば複
数の候補文字が決められる。各文字ごとに得られた複数
の候補文字は４の後処理部に送られる。４ではこの文脈
に関する知識、たとえば単語データからなる知識辞書等
によって正しいと推定される文字列を入力された文字認
識結果より推定して出力する。この結果はさらに５の再
認識部へ送られて、次に述べる処理が実行される。最終
的な結果が５から出力され、これが最終的な文字認識結
果となる。(Embodiment) FIG. 1 is a conceptual block diagram for embodying the present invention. Image data of a form on which characters are printed or written are converted into electric signals and input from one image input unit.
Next, in the second character detection unit, a character line is extracted from the image data, and a character is extracted therefrom. The extracted character patterns are respectively recognized by the three first character recognition units, and as a result of the recognition, for example, a plurality of candidate characters are determined. The plurality of candidate characters obtained for each character are sent to four post-processing units. In step 4, a character string that is presumed to be correct based on knowledge about the context, for example, a knowledge dictionary composed of word data, is estimated from the input character recognition result and output. The result is further sent to the re-recognition unit 5 to execute the processing described below. The final result is output from 5, which is the final character recognition result.

次に５における動作を説明する。第２図は再認識部を
詳細なブロック図で記述したものである。第３図は本実
施例におけるデータの流れを示したものである。20は入
力画像で、これを文字認識した結果が21である。この例
では文字認識結果の上位３候補を図示した。図中の○印
が正解候補である。文脈に関する知識を後処理として使
うことにより、この結果は22のように修正される。ここ
では「人機」なる単語が「入機」と認識されているが、
「人機」が単語辞書に登録されていないために、文脈に
関する知識が使用できず、修正されていない。21と22の
データが第２図10の正解文字推定部に入力される。Next, the operation in 5 will be described. FIG. 2 illustrates the re-recognition unit in a detailed block diagram. FIG. 3 shows a data flow in this embodiment. Reference numeral 20 denotes an input image, and the result of character recognition of the input image is 21. In this example, the top three candidates of the character recognition result are shown. The circles in the figure are correct answer candidates. By using contextual knowledge as post-processing, this result is modified as in 22. Here, the word "human machine" is recognized as "entrance",
Since "human machine" is not registered in the word dictionary, knowledge about the context cannot be used and is not corrected. The data of 21 and 22 are input to the correct character estimating section of FIG.

正解文字推定部では、第１の認識結果から得られた第
１位の候補文字のうち後処理の結果修正が生じた文字は
誤読文字で、後処理結果の文字が正解文字であると判断
し、第３図の例では22の○印の部分をリストアップし、
これを第２認識辞書作成部11に送る。第２辞書作成部で
はリストアップされた文字の入力文字パターンに基づい
て新たにその文字の文字認識用辞書を作成する。この辞
書は第３図23に示されるものであり、これが12の第２認
識辞書記憶部に格納される。The correct character estimating unit determines that the character that has been corrected as a result of the post-processing among the first-ranked candidate characters obtained from the first recognition result is a misread character and the character of the post-processing result is a correct character. In the example of FIG. 3, a list of 22 circles is listed,
This is sent to the second recognition dictionary creating section 11. The second dictionary creation unit creates a new character recognition dictionary for the character based on the input character pattern of the listed character. This dictionary is shown in FIG. 23, and is stored in the 12 second recognition dictionary storage units.

一方、21の認識結果、及び22の後処理結果は13の読取
結果蓄積部に蓄積されている。帳票の認識が終了した後
か、または途中において、13に蓄積されている認識結果
を第２認識実行判定部14に送り、ここで各文字につい
て、第２認識が必要かどうかを判断する。第３図の例で
は後処理が不成功に終った「人機」の部分が第２認識の
実行部分として選択される。そこで、２の文字検出部よ
り再度「人機」の部分の画像パターンが取り出されて、
第２認識部へ送られる。第２認識部では、すでに登録さ
れている23の辞書を使用して「人機」の部分を正しく認
識し、これを出力部に送出する。出力部では第２認識部
に送られなかった部分の読取り結果と第２認識部分の認
識結果を統合して、最終出力結果を出力する。On the other hand, the recognition result of 21 and the post-processing result of 22 are stored in 13 reading result storage units. After or halfway through the recognition of the form, the recognition result stored in 13 is sent to the second recognition execution determination unit 14, where it is determined whether the second recognition is necessary for each character. In the example of FIG. 3, the part of the “human machine” for which the post-processing has been unsuccessful is selected as the execution part of the second recognition. Then, the image pattern of the "human machine" portion is again taken out from the character detection unit 2 and
It is sent to the second recognition unit. The second recognition unit correctly recognizes the “human machine” portion using the 23 registered dictionaries, and sends it to the output unit. The output unit integrates the read result of the part not sent to the second recognition unit and the recognition result of the second recognition unit, and outputs the final output result.

ここで、第２認識部に入力された文字が読取不能であ
るならばまたは、第２の読取結果が最初の読取結果に比
べて信頼性に劣ると判定されれば、第１の文字認識部の
読取結果を最終結果として出力する。Here, if the character input to the second recognition unit cannot be read, or if it is determined that the second reading result is less reliable than the first reading result, the first character recognition unit Is output as the final result.

第２の認識方式としては次のような簡便な方式で認識
を実行する。As a second recognition method, recognition is performed by the following simple method.

まず、辞書を作成する元となるパターン31を太らせて
パターン32を作成し、これを辞書とする。認識は、入力
パターン33を太らせてパターン34を作成し、これと辞書
パターン32との画素ごとにおける排他的論理和を計算
し、パターン38を作り、32と34の不一致部すなわち、35
の部分の面積を計算し、これと、一致部分すなわち37の
面積の比を算出し、不一致部分の小さい辞書に対応する
文字を認識結果とする。この方式では太らせる操作や画
素ごとの排他論理和はランレングス表現のパターンにお
いても直接行なうことが可能なので、パターンや辞書の
表現はランレングス表現も可能である。First, a pattern 32, which is a basis for creating a dictionary, is fattened to create a pattern 32, which is used as a dictionary. For recognition, a pattern 34 is created by fattening the input pattern 33, an exclusive OR of this pattern with the dictionary pattern 32 is calculated for each pixel, and a pattern 38 is created.
Is calculated, and the ratio of the area of the matching part, that is, the area of 37, is calculated, and the character corresponding to the dictionary having the small mismatching part is determined as the recognition result. In this method, the fattening operation and the exclusive OR for each pixel can be directly performed even in a run-length expression pattern, so that the pattern and dictionary expressions can also be run-length expressions.

第５図は別の実施例を説明するための図である。ここ
で42,43,44はそれぞれ第２認識辞書記憶部に格納された
パターンの一部である。41は、３個の文字が接触してい
るために個々の文字が切り出せず、そのままペンディン
グ、または強制的に読取った為に誤読しているパターン
である。再認識部はこのパターンを入力し左側の部分に
ついて各辞書パターンを重ねあわせて、その一致度を調
査し、その中で一致度の高い辞書パターン42に対して重
なる部分45をその文字としてパターン41から切り出す。
同様にして順次パターン43に一致する46を切り出し、パ
ターン44に一致する47を切り出すことにより、パターン
41から各文字を切り出す。このような方式によって複雑
に入組んだ接触文字も確実に切り出すことが可能とな
る。ここで、切り出したパターンについてあらためて、
確認の意味で第１の文字認識方式を適用して認識結果を
チェックしても良い。FIG. 5 is a diagram for explaining another embodiment. Here, 42, 43, and 44 are a part of the patterns stored in the second recognition dictionary storage unit, respectively. Reference numeral 41 denotes a pattern in which individual characters cannot be cut out because three characters are in contact with each other, and are erroneously read because they are pending or forcibly read. The re-recognition unit inputs this pattern, superimposes the respective dictionary patterns on the left part, examines the degree of coincidence, and sets a part 45 overlapping the dictionary pattern 42 having a high degree of coincidence as a character in the pattern 41. Cut out from
Similarly, by sequentially cutting out 46 that matches the pattern 43 and cutting out 47 that matches the pattern 44, the pattern
Cut out each character from 41. With such a method, it is possible to reliably cut out complicated contact characters. Here, once again about the cut out pattern,
For the purpose of confirmation, the recognition result may be checked by applying the first character recognition method.

第１の実施例では第１の文字認識結果と後処理に基づ
いて正解文字を推定したが、第２の実施例では後処理部
を使わずに第１の文字認識結果における認識の信頼性に
関する評価値、たとえば類似度を用いて推定を行なうよ
うにしても良い。In the first embodiment, the correct character is estimated based on the first character recognition result and the post-processing. However, in the second embodiment, the reliability of the recognition in the first character recognition result is reduced without using the post-processing unit. The estimation may be performed using an evaluation value, for example, the similarity.

なお、本発明は上記実施例に限定されるものではな
い。第１の認識部の認識方式や第２の認識部の認識はど
のようなものであっても良く、上記実施例に限定されな
い。第１の認識方式と第２認識方式が同一であっても良
い。また、後処理の方式もどのようなもので良い。再認
識を実行するタイミングもいつでも良い。また、どの部
分を再認識すべきかという判断も後処理結果のみを使う
のではなく、第１の文字認識結果の信頼性も評価値の一
部として使用することができる。The present invention is not limited to the above embodiment. The recognition method of the first recognition unit or the recognition of the second recognition unit may be any method, and is not limited to the above embodiment. The first recognition method and the second recognition method may be the same. In addition, any post-processing method may be used. The timing of executing the re-recognition may be anytime. Further, the determination of which part should be re-recognized does not use only the post-processing result, but also the reliability of the first character recognition result can be used as a part of the evaluation value.

また、すべての文字を再認識の対象としても良い。認
識に使う辞書は太らせることなく、原パターンそのもの
であっても良い。第２の認識用の辞書の蓄積は複数の帳
票にまたがっても良い。In addition, all characters may be set as targets for re-recognition. The dictionary used for recognition may be the original pattern itself without being fat. The storage of the second recognition dictionary may extend over a plurality of forms.

〔発明の効果〕〔The invention's effect〕

本発明によれば入力文字の品質が低い時や、後処理で
使用する単語辞書内に存在しない単語が入力された時な
どに起る誤読に対して、帳票内の文字パターンそのもの
を使用して認識を行なうので、誤読の修正が可能とな
り、結果として高い読取精度を確保することができ、実
用上絶大なる効果を発揮することができる。According to the present invention, when the quality of input characters is low, or when a word that does not exist in the word dictionary used in post-processing is input, for example, when misreading occurs, the character pattern itself in the form is used. Since recognition is performed, erroneous reading can be corrected. As a result, high reading accuracy can be ensured, and a practically tremendous effect can be exhibited.

【図面の簡単な説明】[Brief description of the drawings]

第１図は本発明の一実施例の全体構成図、第２図は本発
明における再認識部の構成図、第３図は本発明のデータ
の流れを表わした図、第４図は元パターンと辞書パター
ンとのマッチングの様子を示す図、第５図は接触文字パ
ターンに対する切出しの様子を示す図である。１……画像入力部、２……文字検出部３……第１の文字認識部、４……後処理部５……再認識部FIG. 1 is an overall configuration diagram of an embodiment of the present invention, FIG. 2 is a configuration diagram of a re-recognition unit in the present invention, FIG. 3 is a diagram showing a data flow of the present invention, and FIG. FIG. 5 is a view showing a state of matching between a character pattern and a dictionary pattern, and FIG. 5 is a view showing a state of cutting out a contact character pattern. DESCRIPTION OF SYMBOLS 1 ... Image input part 2 ... Character detection part 3 ... 1st character recognition part 4 ... Post-processing part 5 ... Re-recognition part

───────────────────────────────────────────────────── フロントページの続き (58)調査した分野(Int.Cl.⁶，ＤＢ名) G06K 9/62 G06K 9/03 G06K 9/68 G06K 9/72──────────────────────────────────────────────────続き Continued on the front page (58) Field surveyed (Int.Cl. ⁶ , DB name) G06K 9/62 G06K 9/03 G06K 9/68 G06K 9/72

Claims

(57)【特許請求の範囲】(57) [Claims]

【請求項１】文字パターンを含む入力画像から文字を検
出し切り出す文字検出手段と、この文字検出手段によって切り出された文字を第１の文
字認識用辞書を用いて認識する第１の文字認識手段と、この第１の文字認識手段の認識結果に対し文脈に関する
知識を用いて誤読部分を修正する後処理手段と、この後処理手段による後処理結果と前記第１の文字認識
手段の認識結果とを比較することにより誤読文字を推定
する推定手段と、この推定手段による推定結果に基づいて前記入力画像の
文字パターンから第２の文字認識用辞書を作成する辞書
作成手段と、この辞書作成手段によって作成された前記第２の文字認
識用辞書を用いて、後処理が不成功に終わった個所の前
記入力画像の文字パターンを再認識する第２の文字認識
手段を有することを特徴とする文字認識装置。1. A character detecting means for detecting and cutting out a character from an input image including a character pattern, and a first character recognizing means for recognizing a character cut out by the character detecting means using a first character recognition dictionary. A post-processing unit that corrects a misread portion using knowledge about a context with respect to the recognition result of the first character recognition unit; a post-processing result of the post-processing unit and a recognition result of the first character recognition unit. Estimating means for estimating misread characters by comparing the following; dictionary creating means for creating a second character recognition dictionary from the character pattern of the input image based on the estimation result by the estimating means; A second character recognizing means for re-recognizing a character pattern of the input image at a place where post-processing is unsuccessful, using the created second character recognition dictionary; Character recognition apparatus according to claim and.

【請求項２】文字パターンを含む入力画像から文字を検
出して切り出し、切り出された前記文字を第１の文字認識用辞書を用いて
第１の文字認識を行い、この第１の文字認識の認識結果に対し文脈に関する知識
を用いて誤読部分を修正する後処理を行い、後処理された結果と前記第１の文字認識の認識結果とを
比較することにより誤読文字を推定し、推定された結果に基づいて前記入力画像の文字パターン
から第２の文字認識用辞書を作成し、作成された前記第２の文字認識用辞書を用いて、後処理
が不成功に終わった個所の前記入力画像の文字パターン
を再認識する第２の文字認識を行うことを特徴とする文
字認識方法。2. A method according to claim 1, wherein a character is detected from an input image including a character pattern and cut out, and the cut out character is subjected to first character recognition using a first character recognition dictionary. The recognition result is subjected to post-processing for correcting the misread portion using the knowledge about the context, and the post-processing result is compared with the recognition result of the first character recognition to estimate the misread character. Based on the result, a second dictionary for character recognition is created from the character pattern of the input image. Using the created second dictionary for character recognition, the input image at a location where post-processing is unsuccessful A second character recognition for re-recognizing said character pattern.