JPH0816724A

JPH0816724A - Character recognition device

Info

Publication number: JPH0816724A
Application number: JP6144245A
Authority: JP
Inventors: Yasuhiro Ura; 康裕浦
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1994-06-27
Filing date: 1994-06-27
Publication date: 1996-01-19
Anticipated expiration: 2017-06-17
Also published as: JP3292595B2

Abstract

PURPOSE:To properly recognize a handwritten character without being affected by the habit of a person who writes the character by recognizing the character by extracting any typical character from characters to be inputted, registering that typical character and collating the characters to be inputted with the registered typical character. CONSTITUTION:Concerning the character recognition device for recognizing the character by segmenting the character to be inputted and collating it with a dictionary 1, this device is provided with a dictionary registering means 18 for extracting and registering the typical character among characters, second dictionary means 19 on which the typical character is registered by the dictionary registering means 18, and second recognition means 20 for recognizing the character by collating the character to be inputted with the typical character. Then, the typical character such as a character in the shortest distance from the dictionary 17 is extracted among the characters to be inputted, for example, and registered on the second dictionary 19 and the character recognition is performed by collating the registered typical character with the characters to be inputted. Thus, the handwritten character can be properly recognized without being affected by the habit of the person who writes the character.

Description

【発明の詳細な説明】Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、文字を入力して文字認
識結果を出力する文字認識装置に関する。文字認識装置
として、例えば保険契約書，売上伝票，払込取扱書など
の帳票上に印刷または記入された文字を読み取り、認識
した結果を画面上やプリンタに出力、またはメモリ上に
保存する装置がある。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition device for inputting characters and outputting a character recognition result. As a character recognition device, for example, there is a device that reads characters printed or entered on forms such as insurance contracts, sales slips, payment handling manuals, etc., and outputs the recognized results on a screen or a printer or in a memory. .

【０００２】手書き文字を認識する場合には、手書き文
字の字形がその文字を書いた人の癖に左右されやすく、
正確に識別するのは困難であった。したがって、手書き
文字の認識を正確に行うことができる文字認識装置の開
発が要望されていた。When recognizing a handwritten character, the shape of the handwritten character is easily influenced by the habit of the person who wrote the character,
It was difficult to identify accurately. Therefore, it has been desired to develop a character recognition device that can accurately recognize handwritten characters.

【０００３】[0003]

【従来の技術】従来の文字認識装置としては、例えば図
７に示すようなものがある。図７において、１は帳票の
入力部であり、入力部１はイメージスキャナよりなり、
帳票の入力を行う。２はフィールド検出部であり、フィ
ールド検出部２は、入力部１より入力された帳票のフィ
ールドを検出する。フィールドは、例えば郵便番号のよ
うなひとまとまりの記入単位を指す。2. Description of the Related Art As a conventional character recognition device, for example, there is one shown in FIG. In FIG. 7, reference numeral 1 denotes a form input unit, and the input unit 1 is an image scanner,
Enter the form. Reference numeral 2 denotes a field detection unit, and the field detection unit 2 detects a field of the form input from the input unit 1. A field refers to a unit of entry, such as a postal code.

【０００４】３は文字切出し部であり、文字切出し部３
はフィールド検出部２で検出したフィールド内から１文
字ごとのイメージを切り出す。４は認識部であり、認識
部４はフィールド上の切り出された文字と辞書５とをテ
ンプレートを用いて照合し、文字認識を行う。認識部４
で文字認識した認識結果は、文字コードとして出力部６
に送られる。出力部６は表示部またはプリンタよりな
り、文字コードに対応する認識文字を表示または印字す
る。Reference numeral 3 is a character cutout portion, and the character cutout portion 3
Cuts out an image for each character from the field detected by the field detector 2. Reference numeral 4 is a recognition unit, and the recognition unit 4 collates the cut out characters on the field with the dictionary 5 using a template to perform character recognition. Recognition unit 4
The recognition result obtained by recognizing the characters is output as a character code by the output unit 6.
Sent to The output unit 6 includes a display unit or a printer, and displays or prints a recognized character corresponding to the character code.

【０００５】ここで、前記辞書５は、文字認識装置の作
成者によって用意されており、辞書５の内容は作成者の
判断によって作成されていた。したがって、文字を書く
人の字形の癖は考慮されることがなかった。Here, the dictionary 5 was prepared by the creator of the character recognition device, and the contents of the dictionary 5 were created by the creator's judgment. Therefore, the glyphistic habit of the person who writes the character was not considered.

【０００６】[0006]

【発明が解決しようとする課題】このような従来の文字
認識装置にあっては、辞書の内容は、作成者の判断によ
って作成され、文字を書く人の字形の癖には一切関与し
ないようになっているため、文字を書く人の癖によって
左右されやすい手書き文字を、自動的に正しく識別する
ことは困難であった。In such a conventional character recognizing device, the contents of the dictionary are created by the creator's judgment so as not to be involved in the character shape habit of the person who writes the character. Therefore, it is difficult to automatically and correctly identify handwritten characters that are easily influenced by the habit of the person who writes the characters.

【０００７】本発明は、このような従来の問題点に鑑み
てなされたものであって、文字を書く人の癖に左右され
ず、手書き文字を正しく認識することができる文字認識
装置を提供することを目的とする。The present invention has been made in view of such conventional problems, and provides a character recognition device which can recognize handwritten characters correctly without being influenced by the habit of the person who writes the characters. The purpose is to

【０００８】[0008]

【課題を解決するための手段】図１は本発明の原理説明
図である。本発明は、入力する文字を切り出して辞書１
７と照合して文字認識を行う文字認識装置において、前
記文字のうち典型的な文字を取り出して登録する辞書登
録手段１８と、該辞書登録手段１８によって前記典型的
な文字が登録される第２の辞書１９と、前記入力する文
字と前記典型的な文字とを照合し文字認識を行う第２の
認識手段２０を備えたことを特徴とする。FIG. 1 is a diagram illustrating the principle of the present invention. The present invention cuts out characters to be input and extracts the dictionary 1
In the character recognition device for character recognition by collating with 7, the dictionary registration means 18 for extracting and registering a typical character among the characters, and the dictionary registration means 18 for registering the typical character And a second recognition means 20 for performing character recognition by collating the input character with the typical character.

【０００９】また、本発明は、帳票上の特定の領域をフ
ィールドとして検出するフィールド検出手段を設け、該
フィールド検出手段により検出されたフィールド上の文
字と前記辞書１７とを照合して文字認識を行った後、前
記第２の辞書１９には前記フィールドごとに前記典型的
な文字を登録することを特徴とする。また、本発明は、
前記典型的な文字が、前記辞書１７と距離が最も小さい
文字であることを特徴とする。Further, according to the present invention, field detecting means for detecting a specific area on a form as a field is provided, and the character on the field detected by the field detecting means is collated with the dictionary 17 for character recognition. After the execution, the typical character is registered in the second dictionary 19 for each field. Also, the present invention
The typical character is a character having the smallest distance from the dictionary 17.

【００１０】[0010]

【作用】このような構成を備えた本発明の文字認識装置
によれば、入力する文字のうち典型的な文字、例えば辞
書１７との距離が最も小さい文字を取り出して、第２の
辞書１９に登録し、登録した典型的な文字と入力する文
字とを照合して文字認識を行うようにしたため、文字を
書く人の癖に左右されることなく手書き文字を正しく認
識することができる。According to the character recognition apparatus of the present invention having such a configuration, of the characters to be input, a typical character, for example, a character having the smallest distance from the dictionary 17, is taken out and is stored in the second dictionary 19. Since the character recognition is performed by registering and registering the registered typical character with the input character, the handwritten character can be correctly recognized without being influenced by the habit of the person who writes the character.

【００１１】また、帳票上の特定の領域をフィールドと
して検出し、フィールドごとに第２の辞書１９を生成す
るため、例えば一枚の帳票上で異なるフィールドを別の
人が書いているような場合にも、手書き文字を正しく認
識することができる。Further, since a specific area on the form is detected as a field and the second dictionary 19 is generated for each field, for example, when another person writes different fields on one form. Also, handwritten characters can be correctly recognized.

【００１２】[0012]

【実施例】以下、本発明の実施例を図面に基づいて説明
する。図２〜図６は本発明の一実施例を示す図である。
図２は本発明の一実施例に係る文字認識装置のブロック
図である。図２において、１１はイメージスキャナより
なる入力部であり、入力部１１は帳票の入力を行う。１
２はフィールド検出手段としてのフィールド検出部であ
り、フィールド検出部１２は、帳票内の任意のフィール
ドの検出を行う。すなわち、フィールド検出部１２は、
帳票のフォーマットを、フィールドの位置や大きさなど
の既知情報として与えておくことにより、入力された帳
票のフィールドを検出する。ここでいうフィールドと
は、住所，名前，金額欄といった、ひとまとまりの記入
単位を指す。Embodiments of the present invention will be described below with reference to the drawings. 2 to 6 are views showing an embodiment of the present invention.
FIG. 2 is a block diagram of a character recognition device according to an embodiment of the present invention. In FIG. 2, reference numeral 11 denotes an input unit including an image scanner, and the input unit 11 inputs a form. 1
Reference numeral 2 denotes a field detecting unit as a field detecting unit, and the field detecting unit 12 detects an arbitrary field in the form. That is, the field detector 12
By inputting the format of the form as known information such as the position and size of the field, the field of the input form is detected. The field here refers to a unit of entry such as address, name, and amount column.

【００１３】図３に帳票の例を示す。図３の帳票１３は
保険契約申込書の例であり、郵便番号，都道府県，市郡
区，町村字，丁番号などのひとまとまりの記入単位１４
Ａ〜１４Ｑがそれぞれフィールド１４を構成する。郵便
番号は２つのフィールド１４Ａ，１４Ｂよりなり、住所
は５つのフィールド１４Ｃ〜１４Ｇよりなり、保険契約
申込者は、氏，名の２つのフィールド１４Ｈ，１４Ｉよ
りなり、被保険者は、氏，名の２つのフィールド１４
Ｊ，１４Ｋよりなり、電話番号は、市外，市内，番号の
３つのフィールド１４Ｌ〜１４Ｎよりなり、申込年月日
は、年，月，日の３つのフィールド１４Ｏ〜１４Ｑより
なるが、郵便番号，住所，保険契約申込者，被保険者，
電話番号，申込年月日を１つのフィールドとして取り扱
っても良い。FIG. 3 shows an example of a form. The form 13 shown in FIG. 3 is an example of an insurance contract application form, and is a unit of a unit 14 such as a postal code, a prefecture, a city / ward, a town / village, and a number.
Each of A to 14Q constitutes the field 14. The postal code consists of two fields 14A and 14B, the address consists of five fields 14C to 14G, the insurance contract applicant consists of two fields 14H and 14I of name and name, and the insured person consists of name and name. Two fields of 14
J, 14K, the telephone number consists of three fields 14L to 14N for out-of-city, local, and number, and the application date consists of three fields 14O to 14Q for year, month, and day. Number, address, insurance contract applicant, insured,
The telephone number and the date of application may be treated as one field.

【００１４】さらに、また、帳票１３全体を一つのフィ
ールドとして取り扱うようにしても良い。図２におい
て、１５は文字切出し部であり、文字切出し部１５はフ
ィールド検出部１２により検出したフィールド１４内か
ら１文字ごとのイメージを切り出す。Furthermore, the entire form 13 may be treated as one field. In FIG. 2, reference numeral 15 is a character cutout unit, and the character cutout unit 15 cuts out an image for each character from the field 14 detected by the field detection unit 12.

【００１５】１６は第１の認識部であり、第１の認識部
１６はあらかじめ組み込まれている大分類用の第１の辞
書１７を用いて、対象となるフィールド１４上の文字の
認識を行う。第１の認識部１６は、認識結果と、テンプ
レートとの距離を出力結果として辞書登録手段としての
辞書登録部１８に出力する。辞書登録部１８は、第１の
辞書１７との距離が最も小さい文字を典型的な文字とし
てフィールド１４ごとに第２の辞書１９に登録する。ま
た、第２の辞書１９内にはあらかじめ第１の辞書１７と
同じものを登録しておく。Reference numeral 16 is a first recognition unit, and the first recognition unit 16 recognizes the character on the target field 14 by using a first-classified dictionary 17 for large classification. . The first recognition unit 16 outputs the recognition result and the distance between the template and the template to the dictionary registration unit 18 as a dictionary registration unit as an output result. The dictionary registration unit 18 registers the character having the smallest distance from the first dictionary 17 as a typical character in the second dictionary 19 for each field 14. In addition, the same dictionary as the first dictionary 17 is registered in the second dictionary 19 in advance.

【００１６】図４はフィールドと認識結果と距離の説明
図である。図４において、Ａ，Ａ，Ｂ，Ｃ，Ａ，Ｃは、
フィールド１４内の文字の認識結果を示す。ａ₁，
ａ₂，ｂ₁，ｃ₁，ａ_m，ｃ_nは第１の辞書１７との各
距離を示す。したがって、カテゴリＡと認識された文字
がｍ個あり、また、カテゴリＣと認識された文字がｎ個
あり、また、カテゴリＢと認識された文字が１個あるこ
とを示す。FIG. 4 is an explanatory diagram of fields, recognition results, and distances. In FIG. 4, A, A, B, C, A and C are
The recognition result of the character in the field 14 is shown. a ₁ ,
_{_{_{a 2, b 1, c 1}}} , a m, is c _n indicating the respective distances between the first dictionary 17. Therefore, it indicates that there are m characters recognized as category A, n characters recognized as category C, and 1 character recognized as category B.

【００１７】カテゴリＡと認識された文字がｍ個ある場
合、その中の距離ａ₁，ａ₂，ａ_mのうち最も距離が小
さい文字、例えば距離ａ₁に対応する文字パターンＡを
第２の辞書１９のカテゴリの部分に登録する。また、カ
テゴリＣと認識された文字がｎ個ある場合、その中の距
離ｃ₁，ｃ_nのうち最も距離が小さい文字、例えば距離
ｃ₁に対応する文字パターンＣを第２の辞書１９のカテ
ゴリＣの部分に登録する。The character is recognized category A may of m is, the distance a ₁ therein, a _2, a whose distance is small characters of _m, for example, the distance a ₁ character pattern A of the second corresponding to It is registered in the category portion of the dictionary 19. Further, when a character is recognized category C there are n, whose distance is small character among distances c _1, c _n therein, a character pattern C corresponding to the example, the distance c ₁ of the second dictionary 19 categories Register in part C.

【００１８】また、カテゴリＢと認識された文字が１個
しかない場合、その距離ｂ₁に対応する文字パターンＢ
を第２の辞書１９のカテゴリＢの部分に登録する。ま
た、図５はフィールドと認識結果と距離の他の説明図で
ある。図５において、１４はフィールドであり、このフ
ィールド１４は６つの枠より構成され、フィールド１４
内には手書き文字が書かれる。When there is only one character recognized as category B, the character pattern B corresponding to the distance b ₁
Is registered in the category B portion of the second dictionary 19. FIG. 5 is another explanatory diagram of fields, recognition results, and distances. In FIG. 5, 14 is a field, and this field 14 is composed of 6 frames.
Handwritten characters are written inside.

【００１９】０，０，１，２，０，２はフィールド１４
内の文字の認識結果である。ａ₁，ａ₂，ｂ₁，ｃ₁，
ａ₃，ｃ₂は第１の辞書１７との距離を示す。したがっ
て、カテゴリ０と認識された文字が３個あり、カテゴリ
１と認識された文字が１個あり、カテゴリ２と認識され
た文字が２個あることを示す。カテゴリ０と認識された
文字が３個ある場合、その中の距離ａ₁，ａ₂，ａ₃の
うち最も距離が小さい文字、例えば距離ａ₁に対応する
文字が第２の辞書１９のカテゴリ０の部分に登録され
る。0, 0, 1, 2, 0, 2 is field 14
It is the recognition result of the character inside. a ₁ , a ₂ , b ₁ , c ₁ ,
a ₃ and c ₂ indicate the distance from the first dictionary 17. Therefore, it indicates that there are three characters recognized as category 0, one character recognized as category 1, and two characters recognized as category 2. When there are three characters recognized as category 0, the character having the smallest distance among the distances a ₁ , a ₂ , and a ₃ , among them, for example, the character corresponding to the distance a ₁ is the category 0 of the second dictionary 19. Will be registered in the part.

【００２０】また、カテゴリ２と認識された文字が２個
ある場合、その中の距離ｃ₁，ｃ₂のうち距離が小さい
方の文字、例えば距離ｃ₁に対応する文字が第２の辞書
１９のカテゴリ２の部分に登録される。また、カテゴリ
１と認識された文字は１個しかないので、その文字（距
離ｂ₁に対応する文字）が第２の辞書１９のカテゴリ１
の部分に登録される。When there are two characters recognized as category 2, the character having the smaller distance out of the distances c ₁ and c ₂ , for example, the character corresponding to the distance c ₁ is the second dictionary 19 Is registered in the category 2 part of the. Further, since there is only one character recognized as category 1, that character (character corresponding to the distance b ₁ ) is the category 1 of the second dictionary 19.
Will be registered in the part.

【００２１】図２に戻り、２０は第２の認識手段として
の第２の認識部であり、第２の認識部２０はフィールド
１４のすべての文字に対して第２の辞書１９を用いて文
字認識を行う。すなわち、第２の認識部２０は、第２の
辞書１９にフィールド１４ごとに登録された、第１の辞
書１７との距離が最も小さい文字、すなわち典型的な文
字と再度入力したフィールド１４内の文字とを照合し、
その認識結果を出力部２１に与える。出力部２１は表示
部またはプリンタよりなり、認識結果を表示または印刷
する。Returning to FIG. 2, reference numeral 20 denotes a second recognition unit as a second recognition unit, and the second recognition unit 20 uses the second dictionary 19 for all the characters in the field 14. To recognize. That is, the second recognizing unit 20 registers the character registered in the second dictionary 19 for each field 14 and having the smallest distance from the first dictionary 17, that is, the typical character in the field 14 that has been re-input. Match the character,
The recognition result is given to the output unit 21. The output unit 21 includes a display unit or a printer, and displays or prints the recognition result.

【００２２】次に、動作を説明する。図６は動作を説明
するためのフローチャートである。図６において、ま
ず、ステップＳ１で入力部１１により帳票１３の入力を
行う。帳票１３としては、例えば図３に示すような保険
契約申込書がある。この帳票１３のフィールド１４には
郵便番号などが手書きされる。Next, the operation will be described. FIG. 6 is a flowchart for explaining the operation. In FIG. 6, first, in step S1, the input unit 11 inputs the form 13. As the form 13, for example, there is an insurance contract application form as shown in FIG. A postal code or the like is handwritten in the field 14 of the form 13.

【００２３】次に、ステップＳ２でフィールド検出部１
２により入力された帳票１３のフィールド１４の検出を
行う。フィールド１４の検出は、帳票１３上の規定の領
域である、郵便番号，都道府県などのように、ひとまと
まりの記入単位１４Ａ〜１４Ｑごとに行うが、住所，保
険契約申込書，被保険者，電話番号などのように複数の
フィールド１４をまとめてひとつのフィールドとしても
良い。Next, in step S2, the field detector 1
The field 14 of the form 13 input by 2 is detected. The field 14 is detected for each unit of the entry unit 14A to 14Q, such as a postal code or a prefecture, which is a prescribed area on the form 13, but the address, insurance contract application form, insured person, A plurality of fields 14 such as a telephone number may be combined into one field.

【００２４】さらに、帳票１３全体を一つのフィールド
として取り扱うようにしても良い。この場合には、フィ
ールド検出部１２を省略することができる。次に、ステ
ップＳ３で文字切出し部１５によりフィールド検出部１
２で検出したフィールド１４内から一文字ごとのイメー
ジを切り出す。次に、ステップＳ４で第１の認識部１６
により第１の辞書１７を用いて対象となるフィールド１
４内の手書き文字の認識を行う。そして、ステップＳ５
で第１の認識部１６により認識した結果とテンプレート
との距離を辞書登録部１８に出力する。すなわち、認識
対象文字と第１の辞書１７を照合し、テンプレートから
はみ出る部分の距離が最も小さい文字パターンを認識結
果として距離とともに出力する。Further, the entire form 13 may be treated as one field. In this case, the field detector 12 can be omitted. Next, in step S3, the field detecting unit 1 is operated by the character extracting unit 15.
An image for each character is cut out from the field 14 detected in 2. Next, in step S4, the first recognition unit 16
By using the first dictionary 17 the target field 1
The handwritten characters in 4 are recognized. Then, step S5
Then, the distance between the result recognized by the first recognition unit 16 and the template is output to the dictionary registration unit 18. That is, the recognition target character is collated with the first dictionary 17, and the character pattern having the smallest distance of the portion protruding from the template is output together with the distance as the recognition result.

【００２５】次に、ステップＳ６で１つのフィールド１
４内のすべての文字の認識が終了したか否かを判別し、
終了していない場合には、ステップＳ３に戻り、ステッ
プＳ３で次の文字を切り出し、終了した場合にはステッ
プＳ７に進む。ステップＳ７ではカテゴリ数ｉを設定
し、順次取り込む。カテゴリ数ｉとしては、例えば図５
に示すようにカテゴリが数字の場合には、１，２，３，
４，５，６，７，８，９，０がカテゴリ数ｉとなる。Next, in step S6, one field 1
Determine whether all the characters in 4 have been recognized,
If not completed, the process returns to step S3, the next character is cut out in step S3, and if completed, the process proceeds to step S7. In step S7, the number of categories i is set and sequentially fetched. The number of categories i is, for example, as shown in FIG.
When the category is a number as shown in, 1, 2, 3,
The number of categories i is 4, 5, 6, 7, 8, 9, and 0.

【００２６】次に、ステップＳ８ではフィールド内文字
数ｊを設定し、順次取り込む。フィールド内文字数ｊと
しては、例えば図５に示すように、フィールド１４のブ
ロックが６個の場合には１，２，３，４，５，６がフィ
ールド内文字数ｊとなる。次に、ステップＳ９でカテゴ
リ数Ｃｉと認識結果数Ａｊが一致するか判別する。カテ
ゴリ数Ｃｉとしては、図５の場合を例にとると、Ｃ₁は
１、Ｃ₂は２、Ｃ₃は３、Ｃ₄は４、Ｃ₅は５、Ｃ₆は
６、Ｃ₇は７、Ｃ₈は８、Ｃ₉は９、Ｃ₀は０となる。Next, in step S8, the number j of characters in the field is set and sequentially fetched. For example, as shown in FIG. 5, when the number of blocks in the field 14 is 6, the number of characters in the field j is 1, 2, 3, 4, 5, 6 as the number of characters in the field j. Next, in step S9, it is determined whether the number of categories Ci and the number of recognition results Aj match. As an example of the number of categories Ci in the case of FIG. 5, C ₁ is 1, C ₂ is 2, C ₃ is 3, C ₄ is 4, C ₅ is 5, C ₆ is 6, and C ₇ is 7. , C ₈ is 8, C ₉ is 9, and C ₀ is 0.

【００２７】また、認識結果数Ａｊとしては、図５の場
合を例にとると、Ａ₁は０、Ａ₂は０、Ａ₃は１、Ａ₄
は２、Ａ₅は０、Ａ₆は２となる。したがって、Ｃ₀＝
Ａ₁、Ｃ₀＝Ａ₂、Ｃ₀＝Ａ₅、Ｃ₁＝Ａ₃、Ｃ₂＝Ａ
₄、Ｃ₂＝Ａ₆のとき、Ｃｉ＝Ａｊとなる。次に、ステ
ップＳ１０でどのＡｊの距離がＣｉの中で最小であるか
否かを判別し、ステップＳ１１でＡｉの距離が最小のも
のをｍi とする。As the recognition result number Aj, taking the case of FIG. 5 as an example, A ₁ is 0, A ₂ is 0, A ₃ is 1, and A _{4 is}
Is 2, A ₅ is 0, and A ₆ is 2. Therefore, C ₀ =
A ₁ , C ₀ = A ₂ , C ₀ = A ₅ , C ₁ = A ₃ , C ₂ = A
_When C ₄ and C ₂ = A ₆ , Ci = Aj. Next, in step S10, it is determined which Aj distance is the smallest in Ci, and in step S11, the Ai distance is set to mi.

【００２８】Ａ₁の距離はａ₁、Ａ₂の距離はａ₂、Ａ
₅の距離はａ₃であり、ａ₁，ａ₂，ａ₃のうち、例え
ばａ₁が最小であると判別された場合には、Ａ₁をｍ₁
とする。また、Ａ₄の距離はｃ₁、Ａ₆の距離はｃ₂で
あり、例えばｃ₁＜ｃ₂のときはＡ₄をｍ₄とする。ま
た、Ｃ₁は１個しかなく、Ａ₃の距離はｂ₁であるの
で、Ａ₃をｍ₃とする。The distance A ₁ is a ₁ , the distance A ₂ is a ₂ , A
The distance of ₅ is a ₃ , and when it is determined that, for example, a ₁ is the smallest among a ₁ , a ₂ , and a ₃ , A _{1 is set} to m ₁
And The distance A ₄ is c ₁ and the distance A ₆ is c _2. For example, when c ₁ <c ₂ , A ₄ is m ₄ . Since there is only one C _{1 and} the distance of A ₃ is b ₁ , A ₃ is m ₃ .

【００２９】次に、ステップＳ１２でフィールド１４内
の典型的な文字の取出しが終了したら、ステップＳ１３
でｍi に対応するカテゴリを第２の辞書１９に登録す
る。すなわち、距離ａ₁に対応する文字０、距離ｂ₁に
対応する文字１、距離Ｃ₁に対応する文字２をそれぞれ
第２の辞書１９の各カテゴリの部分に登録する。次に、
ステップＳ１４で再度帳票１３の入力を行い、フィール
ド１４を検出し、一文字の切り出しを行う。Next, when the extraction of typical characters in the field 14 is completed in step S12, step S13
The category corresponding to mi is registered in the second dictionary 19. That is, the character 0 corresponding to the distance a ₁ , the character ₁ corresponding to the distance b ₁ , and the character 2 corresponding to the distance C ₁ are registered in the respective category portions of the second dictionary 19. next,
In step S14, the form 13 is input again, the field 14 is detected, and one character is cut out.

【００３０】次に、ステップＳ１５で再入力した文字と
第２の辞書１９とを照合する。すなわち、再入力した文
字と第２の辞書１９に登録された、第１の辞書１７との
距離が最も小さい文字、すなわち、典型的な文字とを照
合する。そして、ステップＳ１６で全てのフィールド１
４について第１の認識部１６および第２の認識部２０に
よる文字認識が終了したら、ステップＳ１７で認識結果
を出力部２１に与え、ステップＳ１８で出力部２１は認
識結果を表示または印刷する。Next, the characters re-entered in step S15 are collated with the second dictionary 19. That is, the re-entered character and the character registered in the second dictionary 19 and having the smallest distance from the first dictionary 17, that is, a typical character is collated. Then, in step S16, all fields 1
When the character recognition by the first recognition unit 16 and the second recognition unit 20 for 4 is completed, the recognition result is given to the output unit 21 in step S17, and the output unit 21 displays or prints the recognition result in step S18.

【００３１】このように、手書き文字の詳細識別のため
の第２の辞書１９を、フィールド１４内の文字から第１
の辞書１７との距離が最も小さい文字である典型的な文
字を取り出して生成するようにしたため、すなわち、フ
ィールド１４ごとに典型的な文字が登録された第２の辞
書１９を生成するようにしたため、文字を書いた人の癖
がそのまま第２の辞書１９に反映されることになり、文
字を書く人の癖に左右されることなく、手書き文字を正
しく認識することができる。As described above, the second dictionary 19 for the detailed identification of the handwritten character is firstly selected from the characters in the field 14.
Since a typical character that is the character with the smallest distance from the dictionary 17 is extracted and generated, that is, the second dictionary 19 in which the typical character is registered for each field 14 is generated. , The habit of the person who wrote the character is reflected in the second dictionary 19 as it is, and the handwritten character can be correctly recognized without being influenced by the habit of the person who writes the character.

【００３２】また、フィールド１４ごとに詳細識別用の
第２の辞書１９を生成するため、例えば一枚の帳票１３
上に異なるフィールド１４を別の人が書いているような
場合でも対応することができる。なお、本実施例におい
ては、フィールド検出部１２によって帳票１３上の特定
の領域をフィールド１４として検出するようにしたが、
帳票全体を一つのフィールドとみなすことによりフィー
ルド検出部１２を省略しても良い。帳票全体の文字数が
少ない場合には、こうした方が能率的である。Since the second dictionary 19 for detailed identification is generated for each field 14, for example, one form 13 is used.
Even if another field 14 is written by another person, it can be dealt with. In this embodiment, the field detector 12 detects a specific area on the form 13 as the field 14.
The field detection unit 12 may be omitted by regarding the entire form as one field. This is more efficient when the total number of characters in the form is small.

【００３３】[0033]

【発明の効果】以上説明してきたように、本発明によれ
ば、入力する文字のうち典型的な文字を取り出して第２
の辞書に登録し、入力する文字と登録した典型的な文字
とを照合して文字認識を行うようにしたため、文字を書
く人の癖に左右されず、手書き文字を正しく識別するこ
とができる。As described above, according to the present invention, a typical character is extracted from the characters to be input and the second character is extracted.
Since the character recognition is performed by registering in the dictionary and collating the input character with the registered typical character, the handwritten character can be correctly identified regardless of the habit of the person who writes the character.

【００３４】また、フィールドごとに詳細識別用の第２
の辞書を生成するため、例えば一枚の帳票上で異なるフ
ィールドを別の人が書いているような場合でも、手書き
文字を正しく識別することができる。The second field for detailed identification is provided for each field.
Since the dictionary is generated, handwritten characters can be correctly identified even when another person writes different fields on one form.

【図面の簡単な説明】[Brief description of drawings]

【図１】本発明の原理説明図FIG. 1 is a diagram illustrating the principle of the present invention.

【図２】本発明の一実施例を示すブロック図FIG. 2 is a block diagram showing an embodiment of the present invention.

【図３】帳票の例を示す図FIG. 3 is a diagram showing an example of a form.

【図４】フィールド、認識結果、距離の説明図FIG. 4 is an explanatory diagram of fields, recognition results, and distances.

【図５】フィールド、認識結果、距離の他の説明図FIG. 5 is another explanatory diagram of fields, recognition results, and distances.

【図６】動作を説明するフローチャートFIG. 6 is a flowchart illustrating the operation.

【図７】従来例を示す図FIG. 7 shows a conventional example.

【符号の説明】[Explanation of symbols]

１１：入力部１２：フィールド検出部（フィールド検出手段）１３：帳票１４：フィールド１４Ａ〜１４Ｑ：記入単位１５：文字切出し部１６：第１の認識部１７：第１の辞書１８：辞書登録部（辞書登録手段）１９：第２の辞書２０：第２の認識部（第２の認識手段）２１：出力部 11: input unit 12: field detection unit (field detection means) 13: form 14: field 14A to 14Q: entry unit 15: character cutout unit 16: first recognition unit 17: first dictionary 18: dictionary registration unit ( Dictionary registration unit) 19: Second dictionary 20: Second recognition unit (second recognition unit) 21: Output unit

Claims

【特許請求の範囲】[Claims]

【請求項１】入力する文字を切り出して辞書（１７）と
照合して文字認識を行う文字認識装置において、前記文字のうち典型的な文字を取り出して登録する辞書
登録手段（１８）と、該辞書登録手段（１８）によって前記典型的な文字が登
録される第２の辞書（１９）と、前記入力する文字と前記典型的な文字とを照合し文字認
識を行う第２の認識手段（２０）を備えたことを特徴と
する文字認識装置。1. A character recognition device for recognizing characters by cutting out input characters and collating them with a dictionary (17), and dictionary registration means (18) for extracting and registering typical characters of the characters, A second dictionary (19) in which the typical character is registered by the dictionary registration means (18), and a second recognition means (20) for performing character recognition by collating the input character with the typical character. ) Is provided with a character recognition device.

【請求項２】帳票上の特定の領域をフィールドとして検
出するフィールド検出手段を設け、該フィールド検出手
段により検出されたフィールド上の文字と前記辞書（１
７）とを照合して文字認識を行った後、前記第２の辞書
（１９）には前記フィールドごとに前記典型的な文字を
登録することを特徴とする請求項１記載の文字認識装
置。2. A field detecting means for detecting a specific region on a form as a field is provided, and the character on the field detected by the field detecting means and the dictionary (1
The character recognition device according to claim 1, wherein after the character recognition is performed by collating with 7), the typical character is registered for each field in the second dictionary (19).

【請求項３】前記典型的な文字が、前記辞書（１７）と
距離が最も小さい文字であることを特徴とする請求項
１，２記載の文字認識装置。3. The character recognition device according to claim 1, wherein the typical character is a character having a smallest distance from the dictionary (17).