JP2000057265A

JP2000057265A - Character reader

Info

Publication number: JP2000057265A
Application number: JP10227910A
Authority: JP
Inventors: Yasuhiko Shimizu; 保彦清水
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1998-08-12
Filing date: 1998-08-12
Publication date: 2000-02-25

Abstract

PROBLEM TO BE SOLVED: To provide a character reader which can shorten the process time and improve the recognition rate. SOLUTION: When image data of a 1st character frame recognized by a character recognition part 15 are, for example, 'TAISHO', a select signal SEL for selecting a character set 16a for recognition of a 2nd character frame is outputted from a character set selection part 17 to a character dictionary 16A. Then the image data of the 2nd character frame are segmented by a character segmentation part 16 and supplied to the character recognition part 15, which performs character recognition according to the character set 16a for recognizing only numbers '0' and '1'. When the 1st character frame is 'SHOWA', the 2nd character frame has its characters recognized according to a character set 16c for recognizing numbers '0' to '6'. Thus, the recognition is performed by limiting character candidates according to the recognition result of a previous character frame, so the processing time is shortened and the recognition rate is improved.

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、予め様式の定めら
れた帳票に記載された文字または記号等を読み取る文字
読取装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character reading apparatus for reading characters or symbols written on a form in which a format is determined in advance.

【０００２】[0002]

【従来の技術】図２は、従来の文字読取システムの構成
図である。この文字読取システムは、光学式文字読取装
置（以下、「ＯＣＲ」という）１０とホストコンピュー
タ３０とで構成されている。ＯＣＲ１０は、予め様式の
定められた帳票１のイメージ情報を光学的に読み取るイ
メージスキャナ１１、このイメージスキャナ１１で読み
取られたイメージ情報を格納するイメージメモリ１２、
及び帳票１における文字等の記入位置や文字種別が予め
登録されたフォーマット登録部１３を有している。更
に、このＯＣＲ１０は、フォーマット登録部１３の情報
に基づいてイメージメモリ１２から読み取り対象の文字
やマーク等のイメージデータを切り出す文字切出部１
４、切り出されたイメージデータから文字やマーク等を
認識してコードを出力する文字認識部１５、及び文字の
認識に必要な文字パターンの特徴が登録された文字辞書
１６を備えている。そして、文字認識部１５から出力さ
れた文字コードが、ホストコンピュータ３０へ与えられ
るようになっている。2. Description of the Related Art FIG. 2 is a block diagram of a conventional character reading system. This character reading system includes an optical character reading device (hereinafter, referred to as “OCR”) 10 and a host computer 30. The OCR 10 includes an image scanner 11 that optically reads image information of the form 1 in a predetermined format, an image memory 12 that stores image information read by the image scanner 11,
And a format registration unit 13 in which entry positions and character types of characters and the like in the form 1 are registered in advance. Further, the OCR 10 is a character extracting unit 1 for extracting image data such as characters and marks to be read from the image memory 12 based on the information of the format registering unit 13.
4. A character recognizing unit 15 for recognizing characters and marks from the cut-out image data and outputting a code, and a character dictionary 16 in which characteristics of character patterns required for character recognition are registered. The character code output from the character recognition unit 15 is provided to the host computer 30.

【０００３】図３は、読み取り対象の帳票１の一部を示
す図である。この帳票１には、生年月日を記入するため
の記入欄２が定められている。記入欄２には、明治、大
正、昭和、平成の中から年号を選択してマークを記すマ
ーク枠２ａ、年の十位と一位を数字で記入する文字枠２
ｂ，２ｃ、月の十位と一位を記入する文字枠２ｄ，２
ｅ、及び日の十位と一位を記入する文字枠２ｆ，２ｇが
設けられている。FIG. 3 is a diagram showing a part of a form 1 to be read. The form 1 has an entry field 2 for entering a date of birth. In the entry column 2, select the year from the Meiji, Taisho, Showa, and Heisei periods and mark it with a mark. 2a, and fill in the tens and first places of the year with numbers.
b, 2c, character boxes 2d, 2 for entering the tenth and first places of the month
e and character frames 2f and 2g for entering the tens and first places of the day are provided.

【０００４】次に動作を説明する。図３に示すように帳
票１の記入欄２の所定の位置にマーク及び数字を記入
し、図２のＯＣＲ１０のイメージスキャナ１１に入力す
ると、この帳票１のイメージ情報が読み取られてイメー
ジメモリ１２に格納される。帳票１のイメージ情報がイ
メージメモリ１２に格納されると、今度は文字切出部１
４が動作し、フォーマット登録部１３の情報に基づいて
イメージメモリ１２からマーク枠２ａのイメージデータ
が切り出され、文字認識部１５に与えられる。文字認識
部１５では、フォーマット登録部１３の情報に基づいて
マークの位置が検出され、年号が「大正」であると認識
され、例えば「大正」に対応するコード“２”が出力さ
れる。同様に、文字切出部１４から文字枠２ｂ〜２ｇの
イメージデータが順次切り出され、文字認識部１５に与
えられる。文字認識部１５では、フォーマット登録部１
３の情報に基づいて各文字枠２ｂ〜２ｇには数字が記載
されているとして、文字識別が行われる。そして、文字
認識部１５から、例えば年月日に対応するコード“１２
１０２４”が、順次出力される。Next, the operation will be described. As shown in FIG. 3, a mark and a number are entered in a predetermined position of the entry column 2 of the form 1 and input to the image scanner 11 of the OCR 10 in FIG. 2. The image information of the form 1 is read and stored in the image memory 12. Is stored. When the image information of the form 1 is stored in the image memory 12, the character extracting unit 1
4 operates, the image data of the mark frame 2 a is cut out from the image memory 12 based on the information of the format registration unit 13, and given to the character recognition unit 15. The character recognition unit 15 detects the position of the mark based on the information of the format registration unit 13, recognizes that the year is "Taisho", and outputs a code "2" corresponding to "Taisho", for example. Similarly, the image data of the character frames 2b to 2g is sequentially cut out from the character cutout unit 14 and is provided to the character recognition unit 15. In the character recognition unit 15, the format registration unit 1
Based on the information of No. 3, character identification is performed on the assumption that a number is described in each of the character frames 2b to 2g. Then, from the character recognition unit 15, for example, the code “12” corresponding to the date
1024 "are sequentially output.

【０００５】ＯＣＲ１０の文字認識部１５から出力され
た「大正１２年１０月２４日」というデータは、ホスト
コンピュータ３０に与えられ、このホストコンピュータ
３０によって、そのデータの妥当性のチェックが行われ
る。そして、例えば「大正５０年・・・」のように、有
り得ないデータの場合には、読み取り誤り、或いは記入
誤りとして処理されるようになっている。[0005] The data "October 24, 1912" output from the character recognition unit 15 of the OCR 10 is given to a host computer 30, and the host computer 30 checks the validity of the data. Then, in the case of impossible data, for example, "Taisho 50 ...", the data is processed as a reading error or an entry error.

【０００６】[0006]

【発明が解決しようとする課題】しかしながら、従来の
文字読取システムでは、文字認識部１５において文字等
を認識するときに、文字枠２ｂ〜２ｇ中に記載された文
字種別等の情報を、フォーマット登録部１３のみから得
て識別するだけであった。そして、読み取った年号や年
の各データ間の妥当性のチェックは、ホストコンピュー
タ３０で行うようになっていた。このため、次の（１）
〜（３）のような課題があった。（１）例えば、帳票１のマーク枠２ａから、年号が
「大正」と認識された場合でも、年の十位の数字を記入
する文字枠２ｂ中の数字を「０」〜「９」の１０個の文
字候補から認識する必要があり、認識処理に長時間を要
していた。（２）同様に、年号が「大正」の場合、文字枠２ｂ中
の数字は「０」または「１」に限定されるにもかかわら
ず、「２」〜「９」の不必要な文字候補が存在するた
め、誤認識が発生して認識率が低下することがあった。（３）妥当性の誤りが存在して、本来ならそれ以降の
文字を認識する必要がない場合でも、ＯＣＲ１０で帳票
１中のすべての読み取り対象の文字を認識してコードに
変換してホストコンピュータ３０に出力する必要がある
ので、認識処理時間の短縮が困難であった。本発明は、前記従来技術が持っていた前記（１）〜
（３）の課題を解決し、処理時間の短縮と認識率の向上
が可能な文字読取装置を提供するものである。However, in the conventional character reading system, when the character recognizing unit 15 recognizes a character or the like, information such as the character type described in the character frames 2b to 2g is registered in a format. It was only obtained and obtained from the part 13 alone. The host computer 30 checks the validity of the read year and year data. Therefore, the following (1)
There were problems such as (3). (1) For example, even if the year is recognized as “Taisho” from the mark frame 2a of the form 1, the numbers in the character frame 2b for entering the tens digit of the year are changed from “0” to “9”. It was necessary to recognize from ten character candidates, and it took a long time for the recognition process. (2) Similarly, when the year is “Taisho”, unnecessary characters of “2” to “9” are displayed although the number in the character frame 2b is limited to “0” or “1”. Since the candidate exists, erroneous recognition may occur and the recognition rate may decrease. (3) Even if there is an error in validity and it is not necessary to recognize characters after that, all characters to be read in the form 1 are recognized by the OCR 10 and converted into codes, and the host computer Therefore, it is difficult to reduce the time required for the recognition processing. The present invention provides the above (1) to
It is an object of the present invention to provide a character reading device that solves the problem (3) and can shorten the processing time and improve the recognition rate.

【０００７】[0007]

【課題を解決するための手段】前記課題を解決するため
に、本発明の内の第１の発明は、文字または記号を記入
するための第１の記入枠と、該第１の記入枠に記入され
た文字または記号に対応する文字セット中の文字を記入
するための第２の記入枠とを有する帳票を読み取り、該
帳票に記入された文字または記号を認識する文字読取装
置において、前記文字セット毎に読み取りの対象となる
文字または記号を認識するための特徴データが登録され
た文字辞書と、前記帳票のイメージ情報を格納する格納
手段と、前記格納手段に格納された前記イメージ情報か
ら前記第１及び第２の記入枠のイメージデータを切り出
す切出手段とを備えている。更に、この文字読取装置
は、選択信号に基づいて選択された前記文字辞書中の特
定の文字セットを参照し、前記切出手段で切り出された
イメージデータに対応する文字または記号を認識する文
字認識手段と、前記文字認識手段で認識された前記第１
の記入枠中の文字または記号に基づいて、前記第２の記
入枠のイメージデータ認識用の前記選択信号を出力する
選択手段とを有している。Means for Solving the Problems In order to solve the above problems, a first aspect of the present invention is to provide a first entry box for entering characters or symbols, and a first entry box for entering characters or symbols. A character reading device for reading a form having a second entry frame for entering a character in a character set corresponding to the entered character or symbol and recognizing the character or symbol entered in the form; A character dictionary in which characteristic data for recognizing a character or a symbol to be read for each set is registered, storage means for storing image information of the form, and the image information stored in the storage means. And a cutout means for cutting out image data of the first and second entry frames. Further, the character reading device refers to a specific character set in the character dictionary selected based on the selection signal, and recognizes a character or a symbol corresponding to the image data cut out by the cutout means. Means, the first character recognized by the character recognition means.
Selecting means for outputting the selection signal for recognizing the image data of the second entry frame based on the characters or symbols in the entry frame.

【０００８】第１の発明によれば、以上のように文字読
取装置を構成したので、次のような作用が行われる。ま
ず、格納手段に格納されたイメージ情報の中から、帳票
の第１の記入枠に対応するイメージデータが切出手段に
よって切り出され、文字認識手段に与えられる。文字認
識手段では、選択信号によって選択された文字辞書中の
文字セットの特徴データが参照され、第１の記入枠のイ
メージデータに対応する文字または記号が認識される。
文字認識手段の認識結果は選択手段に与えられ、この選
択手段によって、第２の記入枠のイメージデータ認識用
の選択信号が出力される。次に、格納手段に格納された
イメージ情報の中から、帳票の第２の記入枠に対応する
イメージデータが切り出され、文字認識手段に与えられ
る。文字認識手段では、第２の記入枠のイメージデータ
認識用の選択信号によって選択された文字辞書中の文字
セットの特徴データが参照され、その第２の記入枠のイ
メージデータに対応する文字または記号が認識される。According to the first aspect of the present invention, since the character reading device is configured as described above, the following operation is performed. First, from the image information stored in the storage unit, image data corresponding to the first entry frame of the form is cut out by the cutout unit and provided to the character recognition unit. The character recognition means refers to the characteristic data of the character set in the character dictionary selected by the selection signal, and recognizes a character or a symbol corresponding to the image data of the first entry frame.
The recognition result of the character recognition means is given to the selection means, and the selection means outputs a selection signal for recognizing image data of the second entry frame. Next, image data corresponding to the second entry frame of the form is cut out from the image information stored in the storage means and provided to the character recognition means. The character recognizing means refers to the characteristic data of the character set in the character dictionary selected by the selection signal for recognizing the image data of the second entry frame, and a character or symbol corresponding to the image data of the second entry frame. Is recognized.

【０００９】第２の発明は、文字または記号を記入する
ための第１の記入枠と、該第１の記入枠に記入された文
字または記号に対応する単語セット中の単語を記入する
ための第２の記入枠とを有する帳票を読み取り、該帳票
に記入された文字、記号、または単語を認識する文字読
取装置において、読み取りの対象となる文字または記号
を識別するための特徴データが登録された文字辞書と、
前記単語セット毎に読み取りの対象となる複数の単語が
登録された単語辞書と、前記帳票のイメージ情報を格納
する格納手段と、前記格納手段に格納された前記イメー
ジ情報から前記第１及び第２の記入枠のイメージデータ
を切り出す切出手段とを備えている。更に、この文字読
取装置は、前記文字辞書を参照して前記切出手段で切り
出されたイメージデータに対応する文字または記号を認
識する文字認識手段と、前記文字認識手段によって認識
された前記第１の記入枠中の文字または記号に基づい
て、前記第２の記入枠の単語識別用の選択信号を出力す
る選択手段と、前記選択信号に基づいて前記単語辞書中
の特定の単語セットを参照し、前記文字認識手段で認識
された前記第２の記入枠中の単語を該単語セットの中か
ら識別する単語識別手段とを有している。According to a second aspect of the present invention, there is provided a first entry box for entering characters or symbols, and a word in a word set corresponding to the characters or symbols entered in the first entry box. In a character reading device that reads a form having a second entry frame and recognizes a character, symbol, or word entered in the form, characteristic data for identifying a character or a symbol to be read is registered. Character dictionary,
A word dictionary in which a plurality of words to be read are registered for each word set, storage means for storing image information of the form, and the first and second words based on the image information stored in the storage means. And a cutout means for cutting out the image data of the entry frame. Further, the character reading device includes a character recognition unit that recognizes a character or a symbol corresponding to the image data extracted by the extraction unit with reference to the character dictionary, and the first character recognition unit that is recognized by the character recognition unit. Selecting means for outputting a selection signal for identifying a word in the second entry frame based on the characters or symbols in the entry frame, and referring to a specific word set in the word dictionary based on the selection signal. And word identification means for identifying a word in the second entry frame recognized by the character recognition means from the word set.

【００１０】第２の発明によれば、次のような作用が行
われる。まず、格納手段に格納されたイメージ情報の中
から、帳票の第１の記入枠に対応するイメージデータが
切出手段によって切り出されて、文字認識手段に与えら
れる。文字認識手段では、文字辞書が参照されて、その
第１の記入枠のイメージデータに対応する文字または記
号が認識される。文字認識手段の認識結果は選択手段に
与えられ、この選択手段によって第２の記入枠中の単語
選択用の選択信号が出力される。次に、格納手段に格納
されたイメージ情報の中から、帳票の第２の記入枠に対
応するイメージデータが切り出されて、文字認識手段に
与えられる。文字認識手段では、文字辞書が参照され
て、その第２の記入枠のイメージデータに対応する文字
または記号が認識され、この認識結果が単語識別手段に
与えられる。単語識別手段では、第２の記入枠の単語選
択用の選択信号によって選択された単語辞書中の単語セ
ットの中から、その第２の記入枠のイメージデータに対
応する単語が識別される。According to the second invention, the following operation is performed. First, from the image information stored in the storage means, image data corresponding to the first entry frame of the form is cut out by the cutout means and provided to the character recognition means. The character recognition means refers to the character dictionary and recognizes a character or a symbol corresponding to the image data of the first entry frame. The recognition result of the character recognition means is given to the selection means, and the selection means outputs a selection signal for selecting a word in the second entry frame. Next, image data corresponding to the second entry frame of the form is cut out from the image information stored in the storage means and provided to the character recognition means. The character recognition means refers to the character dictionary, recognizes a character or a symbol corresponding to the image data of the second entry frame, and gives the recognition result to the word identification means. The word identifying means identifies a word corresponding to the image data of the second entry frame from the word set in the word dictionary selected by the selection signal for selecting a word of the second entry frame.

【００１１】[0011]

【発明の実施の形態】第１の実施形態図１は、本発明の第１の実施形態を示す文字読取装置の
構成図であり、図２中の要素と共通の要素には共通の符
号が付されている。この文字読取装置は、例えば図３に
示すように、予め様式の定められた帳票１のイメージ情
報を光学的に読み取るイメージスキャナ１１、及びこの
イメージスキャナ１１で読み取られた帳票１のイメージ
情報を格納する格納手段（例えば、イメージメモリ）１
２を有している。また、この文字読取装置は、読み取り
対象の文字等を記入するための帳票１のマーク枠２ａ及
び文字枠２ｂ〜２ｇの位置や、そこに記入される文字種
別を予め登録したフォーマット登録部１３を有してい
る。イメージメモリ１２には、フォーマット登録部１３
の情報に基づいて帳票１のマーク枠２ａ及び文字枠２ｂ
〜２ｇのイメージデータを切り出す切出手段（例えば、
文字切出部）１４が接続されている。文字切出部１４の
出力側には、文字認識手段（例えば、文字認識部）１５
が接続されている。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS First Embodiment FIG. 1 is a block diagram of a character reading apparatus according to a first embodiment of the present invention. Elements common to those in FIG. Is attached. As shown in FIG. 3, for example, this character reading device stores an image scanner 11 that optically reads image information of a form 1 in a predetermined format, and stores image information of the form 1 read by the image scanner 11. Storage means (for example, image memory) 1
Two. In addition, the character reading device includes a format registration unit 13 in which the positions of the mark frame 2a and the character frames 2b to 2g of the form 1 for writing characters to be read and the character types to be written therein are previously registered. Have. The image memory 12 has a format registration unit 13
Mark 2a and character frame 2b of the form 1 based on the information of
Cutting means for cutting out image data of ~ 2 g (for example,
A character cutout section 14 is connected. On the output side of the character extracting unit 14, a character recognizing unit (for example, a character recognizing unit) 15
Is connected.

【００１２】文字認識部１５は、文字辞書１６Ａ中の特
定の文字セットを参照して、文字切出部１４から与えら
れたイメージデータに対応する文字または記号を認識す
るものである。文字辞書１６Ａは、それぞれ特定のグル
ープの文字を区別して認識するための複数の文字セット
１６ａ，１６ｂ，１６ｃ，…，１６ｚを有している。例
えば文字セット１６ａは、数字「０」，「１」を区別し
て認識するための特徴データを登録したものであり、文
字セット１６ｂは、数字「０」〜「５」を区別して認識
するための特徴データを登録したものである。また、文
字セット１６ｃは、数字「０」〜「６」を区別して認識
するための特徴データを登録したものであり、文字セッ
ト１６ｚは、すべての数字「０」〜「９」を区別して認
識するための特徴データを登録したものである。The character recognition section 15 refers to a specific character set in the character dictionary 16A and recognizes a character or a symbol corresponding to the image data provided from the character extraction section 14. The character dictionary 16A has a plurality of character sets 16a, 16b, 16c,..., 16z for distinguishing and recognizing characters of a specific group. For example, the character set 16a is for registering characteristic data for distinguishing and recognizing the numbers "0" and "1", and the character set 16b is for registering and distinguishing the numbers "0" to "5". This is the registered feature data. The character set 16c registers feature data for distinguishing and recognizing the numbers “0” to “6”. The character set 16z recognizes and distinguishes all the numbers “0” to “9”. Is registered.

【００１３】文字認識部１５の出力側には、認識結果の
文字コードが出力されると共に、選択手段（例えば文字
セット選択部）１７が接続されている。文字セット選択
部１７は、フォーマット登録部１３からの情報と文字認
識部１５から出力された第１の記入枠（例えば、マーク
枠）２ａ中の記号の認識結果とに基づいて、第２の記入
枠（例えば、文字枠）２ｂのイメージデータ認識用の選
択信号ＳＥＬを出力するものである。文字セット選択部
１７から出力された選択信号ＳＥＬは、文字辞書１６Ａ
に与えられ、この文字辞書１６Ａから文字認識部１５
に、選択された文字セット１６ａ等が与えられるように
なっている。On the output side of the character recognizing unit 15, a character code as a result of recognition is output, and a selecting means (for example, a character set selecting unit) 17 is connected. The character set selection unit 17 performs the second entry based on the information from the format registration unit 13 and the recognition result of the symbol in the first entry box (for example, mark box) 2 a output from the character recognition unit 15. It outputs a selection signal SEL for recognizing image data of a frame (for example, a character frame) 2b. The selection signal SEL output from the character set selection unit 17 is transmitted to the character dictionary 16A.
To the character recognition unit 15 from the character dictionary 16A.
Is provided with the selected character set 16a and the like.

【００１４】図４は、図１の動作を示すフローチャート
である。次に、この図４を参照しつつ、図１の文字読取
装置の動作を、図３の帳票１を読み取る場合について説
明する。図３に示すように帳票１の記入欄２の所定の位
置にマーク及び文字を記入し、図１のイメージスキャナ
１１に入力する。図４のステップＳ１において、帳票１
のイメージ情報が読み取られてイメージメモリ１２に格
納される。ステップＳ２において、文字切出部１４が動
作し、フォーマット登録部１３の情報に基づいて、イメ
ージメモリ１２からマーク枠２ａのイメージデータが切
り出され、文字認識部１５に与えられる。また、フォー
マット登録部１３から文字セット選択部１７に対して、
マーク枠２ａには、明治、大正、昭和、平成に対応した
４箇所のマーク位置が有る旨の情報が与えられる。FIG. 4 is a flowchart showing the operation of FIG. Next, the operation of the character reading apparatus of FIG. 1 will be described with reference to FIG. 4 for the case of reading the form 1 of FIG. As shown in FIG. 3, a mark and a character are entered at a predetermined position in the entry column 2 of the form 1, and are inputted to the image scanner 11 of FIG. In step S1 of FIG.
Is read and stored in the image memory 12. In step S <b> 2, the character extracting unit 14 operates, and the image data of the mark frame 2 a is extracted from the image memory 12 based on the information of the format registration unit 13 and provided to the character recognition unit 15. Also, the format registration unit 13 sends a character set selection unit 17 a
Information indicating that there are four mark positions corresponding to Meiji, Taisho, Showa and Heisei is given to the mark frame 2a.

【００１５】ステップＳ３において、切り出されたイメ
ージデータがマークか文字かが判断される。この場合、
マークであるので、ステップＳ４へ進む。ステップＳ４
において、マーク検出用の選択信号ＳＥＬが文字辞書１
６Ａに与えられ、この文字辞書１６Ａから文字認識部１
５に対してマーク識別用の特徴データが出力される。文
字認識部１５では、文字辞書１６Ａから与えられた特徴
データに基づいてマークの位置が検出される。マーク位
置の検出後、ステップＳ５へ進む。ステップＳ５におい
て、マーク位置に基づいてマークが認識される。この場
合、マークに対応する年号が「大正」であると認識さ
れ、例えば「大正」に対応するコード“２”が出力され
る。更に、文字認識部１５から出力されたマーク枠２ａ
の認識結果は、文字セット選択部１７に与えられ、ステ
ップＳ６へ進む。In step S3, it is determined whether the extracted image data is a mark or a character. in this case,
Since it is a mark, the process proceeds to step S4. Step S4
, The mark detection selection signal SEL is
6A, the character recognition unit 1
5, the characteristic data for mark identification is output. The character recognition unit 15 detects the position of the mark based on the feature data provided from the character dictionary 16A. After the detection of the mark position, the process proceeds to step S5. In step S5, a mark is recognized based on the mark position. In this case, the year corresponding to the mark is recognized as “Taisho”, and for example, a code “2” corresponding to “Taisho” is output. Furthermore, the mark frame 2a output from the character recognition unit 15
Is given to the character set selection unit 17, and the process proceeds to step S6.

【００１６】ステップＳ６において、マーク枠２ａの認
識結果が、例えば図示しないホストコンピュータへ出力
され、このマーク枠２ａの認識処理は終了する。ステッ
プＳ６の後、ステップＳ７へ進み、すべての文字認識処
理が完了したか否かが判断される。文字認識処理が残っ
ていれば、ステップＳ２へ戻り、すべての処理が終了し
ていれば、この帳票１の文字読み取り処理は終了する。
この場合、文字枠２ｂ〜２ｇの処理が残っているので、
ステップＳ２へ戻り、文字切出部１４により、フォーマ
ット登録部１３の情報に基づいて、イメージメモリ１２
から文字枠２ｂのイメージデータが切り出され、文字認
識部１５Ａに与えられる。ステップＳ２の後、ステップ
Ｓ３へ進む。ステップＳ３において、切り出されたイメ
ージデータがマークか文字かが判断される。この場合、
文字であるので、ステップＳ８へ進む。In step S6, the recognition result of the mark frame 2a is output to, for example, a host computer (not shown), and the recognition process of the mark frame 2a ends. After step S6, the process proceeds to step S7, where it is determined whether all character recognition processes have been completed. If the character recognition process remains, the process returns to step S2, and if all the processes have been completed, the character reading process of the form 1 ends.
In this case, since the processing of the character frames 2b to 2g remains,
Returning to step S2, the character extracting unit 14 stores the image memory 12 based on the information in the format registering unit 13.
, The image data of the character frame 2b is cut out and provided to the character recognition unit 15A. After step S2, the process proceeds to step S3. In step S3, it is determined whether the extracted image data is a mark or a character. in this case,
Since it is a character, the process proceeds to step S8.

【００１７】ステップＳ８において、文字セット選択部
１７から文字辞書１６Ａに対して、文字枠２ｂのイメー
ジデータ認識用の選択信号ＳＥＬが出力される。この場
合、マーク枠２ａの認識結果は「大正」であるので、文
字セット１６ａを選択するための選択信号ＳＥＬが文字
辞書１６Ａに与えられる。これにより、文字辞書１６Ａ
から文字認識部１５に対して、文字枠２ｂ認識用に、数
字「０」，「１」を区別して認識するための特徴データ
で構成された文字セット１６ａが与えられる。ステップ
Ｓ８の後、ステップＳ９へ進む。ステップＳ９におい
て、文字認識部１５では、文字セット１６ａに基づいて
文字枠２ｂの文字認識が行われる。そして、文字枠２ｂ
のイメージデータは、例えば数字「１」と認識され、こ
の数字「１」に対応するコード“１”が出力される。文
字認識部１５から出力された文字枠２ｂの認識結果は、
文字セット選択部１７に与えられる。ステップＳ９の
後、ステップＳ６へ進む。ステップＳ６において、マー
ク枠２ａの認識結果がホストコンピュータへ出力され、
このマーク枠２ａの認識処理は終了する。In step S8, a selection signal SEL for recognizing image data of the character frame 2b is output from the character set selection unit 17 to the character dictionary 16A. In this case, since the recognition result of the mark frame 2a is "Taisho", the selection signal SEL for selecting the character set 16a is given to the character dictionary 16A. Thereby, the character dictionary 16A
The character set 16a composed of feature data for distinguishing and recognizing the numbers "0" and "1" is provided to the character recognition unit 15 for character frame 2b recognition. After step S8, the process proceeds to step S9. In step S9, the character recognition unit 15 performs character recognition of the character frame 2b based on the character set 16a. And the character frame 2b
Is recognized as, for example, a numeral "1", and a code "1" corresponding to the numeral "1" is output. The recognition result of the character frame 2b output from the character recognition unit 15 is
It is provided to the character set selection unit 17. After step S9, the process proceeds to step S6. In step S6, the recognition result of the mark frame 2a is output to the host computer,
The recognition process of the mark frame 2a ends.

【００１８】文字枠２ｂの認識処理が終了すると、再び
ステップＳ２へ戻り、文字切出部１４により、フォーマ
ット登録部１３の情報に基づいて、イメージメモリ１２
から文字枠２ｃのイメージデータが切り出され、文字認
識部１５に与えられる。ステップＳ８では、文字セット
選択部１７において、文字枠２ｂの認識結果といて数字
「１」が与えられているので、文字枠２ｃのイメージデ
ータ認識用の選択信号ＳＥＬとして、文字セット１６ｂ
を選択するための信号が文字辞書１６Ａに与えられる。
これにより、文字辞書１６Ａから文字認識部１５に対し
て、文字枠２ｃ認識用に、数字「０」〜「５」を区別し
て認識するための特徴データで構成される文字セット１
６ｂが与えられる。これにより、ステップＳ９では、文
字認識部１５では、文字セット１６ｂに基づいて文字枠
２ｂの文字認識が行われる。そして、文字枠２ｃは、例
えば数字「２」と認識され、この数字「２」に対応する
コード“２”が出力される。文字認識部１５Ａから出力
された文字枠２ｃの認識結果は、文字セット選択部１７
に与えられる。When the recognition process of the character frame 2b is completed, the process returns to step S2, and the character extracting unit 14 stores the image memory 12 based on the information of the format registering unit 13.
, The image data of the character frame 2c is cut out and given to the character recognition unit 15. In step S8, since the character set selection unit 17 has given the numeral "1" as the recognition result of the character frame 2b, the character set 16b is used as the selection signal SEL for image data recognition of the character frame 2c.
Is supplied to the character dictionary 16A.
As a result, the character set 16 composed of the feature data for distinguishing and recognizing the numbers “0” to “5” for the character frame 2 c from the character dictionary 16 A to the character recognition unit 15.
6b is provided. Thus, in step S9, the character recognition unit 15 performs character recognition of the character frame 2b based on the character set 16b. Then, the character frame 2c is recognized as, for example, a numeral "2", and a code "2" corresponding to the numeral "2" is output. The recognition result of the character frame 2c output from the character recognition unit 15A is output to the character set selection unit 17
Given to.

【００１９】以下同様に、文字枠２ｄ〜２ｇのイメージ
データは、それぞれ前の文字枠２ｃ〜２ｆの認識結果に
応じて与えられる選択信号ＳＥＬで選択された文字セッ
ト１６ａ等に従って順次文字認識される。ここで、もし
もマーク枠２ａの認識結果が「昭和」であれば、ステッ
プＳ５において、文字認識部１５から「昭和」に対応す
るコード“３”が出力され、文字セット選択部１７に与
えられる。そして、ステップＳ８において、文字セット
選択部１７から文字枠２ｂのイメージデータ認識用の選
択信号ＳＥＬとして、文字セット１６ｃを選択するため
の信号が文字辞書１６Ａに与えられる。これにより、文
字辞書１６Ａから文字認識部１５に対して、文字枠２ｂ
認識用に、数字「０」〜「６」を区別して認識するため
の特徴データで構成される文字セット１６ｃが与えられ
る。Similarly, the image data of the character frames 2d to 2g are sequentially recognized in accordance with the character set 16a and the like selected by the selection signal SEL given according to the recognition result of the previous character frames 2c to 2f. . Here, if the recognition result of the mark frame 2a is “Showa”, the code “3” corresponding to “Showa” is output from the character recognition unit 15 in step S5, and given to the character set selection unit 17. Then, in step S8, a signal for selecting the character set 16c is given from the character set selection unit 17 to the character dictionary 16A as a selection signal SEL for recognizing image data of the character frame 2b. As a result, the character dictionary 16A sends the character
For recognition, a character set 16c composed of feature data for distinguishing and recognizing numbers "0" to "6" is provided.

【００２０】以上のように、この第１の実施形態では、
各文字枠２ｂ〜２ｇのイメージデータを認識するために
文字辞書１６Ａを参照するときに、マーク枠２ａや文字
枠２ｂ〜２ｆの認識結果に従って選択された文字セット
１６ａ〜１６ｚを用いるようにしている。これにより、
認識対象の文字候補を限定することができるので、認識
処理の時間を短縮することができるという利点が有る。
更に、誤って妥当性のない文字が選択されることないの
で、認識率を向上させることができるという利点が有
る。As described above, in the first embodiment,
When referring to the character dictionary 16A for recognizing the image data of each of the character frames 2b to 2g, the character sets 16a to 16z selected according to the recognition results of the mark frames 2a and the character frames 2b to 2f are used. . This allows
Since the character candidates to be recognized can be limited, there is an advantage that the time for the recognition process can be reduced.
Furthermore, since an invalid character is not erroneously selected, there is an advantage that the recognition rate can be improved.

【００２１】第２の実施形態図５は、本発明の第２の実施形態を示す文字読取装置の
構成図であり、図１中の要素と共通の要素には共通の符
号が付されている。この図５の文字読取装置は、図１の
文字読取装置と同様のイメージスキャナ１１、イメージ
メモリ１２、フォーマット登録部１３、文字切出部１
４、及び文字認識部１５を有している。一方、この文字
読取装置は、従来と同様の文字辞書１６を有している。
即ち、文字辞書１６は、文字の認識に必要な文字パター
ンの特徴が登録されたものであり、フォーマット登録部
１３から与えられた情報に従って必要な文字種別の特徴
データを文字認識部１５へ出力するものである。文字認
識部１５の出力側には、選択手段（例えば、単語セット
選択部）１８が接続されている。単語セット選択部１８
は、文字認識部１５から出力された第１の記入枠の認識
結果に基づいて、第２の記入枠の単語認識用の選択信号
ＳＥＬを出力するものである。単語セット選択部１７か
ら出力された選択信号ＳＥＬは、単語辞書１９に与えら
れるようになっている。 Second Embodiment FIG. 5 is a block diagram of a character reading apparatus according to a second embodiment of the present invention. Elements common to those in FIG. 1 are denoted by the same reference numerals. . The character reading device shown in FIG. 5 includes an image scanner 11, an image memory 12, a format registration unit 13, and a character extraction unit 1 similar to the character reading device shown in FIG.
4 and a character recognition unit 15. On the other hand, this character reading device has a character dictionary 16 similar to the conventional one.
That is, the character dictionary 16 has registered therein the characteristics of the character patterns necessary for character recognition, and outputs the characteristic data of the necessary character type to the character recognition unit 15 according to the information provided from the format registration unit 13. Things. A selection means (for example, a word set selection unit) 18 is connected to the output side of the character recognition unit 15. Word set selector 18
Outputs a selection signal SEL for word recognition of the second entry frame based on the recognition result of the first entry frame output from the character recognition unit 15. The selection signal SEL output from the word set selection unit 17 is provided to the word dictionary 19.

【００２２】単語辞書１９は、それぞれ特定のグループ
の単語を区別して認識するための複数の単語セット１９
ａ，１９ｂ，…を有している。例えば単語セット１９ａ
は、複数の銀行を区別して認識するために、「ＡＢ
Ｃ」，「ＢＣＤ」，「ＣＤＥ」，…の銀行名を登録した
ものであり、単語セット１９ｂは、複数の信用金庫を区
別して認識するために、「ＸＹＺ」，「ＹＺＡ」，「Ｚ
ＡＸ」，…の信用金庫名を登録したものである。そし
て、単語辞書は、単語セット選択部１７から与えられた
選択信号ＳＥＬに従って、指定された単語セット１９ａ
等を、単語識別手段（例えば、単語識別部）２０へ出力
するようになっている。単語識別部２０は、文字識別部
１５で認識された文字列で構成される単語を、単語辞書
１９から与えられた単語セット１９ａ等の中から識別し
て出力するものである。The word dictionary 19 includes a plurality of word sets 19 for distinguishing and recognizing words of a specific group.
a, 19b,... For example, the word set 19a
, "AB" to distinguish and recognize multiple banks
., "BCD", "CDE",..., And the word set 19b contains "XYZ", "YZA", "Z
AX ",... Are registered. Then, according to the selection signal SEL given from the word set selection unit 17, the word dictionary stores the designated word set 19a.
Are output to the word identification means (for example, a word identification unit) 20. The word identification unit 20 identifies a word composed of a character string recognized by the character identification unit 15 from a word set 19 a or the like provided from the word dictionary 19 and outputs the word.

【００２３】図６は、読み取り対象の帳票１の他の一部
を示す図である。この帳票１には、金融機関等の名称を
記入するための文字枠３ａ〜３ｆと、その金融機関の種
類（例えば、１：銀行、２：信用金庫）のコードを記入
するための文字枠４が設けられている。図７は、図５の
文字読取装置における単語セット選択部１８、単語辞書
１９、及び単語認識部２０の動作の説明図である。次
に、この図７を参照しつつ、図５の文字読取装置の動作
を、図６の帳票１を読み取る場合について説明する。FIG. 6 is a diagram showing another part of the form 1 to be read. This form 1 has character frames 3a to 3f for entering names of financial institutions and the like, and character frames 4 for entering codes of the types of the financial institutions (for example, 1: bank, 2: credit union). Is provided. FIG. 7 is an explanatory diagram of the operations of the word set selection unit 18, the word dictionary 19, and the word recognition unit 20 in the character reading device of FIG. Next, with reference to FIG. 7, the operation of the character reading apparatus of FIG. 5 will be described for the case of reading the form 1 of FIG.

【００２４】帳票１の文字枠３ａ〜３ｆのイメージデー
タは、文字認識部１５において、文字辞書１６が参照さ
れて認識される。そして、例えば、文字枠３ａの認識結
果として、確からしさの順に、第１候補「Ａ」、第２候
補「Ｐ」、第３候補「Ｄ」が出力される。また、文字枠
３ｂの認識結果として、第１候補「Ｂ」、第２候補
「Ｅ」が出力される。更に、文字枠３ｃの認識結果とし
て、第１候補「Ｏ」、第２候補「Ｃ」、第３候補「Ｄ」
が出力される。文字枠３ｄ〜３ｆは、無記入と認識され
る。一方、文字枠４の認識結果は「１」、即ち「銀行」
と認識される。これらの認識結果は、単語セット選択部
１８と単語認識部２０に与えられる。単語セット選択部
１８では、与えられた文字枠４の認識結果に基づいて、
銀行名が登録された単語セット１９ａを選択するための
選択信号ＳＥＬが、単語辞書１９に出力される。これに
より、単語辞書１９から単語認識部２０へ、複数の銀行
名が登録された単語セット１９ａが与えられる。The image data of the character frames 3a to 3f of the form 1 is recognized in the character recognition unit 15 by referring to the character dictionary 16. Then, for example, as a recognition result of the character frame 3a, the first candidate “A”, the second candidate “P”, and the third candidate “D” are output in order of likelihood. In addition, a first candidate “B” and a second candidate “E” are output as a recognition result of the character frame 3b. Furthermore, as a recognition result of the character frame 3c, the first candidate “O”, the second candidate “C”, and the third candidate “D”
Is output. Character frames 3d to 3f are recognized as blank. On the other hand, the recognition result of the character frame 4 is “1”, that is, “bank”.
Is recognized. These recognition results are provided to the word set selection unit 18 and the word recognition unit 20. In the word set selection unit 18, based on the given recognition result of the character frame 4,
A selection signal SEL for selecting the word set 19 a in which the bank name is registered is output to the word dictionary 19. As a result, a word set 19a in which a plurality of bank names are registered is provided from the word dictionary 19 to the word recognition unit 20.

【００２５】単語認識部２０において、文字が認識され
た文字枠３ａ〜３ｃのそれぞれの認識結果が確からしさ
の高い順に組み合わされて複数の単語が生成され、生成
された複数の単語の中から、確からしさの高い順に、単
語セット１９ａに登録されているか否かが判定される。
そして、最初に登録されていることが確認された単語
が、正しく読み取られたものと識別されて単語認識部２
０から出力される。以上のように、この第２の実施形態
の文字読取装置は、予め読み取り候補の複数の単語を単
語セットとして登録した単語辞書１９と、帳票１の文字
枠４の認識結果に基づいて、文字枠３ａ〜３ｆ中の単語
が含まれる単語セット１９ａ等を選択する単語セット選
択部１８と、認識された文字の確からしさの程度に基づ
いて、選択された単語セット１９ａ等に登録されている
か否かを判定する単語識別部２０とを有している。この
ため、有り得ない単語を認識結果として出力するという
ような読み取り誤りを防止することができるという利点
がある。In the word recognition unit 20, a plurality of words are generated by combining the recognition results of the character frames 3a to 3c in which the characters have been recognized in descending order of certainty, and a plurality of words are generated. It is determined whether or not the words are registered in the word set 19a in descending order of probability.
Then, the word that has been confirmed to be registered first is identified as being correctly read, and the word is recognized by the word recognition unit 2.
Output from 0. As described above, the character reading apparatus according to the second embodiment is based on the word dictionary 19 in which a plurality of reading candidate words are registered in advance as a word set, and the character frame 4 based on the recognition result of the character frame 4 of the form 1. A word set selection unit 18 for selecting a word set 19a or the like including the words in 3a to 3f, and whether or not the word is registered in the selected word set 19a or the like based on the likelihood of the recognized character And a word identification unit 20 for determining For this reason, there is an advantage that a reading error such as outputting an impossible word as a recognition result can be prevented.

【００２６】なお、本発明は、上記実施形態に限定され
ず、種々の変形が可能である。この変形例としては、例
えば、次の（ａ）〜（ｅ）のようなものがある。（ａ）図３における帳票１のマーク枠２ａに代えて、
例えば年号をコードで記入する文字枠を設けても良い。
また、図６における帳票１の文字枠４に代えて、例えば
種類をマークによって指定するマーク枠を設けても良
い。（ｂ）図１の文字読取装置では、図３の帳票１の各文
字枠２ｂ，２ｃ，…の読み取りに対して、それぞれ文字
セット１６ａ等を使用しているが、例えば、年に対応す
る２つの文字枠２ｂ，２ｃを共通にして、２桁の数字
「０１」〜「１５」、「０１」〜「６４」等の文字セッ
トを使用するようにしても良い。The present invention is not limited to the above embodiment, and various modifications are possible. For example, there are the following modifications (a) to (e). (A) Instead of the mark frame 2a of the form 1 in FIG.
For example, a character frame for entering the year by code may be provided.
Further, instead of the character frame 4 of the form 1 in FIG. 6, for example, a mark frame for specifying the type by a mark may be provided. (B) In the character reading apparatus of FIG. 1, the character sets 16a and the like are used for reading the character frames 2b, 2c,... Of the form 1 in FIG. One character frame 2b, 2c may be used in common, and a character set such as two-digit numbers “01” to “15”, “01” to “64” may be used.

【００２７】（ｃ）図１の文字読取装置における文字
セット選択部１７は、選択信号ＳＥＬを１つのマーク枠
２ａや文字枠２ｂ等の認識結果に基づいて出力するよう
になっているが、２つ以上のマーク枠２ａや文字枠２ｂ
等の認識結果に基づいて、この選択信号ＳＥＬを出力す
るようにしても良い。これにより、より詳細に文字セッ
トを選択することが可能になり、更に認識処理の時間の
短縮と、認識率の向上が可能になる。（ｄ）図１の文字認識部１５の出力側に、図５の単語
セット選択部１８、単語辞書１９、及び単語認識部２０
を付加しても良い。これにより、第１及び第２の実施形
態を合わせた利点が得られる。（ｅ）図１及び図５の文字読取装置は、文字切出部１
４、文字認識部文字セット選択部１７、単語セット選択
部１８、及び単語識別部２０等を個別の処理部で構成し
ているが、コンピュータを用いてプログラム制御によっ
てこれらの処理を行うようにしても良い。(C) The character set selection section 17 in the character reading apparatus of FIG. 1 outputs the selection signal SEL based on the recognition result of one mark frame 2a, one character frame 2b, etc. One or more mark frames 2a and character frames 2b
The selection signal SEL may be output based on a recognition result such as the above. As a result, it is possible to select a character set in more detail, and it is possible to further reduce the time required for the recognition process and improve the recognition rate. (D) On the output side of the character recognition unit 15 in FIG. 1, the word set selection unit 18, the word dictionary 19, and the word recognition unit 20 in FIG.
May be added. Thereby, the combined advantages of the first and second embodiments can be obtained. (E) The character reading device shown in FIGS.
4. Character Recognition Unit The character set selection unit 17, the word set selection unit 18, the word identification unit 20, and the like are configured as individual processing units, but these processes are performed under program control using a computer. Is also good.

【００２８】[0028]

【発明の効果】以上詳細に説明したように、第１の発明
によれば、第１の記入枠の認識結果に基づいて、第２の
記入枠の認識に用いる文字セットを指定するための選択
手段と、この選択手段からの選択信号によって特定の文
字セットを出力する文字辞書を有している。これによ
り、認識対象の文字数を減少することが可能になり、認
識処理時間の短縮と、認識率の向上が可能になる。第２
の発明によれば、第１の記入枠の認識結果に基づいて、
第２の記入枠に記入された単語の識別に用いる単語セッ
トを指定するための選択手段と、この選択手段からの選
択信号によって特定の単語セットを出力する単語辞書を
有している。これにより、無意味な文字の組み合わせを
認識結果として出力することがなくなり、認識率の向上
が可能になる。As described above in detail, according to the first aspect, the selection for designating the character set to be used for recognizing the second entry box based on the recognition result of the first entry box. Means, and a character dictionary for outputting a specific character set in response to a selection signal from the selection means. This makes it possible to reduce the number of characters to be recognized, thereby shortening the recognition processing time and improving the recognition rate. Second
According to the invention of the above, based on the recognition result of the first entry frame,
It has a selecting means for designating a word set used for identifying a word entered in the second entry box, and a word dictionary for outputting a specific word set in response to a selection signal from the selecting means. As a result, a meaningless combination of characters is not output as a recognition result, and the recognition rate can be improved.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明の第１の実施形態を示す文字読取装置の
構成図である。FIG. 1 is a configuration diagram of a character reading device according to a first embodiment of the present invention.

【図２】従来の文字読取システムの構成図である。FIG. 2 is a configuration diagram of a conventional character reading system.

【図３】読み取り対象の帳票１の一部を示す図である。FIG. 3 is a diagram showing a part of a form 1 to be read;

【図４】図１の動作を示すフローチャートである。FIG. 4 is a flowchart showing the operation of FIG.

【図５】本発明の第２の実施形態を示す文字読取装置の
構成図である。FIG. 5 is a configuration diagram of a character reading device according to a second embodiment of the present invention.

【図６】読み取り対象の帳票１の他の一部を示す図であ
る。FIG. 6 is a diagram showing another part of the form 1 to be read.

【図７】図５の文字読取装置における単語セット選択部
１８、単語辞書１９、及び単語認識部２０の動作の説明
図である。FIG. 7 is an explanatory diagram of operations of a word set selecting unit, a word dictionary, and a word recognizing unit in the character reading apparatus of FIG.

【符号の説明】[Explanation of symbols]

１帳票２ａマーク枠２ｂ〜２ｇ，３ａ〜３ｆ，４文字枠１１イメージスキャナ１２イメージメモリ１３フォーマット登録部１４文字切出部１５文字認識部１６，１６Ａ文字辞書１６ａ〜１６ｚ文字セット１７文字セット選択部１８単語セット選択部１９単語辞書１９ａ，１９ｂ単語セット２０単語識別部 1 Form 2a Mark frame 2b-2g, 3a-3f, 4 character frame 11 Image scanner 12 Image memory 13 Format registration unit 14 Character extraction unit 15 Character recognition unit 16, 16A Character dictionary 16a-16z Character set 17 Character set selection unit 18 Word set selection unit 19 Word dictionary 19a, 19b Word set 20 Word identification unit

Claims

【特許請求の範囲】[Claims]

【請求項１】文字または記号を記入するための第１の
記入枠と、該第１の記入枠に記入された文字または記号
に対応する文字セット中の文字を記入するための第２の
記入枠とを有する帳票を読み取り、該帳票に記入された
文字または記号を認識する文字読取装置において、前記文字セット毎に読み取りの対象となる文字または記
号を認識するための特徴データが登録された文字辞書
と、前記帳票のイメージ情報を格納する格納手段と、前記格納手段に格納された前記イメージ情報から前記第
１及び第２の記入枠のイメージデータを切り出す切出手
段と、選択信号に基づいて選択された前記文字辞書中の特定の
文字セットを参照し、前記切出手段で切り出されたイメ
ージデータに対応する文字または記号を認識する文字認
識手段と、前記文字認識手段で認識された前記第１の記入枠中の文
字または記号に基づいて、前記第２の記入枠のイメージ
データ認識用の前記選択信号を出力する選択手段とを、備えたことを特徴とする文字読取装置。1. A first entry box for entering a character or symbol, and a second entry for entering a character in a character set corresponding to the character or symbol entered in the first entry box. In a character reading device that reads a form having a frame and recognizes a character or a symbol written on the form, a character in which characteristic data for recognizing a character or a symbol to be read is registered for each character set. Dictionary, storage means for storing the image information of the form, cutout means for cutting out the image data of the first and second entry frames from the image information stored in the storage means, based on a selection signal A character recognition unit that refers to a specific character set in the selected character dictionary and recognizes a character or a symbol corresponding to the image data extracted by the extraction unit; Selecting means for outputting the selection signal for recognizing image data of the second entry frame based on characters or symbols in the first entry frame recognized by the recognition means. Character reading device.

【請求項２】文字または記号を記入するための第１の
記入枠と、該第１の記入枠に記入された文字または記号
に対応する単語セット中の単語を記入するための第２の
記入枠とを有する帳票を読み取り、該帳票に記入された
文字、記号、または単語を認識する文字読取装置におい
て、読み取りの対象となる文字または記号を識別するための
特徴データが登録された文字辞書と、前記単語セット毎に読み取りの対象となる複数の単語が
登録された単語辞書と、前記帳票のイメージ情報を格納する格納手段と、前記格納手段に格納された前記イメージ情報から前記第
１及び第２の記入枠のイメージデータを切り出す切出手
段と、前記文字辞書を参照して前記切出手段で切り出されたイ
メージデータに対応する文字または記号を認識する文字
認識手段と、前記文字認識手段によって認識された前記第１の記入枠
中の文字または記号に基づいて、前記第２の記入枠の単
語識別用の選択信号を出力する選択手段と、前記選択信号に基づいて前記単語辞書中の特定の単語セ
ットを参照し、前記文字認識手段で認識された前記第２
の記入枠中の単語を該単語セットの中から識別する単語
識別手段とを、備えたことを特徴とする文字読取装置。2. A first entry box for entering a character or symbol, and a second entry for entering a word in a word set corresponding to the character or symbol entered in the first entry box. A character reading device that reads a form having a frame and recognizes a character, symbol, or word written on the form includes a character dictionary in which feature data for identifying a character or a symbol to be read is registered. A word dictionary in which a plurality of words to be read are registered for each word set; storage means for storing the image information of the form; and the first and second images from the image information stored in the storage means. And a character recognition unit that recognizes a character or a symbol corresponding to the image data extracted by the extraction unit with reference to the character dictionary. A step, a selection unit that outputs a selection signal for word identification of the second entry box based on a character or a symbol in the first entry box recognized by the character recognition unit; And referring to a specific word set in the word dictionary on the basis of the second
And a word identifying means for identifying a word in the entry frame from the word set.