JP2001297302A

JP2001297302A - Character reader

Info

Publication number: JP2001297302A
Application number: JP2000110112A
Authority: JP
Inventors: Hiroichi Iwashita; 博一岩下; Kazuhiro Ishikawa; 和弘石川
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 2000-04-12
Filing date: 2000-04-12
Publication date: 2001-10-26
Anticipated expiration: 2020-04-12
Also published as: JP4544691B2

Abstract

PROBLEM TO BE SOLVED: To discriminate possibility that a character is erroneously recognized. SOLUTION: A recognizing part 5 recognizes a character from image data acquired by an image input part 1. Concerning the character coordinates of a rectangle surrounding this recognized character, a size deciding part 6 calculates a condition while using size decision data stored in a decision data storage memory 24. The size decision data contain a conditional expression and a processing method such as non-read processing, no processing or deletion corresponding to that conditional expression. On the basis of this conditional expression, the size deciding part 6 calculates the condition. When the calculated result of this conditional expression is true, possibility in erroneous reading is decided and post-processing corresponding to the conditional expression, with which the calculated result becomes true, is performed.

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、読み取った文字、
あるいは文字列に対して誤って文字を認識した可能性を
判別し、誤認識したときは、読み取られた文字列につい
て知識処理あるいは後処理を行うような文字読取装置に
関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to read characters,
Alternatively, the present invention relates to a character reading device that determines the possibility of erroneously recognizing a character from a character string and, when erroneous recognition is performed, performs knowledge processing or post-processing on the read character string.

【０００２】[0002]

【従来の技術】従来より、帳票イメージ上に記録された
文字を読み取る文字読取装置が知られている。2. Description of the Related Art Conventionally, there has been known a character reading apparatus for reading characters recorded on a form image.

【０００３】従来の文字読取装置では、画像上の指定さ
れた領域を走査し、行切り出し処理によって行の座標を
切り出し、文字切り出し処理によって行の座標内の文字
の座標を検出する。そして、検出後、各文字の座標の画
像についての文字認識処理を行う。In a conventional character reading apparatus, a designated area on an image is scanned, line coordinates are cut out by line cutout processing, and character coordinates within the line coordinates are detected by character cutout processing. After the detection, a character recognition process is performed on the image of the coordinates of each character.

【０００４】文字読取装置では、一般に、行の切り出
し、文字の切り出しを行ってから文字認識が行われる。
行の切り出しを行うには、画像を横方向に走査し、黒画
素数のヒストグラムを作成し、ヒストグラムの値が０に
なった箇所で各行を区切る。[0004] In a character reading device, character recognition is generally performed after cutting out lines and characters.
To cut out a row, the image is scanned in the horizontal direction, a histogram of the number of black pixels is created, and each row is separated at a point where the value of the histogram becomes 0.

【０００５】また、文字の切り出しを行うには、画像を
縦方向に走査し、黒画素数のヒストグラムを作成し、ヒ
ストグラムの値が０になった箇所で区切り、各文字に分
離する。誤って文字を認識したときは、単語照合辞書等
を用いた知識処理あるいは後処理と呼ばれる処理を行う
ことにより、誤読や不読の文字を置換して認識率を向上
させる。In order to cut out characters, the image is scanned in the vertical direction, a histogram of the number of black pixels is created, and the image is separated at each point where the value of the histogram becomes 0 to separate each character. If a character is erroneously recognized, a process called a knowledge process or a post-process using a word collation dictionary or the like is performed to replace a misread or unread character, thereby improving the recognition rate.

【０００６】[0006]

【発明が解決しようとする課題】ところで、かかる従来
の文字読取装置では、ヒストグラムを作成して行あるい
は文字の切り出しを行うようにしているので、正しく行
切り出し、文字切り出しを行えない場合がある。In such a conventional character reading apparatus, since a line or a character is cut out by creating a histogram, a line or a character may not be cut out correctly.

【０００７】図２は、かかる従来の説明図である。例え
ば、図２（ａ）は、複数の文字に対して抹消線が施され
ている例を示す。尚、図中、破線で示す矩形は、１文字
の正しい区分を示す。この場合、抹消線があるために文
字間を正しく識別できなくなってしまう。従って、すべ
ての文字を１つの文字と誤認識してしまい、正しく行切
り出しを行えない。FIG. 2 is an explanatory diagram of such a conventional art. For example, FIG. 2A shows an example in which a strike-through is given to a plurality of characters. In the drawing, the rectangle indicated by the broken line indicates the correct division of one character. In this case, since there is a strike-through line, it becomes impossible to correctly distinguish between characters. Therefore, all characters are erroneously recognized as one character, and line segmentation cannot be performed correctly.

【０００８】また、図２（ｂ）は、複数の文字を丸囲い
した例を示す。この場合、丸囲い線のために、２行の文
字を縦長の１文字と認識してしまい、正しい行切り出し
を行うことができない。FIG. 2B shows an example in which a plurality of characters are circled. In this case, two lines of characters are recognized as one vertically long character due to the encircled line, and a correct line cutout cannot be performed.

【０００９】また、図２（ｃ）は、印字にずれが生じた
例を示す。この場合、行間に間隙がなくなってしまい、
行を正しく識別できず、２行の文字を縦長の１文字とし
て切り出してしまう。FIG. 2 (c) shows an example in which printing has shifted. In this case, there is no gap between the lines,
Lines cannot be identified correctly, and two lines of characters are cut out as one vertically long character.

【００１０】また、図２（ｄ）は、桁区切りの線が行に
含まれている例を示す。この場合、文字認識処理には不
要である桁線が文字として切り出されてしまう。このよ
うに行切り出しや文字切り出し結果に誤りがあると、明
らかに文字サイズや文字の位置が正しくなくても、検出
された文字の矩形を各文字としてそのまま文字認識の処
理がなされ、誤読が生じたり不要な文字が出力されたり
して文字認識装置としての信頼性が低下する。FIG. 2D shows an example in which a line for separating digits is included in a row. In this case, a digit line unnecessary for the character recognition processing is cut out as a character. If there is an error in the line segmentation or character segmentation results, even if the character size and character position are obviously incorrect, the character recognition process is performed as it is with the detected character rectangle as each character, causing misreading. Or unnecessary characters are output, thereby lowering the reliability as a character recognition device.

【００１１】また、このような認識結果の修正するに
は、オペレータが手作業で行う必要があり、オペレータ
の負担も増大する。従って、誤って文字を認識した可能
性があるか否かを正しく判断し、誤って文字を認識した
ときは、後処理を自動的に行えるようにする必要があ
る。Further, to correct such a recognition result, it is necessary for the operator to manually perform the correction, which increases the burden on the operator. Therefore, it is necessary to correctly determine whether or not there is a possibility that a character has been incorrectly recognized, and to automatically perform post-processing when a character is incorrectly recognized.

【００１２】[0012]

【課題を解決するための手段】本発明は以上の点を解決
するため次の構成を採用する。〈構成１〉請求項１の発明に係る文字読取装置は、所定
の用紙に印字された文字を画像データとして取得する画
像入力手段と、該画像入力手段により取得された画像デ
ータから各文字の位置及び大きさを文字座標で特定し、
各文字を認識して文字コードに変換する認識手段と、該
認識手段により認識された各文字の位置及び大きさに対
し、文字座標を用いて所定の誤読判定条件を設定し、当
該各文字の位置及び大きさが誤読判定条件に該当すると
きは、誤って文字を認識した可能性があると判定し、当
該文字については、認識結果に対する後処理を行う誤読
判定処理手段と、を備えるようにした。The present invention employs the following structure to solve the above problems. <Structure 1> A character reading device according to the first aspect of the present invention is a character reading device, comprising: image input means for acquiring a character printed on a predetermined sheet as image data; and a position of each character based on the image data acquired by the image input means. And the size is specified by character coordinates,
Recognition means for recognizing each character and converting it to a character code, and for the position and size of each character recognized by the recognition means, set predetermined misreading determination conditions using character coordinates, and When the position and the size correspond to the misreading determination condition, it is determined that there is a possibility that the character is erroneously recognized, and for the character, misreading determination processing means for performing post-processing on the recognition result is provided. did.

【００１３】〈構成２〉請求項２の発明に係る文字読取
装置では、画像データの中で同じ誤読可能性判定条件が
適用される所定の読み取り領域を設定し、前記誤読可能
性判定手段は、読み取った文字が誤読判定条件に該当す
るか否かを読み取り領域毎に判定するように構成されて
いる。<Structure 2> In the character reading device according to the second aspect of the present invention, a predetermined reading area to which the same misreadability determination condition is applied is set in the image data, and the misreadability determination means includes: It is configured to determine whether or not the read character satisfies an erroneous reading determination condition for each reading area.

【００１４】〈構成３〉請求項３の発明に係る文字読取
装置では、前記誤読可能性判定手段が、各文字の文字座
標に基づいて認識対象の文字の幅、高さ及び位置を算出
し、算出された認識対象の文字の幅、高さ及び位置に誤
読判定条件を設定し、これらの文字の幅、高さ及び位置
のうちいずれか１つが誤読判定条件に該当したときに誤
って文字を認識した可能性があると判定するように構成
されている。<Structure 3> In the character reading apparatus according to the third aspect of the present invention, the misreadability determining means calculates the width, height and position of the character to be recognized based on the character coordinates of each character. Misreading determination conditions are set for the calculated width, height, and position of the character to be recognized, and when one of the width, height, and position of the character meets the misreading determination condition, the character is erroneously detected. It is configured to determine that there is a possibility of recognition.

【００１５】〈構成４〉請求項４の発明に係る文字読取
装置では、前記誤読可能性判定手段が、各文字の文字座
標に基づいて認識対象の文字の幅、高さ及び位置を算出
し、算出された認識対象の文字の幅、高さ及び位置に誤
読判定条件を設定し、これらの文字の幅、高さ及び位置
のうち少なくとも１つが誤読判定条件に該当したときに
誤って文字を認識した可能性があると判定するように構
成されている。<Structure 4> In the character reading apparatus according to the invention of claim 4, the misreadability determining means calculates the width, height and position of the character to be recognized based on the character coordinates of each character, Misreading determination conditions are set for the calculated width, height, and position of the recognition target character, and the character is erroneously recognized when at least one of the width, height, and position of the character meets the misreading determination condition. It is configured to determine that there is a possibility of having done this.

【００１６】〈構成５〉請求項５の発明に係る文字読取
装置では、前記誤読可能性判定手段が、各文字の文字座
標に基づいて認識対象の文字の前後関係を算出し、算出
された認識対象の文字の前後関係に誤読判定条件を設定
し、当該前後関係が誤読判定条件に該当したときに誤っ
て文字を認識した可能性があると判定するように構成さ
れている。<Structure 5> In the character reading apparatus according to the fifth aspect of the present invention, the misreadability determining means calculates the context of the character to be recognized based on the character coordinates of each character, and calculates the calculated recognition. A misreading determination condition is set in the context of the target character, and when the context matches the misreading determination condition, it is determined that there is a possibility that the character is erroneously recognized.

【００１７】〈構成６〉請求項６の発明に係る文字読取
装置では、前記誤読可能性判定手段が、各文字の文字座
標に基づいてその行の全ての文字を含む行を作成し、行
の位置関係及びその行に含まれている文字の位置、文字
数に誤読判定条件を設定し、行の位置関係及びその行に
含まれている文字の位置、文字数のうちのいずれか１つ
が誤読判定条件に該当したときに誤って文字を認識した
可能性があると判定するように構成されている。<Structure 6> In the character reading apparatus according to the sixth aspect of the present invention, the misreadability determining means creates a line including all the characters of the line based on the character coordinates of each character, and The misreading determination condition is set for the positional relationship, the position of the character included in the line, and the number of characters, and any one of the positional relationship of the line, the position of the character included in the line, and the number of characters is determined as the misreading determination condition. Is configured to determine that there is a possibility that a character has been erroneously recognized when に.

【００１８】〈構成７〉請求項７の発明に係る文字読取
装置では、前記誤読可能性判定手段が、各文字の文字座
標に基づいてその行の全ての文字を含む行を作成し、行
の位置関係及びその行に含まれている文字の位置、文字
数に誤読判定条件を設定し、行の位置関係及びその行に
含まれている文字の位置、文字数のうちの少なくとも１
つが誤読判定条件に該当したときに誤って文字を認識し
た可能性があると判定するように構成されている。<Structure 7> In the character reading apparatus according to the seventh aspect of the present invention, the misreadability determining means creates a line including all the characters of the line based on the character coordinates of each character, and A misreading determination condition is set for the positional relationship and the position and the number of characters included in the line, and at least one of the positional relationship of the line and the position and the number of characters included in the line is set.
It is configured to determine that there is a possibility that a character has been erroneously recognized when one of the conditions matches the misreading determination condition.

【００１９】〈構成８〉請求項８の発明に係る文字読取
装置では、前記誤読判定処理手段が、各文字の文字座標
に基づいて同一行で同じ条件を有する文字が連続してい
るとき、当該連続した複数の文字をブロックにまとめ、
ブロックの位置関係及びそのブロックに含まれている文
字の位置、文字数に誤読判定条件を設定し、認識対象の
ブロックの位置関係及びそのブロックに含まれている文
字の位置、文字数のうちいずれか１つが誤読判定条件に
該当したときに誤って文字を認識した可能性があると判
定するように構成されている。<Structure 8> In the character reading device according to the invention of claim 8, when the misreading determination processing means determines that characters having the same condition are consecutive in the same line on the same line based on the character coordinates of each character, Combine multiple consecutive characters into a block,
An erroneous reading determination condition is set for the positional relationship of the block, the position of the character included in the block, and the number of characters, and any one of the positional relationship of the block to be recognized, the position of the character included in the block, and the number of characters is set. It is configured to determine that there is a possibility that a character has been erroneously recognized when one of the conditions matches the misreading determination condition.

【００２０】〈構成９〉請求項９の発明に係る文字読取
装置では、前記誤読判定処理手段が、各文字の文字座標
に基づいて同一行で同じ条件を有する文字が連続してい
るとき、当該連続した複数の文字をブロックにまとめ、
ブロックの位置関係及びそのブロックに含まれている文
字の位置、文字数に誤読判定条件を設定し、認識対象の
ブロックの位置関係及びそのブロックに含まれている文
字の位置、文字数のうち少なくとも１つが誤読判定条件
に該当したときに誤って文字を認識した可能性があると
判定するように構成されている。<Structure 9> In the character reading device according to the ninth aspect of the invention, when the misreading determination processing means determines that characters having the same condition are consecutive in the same line on the same line based on the character coordinates of each character, Combine multiple consecutive characters into a block,
An erroneous reading determination condition is set for the positional relationship of the block, the position of the character included in the block, and the number of characters, and at least one of the positional relationship of the block to be recognized, the position of the character included in the block, and the number of characters is set. It is configured to determine that there is a possibility that a character has been erroneously recognized when the misreading determination condition is met.

【００２１】〈構成１０〉請求項１０の発明に係る文字
読取装置では、前記誤読判定処理手段が、誤読判定条件
をスクリプトで記述した。<Structure 10> In the character reading apparatus according to the tenth aspect of the present invention, the misreading determination processing means describes the misreading determination condition in a script.

【００２２】〈構成１１〉請求項１１の発明に係る文字
読取装置では、前記誤読判定処理手段が、誤って文字を
読み取った可能性に応じて、当該文字を別の文字に置換
する処理、未処理、削除処理のうち、いずれか１つを選
択処理するように構成されている。<Structure 11> In the character reading apparatus according to the eleventh aspect, the misreading determination processing means replaces the character with another character in accordance with the possibility that the character is erroneously read. It is configured to select one of the processing and the deletion processing.

【００２３】[0023]

【発明の実施の形態】以下、本発明の実施の形態を具体
例を用いて説明する。〈具体例１〉具体例１は、画像データから文字を認識
し、認識された文字の位置及び大きさに対して、読み取
り領域に適応したサイズ判定データを誤読判定条件とし
て指定してサイズ判定を行い、誤読の可能性があるとき
は、不読等の後処理を行うようにしたものである。DESCRIPTION OF THE PREFERRED EMBODIMENTS Embodiments of the present invention will be described below using specific examples. <Specific example 1> In specific example 1, a character is recognized from image data, and size determination data adapted to a reading area is specified as a misreading determination condition with respect to the position and size of the recognized character to determine a size. When there is a possibility of erroneous reading, post-processing such as unreading is performed.

【００２４】図１は、具体例１の構成を示すブロック図
である。具体例１の文字読取装置は、画像入力部１と、
表示部２と、入力部３と、制御部４と、認識部５と、サ
イズ判定部６と、画像メモリ２１と、読み取り領域情報
格納メモリ２２と、認識結果格納メモリ２３と、判定デ
ータ格納メモリ２４と、判定結果格納メモリ２５と、参
照座標格納メモリ２６と、を備えて構成されている。FIG. 1 is a block diagram showing the configuration of the first embodiment. The character reading device according to the first specific example includes an image input unit 1,
Display unit 2, input unit 3, control unit 4, recognition unit 5, size determination unit 6, image memory 21, read area information storage memory 22, recognition result storage memory 23, determination data storage memory 24, a determination result storage memory 25, and a reference coordinate storage memory 26.

【００２５】画像入力部１は、イメージスキャナおよび
ＦＡＸ等のように、帳票上に記入された文字、図形を画
像データとして入力する機能を有する画像入力手段であ
る。表示部２は、ディスプレイ等のように、オペレータ
に対して情報を表示する機能を有するものである。入力
部３は、キーボード、マウス等のように、オペレータか
らの入力を受け付ける機能を有するものである。The image input unit 1 is an image input means such as an image scanner and a facsimile which has a function of inputting characters and figures written on a form as image data. The display unit 2 has a function of displaying information to an operator, such as a display. The input unit 3 has a function of receiving input from an operator, such as a keyboard and a mouse.

【００２６】認識部５は、画像メモリ２１に格納されて
いる画像データを参照して読み取り領域を走査し、行の
切り出し及び文字の切り出しを行って各文字の位置及び
大きさを文字座標で特定し、各文字の認識を行い、認識
した文字を文字コードに変換する機能を有する認識手段
である。The recognition unit 5 scans the reading area with reference to the image data stored in the image memory 21, cuts out lines and cuts out characters, and specifies the position and size of each character by character coordinates. This is a recognition means having a function of recognizing each character and converting the recognized character into a character code.

【００２７】図３は文字座標の説明図である。この図３
に示すように、文字「あ」は認識対象の文字であって、
この文字の位置及び大きさは、図中、破線で示すように
文字「あ」を囲む矩形によって特定される。この矩形
は、所定の位置を原点とし、図中、左上の座標を（l,
t）、右下の座標を（r,b）として、座標（l,t）−（r,
b）で表され、この座標が文字座標となる。FIG. 3 is an explanatory diagram of character coordinates. This figure 3
As shown in the figure, the character "A" is a character to be recognized,
The position and size of this character are specified by a rectangle surrounding the character "A" as shown by a broken line in the figure. This rectangle has the origin at a predetermined position, and the upper left coordinate in the figure is (l,
t), the lower right coordinate is (r, b), and the coordinates (l, t)-(r,
These coordinates are character coordinates.

【００２８】サイズ判定部６は、認識部５によって認識
された結果に対して、文字の位置及びサイズについての
サイズ判定を行い、これにより誤読の可能性を判定する
機能を有する誤読判定処理手段である。The size judging section 6 is an erroneous reading judgment processing means having a function of judging the size of the position and size of the character on the result recognized by the recognizing section 5 and thereby judging the possibility of erroneous reading. is there.

【００２９】図４は、具体例１のサイズ判定に用いるサ
イズ判定データの一例を示す説明図である。この図４に
示すように、サイズ判定データには、複数の条件式及び
その条件式に該当したときの処理方法が含まれている。FIG. 4 is an explanatory diagram showing an example of size determination data used for size determination in the first embodiment. As shown in FIG. 4, the size determination data includes a plurality of conditional expressions and a processing method when the conditional expression is satisfied.

【００３０】ここで、処理方法としての不読は、認識さ
れて変換された文字コードを、例えば「？」などの認識
結果として含まれるべきでない文字に置換してオペレー
タによる認識結果の修正作業を容易にするための処理で
ある。Here, the unread as a processing method is performed by replacing the recognized and converted character code with a character that should not be included as a recognition result, such as "?", And corrects the recognition result by the operator. This is processing for facilitation.

【００３１】未処理は、予め後処理を行わないように指
定条件が設定された文字を不読や削除などの処理から除
外するための処理である。削除は、不要と考えられる文
字を削除する処理である。尚、処理方法は条件式に応じ
て適宜、設定される。The unprocessed process is a process for excluding a character for which a designated condition is set so that post-processing is not performed in advance from processes such as unread and deletion. Deletion is processing for deleting characters that are considered unnecessary. The processing method is appropriately set according to the conditional expression.

【００３２】制御部４は、文字読取装置の各ブロックを
制御する機能を有するものである。画像メモリ２１は、
画像入力部１によって入力された画像データを格納する
ためのメモリである。The control section 4 has a function of controlling each block of the character reading device. The image memory 21
This is a memory for storing image data input by the image input unit 1.

【００３３】読み取り領域情報格納メモリ２２は、読み
取り処理を行うための領域情報を格納するためのメモリ
である。図５は具体例１の読み取り領域の説明図であ
る。一例として帳票を示す。この帳票には、文字が印字
された読み取り領域Ａ〜Ｆが設定されている。認識対象
の文字の大きさ、字体等は各領域毎に異なっており、サ
イズ判定は、この読み取り領域Ａ〜Ｆ毎に行われる。領
域情報は、この領域の座標を指定するための領域指定情
報とこの領域に適用されるサイズ判定用の条件式等を指
定するためのサイズ判定データ指定情報とであり、読み
取り領域Ａ〜Ｆ毎に格納されている。The read area information storage memory 22 is a memory for storing area information for performing a reading process. FIG. 5 is an explanatory diagram of a reading area according to the first embodiment. A form is shown as an example. In this form, reading areas A to F on which characters are printed are set. The size, font, and the like of the character to be recognized are different for each area, and the size determination is performed for each of the reading areas A to F. The area information is area specifying information for specifying coordinates of this area and size determination data specifying information for specifying a conditional expression for size determination and the like applied to this area. Is stored in

【００３４】認識結果格納メモリ２３は、各領域の座標
及び、認識部５によって認識された文字座標及び文字コ
ードを格納するものである。判定データ格納メモリ２４
は、図４に示すようなサイズ判定データを格納するため
のメモリである。The recognition result storage memory 23 stores the coordinates of each area and the character coordinates and character codes recognized by the recognition unit 5. Judgment data storage memory 24
Is a memory for storing size determination data as shown in FIG.

【００３５】判定結果格納メモリ２５は、サイズ判定を
行った結果、最終的に得られた文字コード及びその文字
座標を格納するためのメモリであり、この判定結果格納
メモリ２５に格納されるこの文字コード及びその文字座
標は、認識結果格納メモリ２３に格納されているデータ
形式と同じ形式で格納される。The determination result storage memory 25 is a memory for storing a character code and its character coordinates finally obtained as a result of the size determination, and the character code stored in the determination result storage memory 25 is stored. The codes and their character coordinates are stored in the same format as the data format stored in the recognition result storage memory 23.

【００３６】参照座標格納メモリ２６は、サイズ判定部
６が認識結果格納メモリ２３を参照するための参照座標
及び文字位置を格納するためのメモリである。具体例１
では、参照座標として、認識部５によって認識された文
字の文字座標が格納される。文字位置は、認識部５に格
納されている文字のうち、参照する文字の位置を示すデ
ータであり、参照する文字が例えば１文字目のときは１
となる。The reference coordinate storage memory 26 is a memory for storing reference coordinates and character positions for the size determination unit 6 to refer to the recognition result storage memory 23. Example 1
Here, the character coordinates of the character recognized by the recognition unit 5 are stored as reference coordinates. The character position is data indicating the position of the character to be referenced among the characters stored in the recognition unit 5. For example, when the character to be referenced is the first character, the character position is 1
Becomes

【００３７】〈動作〉次に具体例１の動作を説明する。
制御部４は各ブロックを制御して文字の読み取りを実行
する。<Operation> Next, the operation of the first embodiment will be described.
The control unit 4 controls each block to read characters.

【００３８】図６は具体例１の動作を示すフローチャー
トである。ステップ（図中、ステップを「Ｓ」と記
す。）１では、画像入力部１が帳票等から画像データを
読み込む。画像データは画像メモリ２１に格納され、そ
の領域情報は読み取り領域情報格納メモリ２２に格納さ
れる。FIG. 6 is a flowchart showing the operation of the first embodiment. In step (in the figure, the step is referred to as “S”) 1, the image input unit 1 reads image data from a form or the like. The image data is stored in the image memory 21, and its area information is stored in the read area information storage memory 22.

【００３９】制御部４は、読み取り領域情報格納メモリ
２２に領域情報が格納されているか否かを判定する（ス
テップ２）。最初は、図５に示すように読み取り領域Ａ
が指定される。認識部５は、画像メモリ２１に格納され
ている画像データを参照し、読み取り領域Ａの領域指定
情報に基づいてこの読み取り領域Ａ内の画像イメージに
対して行の切り出し、文字の切り出しを行い、各文字の
座標を検出する。そして、この文字の認識を行う。認識
された文字は文字コードに変換され、この文字座標及び
変換された文字コードは認識結果格納メモリ２３に格納
される（ステップ３）。The control section 4 determines whether or not area information is stored in the read area information storage memory 22 (step 2). Initially, as shown in FIG.
Is specified. The recognizing unit 5 refers to the image data stored in the image memory 21 and cuts out lines and characters from the image in the reading area A based on the area designation information of the reading area A. Detect the coordinates of each character. Then, the character is recognized. The recognized character is converted into a character code, and the character coordinates and the converted character code are stored in the recognition result storage memory 23 (step 3).

【００４０】サイズ判定部６は、この認識結果に対して
サイズ判定を行う（ステップ４）。図７は、具体例１の
サイズ判定部６が行うサイズ判定処理を示すフローチャ
ートである。ステップ１１では、認識結果格納メモリ２
３から判定対象の文字座標及びその文字位置を取得し、
取得した文字座標及び文字位置を参照座標格納メモリ２
６に格納する。尚、最初の文字位置は１である。また、
次の判定対象となる文字の文字座標及び文字位置がなけ
れば、例えば文字座標及び文字位置をすべて０にした
「矩形なし」の情報を参照座標格納メモリ２６に格納す
る。The size judging section 6 judges the size of the recognition result (step 4). FIG. 7 is a flowchart illustrating a size determination process performed by the size determination unit 6 of the first specific example. In step 11, the recognition result storage memory 2
The character coordinates to be determined and their character positions are obtained from 3 and
The obtained character coordinates and character positions are stored in a reference coordinate storage memory 2.
6 is stored. Note that the first character position is 1. Also,
If there is no character coordinate and character position of the next character to be determined, for example, information of “no rectangle” in which the character coordinate and character position are all set to 0 is stored in the reference coordinate storage memory 26.

【００４１】ステップ１２では、参照座標格納メモリ２
６に格納されている文字座標を参照し、判定対象となる
次の文字座標の有無を判定する。参照座標格納メモリ２
６に「矩形なし」の情報が格納されていなければ、ステ
ップ１３に進む。In step 12, the reference coordinate storage memory 2
6, the presence / absence of the next character coordinate to be determined is determined. Reference coordinate storage memory 2
If the information of “no rectangle” is stored in 6, the process proceeds to step 13.

【００４２】ステップ１３では、サイズ判定データを用
いて条件計算を行う。サイズ判定の条件は便宜上、数式
によって記述される。次式（１）〜（３）は、そのサイ
ズ判定に用いる条件式の一例である。（ｂ−ｔ＋１）＞Ｗth …（１）（ｒ−ｌ＋１）＞Ｈth …（２）ｔ＜ｔmin …（３）但し、ｌ、ｒ：矩形のｘ座標ｔ，ｂ：矩形のｙ座標Ｗth：幅（ｘ）方向の矩形の下限値（例えば４０）Ｈth：高さ（ｙ）方向の矩形の下限値（例えば４０）ｔmin：座標ｔの最小値（例えば１２００）In step 13, a condition calculation is performed using the size determination data. The size determination condition is described by a mathematical expression for convenience. The following expressions (1) to (3) are examples of conditional expressions used for the size determination. (B−t + 1)> Wth (1) (rl + 1)> Hth (2) t <tmin (3) where 1, r: x coordinate of rectangle t, b: y coordinate of rectangle Wth: width The lower limit of the rectangle in the (x) direction (for example, 40) Hth: The lower limit of the rectangle in the height (y) direction (for example, 40) tmin: The minimum value of the coordinate t (for example, 1200)

【００４３】式（１）は文字座標から得られる矩形の高
さ（ｙ方向）による誤読判定条件を示し、式（２）は文
字座標から得られる矩形の幅（ｘ方向）による誤読判定
条件を示し、式（３）は文字座標自体を誤読判定条件に
したものである。Equation (1) shows the misreading determination condition based on the height (y direction) of the rectangle obtained from the character coordinates, and equation (2) shows the misreading determination condition based on the width (x direction) of the rectangle obtained from the character coordinates. Equation (3) shows that the character coordinates themselves are used as the misreading determination condition.

【００４４】尚、次式（４）に示すように、２つ以上の
条件式を論理積（ＡＮＤ）あるいは、論理和（ＯＲ）で
複合化してもよい。（ｂ−ｔ＋１）＞４０ＡＮＤ（ｒ−ｌ＋１）＞４０ …（４）Incidentally, as shown in the following expression (4), two or more conditional expressions may be compounded by a logical product (AND) or a logical sum (OR). (Bt + 1)> 40 AND (rl + 1)> 40 (4)

【００４５】このサイズ判定データは判定データ格納メ
モリ２４に格納されている。読み取り領域情報格納メモ
リ２２に格納されているサイズ判定データ指定情報を参
照し、このサイズ判定データ指定情報を用いて読み取り
領域Ａに適応したサイズ判定データが指定される。文字
座標は参照座標格納メモリ２６から取り出され、この文
字座標にこの読み取り領域Ａのサイズ判定データを適用
して誤読判定のための条件計算を行う。The size determination data is stored in the determination data storage memory 24. With reference to the size determination data specification information stored in the read area information storage memory 22, size determination data adapted to the read area A is specified using the size determination data specification information. The character coordinates are taken out from the reference coordinate storage memory 26, and the size determination data of the reading area A is applied to the character coordinates to calculate a condition for erroneous reading determination.

【００４６】この装置には、これらの条件式の真偽を計
算する制御プログラムが格納されている。条件計算は、
まず、図４に示す最初の条件式に文字座標を代入するこ
とにより行われる。計算の結果、偽のとき、即ち、条件
式を満足しないときは、次の条件式に文字座標を代入す
る。このように、順次、文字座標を条件式に代入し、真
になったとき、そこで計算を終了させる。This device stores a control program for calculating the truth of these conditional expressions. The condition calculation is
First, this is performed by substituting the character coordinates into the first conditional expression shown in FIG. If the result of the calculation is false, that is, if the conditional expression is not satisfied, the character coordinates are substituted into the following conditional expression. In this way, the character coordinates are sequentially substituted into the conditional expression, and when it becomes true, the calculation is terminated there.

【００４７】例えば、図２（ａ）、（ｂ）に示すよう
に、複数の文字に対して抹消線が施されている場合、複
数の文字を丸囲いした場合には、式（１）、（２）を満
足するようになる。また、図２（ｃ）、（ｄ）に示すよ
うに、印字がずれた場合、桁区切りの線が行に含まれて
いる場合には、式（３）を満足するようになる。このよ
うな場合、各条件式の計算結果は真となる。所定の条件
式の計算結果が真となったとき、誤読の可能性があると
判定され、それ以上の条件計算は行わない。For example, as shown in FIGS. 2 (a) and 2 (b), when a plurality of characters are struck through, and when a plurality of characters are circled, equation (1) (2) is satisfied. Further, as shown in FIGS. 2C and 2D, when printing is shifted or when a line for separating digits is included in the line, the expression (3) is satisfied. In such a case, the calculation result of each conditional expression is true. When the calculation result of the predetermined conditional expression becomes true, it is determined that there is a possibility of misreading, and no further conditional calculation is performed.

【００４８】一方、すべての条件式についての計算結果
が偽となったとき、このサイズ判定データによる条件計
算は偽となる。このときは、誤読の可能性はないと判定
される。この計算結果は、判定結果格納メモリ２５に格
納される。On the other hand, when the calculation results for all the conditional expressions become false, the condition calculation based on the size determination data becomes false. At this time, it is determined that there is no possibility of misreading. This calculation result is stored in the determination result storage memory 25.

【００４９】ステップ１４では、まず、計算結果の真偽
を判別する。計算結果が偽のときは、ステップ１５に進
む。ステップ１５では、偽となった文字の文字位置を参
照座標格納メモリ２６から取得し、誤読の可能性はない
と判定されているので、認識結果格納メモリ２３に格納
されているその文字位置の文字座標及び文字コードをそ
のまま判定結果格納メモリ２５へコピーする。また、ス
テップ１４において、条件計算の結果が真となったとき
は、ステップ１６に進む。In step 14, first, it is determined whether the calculation result is true or false. When the calculation result is false, the process proceeds to step 15. In step 15, the character position of the false character is acquired from the reference coordinate storage memory 26, and it is determined that there is no possibility of misreading. Therefore, the character position of the character position stored in the recognition result storage memory 23 is determined. The coordinates and the character code are directly copied to the determination result storage memory 25. If the result of the condition calculation is true in step 14, the process proceeds to step 16.

【００５０】ステップ１６では、処理方法を判別する。
例えば、図４において、認識対象文字の文字座標が条件
式２を満足することにより計算結果が偽になったとき、
処理方法は未処理となる。In step 16, the processing method is determined.
For example, in FIG. 4, when the character coordinate of the recognition target character satisfies conditional expression 2, and the calculation result becomes false,
The processing method is unprocessed.

【００５１】処理方法が未処理のときは、ステップ１５
に進み、予め設定された指定条件を満足する文字を不読
や削除などの処理から除外するために認識結果格納メモ
リ２３に格納されているその文字位置の文字座標及び文
字コードをそのまま判定結果格納メモリ２５へコピーす
る。If the processing method is unprocessed, step 15
The character coordinates and character codes of the character positions stored in the recognition result storage memory 23 are stored as they are in the recognition result storage memory 23 in order to exclude characters satisfying the preset designated conditions from processing such as unreading or deletion. Copy to memory 25.

【００５２】また、条件式１を満足することにより計算
結果が偽になったとき、処理方法は不読となる。処理方
法が不読のときは、ステップ１７に進み、その文字の文
字位置を参照座標格納メモリ２６から取得し、認識結果
格納メモリ２３に格納されているその文字位置の文字座
標を判定結果格納メモリ２５へコピーし、その文字の文
字コードを、例えば「？」などの認識結果として含まれ
るべきでない文字に置換して判定結果格納メモリ２５に
格納する。従って、オペレータは、この文字を視認する
ことにより誤読の可能性を一目で識別できる。また、処
理方法が削除のときは、例えばゴミ等によってイメージ
化され、誤読されたと考えられる不要な文字あるいは記
号を削除する。When the calculation result becomes false by satisfying conditional expression 1, the processing method becomes unread. If the processing method is unreadable, the process proceeds to step 17 where the character position of the character is obtained from the reference coordinate storage memory 26, and the character coordinate of the character position stored in the recognition result storage memory 23 is determined. 25, and the character code of the character is replaced with a character that should not be included as a recognition result, such as “?”, And stored in the determination result storage memory 25. Therefore, by visually recognizing this character, the operator can identify the possibility of misreading at a glance. When the processing method is deletion, unnecessary characters or symbols that are imaged by dust or the like and are considered to be erroneously read are deleted.

【００５３】最初の文字についてのこのような処理が終
了した後、ステップ１１に戻り、次の文字座標及び文字
位置を取得して同じようにステップ１２〜１７を実行す
る。そして、参照座標格納メモリ２６に「矩形なし」の
情報が格納されたときは、ステップ１２において、読み
取り領域Ａにおいて認識された全ての文字について、サ
イズ判定が行われたと判定し、ステップ２に戻る。この
ような処理は、読み取り領域Ｂ〜Ｆについても行われ、
全ての読み取り領域についてこのような処理が行われた
とき、処理が完了する。After such processing for the first character is completed, the process returns to step 11 to acquire the next character coordinates and character position and execute steps 12 to 17 in the same manner. Then, when the information “no rectangle” is stored in the reference coordinate storage memory 26, it is determined in step 12 that the size determination has been performed for all the characters recognized in the reading area A, and the process returns to step 2. . Such processing is also performed for the reading areas B to F,
When such processing has been performed for all reading areas, the processing is completed.

【００５４】〈具体例１の効果〉以上、説明したように
具体例１によれば、文字認識対象の文字の読み取りを行
うときに、各文字の座標および座標から求められる高さ
や幅等に対してサイズ判定データを設定し、サイズ判定
を行うようにしたので、その文字座標から認識結果の各
文字の誤読の可能性についての評価を適切に行うことが
できる。<Effects of Specific Example 1> As described above, according to Specific Example 1, when a character to be recognized is read, the coordinates of each character and the height and width obtained from the coordinates are different. Since the size determination data is set and the size determination is performed, it is possible to appropriately evaluate the possibility of misreading of each character of the recognition result from the character coordinates.

【００５５】また、処理方法が未処理のときは、予め不
読処理を行わないように設定された指定条件を満足する
ような文字に対しては、未処理とすることにより、この
文字を不読処理、削除処理から除外することができる。When the processing method is unprocessed, a character satisfying a designated condition set in advance so as not to perform the unread processing is unprocessed so that this character is unprocessed. It can be excluded from reading processing and deletion processing.

【００５６】また、認識結果に誤読の可能性が高いと考
えられる文字に対しては、不読処理を行うことにより、
その対象となった文字が「？」のような含まれるべきで
ない文字に置換されるので、オペレータは一目で視認で
き、オペレータによる認識結果の修正作業が容易とな
る。Also, by performing unread processing on characters that are likely to be misread in the recognition result,
Since the target character is replaced with a character that should not be included, such as "?", The operator can see at a glance, and the operator can easily correct the recognition result.

【００５７】また、例えばゴミ等によってイメージ化さ
れて誤読されたと考えられる明らかに不要な文字、記号
に対しては、この文字を削除することにより、オペレー
タによる修正作業の負荷を軽減できる。For unnecessary characters and symbols which are considered to be erroneously read by being imaged due to dust or the like, for example, by deleting these characters, the burden of correction work by the operator can be reduced.

【００５８】さらに、このようなサイズ判定を読み取り
領域毎に行うようにしたので、読み取り領域毎に字体、
文字種、大きさ異なっているような帳票においても各領
域毎に適切な文字認識、サイズ判定を行うことができ、
文字認識精度が向上する。Further, since such a size determination is performed for each reading area, the font,
Appropriate character recognition and size judgment can be performed for each area even in forms with different character types and sizes,
Character recognition accuracy is improved.

【００５９】〈具体例２〉具体例２は、文字座標に基づ
いて３つの文字の前後関係を算出し、この前後関係に対
して誤読判定条件を設定し、サイズ判定を行うようにし
たものである。具体例２の文字読取装置は、具体例１と
同様に、画像入力部１と、表示部２と、入力部３と、制
御部４と、認識部５と、サイズ判定部６と、画像メモリ
２１と、読み取り領域情報格納メモリ２２と、認識結果
格納メモリ２３と、判定データ格納メモリ２４と、判定
結果格納メモリ２５と、参照座標格納メモリ２６と、を
備えて構成されている。<Specific Example 2> In specific example 2, the context of three characters is calculated based on the character coordinates, an erroneous reading determination condition is set for the context, and the size is determined. is there. As in the specific example 1, the character reading device according to the specific example 2 includes an image input unit 1, a display unit 2, an input unit 3, a control unit 4, a recognition unit 5, a size determination unit 6, an image memory 21, a read area information storage memory 22, a recognition result storage memory 23, a determination data storage memory 24, a determination result storage memory 25, and a reference coordinate storage memory 26.

【００６０】但し、参照座標格納メモリ２６には３つの
格納エリアが備えられている。図８は、その参照座標格
納メモリ２６の説明図である。格納エリアｂは、認識対
象である現在文字の文字座標及び文字位置を格納するた
めのエリアであり、格納エリアａ，ｃは、それぞれ現在
文字の１つ前の文字座標及び文字位置、その次の文字の
文字座標及び文字位置を格納するためのエリアである。
尚、具体例１と同一要素については同一符号を付して説
明を省略する。However, the reference coordinate storage memory 26 has three storage areas. FIG. 8 is an explanatory diagram of the reference coordinate storage memory 26. The storage area b is an area for storing the character coordinates and the character position of the current character to be recognized, and the storage areas a and c are respectively the character coordinates and the character position immediately before the current character, and the next. This is an area for storing character coordinates and character positions of characters.
Note that the same elements as those in the first embodiment are denoted by the same reference numerals, and description thereof is omitted.

【００６１】〈動作〉次に具体例２の動作を説明する。
具体例２においても、具体例１と同様に、図６のフロー
チャートを実行し、ステップ４においてサイズ判定処理
を実施する。<Operation> Next, the operation of the embodiment 2 will be described.
Also in the specific example 2, similarly to the specific example 1, the flowchart of FIG. 6 is executed, and the size determination process is performed in step 4.

【００６２】図９は具体例２のサイズ判定処理を示すフ
ローチャートである。ステップ２１では、先頭文字の文
字座標及び文字位置を取得する。取得した文字座標及び
文字位置は参照座標格納メモリ２６の格納エリアｃに格
納され、格納エリアａ、ｂにはともに「矩形なし」の情
報が格納される。FIG. 9 is a flowchart showing the size determination processing of the second embodiment. In step 21, the character coordinates and character position of the first character are obtained. The acquired character coordinates and character positions are stored in the storage area c of the reference coordinate storage memory 26, and information of "no rectangle" is stored in both the storage areas a and b.

【００６３】ステップ２２では、次の文字の文字座標及
び文字位置を取得する。次の文字座標及び文字位置が取
得されたとき、参照座標格納メモリ２６の格納エリアｃ
に格納されていた先頭文字の文字座標及び文字位置は格
納エリアｂに格納され、取得した次の文字座標及び文字
位置が格納エリアｃに格納される。In step 22, the character coordinates and character position of the next character are obtained. When the next character coordinate and character position are obtained, the storage area c of the reference coordinate storage memory 26
Are stored in the storage area b, and the acquired next character coordinates and character position are stored in the storage area c.

【００６４】以後、文字座標及び文字位置を取得する毎
に、格納エリアｂ，ｃに格納されている文字座標及び文
字位置をそれぞれ格納エリアａ，ｂに格納し、取得した
文字座標及び文字位置を格納エリアｃに格納する。Thereafter, each time the character coordinates and the character position are obtained, the character coordinates and the character position stored in the storage areas b and c are stored in the storage areas a and b, respectively. Store in storage area c.

【００６５】尚、次にサイズ判定を行う文字の文字座標
及び文字位置がなければ、例えば文字座標及び文字位置
をすべて０にした「矩形なし」の情報を格納エリアｃに
格納する。If there is no character coordinate and character position of the character whose size is to be determined next, for example, information of "no rectangle" in which the character coordinate and character position are all 0 is stored in the storage area c.

【００６６】ステップ２３では、判定対象の現在の文字
の有無を判定する。格納エリアｂに「矩形なし」の情報
が格納されていないときは、判定対象の現在の文字があ
ると判定してステップ２４に進む。In step 23, it is determined whether there is a current character to be determined. When the information of “no rectangle” is not stored in the storage area b, it is determined that there is a current character to be determined, and the process proceeds to step 24.

【００６７】ステップ２４では、３つの文字の文字座標
に対し、サイズ判定データを用いて条件計算を行う。図
１０は具体例２の説明図である。３つの文字座標を図１
０（ａ）に示すように設定する。この３つの文字座標か
ら文字間の間隔を算出し、この間隔に誤読判定条件とし
てのサイズ判定データを適用してサイズ判定を行う。式
（５）、（６）は、サイズ判定データとしての条件式の
一例である。ｌ−ｐｒ−１＝０ …（５）ｎｌ−ｒ−１＝０ …（６）In step 24, condition calculation is performed on the character coordinates of the three characters using the size determination data. FIG. 10 is an explanatory diagram of the specific example 2. Figure 1 shows three character coordinates
Set as shown in FIG. An interval between characters is calculated from the three character coordinates, and size determination is performed by applying size determination data as an erroneous reading determination condition to the interval. Expressions (5) and (6) are examples of conditional expressions as size determination data. l-pr-1 = 0 (5) nl-r-1 = 0 (6)

【００６８】式（５）は、図１０（ｂ）に示すように、
現在の文字と前の文字の間隔が０となる条件式であり、
式（６）は、図１０（ｃ）に示すように、現在の文字と
次の文字の間隔が０となる条件式である。The equation (5) is obtained as shown in FIG.
This is a conditional expression in which the interval between the current character and the previous character is 0,
Expression (6) is a conditional expression in which the interval between the current character and the next character is 0, as shown in FIG.

【００６９】尚、式（５）及び（６）を具体例１と同様
に論理積（ＡＮＤ）あるいは論理和（ＯＲ）で複合化し
てもよい。また、３つの文字の前後関係は各文字の間隔
に限られるものではなく、３つの文字の大きさの関係等
を条件にしてもよい。Expressions (5) and (6) may be compounded by logical product (AND) or logical sum (OR) as in the first embodiment. The order of the three characters is not limited to the interval between the characters, but may be based on the size of the three characters.

【００７０】このサイズ判定データは判定データ格納メ
モリ２４に格納されており、具体例１と同じように、読
み取り領域情報格納メモリ２２に格納されているサイズ
判定データ指定情報を参照し、このサイズ判定データ指
定情報を用いて図５に示す読み取り領域Ａに適応したサ
イズ判定データを指定し、参照座標格納メモリ２６から
３つの文字座標を取り出して、この読み取り領域Ａのサ
イズ判定データを適用して誤読判定のための条件計算を
行う。The size determination data is stored in the determination data storage memory 24. As in the first embodiment, the size determination data is referred to the size determination data designation information stored in the read area information storage memory 22, and the size determination data is stored. Using the data designation information, size determination data adapted to the reading area A shown in FIG. 5 is designated, three character coordinates are extracted from the reference coordinate storage memory 26, and erroneous reading is performed by applying the size determination data of the reading area A. The condition calculation for the judgment is performed.

【００７１】条件計算の方法は具体例１と同様であり、
計算結果は判定結果格納メモリ２５に格納される。但
し、参照座標格納メモリ２６の格納エリアａ、または格
納エリアｃに「矩形なし」の情報が格納されているとき
は、その条件式の計算結果は偽となる。The method of calculating the condition is the same as that of the first embodiment.
The calculation result is stored in the determination result storage memory 25. However, when the information “no rectangle” is stored in the storage area a or the storage area c of the reference coordinate storage memory 26, the calculation result of the conditional expression is false.

【００７２】ステップ２５では、計算結果の真偽を判別
する。計算結果が偽のときは、どの条件式にも該当しな
いので誤読の可能性はないと判定し、ステップ２６に進
んで参照座標格納メモリ２６の格納エリアｂに格納され
ている文字位置を取得し、具体例１と同様に認識結果格
納メモリ２３に格納されているその文字位置の文字座標
及び文字コードを判定結果格納メモリ２５にそのままコ
ピーする。In step 25, it is determined whether the calculation result is true or false. If the calculation result is false, it does not correspond to any of the conditional expressions, so it is determined that there is no possibility of erroneous reading, and the process proceeds to step 26 to obtain the character position stored in the storage area b of the reference coordinate storage memory 26. The character coordinates and the character code of the character position stored in the recognition result storage memory 23 are copied to the determination result storage memory 25 as they are in the same manner as in the first embodiment.

【００７３】また、計算結果が真のときは、誤読の可能
性があると判定し、ステップ２７に進んで条件式に対応
する処理方法を判別し、処理方法が未処理のときは、格
納エリアｂに格納されている現在の文字の文字座標及び
文字コードをそのまま判定結果格納メモリ２５へコピー
する（ステップ２６）。If the result of the calculation is true, it is determined that there is a possibility of erroneous reading, and the process proceeds to step 27 to determine the processing method corresponding to the conditional expression. The character coordinates and character code of the current character stored in b are directly copied to the determination result storage memory 25 (step 26).

【００７４】処理方法が不読のときは、ステップ２８に
進み、その文字の文字位置を参照座標格納メモリ２６か
ら取得し、認識結果格納メモリ２３に格納されているそ
の文字位置の文字座標を判定結果格納メモリ２５へコピ
ーし、その文字の文字コードを、例えば「？」などのよ
うに認識結果として含まれるべきでない文字に置換して
判定結果格納メモリ２５に格納する。そして、処理方法
が削除のときは、不要と考えられる文字あるいは記号を
削除する。If the processing method is unreadable, the process proceeds to step 28, where the character position of the character is obtained from the reference coordinate storage memory 26, and the character coordinate of the character position stored in the recognition result storage memory 23 is determined. The result is copied to the result storage memory 25, and the character code of the character is replaced with a character that should not be included as a recognition result, such as “?”, And stored in the determination result storage memory 25. When the processing method is deletion, characters or symbols considered unnecessary are deleted.

【００７５】このような処理を全ての文字について行
い、全ての文字についてサイズ判別が行われたとき（ス
テップ２３）、このサイズ判定処理を終了させ、全ての
読み取り領域についてこのような処理が行われたとき
（ステップ２）、処理が完了する。When such processing is performed for all characters and the size determination is performed for all characters (step 23), the size determination processing is terminated, and such processing is performed for all reading areas. (Step 2), the process is completed.

【００７６】〈具体例２の効果〉以上、説明したように
具体例２によれば、３つの文字の前後関係に対してサイ
ズ判定を行うようにしたので、具体例１と同様の効果が
得られるだけでなく、誤読の可能性を、より的確に判別
することができる。<Effects of Specific Example 2> According to the specific example 2, as described above, the size is determined with respect to the context of three characters, so that the same effect as that of the specific example 1 is obtained. In addition, the possibility of misreading can be determined more accurately.

【００７７】〈具体例３〉具体例３は、文字座標に基づ
いて算出された現在文字の行位置、文字位置、行先頭か
らの文字位置、その行の文字数に誤読判定条件を設定
し、サイズ判定を行うようにしたものである。<Specific Example 3> In specific example 3, an erroneous reading determination condition is set for the line position of the current character, the character position, the character position from the head of the line, and the number of characters in the line, which are calculated based on the character coordinates. The judgment is made.

【００７８】図１１は、具体例３の構成を示すブロック
図である。具体例３の文字読取装置は、画像入力部１
と、表示部２と、入力部３と、制御部４と、認識部５
と、サイズ判定部６と、画像メモリ２１と、読み取り領
域情報格納メモリ２２と、認識結果格納メモリ２３と、
判定データ格納メモリ２４と、判定結果格納メモリ２５
と、参照座標格納メモリ２６と、関連情報格納メモリ２
７と、を備えて構成されている。FIG. 11 is a block diagram showing the configuration of the third embodiment. The character reading device according to the third embodiment includes an image input unit 1
, Display unit 2, input unit 3, control unit 4, recognition unit 5
A size determination unit 6, an image memory 21, a read area information storage memory 22, a recognition result storage memory 23,
Determination data storage memory 24 and determination result storage memory 25
, Reference coordinate storage memory 26, and related information storage memory 2
7 are provided.

【００７９】この関連情報格納メモリ２７は、現在文字
に関する情報として、現在文字の行位置、文字位置、行
先頭からの文字位置およびその行の文字数などの関連情
報を格納するメモリである。The related information storage memory 27 is a memory for storing related information such as the line position of the current character, the character position, the character position from the head of the line, and the number of characters in the line as information on the current character.

【００８０】図１２は具体例３の関連情報の説明図であ
る。この行位置Ｌ、文字位置Ｉ、行先頭からの文字位置
ＬＩ、その行の文字数ＬＮは１以上の値とする。尚、具
体例１及び具体例２と同一要素については同一符号を付
して説明を省略する。FIG. 12 is an explanatory diagram of the related information of the third embodiment. The line position L, the character position I, the character position LI from the head of the line, and the number LN of characters in the line are values of 1 or more. Note that the same reference numerals are given to the same elements as those in the first and second examples, and the description will be omitted.

【００８１】〈動作〉次に具体例３の動作を説明する。
具体例３においても、具体例１と同様に、図６のフロー
チャートを実行し、ステップ４においてサイズ判定処理
を実施する。<Operation> Next, the operation of the embodiment 3 will be described.
In the specific example 3, as in the specific example 1, the flowchart of FIG. 6 is executed, and the size determination process is performed in the step 4.

【００８２】図１３は具体例３のサイズ判定処理を示す
フローチャートである。ステップ３１〜３３では、具体
例２のステップ２１〜２３と同様に先頭文字及び次の文
字の文字座標及び文字位置を取得し、それぞれ参照座標
格納メモリ２６の格納エリアｂ，ｃに格納し、ステップ
３４に進む。FIG. 13 is a flowchart showing the size determination processing of the third embodiment. In steps 31 to 33, the character coordinates and the character position of the first character and the next character are acquired and stored in the storage areas b and c of the reference coordinate storage memory 26, respectively, as in steps 21 to 23 of the specific example 2. Proceed to 34.

【００８３】ステップ３４では、判定対象である現在文
字の関連情報を設定する。即ち、認識結果格納メモリ２
３を参照し、図１２に示すように、判定対象の文字につ
いての行位置Ｌ、文字位置Ｌ、行先頭からの文字位置Ｌ
Ｉ、その行の文字数ＬＮ等を取得する。そして、これら
の関連情報を関連情報メモリＳ１３に格納する。In step 34, related information of the current character to be determined is set. That is, the recognition result storage memory 2
3, the line position L, the character position L, and the character position L from the head of the line for the character to be determined as shown in FIG.
I, the number of characters LN in the line, etc. are obtained. Then, the related information is stored in the related information memory S13.

【００８４】ステップ３５では、判定対象である現在の
文字の文字座標及びその関連情報に対し、サイズ判定デ
ータを用いて条件計算を行う。式（７）、（８）は、サ
イズ判定データとしての条件式の一例である。Ｉ＝２ …（７）ＬＮ＝３ …（８）式（７）は２文字目の場合の条件式であり、式（８）は
行文字数が３の場合の条件式である。In step 35, a condition calculation is performed on the character coordinates of the current character to be determined and its related information using the size determination data. Expressions (7) and (8) are examples of conditional expressions as size determination data. I = 2 ... (7) LN = 3 ... (8) Expression (7) is a conditional expression for the second character, and Expression (8) is a conditional expression for the case where the number of line characters is three.

【００８５】尚、具体例１、２と同様に、これら２つの
条件式を論理積（ＡＮＤ）あるいは論理和（ＯＲ）で複
合化してもよい。このサイズ判定データは判定データ格
納メモリ２４に格納されており、具体例１、２と同じよ
うに、読み取り領域情報格納メモリ２２に格納されてい
るサイズ判定データ指定情報を参照し、このサイズ判定
データ指定情報を用いて読み取り領域に適応したサイズ
判定データを指定し、参照座標格納メモリ２６から３つ
の文字座標を取り出して、この読み取り領域のサイズ判
定データを適用して誤読判定のための条件計算を行う。As in the first and second embodiments, these two conditional expressions may be combined with a logical product (AND) or a logical sum (OR). The size determination data is stored in the determination data storage memory 24. As in the first and second examples, the size determination data is referred to the size determination data designation information stored in the read area information storage memory 22. The size determination data adapted to the reading area is specified using the specification information, the three character coordinates are extracted from the reference coordinate storage memory 26, and the size calculation data for the reading area is applied to calculate the condition for erroneous reading determination. Do.

【００８６】条件計算の方法は具体例１と同様であり、
計算結果は判定結果格納メモリ２５に格納される。但
し、具体例２と同様に、参照座標格納メモリ２６の格納
エリアａ、または格納エリアｃに「矩形なし」の情報が
格納されているときは、その条件式の計算結果は偽とな
る。そして、ステップ３６〜３９では、具体例１，２と
同様に後処理を行う。The condition calculation method is the same as that of the first embodiment.
The calculation result is stored in the determination result storage memory 25. However, as in the specific example 2, when the information “no rectangle” is stored in the storage area a or the storage area c of the reference coordinate storage memory 26, the calculation result of the conditional expression is false. Then, in steps 36 to 39, post-processing is performed as in the first and second specific examples.

【００８７】このような処理を全ての文字について行
い、全ての文字についてサイズ判別が行われたとき（ス
テップ３３）、このサイズ判定処理を終了させ、全ての
読み取り領域についてこのような処理が行われたとき
（ステップ２）、処理が完了する。When such processing is performed for all characters and the size determination is performed for all characters (step 33), the size determination processing is terminated, and such processing is performed for all reading areas. (Step 2), the process is completed.

【００８８】〈具体例３の効果〉以上、説明したように
具体例３によれば、現在文字と前後の文字との位置関係
だけでなく、現在文字の関連情報として行位置Ｌ、文字
位置Ｌ、行先頭からの文字位置ＬＩ、その行の文字数Ｌ
Ｎに対してサイズ判定を行うようにしたので、具体例
１，２の効果が得られるとともに、特定の行や文字につ
いて処理条件を設定でき、行切り出し処理や文字切り出
し処理の誤りによる誤読や不要な文字への、より的確な
処理を適用することができる。<Effects of Specific Example 3> As described above, according to Specific Example 3, not only the positional relationship between the current character and the preceding and following characters but also the line position L and the character position L as related information of the current character. , Character position LI from the beginning of the line, number of characters L in the line
Since the size determination is performed for N, the effects of the specific examples 1 and 2 can be obtained, and the processing conditions can be set for a specific line or character. More accurate processing can be applied to a proper character.

【００８９】〈具体例４〉具体例４は、文字座標に基づ
いて３つの文字の行座標を算出し、この行座標に誤読判
定条件を設定し、サイズ判定を行うようにしたものであ
る。<Example 4> In Example 4, the line coordinates of three characters are calculated based on the character coordinates, an erroneous reading judgment condition is set for the line coordinates, and the size judgment is performed.

【００９０】具体例４の文字読取装置は、具体例３と同
様に、画像入力部１と、表示部２と、入力部３と、制御
部４と、認識部５と、サイズ判定部６と、画像メモリ２
１と、読み取り領域情報格納メモリ２２と、認識結果格
納メモリ２３と、判定データ格納メモリ２４と、判定結
果格納メモリ２５と、参照座標格納メモリ２６と、関連
情報格納メモリ２７と、を備えて構成されている。尚、
具体例１〜３と同一要素については同一符号を付して説
明を省略する。The character reading apparatus according to the fourth embodiment has an image input unit 1, a display unit 2, an input unit 3, a control unit 4, a recognizing unit 5, a size determining unit 6 similar to the third embodiment. , Image memory 2
1, a read area information storage memory 22, a recognition result storage memory 23, a determination data storage memory 24, a determination result storage memory 25, a reference coordinate storage memory 26, and a related information storage memory 27. Have been. still,
The same elements as those in the first to third examples are denoted by the same reference numerals, and description thereof will be omitted.

【００９１】〈動作〉次に具体例４の動作を説明する。
具体例４においても、具体例１と同様に、図６のフロー
チャートを実行し、ステップ４においてサイズ判定処理
を実施する。<Operation> Next, the operation of the embodiment 4 will be described.
In the specific example 4, similarly to the specific example 1, the flowchart of FIG. 6 is executed, and the size determination process is performed in the step 4.

【００９２】図１４は具体例４のサイズ判定処理を示す
フローチャートである。ステップ４１では、先頭行の矩
形領域を作成する。図１５は具体例４の行座標の作成方
法を示す説明図である。この図１５に示すように、破線
で示す矩形領域、、はそれぞれ１つの文字を囲む
矩形領域を示す。FIG. 14 is a flowchart showing the size determination processing of the fourth embodiment. In step 41, a rectangular area in the first row is created. FIG. 15 is an explanatory diagram illustrating a method of creating row coordinates according to the fourth embodiment. As shown in FIG. 15, rectangular areas indicated by broken lines indicate rectangular areas each surrounding one character.

【００９３】矩形領域〜は例えば、以下の文字座標
によって表す。矩形領域の文字座標：（pl,pt）−（pr,pb）矩形領域の文字座標：（l,t）−（r,b）矩形領域の文字座標：（nl,nt）−（nr,nb）The rectangular area is represented by the following character coordinates, for example. Character coordinates of rectangular area: (pl, pt)-(pr, pb) Character coordinates of rectangular area: (l, t)-(r, b) Character coordinates of rectangular area: (nl, nt)-(nr, nb )

【００９４】先頭行の行座標を作成するには、この全文
字を含むようにして最小の矩形領域を設定する。この
矩形領域〜の文字座標を認識結果格納メモリ２３か
ら取り出して、行座標（nl,pt）−（nr,b）が作成され
る。To create the line coordinates of the first line, a minimum rectangular area is set so as to include all the characters. The character coordinates of this rectangular area are extracted from the recognition result storage memory 23, and line coordinates (nl, pt)-(nr, b) are created.

【００９５】また、この行の最終文字の文字位置をこの
行の文字位置として、作成された先頭行の行座標（nl,p
t）−（nr,b）及びこの文字位置を参照座標格納メモリ
２６の格納エリアｃに格納し、格納エリアａ，ｂには
「矩形なし」の情報を格納する。The character position of the last character of this line is set as the character position of this line, and the line coordinates (nl, p
t)-(nr, b) and this character position are stored in the storage area c of the reference coordinate storage memory 26, and the information of "no rectangle" is stored in the storage areas a and b.

【００９６】ステップ４２では、ステップ４１と同様
に、次の行の矩形領域を作成する。次の行の矩形領域が
作成されたとき、参照座標格納メモリ２６の格納エリア
ｃに格納されていた先頭行の行座標及びその行の最終文
字の文字位置は格納エリアｂに格納され、作成された次
の行座標及びその行の最終文字の文字位置が格納エリア
ｃに格納される。At step 42, as in step 41, a rectangular area of the next row is created. When the rectangular area of the next row is created, the row coordinates of the first row stored in the storage area c of the reference coordinate storage memory 26 and the character position of the last character of the row are stored and created in the storage area b. The next line coordinate and the character position of the last character of the line are stored in the storage area c.

【００９７】以後、行座標が作成される毎に、格納エリ
アｂ，ｃに格納されているデータをそれぞれ格納エリア
ａ，ｂに格納し、作成した文字座標及び文字位置を格納
エリアｃに格納する。Thereafter, every time row coordinates are created, data stored in storage areas b and c are stored in storage areas a and b, respectively, and the created character coordinates and character positions are stored in storage area c. .

【００９８】尚、次にサイズ判定を行うべき行の行座標
がなければ、例えば行座標及びその行の最終文字の文字
位置をすべて０にした「矩形なし」の情報を格納エリア
ｃに格納する。If there is no line coordinate of the line for which the size determination is to be performed next, for example, the information of "no rectangle" in which the line coordinates and the character position of the last character of the line are all 0 is stored in the storage area c. .

【００９９】ステップ４３では、判定対象である現在の
行の有無を判定する。格納エリアｂに「矩形なし」の情
報が格納されていないときは、判定対象の現在の行があ
ると判定してステップ４４に進む。In step 43, it is determined whether there is a current line to be determined. When the information of “no rectangle” is not stored in the storage area b, it is determined that there is a current line to be determined, and the process proceeds to step 44.

【０１００】ステップ４４では、判定対象の現在行の関
連情報を設定する。即ち、現在行の最終文字を参照座標
格納メモリ２６の格納エリアｂから取得し、その文字位
置の文字に関して認識結果格納メモリ２３を参照し、そ
の行位置Ｌ、文字位置Ｌ、行先頭からの文字位置ＬＩ、
その行の文字数ＬＮ等を取得する。そして、これらの関
連情報を関連情報メモリＳ１３に格納する。In step 44, related information of the current line to be determined is set. That is, the last character of the current line is acquired from the storage area b of the reference coordinate storage memory 26, the character at that character position is referred to the recognition result storage memory 23, and the line position L, character position L, character from the line head, Position LI,
The number of characters LN and the like in the line are acquired. Then, the related information is stored in the related information memory S13.

【０１０１】ステップ４５では、現在行及びその関連情
報に対し、サイズ判定データを用いて条件計算を行う。
条件式については、現在行の位置関係、現在行の大きさ
等について設定することができる。また、具体例１〜３
と同様に、２つの条件式を論理積（ＡＮＤ）あるいは論
理和（ＯＲ）で複合化してもよい。In step 45, condition calculation is performed on the current line and its related information using the size determination data.
As for the conditional expression, the positional relationship of the current line, the size of the current line, and the like can be set. Further, specific examples 1 to 3
Similarly to the above, two conditional expressions may be compounded by a logical product (AND) or a logical sum (OR).

【０１０２】サイズ判定データは判定データ格納メモリ
２４に格納されており、具体例１〜３と同じように、読
み取り領域情報格納メモリ２２に格納されているサイズ
判定データ指定情報を参照し、このサイズ判定データ指
定情報を用いて読み取り領域に適応したサイズ判定デー
タを指定し、参照座標格納メモリ２６から３つの文字座
標を取り出して、この読み取り領域のサイズ判定データ
を適用して誤読判定のための条件計算を行う。The size determination data is stored in the determination data storage memory 24. As in the first to third examples, the size determination data is referred to the size determination data designation information stored in the read area information storage memory 22, and the size determination data is stored. The size determination data adapted to the reading area is specified using the determination data specification information, the three character coordinates are extracted from the reference coordinate storage memory 26, and the size determination data of the reading area is applied to determine the condition for erroneous reading determination. Perform calculations.

【０１０３】サイズ判定データには、具体例１（図４）
と同じような条件式とそれに対応した処理方法が含まれ
ている。条件計算の方法は具体例１と同様であり、計算
結果は判定結果格納メモリ２５に格納される。Specific example 1 (FIG. 4) is included in the size determination data.
And a processing method corresponding to the conditional expression. The condition calculation method is the same as that of the first embodiment, and the calculation result is stored in the determination result storage memory 25.

【０１０４】但し、具体例２と同様に、参照座標格納メ
モリ２６の格納エリアａ、または格納エリアｃに「矩形
なし」の情報が格納されているときは、その条件式の計
算結果は偽となる。However, as in the specific example 2, when the information “no rectangle” is stored in the storage area a or the storage area c of the reference coordinate storage memory 26, the calculation result of the conditional expression is false. Become.

【０１０５】ステップ４６では、計算結果を判別し、計
算結果が偽のときは、ステップ４７に進む。ステップ４
７では、関連情報格納メモリ２７から現在行の文字位置
Ｉ、即ち、現在行の最終文字位置とその行の文字数ＬＮ
を取得し、この行の開始文字位置（Ｉ−ＬＮ＋１）から
最終文字位置Ｉまでの文字座標及び文字コードを認識結
果格納メモリ２３から判定結果格納メモリ２５へそのま
まコピーする。At step 46, the calculation result is determined. If the calculation result is false, the process proceeds to step 47. Step 4
7, the character position I of the current line from the related information storage memory 27, that is, the last character position of the current line and the number of characters LN of the line
Is obtained, and the character coordinates and the character code from the start character position (I-LN + 1) to the last character position I of this line are directly copied from the recognition result storage memory 23 to the determination result storage memory 25.

【０１０６】また、計算結果が真のときは、誤読の可能
性があると判定してステップ４８に進んで条件式に対応
する処理方法を判別する。処理方法が未処理のときは、
ステップ４７に進み、判定結果が偽のときと同じ処理を
行う。処理方法が不読のときは、ステップ４９に進む。If the result of the calculation is true, it is determined that there is a possibility of erroneous reading, and the routine proceeds to step 48, where a processing method corresponding to the conditional expression is determined. If the processing method is unprocessed,
Proceeding to step 47, the same processing as when the determination result is false is performed. If the processing method is unread, the process proceeds to step 49.

【０１０７】ステップ４９では、関連情報格納メモリ２
７から現在行の文字位置Ｉ、即ち、最終文字位置とその
行の文字数ＬＮを取得し、この行の開始文字位置（Ｉ−
ＬＮ＋１）から最終文字位置Ｉまでの文字座標を認識結
果格納メモリ２３から判定結果格納メモリ２５へコピー
し、文字コードを、例えば「？」などのように認識結果
として含まれるべきでない文字に置換して判定結果格納
メモリ２５に格納する。処理方法が削除のときは、その
文字を削除する。In step 49, the related information storage memory 2
7, the character position I of the current line, that is, the last character position and the number of characters LN of the line are obtained, and the start character position (I-
The character coordinates from (LN + 1) to the final character position I are copied from the recognition result storage memory 23 to the determination result storage memory 25, and the character code is replaced with a character that should not be included as a recognition result, such as "?" Stored in the determination result storage memory 25. If the processing method is delete, the character is deleted.

【０１０８】このような処理を全ての行について行い、
全ての行についてサイズ判別が行われたとき（ステップ
４３）、このサイズ判定処理を終了させ、全ての読み取
り領域についてこのような処理が行われたとき（ステッ
プ２）、処理が完了する。This process is performed for all rows.
When the size determination is performed on all the rows (step 43), the size determination process is terminated, and when such a process is performed on all the read areas (step 2), the process is completed.

【０１０９】〈具体例４の効果〉以上、説明したように
具体例４によれば、現在行前後の位置関係を算出し、こ
の位置関係に対してサイズ判定を行うようにしたので、
行単位で行の切り出し処理や文字の切り出し処理の誤り
を判別し、後処理を行うことができる。<Effects of Specific Example 4> As described above, according to Specific Example 4, the positional relationship before and after the current line is calculated, and the size is determined based on this positional relationship.
Post-processing can be performed by determining an error in the line cutout process or the character cutout process in line units.

【０１１０】〈具体例５〉具体例５は、同一行で同じ位
置条件の文字が連続したとき、これらの文字をブロック
にまとめ、ブロック単位でサイズ判定を行うようにした
ものである。<Example 5> In Example 5, when characters having the same position condition are consecutive on the same line, these characters are combined into blocks, and the size is determined for each block.

【０１１１】具体例５の関連情報格納メモリ２７には、
前後の文字間隔に基づいてブロックにまとめるための条
件式が格納されている。例えば、文字位置ｉ，ｉ＋１の
文字座標をそれぞれ（Ｌ(i)，Ｔ(i)）−（Ｒ(i)，Ｂ
(i)）、文字座標（Ｌ(i+1)，Ｔ(i+1)）−（Ｒ(i+1)，Ｂ
(i＋1)）とすると、間隔Ｄは以下の式（９）によって計
算される。Ｄ＝Ｌ(i+1)−Ｒ(i)−１ …（９）In the related information storage memory 27 of the fifth embodiment,
A conditional expression for grouping into blocks based on the preceding and following character intervals is stored. For example, the character coordinates of the character positions i and i + 1 are represented by (L (i), T (i))-(R (i), B
(i)), character coordinates (L (i + 1), T (i + 1))-(R (i + 1), B
(i + 1)), the interval D is calculated by the following equation (9). D = L (i + 1) -R (i) -1 (9)

【０１１２】式（１０）〜（１５）は、間隔Ｄに基づい
てブロックを作成する条件を示す式である。Ｄ＝Ｄthl …（１０）Ｄ≠Ｄthl …（１１）Ｄ＜Ｄthl …（１２）Ｄ≦Ｄthl …（１３）Ｄ＞Ｄthl …（１４）Ｄ≧Ｄthl …（１５）但し、Ｄthl：所定値Expressions (10) to (15) are expressions showing conditions for creating a block based on the interval D. D = Dthl (10) D ≠ Dthl (11) D <Dthl (12) D ≦ Dthl (13) D> Dthl (14) D ≧ Dthl (15) where Dthl: predetermined value

【０１１３】これらの式（１０）〜（１５）が関連情報
格納メモリ２７に格納されている。具体例５の判定デー
タ格納メモリ２４には、このブロックに対して適用され
るサイズ判定データが格納されている。The equations (10) to (15) are stored in the related information storage memory 27. The determination data storage memory 24 of the specific example 5 stores the size determination data applied to this block.

【０１１４】図１６は具体例５のサイズ判定データの一
例を示す説明図である。具体例５の参照座標格納メモリ
２６は、具体例２と同様に３つの格納エリアａ〜ｃを有
している。尚、具体例１〜４と同一要素については同一
符号を付して説明を省略する。FIG. 16 is an explanatory diagram showing an example of the size determination data of the specific example 5. The reference coordinate storage memory 26 of the specific example 5 has three storage areas a to c as in the specific example 2. In addition, the same reference numerals are given to the same elements as those in the specific examples 1 to 4, and the description is omitted.

【０１１５】〈動作〉次に具体例５の動作を説明する。
具体例５においても、具体例１と同様に、図６のフロー
チャートを実行し、ステップ４においてサイズ判定処理
を実施する。<Operation> Next, the operation of the embodiment 5 will be described.
In the specific example 5, as in the specific example 1, the flowchart of FIG. 6 is executed, and the size determination process is performed in the step 4.

【０１１６】図１７は具体例５のサイズ判定処理を示す
フローチャートである。ステップ５１では、認識結果格
納メモリ２３から取得したその領域の文字を先頭から参
照して、その間隔Ｄを計算し、条件式（１０）〜（１
５）を評価して、いずれかの条件が同一行で連続して該
当するときは、これらの文字を含む最小の矩形領域を１
つのブロックとする。FIG. 17 is a flowchart showing the size determination processing of the fifth embodiment. In step 51, the interval D is calculated by referring to the character of the area acquired from the recognition result storage memory 23 from the top, and the conditional expressions (10) to (1)
5) is evaluated, and if any of the conditions is successively satisfied on the same line, the smallest rectangular area including these characters is set to 1
One block.

【０１１７】図１８はこのブロックの説明図である。こ
の図１８に示すように、同一行に文字Ｐ，Ｑ，Ｒが並ん
でいる場合、文字Ｐ，Ｑの間隔Ｄは、前述のように式
（９）によって表される。FIG. 18 is an explanatory diagram of this block. As shown in FIG. 18, when characters P, Q, and R are arranged on the same line, the interval D between the characters P and Q is expressed by Expression (9) as described above.

【０１１８】例えば、文字Ｐ，Ｑの間隔Ｄが式（１０）
〜（１５）のいずれか１つに該当しているときは文字
Ｐ，Ｑが１つのブロックにまとめられる。図１８の破線
で示す領域がこのようにして作成された１つのブロック
を示す。For example, the interval D between the characters P and Q is given by the following equation (10).
If any one of (15) to (15) is satisfied, the characters P and Q are combined into one block. The area shown by the broken line in FIG. 18 shows one block created in this way.

【０１１９】尚、文字が、図１５に示すように領域、
、に印字されているときは、実線で示す領域が最
小の矩形領域となり、これが１つのブロックになる。こ
のブロックはブロック座標（Ｌ(i)，Ｔ(i)）−（Ｒ(i+
1)，Ｂ(i+1)）によって特定される。Note that, as shown in FIG.
, The area indicated by the solid line is the minimum rectangular area, which is one block. This block is represented by block coordinates (L (i), T (i))-(R (i +
1), B (i + 1)).

【０１２０】まず、最初、参照座標格納メモリ２６の格
納エリアａ、ｂには、「矩形なし」の情報を格納し、格
納エリアｃにこの先頭ブロックのブロック座標をそのブ
ロックの最終文字位置とともに格納する。First, information of "no rectangle" is stored in the storage areas a and b of the reference coordinate storage memory 26, and the block coordinates of the first block are stored together with the last character position of the block in the storage area c. I do.

【０１２１】ステップ５２では、次のブロックをステッ
プ５１と同じように作成する。参照座標格納メモリ２６
の格納エリアａ，ｂ，ｃに格納されている参照情報を１
つずつ移動させ、次のブロックの参照座標を認識結果格
納メモリ２３から取得し、このブロックの参照座標を格
納エリアｃにそのブロックの最終文字位置とともに格納
する。もし次の行がないときは、「矩形なし」の情報を
格納する。In step 52, the next block is created in the same manner as in step 51. Reference coordinate storage memory 26
Reference information stored in storage areas a, b, and c of
The reference coordinates of the next block are acquired from the recognition result storage memory 23, and the reference coordinates of this block are stored in the storage area c together with the last character position of the block. If there is no next line, the information of "no rectangle" is stored.

【０１２２】ステップ５３では、サイズ判定を行うべき
ブロックの有無を判定する。サイズ判定を行うべきブロ
ックがあるときは、ステップ５４に進む。ステップ５４
では、判定対象である現在ブロックの関連情報を設定す
る。この関連情報を設定するには、現在ブロックの最終
文字の文字位置を、参照座標格納メモリ２６に格納され
ている現在ブロックの文字位置から取得し、その文字位
置の文字について認識結果格納メモリ２３を参照し、行
位置Ｌ、文字位置Ｉ、行先頭からの文字位置ＬＩ，その
行の文字数ＬＮおよびブロック文字数ＢＮを取得し、関
連情報格納メモリ２７に格納する。尚、Ｌ，Ｉ，ＬＩ，
ＬＮ、ＢＮは１以上の値とする。In step 53, it is determined whether or not there is a block whose size is to be determined. When there is a block to be subjected to size determination, the process proceeds to step 54. Step 54
Then, related information of the current block to be determined is set. To set the relevant information, the character position of the last character of the current block is obtained from the character position of the current block stored in the reference coordinate storage memory 26, and the recognition result storage memory 23 is stored in the recognition result storage memory 23 for the character at that character position. By referring to the information, the line position L, the character position I, the character position LI from the head of the line, the number of characters LN of the line, and the number of block characters BN are acquired and stored in the related information storage memory 27. Note that L, I, LI,
LN and BN are one or more values.

【０１２３】ステップ５５では、参照座標格納メモリ２
６のエリアａ，ｂ，ｃに格納されている現在ブロックの
１つ前のブロック、現在ブロック、その次のブロックの
ブロック座標、及び関連情報格納メモリ２７に格納され
ているブロック関連情報を参照し、読み取り領域情報格
納メモリ２２に格納されているその領域の領域情報に従
って、判定データ格納メモリ２４に格納されているサイ
ズ判定データを参照し、このサイズ判定データの条件式
の真偽を計算する。尚、条件式は、具体例１〜４と同じ
ような条件式であってもよいし、論理積(ＡＮＤ)や論理
和(ＯＲ)によって複合化させたものでもよい。In step 55, the reference coordinate storage memory 2
6, the block coordinates immediately before the current block stored in the areas a, b, and c, the current block, the block coordinates of the next block, and the block related information stored in the related information storage memory 27 are referred to. In accordance with the area information of the area stored in the reading area information storage memory 22, the size determination data stored in the determination data storage memory 24 is referred to, and the true / false of the conditional expression of the size determination data is calculated. Note that the conditional expressions may be the same conditional expressions as in the specific examples 1 to 4, or may be compounded by logical product (AND) or logical sum (OR).

【０１２４】サイズ判定データには、具体例１（図４）
と同じような条件式とそれに対応した処理方法が含まれ
ている。条件計算の方法は具体例１と同様であり、計算
結果は判定結果格納メモリ２５に格納される。Specific example 1 (FIG. 4) is included in the size determination data.
And a processing method corresponding to the conditional expression. The condition calculation method is the same as that of the first embodiment, and the calculation result is stored in the determination result storage memory 25.

【０１２５】但し、具体例２と同様に、参照座標格納メ
モリ２６の格納エリアａ、または格納エリアｃに「矩形
なし」の情報が格納されているときは、その条件式の計
算結果は偽となる。However, as in the specific example 2, when the information “no rectangle” is stored in the storage area a or the storage area c of the reference coordinate storage memory 26, the calculation result of the conditional expression is false. Become.

【０１２６】ステップ５６では、計算結果の真偽を判別
し、計算結果が偽のときは、ステップ５７に進む。ステ
ップ５７では、関連情報格納メモリ２７から現在ブロッ
クの文字位置Ｉ、即ち、現在ブロックの最終文字位置と
そのブロックの文字数ＢＮを取得し、このブロックの開
始文字位置（Ｉ−ＢＮ＋１）から最終文字位置Ｉまでの
文字座標及び文字コードを認識結果格納メモリ２３から
判定結果格納メモリ２５へそのままコピーする。At step 56, it is determined whether the calculation result is true or false. If the calculation result is false, the process proceeds to step 57. In step 57, the character position I of the current block, that is, the last character position of the current block and the number of characters BN of the block are obtained from the related information storage memory 27, and the start character position (I-BN + 1) and the last character position of this block are obtained. The character coordinates and character codes up to I are copied from the recognition result storage memory 23 to the determination result storage memory 25 as they are.

【０１２７】また、計算結果が真のときは、誤読の可能
性があると判定してステップ５８に進んで条件式に対応
する処理方法を判別する。処理方法が未処理のときは、
ステップ５７に進み、判定結果が偽のときと同じ処理を
行う。処理方法が不読のときは、ステップ５９に進む。If the result of the calculation is true, it is determined that there is a possibility of erroneous reading, and the routine proceeds to step 58, where the processing method corresponding to the conditional expression is determined. If the processing method is unprocessed,
Proceeding to step 57, the same processing as when the determination result is false is performed. If the processing method is not read, the process proceeds to step 59.

【０１２８】ステップ５９では、関連情報格納メモリ２
７から現在ブロックの文字位置Ｉ、即ち、最終文字位置
とそのブロックの文字数ＢＮを取得し、このブロックの
開始文字位置（Ｉ−ＢＮ＋１）から最終文字位置Ｉまで
の文字座標を認識結果格納メモリ２３から判定結果格納
メモリ２５へコピーし、その文字コードを、例えば
「？」などのように認識結果として含まれるべきでない
文字に置換して判定結果格納メモリ２５に格納する。処
理方法が削除のときは、その文字を削除する。In the step 59, the related information storage memory 2
7, the character position I of the current block, that is, the last character position and the number of characters BN of the block are obtained, and the character coordinates from the start character position (I-BN + 1) of this block to the last character position I are obtained. To the determination result storage memory 25, and replaces the character code with a character that should not be included as a recognition result, such as “?”, And stores the character code in the determination result storage memory 25. If the processing method is delete, the character is deleted.

【０１２９】このような処理を全てのブロックについて
行い、全てのブロックについてのサイズ判定が終了した
とき（ステップ５３）、ステップ２に戻り、全ての領域
情報について認識処理（ステップ３）、サイズ判定処理
（ステップ４）が行われたとき（ステップ２）、すべて
の処理を終了させる。When such processing is performed for all the blocks and the size determination for all the blocks is completed (step 53), the process returns to step 2, where the recognition processing for all the area information (step 3), the size determination processing When (Step 4) is performed (Step 2), all the processes are terminated.

【０１３０】〈具体例５の効果〉以上、説明したように
具体例５によれば、同一行で同じ条件の文字が連続した
とき、これらの文字をブロックにまとめ、このブロック
に対してサイズ判定を行うようにしたので、ブロック単
位で行の切り出し処理や文字の切り出し処理の誤りを判
別し、後処理を行うことができる。<Effects of Specific Example 5> As described above, according to Specific Example 5, when characters of the same condition continue on the same line, these characters are combined into a block, and the size of this block is determined. Is performed, it is possible to determine an error in the line cutout processing and the character cutout processing in block units and perform post-processing.

【０１３１】〈具体例６〉具体例６は、サイズ判定デー
タをスクリプトデータで記述するようにしたものであ
る。<Example 6> In Example 6, the size determination data is described in script data.

【０１３２】図１９は、具体例６の構成を示すブロック
図である。具体例６の文字読取装置は、画像入力部１
と、表示部２と、入力部３と、制御部４と、認識部５
と、サイズ判定部６と、スクリプトデータ解析部７と、
画像メモリ２１と、読み取り領域情報格納メモリ２２
と、認識結果格納メモリ２３と、判定データ格納メモリ
２４と、判定結果格納メモリ２５と、参照座標格納メモ
リ２６と、関連情報格納メモリ２７と、スクリプトデー
タ格納メモリ２８と、を備えて構成されている。FIG. 19 is a block diagram showing the configuration of the sixth embodiment. The character reading device according to the specific example 6 includes the image input unit 1
, Display unit 2, input unit 3, control unit 4, recognition unit 5
, A size determination unit 6, a script data analysis unit 7,
Image memory 21 and read area information storage memory 22
, A recognition result storage memory 23, a determination data storage memory 24, a determination result storage memory 25, a reference coordinate storage memory 26, a related information storage memory 27, and a script data storage memory 28. I have.

【０１３３】スクリプトデータ格納メモリ２８は、スク
リプトで記述されたサイズ判定データを格納するメモリ
であり、このスクリプトはテキストで記述されている。
スクリプトデータ解析部７は、スクリプトデータ格納メ
モリ２８に格納されているサイズ判定データを参照し、
構文解析を行い、サイズ判定部６が使用できる内部的な
サイズ判定データに変換する機能を有する解析部であ
る。尚、具体例１〜５と同一要素については同一符号を
付して説明を省略する。The script data storage memory 28 is a memory for storing size determination data described in a script, and the script is described in text.
The script data analysis unit 7 refers to the size determination data stored in the script data storage memory 28,
The analyzing unit has a function of performing syntax analysis and converting the data into internal size determining data that can be used by the size determining unit 6. Note that the same elements as those of the first to fifth embodiments are denoted by the same reference numerals, and description thereof will be omitted.

【０１３４】〈動作〉次に具体例６の動作を説明する。
具体例２においても、具体例１と同様に、図６のフロー
チャートを実行し、ステップ４においてサイズ判定処理
を実施する。<Operation> Next, the operation of the embodiment 6 will be described.
Also in the specific example 2, similarly to the specific example 1, the flowchart of FIG. 6 is executed, and the size determination process is performed in step 4.

【０１３５】図２０は具体例６のサイズ判定処理を示す
フローチャートである。ステップ６１では、スクリプト
で記述されたサイズ判定データを解析する。サイズ判定
データを解析するには、読み取り領域情報格納メモリ２
２に格納されているその領域の情報に従ってスクリプト
データ格納メモリ２８からスクリプトを取得する。FIG. 20 is a flowchart showing the size determination processing of the sixth embodiment. In step 61, the size determination data described in the script is analyzed. To analyze the size determination data, the read area information storage memory 2
The script is acquired from the script data storage memory 28 in accordance with the information of the area stored in the area 2.

【０１３６】スクリプトデータ解析部７はこのスクリプ
トを構文解析し、サイズ判定部６が使用できる内部的な
サイズ判定データに変換し、変換されたサイズ判定デー
タを判定データ格納メモリ２４に格納する。The script data analysis unit 7 analyzes the syntax of the script, converts the syntax into internal size determination data that can be used by the size determination unit 6, and stores the converted size determination data in the determination data storage memory 24.

【０１３７】式（１６）は、このスクリプトで記述され
たサイズ判定データの一例を示す式である。処理単位，（条件１）処理１｜（条件２）処理２｜…｜（条件ｎ）処理ｎ …（１６）Expression (16) is an expression showing an example of the size determination data described in this script. (Condition 1) Process 1 | (Condition 2) Process 2 | ... | (Condition n) Process n ... (16)

【０１３８】処理単位には、文字単位、行単位等の処理
単位が記述され、条件１〜ｎには、例えば、条件式（１
０）〜（１５）が記述される。そして、その条件１〜ｎ
に対応した処理１〜ｎを列挙する。The processing unit describes a processing unit such as a character unit or a line unit. Conditions 1 to n include, for example, a conditional expression (1
0) to (15) are described. Then, the conditions 1 to n
Are enumerated.

【０１３９】ステップ６２〜６６では、具体例５のステ
ップ５２〜５５と同様に現在行の関連情報を設定し、判
定データ格納メモリ２４に格納されているサイズ判定デ
ータを用いて条件計算を行う。そして、ステップ６７〜
７０では、具体例５と同じような後処理を行う。In steps 62 to 66, related information of the current row is set in the same manner as in steps 52 to 55 of the specific example 5, and a condition calculation is performed using the size determination data stored in the determination data storage memory 24. And step 67-
At 70, the same post-processing as in the fifth embodiment is performed.

【０１４０】このような処理を全てのブロックについて
行い、全てのブロックについてのサイズ判定が終了した
とき（ステップ６４）、ステップ２に戻り、全ての領域
情報について認識処理（ステップ３）、サイズ判定処理
（ステップ４）が行われたとき（ステップ２）、すべて
の処理を終了させる。Such processing is performed for all blocks, and when the size determination for all blocks is completed (step 64), the flow returns to step 2 to perform recognition processing for all area information (step 3) and size determination processing. When (Step 4) is performed (Step 2), all the processes are terminated.

【０１４１】〈具体例６の効果〉以上、説明したように
具体例６によれば、サイズ判定データをスクリプトで記
述することにより、具体例１〜５と同様の効果を得るこ
とができるとともに、条件式を容易に定義できる。この
ため、サイズ判定データの誤り等による変更に容易に対
応することができる。<Effects of Specific Example 6> According to the specific example 6, as described above, the same effects as those of the specific examples 1 to 5 can be obtained by describing the size determination data in a script. Conditional expressions can be easily defined. Therefore, it is possible to easily cope with a change due to an error or the like of the size determination data.

【図面の簡単な説明】[Brief description of the drawings]

【図１】具体例１の構成を示すブロック図である。FIG. 1 is a block diagram illustrating a configuration of a specific example 1.

【図２】従来の説明図である。FIG. 2 is a conventional explanatory diagram.

【図３】具体例１の文字座標の説明図である。FIG. 3 is an explanatory diagram of character coordinates in a specific example 1.

【図４】具体例１のサイズ判定データの一例を示す説明
図である。FIG. 4 is an explanatory diagram illustrating an example of size determination data of a specific example 1.

【図５】具体例１の読み取り領域の説明図である。FIG. 5 is an explanatory diagram of a reading area according to a specific example 1.

【図６】具体例１の動作を示すフローチャートである。FIG. 6 is a flowchart illustrating an operation of a specific example 1.

【図７】具体例１のサイズ判定処理を示すフローチャー
トである。FIG. 7 is a flowchart illustrating a size determination process of a specific example 1.

【図８】具体例２の参照座標格納メモリの説明図であ
る。FIG. 8 is an explanatory diagram of a reference coordinate storage memory of a specific example 2.

【図９】具体例２のサイズ判定処理を示すフローチャー
トである。FIG. 9 is a flowchart illustrating a size determination process of a specific example 2.

【図１０】具体例２の説明図である。FIG. 10 is an explanatory diagram of a specific example 2.

【図１１】具体例３の構成を示すブロック図である。FIG. 11 is a block diagram illustrating a configuration of a specific example 3.

【図１２】具体例３の関連情報の説明図である。FIG. 12 is an explanatory diagram of related information in a specific example 3.

【図１３】具体例３のサイズ判定処理を示すフローチャ
ートである。FIG. 13 is a flowchart illustrating a size determination process according to a third example.

【図１４】具体例４のサイズ判定処理を示すフローチャ
ートである。FIG. 14 is a flowchart illustrating a size determination process of Example 4;

【図１５】具体例４の行座標の作成方法を示す説明図で
ある。FIG. 15 is an explanatory diagram illustrating a method of creating row coordinates in a specific example 4.

【図１６】具体例５のサイズ判定データの一例を示す説
明図である。FIG. 16 is an explanatory diagram showing an example of size determination data of a specific example 5.

【図１７】具体例５のサイズ判定処理を示すフローチャ
ートである。FIG. 17 is a flowchart illustrating a size determination process of Example 5;

【図１８】具体例５のブロックの説明図である。FIG. 18 is an explanatory diagram of a block of Example 5;

【図１９】具体例６の構成を示す説明図である。FIG. 19 is an explanatory diagram showing a configuration of a specific example 6.

【図２０】具体例６のサイズ判定処理を示すフローチャ
ートである。FIG. 20 is a flowchart illustrating a size determination process of Example 6;

【符号の説明】１画像入力部４制御部５認識部６サイズ判定部２２読み取り領域情報格納メモリ２３認識結果格納メモリ２４判定データ格納メモリ２５判定結果格納メモリ２６参照座標格納メモリ[Description of Signs] 1 Image input unit 4 Control unit 5 Recognition unit 6 Size determination unit 22 Reading area information storage memory 23 Recognition result storage memory 24 Determination data storage memory 25 Determination result storage memory 26 Reference coordinate storage memory

Claims

【特許請求の範囲】[Claims]

【請求項１】所定の用紙に印字された文字を画像デー
タとして取得する画像入力手段と、該画像入力手段により取得された画像データから各文字
の位置及び大きさを文字座標で特定し、各文字を認識し
て文字コードに変換する認識手段と、該認識手段により認識された各文字の位置及び大きさに
対し、文字座標を用いて所定の誤読判定条件を設定し、
当該各文字の位置及び大きさが誤読判定条件に該当する
ときは、誤って文字を認識した可能性があると判定し、
当該文字については、認識結果に対する後処理を行う誤
読判定処理手段と、を備えたことを特徴とする文字読取
装置。An image input means for acquiring a character printed on a predetermined sheet as image data, and a position and a size of each character are specified by character coordinates from the image data acquired by the image input means. Recognizing means for recognizing a character and converting it to a character code; and setting predetermined misreading determination conditions using character coordinates for the position and size of each character recognized by the recognizing means;
When the position and size of each character correspond to the misreading determination condition, it is determined that there is a possibility that the character is erroneously recognized,
A character reading device comprising: an erroneous reading determination processing unit that performs post-processing on a recognition result for the character.

【請求項２】画像データの中で同じ誤読可能性判定条
件が適用される所定の読み取り領域を設定し、前記誤読
可能性判定手段は、読み取った文字が誤読判定条件に該
当するか否かを読み取り領域毎に判定するように構成さ
れたことを特徴とする請求項１に記載の文字読取装置。2. A predetermined reading area to which the same misreadability determination condition is applied in image data is set, and the misreadability determination means determines whether the read character corresponds to the misreadability determination condition. The character reading device according to claim 1, wherein the character reading device is configured to make a determination for each reading area.

【請求項３】前記誤読判定処理手段は、各文字の文字
座標に基づいて認識対象の文字の幅、高さ及び位置を算
出し、算出された認識対象の文字の幅、高さ及び位置に
誤読判定条件を設定し、これらの文字の幅、高さ及び位
置のうちいずれか１つが誤読判定条件に該当したときに
誤って文字を認識した可能性があると判定するように構
成されたことを特徴とする請求項１又は請求項２に記載
の文字読取装置。3. The misreading determination processing means calculates the width, height, and position of the character to be recognized based on the character coordinates of each character, and calculates the width, height, and position of the calculated character to be recognized. A misreading determination condition is set, and when any one of the width, height, and position of the character corresponds to the misreading determination condition, it is configured to determine that there is a possibility that the character is erroneously recognized. The character reading device according to claim 1 or 2, wherein:

【請求項４】前記誤読判定処理手段は、各文字の文字
座標に基づいて認識対象の文字の幅、高さ及び位置を算
出し、算出された認識対象の文字の幅、高さ及び位置に
誤読判定条件を設定し、これらの文字の幅、高さ及び位
置のうち少なくとも１つが誤読判定条件に該当したとき
に誤って文字を認識した可能性があると判定するように
構成されたことを特徴とする請求項１又は請求項２に記
載の文字読取装置。4. The misreading determination processing means calculates the width, height, and position of the character to be recognized based on the character coordinates of each character, and calculates the width, height, and position of the calculated character to be recognized. A misreading determination condition is set, and when at least one of the width, height, and position of the character corresponds to the misreading determination condition, it is configured to determine that there is a possibility that the character is erroneously recognized. The character reading device according to claim 1 or 2, wherein

【請求項５】前記誤読判定処理手段は、各文字の文字
座標に基づいて認識対象の文字の前後関係を算出し、算
出された認識対象の文字の前後関係に誤読判定条件を設
定し、当該前後関係が誤読判定条件に該当したときに誤
って文字を認識した可能性があると判定するように構成
されたことを特徴とする請求項１又は請求項２に記載の
文字読取装置。5. The misreading determination processing means calculates the context of the character to be recognized based on the character coordinates of each character, and sets a misreading determination condition in the calculated context of the character to be recognized. The character reading device according to claim 1, wherein the character reading device is configured to determine that there is a possibility that a character has been erroneously recognized when the context corresponds to an erroneous reading determination condition.

【請求項６】前記誤読判定処理手段は、各文字の文字
座標に基づいてその行の全ての文字を含む行を作成し、
行の位置関係及びその行に含まれている文字の位置、文
字数に誤読判定条件を設定し、行の位置関係及びその行
に含まれている文字の位置、文字数のうちのいずれか１
つが誤読判定条件に該当したときに誤って文字を認識し
た可能性があると判定するように構成されたことを特徴
とする請求項１又は請求項２に記載の文字読取装置。6. The misreading determination processing means creates a line including all the characters in the line based on the character coordinates of each character,
A misreading determination condition is set for the positional relationship of the line and the position and the number of characters included in the line, and any one of the positional relationship of the line and the position or the number of characters included in the line is set.
The character reading device according to claim 1, wherein it is configured to determine that there is a possibility that a character has been erroneously recognized when one of the conditions corresponds to an erroneous reading determination condition.

【請求項７】前記誤読判定処理手段は、各文字の文字
座標に基づいてその行の全ての文字を含む行を作成し、
行の位置関係及びその行に含まれている文字の位置、文
字数に誤読判定条件を設定し、行の位置関係及びその行
に含まれている文字の位置、文字数のうちの少なくとも
１つが誤読判定条件に該当したときに誤って文字を認識
した可能性があると判定するように構成されたことを特
徴とする請求項１又は請求項２に記載の文字読取装置。7. The misreading determination processing means creates a line including all the characters of the line based on the character coordinates of each character,
A misreading determination condition is set for the positional relationship of the line, the position of the character included in the line, and the number of characters, and at least one of the positional relationship of the line and the position of the character included in the line, the number of characters is determined to be misread. The character reading device according to claim 1, wherein the character reading device is configured to determine that there is a possibility that a character is erroneously recognized when a condition is satisfied.

【請求項８】前記誤読判定処理手段は、各文字の文字
座標に基づいて同一行で同じ条件を有する文字が連続し
ているとき、当該連続した複数の文字をブロックにまと
め、ブロックの位置関係及びそのブロックに含まれてい
る文字の位置、文字数に誤読判定条件を設定し、認識対
象のブロックの位置関係及びそのブロックに含まれてい
る文字の位置、文字数のうちいずれか１つが誤読判定条
件に該当したときに誤って文字を認識した可能性がある
と判定するように構成されたことを特徴とする請求項１
又は請求項２に記載の文字読取装置。8. When the characters having the same condition are continuous on the same line based on the character coordinates of each character, the misreading determination processing means collects the plurality of continuous characters into a block, And a misreading determination condition is set for the position and the number of characters included in the block, and any one of the positional relationship of the block to be recognized and the position and the number of characters included in the block is determined as the misreading determination condition. 2. The apparatus according to claim 1, wherein it is determined that there is a possibility that the character is erroneously recognized when the condition (1) is satisfied.
Or the character reading device according to claim 2.

【請求項９】前記誤読判定処理手段は、各文字の文字
座標に基づいて同一行で同じ条件を有する文字が連続し
ているとき、当該連続した複数の文字をブロックにまと
め、ブロックの位置関係及びそのブロックに含まれてい
る文字の位置、文字数に誤読判定条件を設定し、認識対
象のブロックの位置関係及びそのブロックに含まれてい
る文字の位置、文字数のうち少なくとも１つが誤読判定
条件に該当したときに誤って文字を認識した可能性があ
ると判定するように構成されたことを特徴とする請求項
１又は請求項２に記載の文字読取装置。9. When the characters having the same condition are consecutive on the same line based on the character coordinates of each character, the misreading determination processing means collects the plurality of consecutive characters into a block, and determines a positional relationship between the blocks. And misreading determination conditions are set for the position and the number of characters included in the block, and at least one of the positional relationship of the block to be recognized and the position and the number of characters included in the block is included in the misreading determination condition. The character reading device according to claim 1, wherein it is configured to determine that there is a possibility that a character has been erroneously recognized when the character is recognized.

【請求項１０】前記誤読判定処理手段は、誤読判定条
件をスクリプトで記述したことを特徴とする請求項１〜
９のいずれか１つに記載の文字読取装置。10. The erroneous reading judgment processing means described in a script for erroneous reading judgment conditions.
10. The character reading device according to any one of 9 above.

【請求項１１】前記誤読判定処理手段は、誤って文字
を読み取った可能性に応じて、当該文字を別の文字に置
換する処理、未処理、削除処理のうち、いずれか１つを
選択処理するように構成されたことを特徴とする請求項
１〜１０のいずれか１つに記載の文字読取装置。11. The erroneous reading determination processing means selects one of a process of replacing the character with another character, an unprocessed process, and a deleting process according to a possibility that the character is erroneously read. The character reading device according to any one of claims 1 to 10, wherein the character reading device is configured to perform the following.