JPH0394393A

JPH0394393A - Character recognizing device

Info

Publication number: JPH0394393A
Application number: JP1230415A
Authority: JP
Inventors: Mitsumasa Sugiyama; 杉山　光正
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1989-09-07
Filing date: 1989-09-07
Publication date: 1991-04-19

Abstract

PURPOSE:To shorten the time relating to the whole character recognition processing by rapidly executing the character segmenting processing of a character to be recognized. CONSTITUTION:The character recognizing device is constituted of an image data base 1 storing original images, an image extending circuit 8, a low resolution image storing part 3, a high resolution image storing part 4, a character segmenting processing part 5, and a character recognizing processing part 6. Since the character segmenting position is detected based upon the low resolution data, the processing time required for the detection can be shortened. Thereby, the processing time required for character recognition in the whole character recognition processing can also be shortened.

Description

【発明の詳細な説明】［産業上の利用分野］本発明は文字認識装置、詳しくは１文字分の文字イメー
ジデータを切り出し、その切り出した文字のイメージデ
ータから文字認識する文字認識装置に関するものである
。[Detailed Description of the Invention] [Field of Industrial Application] The present invention relates to a character recognition device, and more particularly, to a character recognition device that cuts out character image data for one character and recognizes characters from the image data of the cut out character. be.

［従来の技術］従来より光学的に原稿画像を読取り、その文字を認識す
る装置がある。[Prior Art] Conventionally, there have been devices that optically read a document image and recognize characters therein.

通常、認識対象である原稿画像には、多数の文字がある
わけであるから、文字認識の際には先ず、個々の文字の
切り出し処理（１文字の存在する領域位置の検出）を行
なう必要がある。Normally, there are many characters in the original image that is the target of recognition, so when recognizing characters, it is first necessary to cut out each character (detect the position of the area where one character exists). be.

そして、切り出したｌ文字分のイメージデータに基づい
て文字認識が行なわれる。Then, character recognition is performed based on the extracted image data for l characters.

ところで認識率を向上させるためには、切り出した文字
を構成するドット数は多ければ多い程良い。すなわち、
スキャナ等で原稿画像を読み取り、そして認識処理する
場合には、そのスキャナの解像度は高い方が良い。Incidentally, in order to improve the recognition rate, the greater the number of dots constituting the extracted character, the better. That is,
When reading a document image using a scanner or the like and performing recognition processing, it is better that the resolution of the scanner is higher.

［発明が可決しようとする課題］しかし、一方では、解像度が高ければ高い程、文字切り
出し処理及び文字認識処理に要する時間は大きなものと
なる。[Problems to be Solved by the Invention] However, on the other hand, the higher the resolution, the longer the time required for character extraction processing and character recognition processing.

本発明はかかる課題に鑑みなされたものであり、少なく
とも文字切り出し処理を高速にして、文字認識処理に係
る全体の処理時間の短縮を図る文字認識装置を提供しよ
うとするものである。The present invention has been made in view of this problem, and it is an object of the present invention to provide a character recognition device that at least speeds up character extraction processing and reduces the overall processing time related to character recognition processing.

［課題を解決するための手段］及び［作用］この課題を
解決するため、本発明の文字認識装置は以下に示す構成
を備える。すなわち、１文字分の文字イメージデータを
切り出し、その切り出した文字のイメージデータから文
字認識する文字認識装置において、認識対象文字群のイ
メージデータの低解像度データを記憶する第１の記憶手
段と、前記認識対象文字群のイメージデータの高解像度
データを記憶する第２の記憶手段と、前記第１の記憶手
段に記憶された個々の文字のイメージデータの存在位置
に基づいて前記第２の記憶手段に記憶された対応文字の
イメージデータを切り出す文字切り出し手段とを備え、
該文字切り出し手段で切り出された文字のイメージデー
タな基にして文字認識処理を行なう。[Means for Solving the Problem] and [Operation] In order to solve this problem, the character recognition device of the present invention has the configuration shown below. That is, in a character recognition device that cuts out character image data for one character and recognizes characters from the image data of the cut out character, a first storage means for storing low resolution data of image data of a group of characters to be recognized; a second storage means for storing high-resolution data of image data of a group of characters to be recognized; and a second storage means for storing high resolution data of image data of a group of characters to be recognized; a character cutting means for cutting out image data of stored corresponding characters;
Character recognition processing is performed based on the image data of the characters cut out by the character cutting means.

［実施例］以下、添付図面に従って本発明に係る実施例を詳細に説
明する。[Embodiments] Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings.

第１図は本実施例における文字認識装置のブロック構成
図である。FIG. 1 is a block diagram of a character recognition device in this embodiment.

図中、１は原稿画像（実施例では横書きの原稿画像）を
記憶している画像データベース、２は画像伸張回路、３
は低解像画像記憶部、４は高解像画像記憶部、５は文字
切り出し処理部、６は文字認識処理部である。尚、低解
像画像記憶部３及び高解像画像記憶部４は共にＲＡＭよ
り構成されていて、各々の解像度に対応してｌ原稿画像
１ページ分記憶可能な容量を有するものとする。In the figure, 1 is an image database that stores document images (horizontally written document images in this embodiment), 2 is an image decompression circuit, and 3
4 is a low-resolution image storage section, 4 is a high-resolution image storage section, 5 is a character cutout processing section, and 6 is a character recognition processing section. Note that both the low-resolution image storage section 3 and the high-resolution image storage section 4 are constituted by RAM, and have a capacity capable of storing one page of original images corresponding to each resolution.

画像データベースｌには階層的圧縮画像が保存されてお
り、画像伸張回路２により低解像画像から高解像画像ま
で段階的に伸張することができる。実施例では、５０ｄ
ｐｉ，１００ｄｐＬ，２００ｄｐｉ，４００ｄｐｉの解
像度の画像データを得ることができる。画像伸張回路２
は画像データベース１から画像データを入力して伸張し
、５０ｄｐｉの解像度の画像データを低解像画記憶部３
に格納し、４００ｄｐｉの解像度の画像データを高解像
画像記憶部４に格納する。The image database 1 stores hierarchically compressed images, which can be expanded in stages from low resolution images to high resolution images by the image expansion circuit 2. In the example, 50d
Image data with resolutions of pi, 100 dpL, 200 dpi, and 400 dpi can be obtained. Image decompression circuit 2
inputs and expands image data from the image database 1, and stores the image data with a resolution of 50 dpi in the low-resolution image storage unit 3.
The image data with a resolution of 400 dpi is stored in the high-resolution image storage section 4.

これら低解像画像記録部３及び高解像画像記憶部４への
画像データの展開を終えると、文字切り出し処理部５は
、個々の文字毎の切り出し位置を検出する。After the image data has been developed into the low-resolution image recording section 3 and the high-resolution image storage section 4, the character cutout processing section 5 detects the cutout position of each character.

具体的には、低解像画像記憶部３内に展開された原稿画
像に対して、横方向にスキャンし、黒ドットの頻度を取
る。そして、縦（Ｙ）方向を変移とする黒ドットの分布
を取ることで、各行の位置（縦方向の座標位置）を検出
する。ここで検出される座標位置は、文字が横方向に並
んでいる各行の上辺の座標と下辺の座標である。Specifically, the document image developed in the low-resolution image storage section 3 is scanned in the horizontal direction, and the frequency of black dots is determined. Then, the position of each row (vertical coordinate position) is detected by taking the distribution of black dots whose displacement is in the vertical (Y) direction. The coordinate positions detected here are the coordinates of the upper side and the coordinates of the lower side of each line in which characters are arranged horizontally.

この後、注目行における各文字毎の横方向の座標位置を
検出する。この検出も先の行検出と同様である。すなわ
ち、注目行における縦方向の黒ドットの頻度を取り、そ
れを横（Ｘ）方向に変移する分布をとる。そして、黒ド
ットの存在を検出された位置からそれが消失した位置ま
で（その間が文字幅となる）を１文字の横方向の座標位
置とする。Thereafter, the horizontal coordinate position of each character in the line of interest is detected. This detection is also similar to the previous line detection. That is, the frequency of black dots in the vertical direction in the row of interest is taken, and a distribution is obtained in which it is shifted in the horizontal (X) direction. Then, the coordinate position in the horizontal direction of one character is defined as the position from the position where the presence of the black dot is detected to the position where it disappears (the space between is the character width).

こうして、低解像画像記憶部３における各文字の左上の
点と右下の点の座標位置が求められることになる。In this way, the coordinate positions of the upper left point and lower right point of each character in the low-resolution image storage section 3 are determined.

第３図で示すと、文字「の」の左上点は、右方向にｉ画
素目、下方向にｊライン目であるのでの座標位置（ｉ．
ｊ）、右下点はｍ画素目ｎライン目であるから座標位置
（ｍ，ｎ）である。As shown in FIG. 3, the upper left point of the character "no" is the i-th pixel in the right direction and the j-th line in the downward direction, so the coordinate position (i.
j), the lower right point is the m-th pixel and the n-th line, so it is the coordinate position (m, n).

文字切り出し処理部５は低解像画像記憶部３に記憶され
た注目文字画像の左上点及び右下点の座標を検出した後
、高解像画像記憶部４内の対応する文字画像の左上点と
右下点の座標を算出する。After detecting the coordinates of the upper left point and lower right point of the character image of interest stored in the low-resolution image storage section 3, the character extraction processing section 5 detects the coordinates of the upper left point and the lower right point of the character image of interest stored in the low-resolution image storage section 4. and calculate the coordinates of the lower right point.

実施例の場合、低解像画像記憶部３には５０ｄｐｉの画
像が、高解像画像記憶部４はその８倍の４００ｄｐｉの
画像が記憶されていることは先に説明した。すなわち、
高解像画像記憶部４は低解像画像記憶部３の縦横とも８
倍の大きさを有するものである。In the case of the embodiment, it was explained earlier that the low-resolution image storage section 3 stores an image of 50 dpi, and the high-resolution image storage section 4 stores an image of 400 dpi, which is 8 times that. That is,
The high-resolution image storage section 4 is 8 in length and width of the low-resolution image storage section 3.
It is twice as large.

従って、高解像画像記憶部４内の対応する文字画像の左
上点の座標は（８ｉ．８ｊ）．右下点の座標は（８ｍ，
８ｎ）となるが、低解像画像データにおける切り出し位
置を高解像画像データが確実に含むようにするため、実
施例では高解像画像記憶部４の左上点の座標を（８ｉ−
ｋ，８ｊ一ｋ）、右下点の座標を（８ｍ＋ｋ．８ｎ＋ｋ
）とした。ここで、ｋ＝ｏ〜７の適当な値である。Therefore, the coordinates of the upper left point of the corresponding character image in the high-resolution image storage unit 4 are (8i.8j). The coordinates of the lower right point are (8m,
However, in order to ensure that the high-resolution image data includes the cutout position in the low-resolution image data, in the embodiment, the coordinates of the upper left point of the high-resolution image storage section 4 are (8i-
k, 8j - k), and the coordinates of the lower right point are (8m + k. 8n + k
). Here, k is an appropriate value of o to 7.

文字切り出し処理部５は、この得られた高解像画像記憶
部４内における座標位置でもって規定される矩形領域の
画像を切り出し、文字認識処理部６に引き渡す。これに
よって、文字認識処理部は高解像画像記憶部４より切り
出された文字画像の認識処理を実行し、認識結果である
文字コードを生成する。The character cutout processing unit 5 cuts out an image of a rectangular area defined by the obtained coordinate position in the high resolution image storage unit 4 and delivers it to the character recognition processing unit 6. Thereby, the character recognition processing section executes recognition processing on the character image cut out from the high-resolution image storage section 4, and generates a character code as a recognition result.

第２図は文字切り出し位置検出に係るフローチャートで
あり、文字切り出し処理部５の処理内容の一部を成すも
のである。FIG. 2 is a flowchart relating to character extraction position detection, which forms part of the processing content of the character extraction processing section 5.

先ず、ステップＳ１において、低解像画像データ（低解
像画像記憶部３に記憶されている）での１文字の文字画
像の左上の点をｉ画素ｊラインとし、右下の点をｍ画素
ｎラインする。低解像画像記憶部３には５０ｄｐｉの解
像度の画素データが格納されており、高解像画像記憶部
にはその８倍の４００ｄｐ　ｉの解像度の画像データが
格納されているので、ステップＳ２では、高解像画像デ
ー夕における左上点を（８ｉ−ｋ）画素（８ｊ−ｋ）ラ
イン、右下点を（８ｍ＋ｋ）画素（８ｎ十ｋ）ラインと
する。First, in step S1, the upper left point of the character image of one character in the low resolution image data (stored in the low resolution image storage unit 3) is defined as i pixel j line, and the lower right point is defined as m pixel. Do n lines. Since the low-resolution image storage section 3 stores pixel data with a resolution of 50 dpi, and the high-resolution image storage section stores image data with a resolution of 400 dpi, which is eight times that amount, in step S2, , the upper left point of the high-resolution image data is (8i-k) pixels (8j-k) line, and the lower right point is (8m+k) pixels (8n10k) line.

そして、次のステップＳ３において、得られた座標位置
（８ｉ−ｋ，８ｊ−ｋ）及び（８ｍ十ｋ，８ｎ＋ｋ）で
規定される矩形内の画像を高解像画像記憶部４より切り
出し（読み出し）、それを文字認識処理部６に出力する
。Then, in the next step S3, the image within the rectangle defined by the obtained coordinate positions (8i-k, 8j-k) and (8m-k, 8n+k) is cut out (read out) from the high-resolution image storage unit 4. ), and outputs it to the character recognition processing section 6.

以上説明したように本実施例によれば、文字切り出し位
置を、低解像度のデータを基にしても検出するので、そ
の検出に要する処理時間を短縮することが可能となる。As described above, according to this embodiment, character cutout positions are detected even based on low resolution data, so it is possible to shorten the processing time required for the detection.

従って、文字認識に要する全体での処理時間も短縮する
ことが可能となる。Therefore, the overall processing time required for character recognition can also be reduced.

尚，上述した実施例では、低解像画像記憶に５０ｄｐｉ
、高解像画像記憶部に４００ｄｐ　ｉの解像度の画像デ
ータを格納した場合を説明したが、必ずしも５０ｄｐｉ
や４００ｄｐ　ｉである必要はない。例えば、低解像画
像記憶部に１００ｄｐｉの解像度の画像データを格納し
てもよい。この場合には、解像度の差は４倍となるから
、文字切り出し位置の座標は（４ｉ−ｋ，４ｊ−ｋ），
（４ｍ十ｋ，４ｎ＋ｋ）となる。但し、ｋ＝Ｏ〜３の適
当な値である。In the above-mentioned embodiment, 50 dpi is used for low-resolution image storage.
, a case has been described in which image data with a resolution of 400 dpi is stored in the high-resolution image storage unit, but it is not necessarily 50 dpi.
It does not need to be 400dpi or 400dpi. For example, image data with a resolution of 100 dpi may be stored in the low-resolution image storage section. In this case, the difference in resolution is four times, so the coordinates of the character extraction position are (4i-k, 4j-k),
(4m+k, 4n+k). However, k is an appropriate value of O to 3.

また、画像データベース１から、５０ｄｐｉ，１００ｄ
ｐｉ，２００ｄｐｉ，４００ｄｐｉの画像データを得て
いるが、画像データにより、他の解像度の画像データ、
例えば、７５ｄｐｉ，１５０ｄｐｉ，３００ｄｐｉでも
よいことは勿論である。Also, from image database 1, 50dpi, 100d
We have obtained image data of pi, 200 dpi, and 400 dpi, but depending on the image data, image data of other resolutions,
For example, it is of course possible to use 75 dpi, 150 dpi, or 300 dpi.

更には、実施例では、認識対象の文字画像を画像データ
ベース１から取り出す例を説明したが、装置にスキャナ
やビデオカメラ等の画像入力装置が接続されている場合
には、それらから入力した画像（或いは一旦外部記憶装
置に記憶させた画像）を基に処理しても良い。但し、画
像入力手段としては認識率向上ため、高解像度のもので
ある必要がある。Furthermore, in the embodiment, an example was explained in which character images to be recognized are retrieved from the image database 1, but if an image input device such as a scanner or a video camera is connected to the device, images input from them ( Alternatively, processing may be performed based on images (images that have been temporarily stored in an external storage device). However, the image input means needs to have high resolution in order to improve the recognition rate.

例えば、画像入力装置の解像度が４００ｄｐｉである場
合には、それでもって読み取った画像をそのまま高解像
画像記憶部４に展開する。そして、低解像画像記憶３に
は縦横とも８画素おきに（或いは８画素全ての論理和や
平均値を算出することで）１画素データを生成すること
でｓｏｃｔｐｉの画像を生成する。For example, when the resolution of the image input device is 400 dpi, the read image is developed into the high-resolution image storage section 4 as it is. Then, in the low-resolution image storage 3, one pixel data is generated every 8 pixels both vertically and horizontally (or by calculating the logical sum or average value of all 8 pixels), thereby generating a soctpi image.

また、実施例では、画像データは画像データベース１か
ら取り出していたが、装置にモデムやＦＡＸ等電話回線
等によって画像データを受信する装置が接続されている
場合には、階層的に圧縮された画像データを受信して一
時記憶回路に保存し、一時記憶回路から画像データを取
り出してもよい。Furthermore, in the embodiment, the image data was retrieved from the image database 1, but if the device is connected to a device that receives image data via a telephone line, such as a modem or FAX, the hierarchically compressed image Data may be received and stored in a temporary storage circuit, and image data may be retrieved from the temporary storage circuit.

更には、前実施例では画像データベース１には階層的に
圧縮された画像データが保存するものとして説明した。Furthermore, in the previous embodiment, the image database 1 was described as storing hierarchically compressed image data.

しかしながら、画像データベース１に階層的に圧縮され
た画像データと、圧縮されていない画像データが混在し
て保存する場合には、階層的に圧縮されている画像デー
タに対しては上記実施例に従い、圧縮されていない画像
データに対しては、高解像画像データのみを用いる従来
の方法を行うようにしてもよい。However, when storing a mixture of hierarchically compressed image data and uncompressed image data in the image database 1, the hierarchically compressed image data is stored according to the above embodiment. For uncompressed image data, a conventional method using only high-resolution image data may be used.

［発明の効果］以上説明したように、本発明によれば、認識対象の文字
切り出し処理が高速に行なわれるので、文字認識処理全
体に係る時間の短縮化が可能となる。[Effects of the Invention] As described above, according to the present invention, character extraction processing to be recognized is performed at high speed, so that the time required for the entire character recognition processing can be shortened.

【図面の簡単な説明】[Brief explanation of drawings]

第１図は本実施例における文字認識装置のブロック構成
図、第２図は文字切り出し処理部の動作を示すフローチャー
ト、第３図は文字と文字切り出し位置の関係を示す図である
。図中、■・・・画像データベース、２・・・画像伸張回
路、３・・・低解像画像記憶部、４・・・高解像画像記
憶部、５・・・文字切り出し処理一部、６・・・文字認
識処理部である。FIG. 1 is a block diagram of the character recognition device in this embodiment, FIG. 2 is a flowchart showing the operation of the character extraction processing section, and FIG. 3 is a diagram showing the relationship between characters and character extraction positions. In the figure, ■... Image database, 2... Image decompression circuit, 3... Low resolution image storage section, 4... High resolution image storage section, 5... Character cutting processing part, 6...Character recognition processing section.

Claims

【特許請求の範囲】　１文字分の文字イメージデータを切り出し、その切り
出した文字のイメージデータから文字認識する文字認識
装置において、認識対象文字群のイメージデータの低解像度データを記
憶する第１の記憶手段と、前記認識対象文字群のイメージデータの高解像度データ
を記憶する第２の記憶手段と、前記第１の記憶手段に記憶された個々の文字のイメージ
データの存在位置に基づいて前記第２の記憶手段に記憶
された対応文字のイメージデータを切り出す文字切り出
し手段とを備え、該文字切り出し手段で切り出された文
字のイメージデータを基にして文字認識処理を行なうこ
とを特徴とする文字認識装置。[Claims] In a character recognition device that cuts out character image data for one character and recognizes characters from the image data of the cut out character, the first memory stores low-resolution data of image data of a group of characters to be recognized. means, a second storage means for storing high-resolution data of image data of the group of characters to be recognized; A character recognition device comprising a character cutting means for cutting out image data of a corresponding character stored in a storage means, and performing character recognition processing based on the image data of the character cut out by the character cutting means. .