JPH04268989A

JPH04268989A - Method and device for recognizing character

Info

Publication number: JPH04268989A
Application number: JP3053350A
Authority: JP
Inventors: Yukio Sato; 幸雄佐藤; Takemoto Aso; 麻生　健資
Original assignee: Nippon Steel Corp
Current assignee: Nippon Steel Corp
Priority date: 1991-02-25
Filing date: 1991-02-25
Publication date: 1992-09-24

Abstract

PURPOSE:To set a character image segmented at an appropriate segmenting position as a recognition target by setting the binary threshold value of an input image at the optimum value. CONSTITUTION:After the background of a character part is eliminated from the input image, the character part is extracted(procedure A). Thence, an extracted character part is binarized, and a binary threshold value is optimized based on projection distribution data to orthogonal two axes(procedure B). Thirdly, a character frame divided into plural areas is set based on the projection distribution data to the orthogonal two axes of an optimized binary image(procedure C). Fourthly, the input image is binarized at every divided area in the character frame, and respective independent binary threshold value is optimized (procedure D). Finally, the character frame is re-set based on the projection distribution data to the orthogonal two axes of the binary image optimized at the procedure D(procedure E).

Description

【発明の詳細な説明】[Detailed description of the invention]

【０００１】0001

【産業上の利用分野】本発明は画像処理技術を用いて文
字（数字）を認識する文字認識装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition device that recognizes characters (numbers) using image processing technology.

【０００２】0002

【従来の技術】撮像装置（ビデオカメラやイメージリー
ダ）によって得た文字画像データに基いて文字認識を行
う文字認識装置が従来より知られている。認識の一手法
として、文字の２値化画像と予め記憶した辞書データと
のパターンマッチングをとって文字を特定する方式が知
られている。2. Description of the Related Art Character recognition devices have been known that perform character recognition based on character image data obtained by an imaging device (video camera or image reader). As one method of recognition, a method is known in which a character is identified by performing pattern matching between a binary image of the character and pre-stored dictionary data.

【０００３】0003

【発明が解決しようとする課題】このような画像処理に
基く文字認識の精度は、画像の２値化のレベルの適正さ
に大きく依存する。一般には、２値化のしきいレベルは
固定であるので、文字の濃度が低い場合には、辞書デー
タより細目の文字或いは欠けがある文字になり易い。ま
た２値化のしきい値を下げると、文字は太くなるが、背
景の明るい部分が残ってしまう。逆に、文字の濃度が高
い場合には、辞書データより太めの２値化が行われる。太めの文字に対しては、凝縮、細線化のデータ処理を施
すことが可能であるが、文字に欠けが生じ易くなる。従
って文字の濃度が低いものから高いもの全てにわたって
適正な２値化を行うことは困難である。２値化が適正で
なく、辞書より細目又は太目であったり欠けが多いと、
マッチング精度が著しく低下する。また文字列から個々
の文字を切取る文字枠の位置精度が悪いとマッチング精
度が低下する。本発明は、上述の問題にかんがみ、適正
な２値化及び正確な文字切取枠の設定によりマッチング
精度を高めることを目的とする。The accuracy of character recognition based on such image processing largely depends on the appropriateness of the level of image binarization. Generally, the threshold level for binarization is fixed, so when the density of a character is low, the character is likely to be smaller than the dictionary data or have a missing character. Furthermore, if the threshold value for binarization is lowered, the characters become thicker, but the bright parts of the background remain. Conversely, if the density of the characters is high, the binarization is performed to be thicker than the dictionary data. Although thick characters can be subjected to data processing such as condensation and thinning, the characters are more likely to be missing. Therefore, it is difficult to perform appropriate binarization for all characters, from low to high density. If the binarization is not appropriate and the data is thinner or thicker than the dictionary, or there are many gaps,
Matching accuracy decreases significantly. Furthermore, if the positional accuracy of the character frame for cutting out individual characters from the character string is poor, matching accuracy will decrease. In view of the above-mentioned problems, the present invention aims to improve matching accuracy through appropriate binarization and accurate setting of character cutting frames.

【０００４】0004

【課題を解決するための手段】本発明の文字認識方法は
、図１に示すように、入力画像から文字部の背景を除去
して文字部を抽出する抽出手段Ａと、抽出した文字部を
２値化し、その直交２軸への投影分布データに基いて２
値化のしきい値を最適化する手順Ｂと、最適化された２
値化画像の直交２軸への投影分布データに基いて複数の
領域に分割された文字枠を設定する手段Ｃと、上記文字
枠内の各分割領域ごとに上記入力画像を２値化し、夫々
の独立した２値化しきい値を最適化する手順Ｄと、上記
手順Ｄで最適化された２値化画像の直交２軸への投影分
布データに基いて文字枠を再設定する手順Ｅと、上記手
順Ｅの文字枠内の２値化画像と辞書データとを照合して
文字を特定する手順Ｆとから成る。また本発明の文字認
識装置は、図２３に示すように、入力画像を２値化する
２値化手段１と、認識すべき文字の座標系を決定するた
めに文字の近傍に付された基準文字又は基準図形の２値
化画像と辞書データとを照合する照合手段２と、照合結
果に基き、上記基準文字又は基準図形の２値化画像から
基準座標を算出する座標算出手段３と、上記基準座標に
基き、認識すべき文字を切出すための文字枠を設定する
文字枠設定手段４と、入力画像から上記文字枠に基いて
切出された個々の文字を認識する認識手段５とを備える
。[Means for Solving the Problems] As shown in FIG. 1, the character recognition method of the present invention includes an extraction means A for removing the background of a character part from an input image and extracting the character part; Based on the binarized and projected distribution data on two orthogonal axes,
Procedure B for optimizing the valuation threshold and optimized 2
means C for setting a character frame divided into a plurality of regions based on projection distribution data of the digitized image on two orthogonal axes; and means C for binarizing the input image for each divided region within the character frame, respectively A step D of optimizing the independent binarization threshold value of , and a step E of resetting the character frame based on the projection distribution data on two orthogonal axes of the binarized image optimized in the above step D. This step consists of step F in which characters are identified by comparing the binary image in the character frame of step E with dictionary data. Further, as shown in FIG. 23, the character recognition device of the present invention includes a binarization means 1 for binarizing an input image, and a reference attached near the character to determine the coordinate system of the character to be recognized. a collation means 2 for collating a binary image of a character or reference figure with dictionary data; a coordinate calculation means 3 for calculating reference coordinates from the binary image of the reference character or reference figure based on the collation result; A character frame setting means 4 for setting a character frame for cutting out characters to be recognized based on reference coordinates, and a recognition means 5 for recognizing each character cut out from an input image based on the character frame. Be prepared.

【０００５】[0005]

【作用】まず画像全体について最適２値化を行い、次に
１文字内の分割領域ごとに最適２値化を行う。従って全
体的な濃度むら及び局所的な濃度むらの双方に対応した
最適２値化が行われる。またノイズや欠損の多い画像に
対しては、基準文字又は基準図形を用いて文字切取枠の
座標を定めることにより、高精度の文字切取りを行うこ
とができ、マッチング精度が大巾に向上する。[Operation] First, the entire image is optimally binarized, and then each divided area within one character is optimally binarized. Therefore, optimal binarization is performed that deals with both overall density unevenness and local density unevenness. Furthermore, for images with many noises or defects, by determining the coordinates of a character cutting frame using a reference character or a reference figure, highly accurate character cutting can be performed, and matching accuracy is greatly improved.

【０００６】[0006]

【実施例】図２は本発明の文字認識装置の一実施を示す
鋼材ナンバリングシステムのブロック図であり、図３は
図２のコード番号読取装置の詳細を示すブロック図であ
る。DESCRIPTION OF THE PREFERRED EMBODIMENTS FIG. 2 is a block diagram of a steel numbering system showing one implementation of the character recognition device of the present invention, and FIG. 3 is a block diagram showing details of the code number reading device of FIG. 2.

【０００７】図２において、板材或いはパイプなどの鋼
材１０上には、品種、等級、製造ロット、製造者コード
等を示す複数桁の数字又はアルファベットから成る製造
コード番号１２が、刻印又はペイントにより予め付され
ている。図２のシステムは、製品の生産管理のために製
造コード番号を読取って、管理用コンピュータに出力す
る。In FIG. 2, a manufacturing code number 12 consisting of multi-digit numbers or alphabets indicating the type, grade, manufacturing lot, manufacturer code, etc. is pre-engraved or painted on a steel material 10 such as a plate or pipe. It is attached. The system shown in FIG. 2 reads a manufacturing code number for product production management and outputs it to a management computer.

【０００８】鋼材１０はコンベアで矢印Ａ方向に搬送さ
れる。鋼材１０のコード番号１２の部分はＩＴＶカメラ
１４で撮像され、その映像出力がコード番号読取装置１
６に供給される。ＩＴＶカメラ１４の映像出力が適当な
解像度を持つように、コンベアの両側には照明装置１８
が設けられている。コード番号読取装置１６は、読取っ
たコード番号をプロセスコンピュータ２０を介して管理
コンピュータ２２に送出する。管理コンピュータ２２は
、生産計画表に基いて予定番号と読取ったコード番号と
を照合する。プロセスコンピュータ２０は、鋼材１０の
コード番号１２が生産計画表の番号と合致する場合、払
出指令をシーケンサ２４に出力する。シーケンサ２４は
コンベアを制御して、鋼材１０を次工程に送り出すと共
に、位置センサ２６のセンサ出力を受けて、次の鋼材１
０の位置決めを行う。The steel material 10 is conveyed in the direction of arrow A by a conveyor. The part of the steel material 10 with the code number 12 is imaged by the ITV camera 14, and the video output is sent to the code number reader 1.
6. Illumination devices 18 are installed on both sides of the conveyor so that the video output of the ITV camera 14 has a suitable resolution.
is provided. The code number reading device 16 sends the read code number to the management computer 22 via the process computer 20. The management computer 22 compares the scheduled number with the read code number based on the production schedule. If the code number 12 of the steel material 10 matches the number in the production schedule, the process computer 20 outputs a delivery command to the sequencer 24. The sequencer 24 controls the conveyor to send the steel material 10 to the next process, and also receives the sensor output from the position sensor 26 and sends the steel material 10 to the next process.
Performs 0 positioning.

【０００９】コード番号読取装置１６は、図３に示すよ
うに、データバス３０を介してＣＰＵ３２に接続された
画像処理プロセッサ３４、画像メモリ３６、ディスプレ
イ３８、プリンタ４０、フロッピードライブ４２、キー
ボード４４等から成るコンピュータで構成されている。コード番号１２を撮像するＩＴＶカメラ１４の出力は、
Ａ／Ｄコンバータ４６でディジタル化され、データバス
３０を介して画像メモリ３６に蓄積される。ＣＰＵ３２
の制御により、画像メモリ３６の画像データは画像処理
プロセッサ３４に転送され、予めプログラムされた認識
アルゴリズムに基いて文字認識が行われる。認識結果は
、上位ＣＰＵ（プロセスコンピュータ２０）に送られる
。As shown in FIG. 3, the code number reading device 16 includes an image processing processor 34, an image memory 36, a display 38, a printer 40, a floppy drive 42, a keyboard 44, etc., which are connected to the CPU 32 via a data bus 30. It consists of a computer consisting of The output of the ITV camera 14 that images code number 12 is:
The data is digitized by the A/D converter 46 and stored in the image memory 36 via the data bus 30. CPU32
Under the control of the image memory 36, the image data in the image memory 36 is transferred to the image processing processor 34, and character recognition is performed based on a preprogrammed recognition algorithm. The recognition result is sent to the upper CPU (process computer 20).

【００１０】図４は、コード番号読取装置１６において
行われる画像処理の概略フロー図であって、ＩＴＶカメ
ラ１４によって取込まれた画像データは、ステップＳ１
で背景平準化の処理を受ける。この処理は照明の反射に
よる画像の濃度むらを除去するために行われる。次にス
テップＳ２で背景を除去した文字の部分の切出しが行わ
れる。次にステップＳ３で各文字ごとに設定した複数（
例えば３つ）の分割セグメントごとに原画像に対する２
値化が行われる。２値化のしきい値は各セグメントごと
に最適化されている。２値化された各セグメントは合成
され、一文字全体でしきい値が補正された後、一文字ご
とに次のステップＳ４で辞書データとのパターンマッチ
ングの処理がなされる。マッチングがとれると、認識出
力が管理コンピュータ２２に送出される。FIG. 4 is a schematic flow diagram of image processing performed in the code number reading device 16, in which image data captured by the ITV camera 14 is processed in step S1.
undergoes background leveling processing. This processing is performed to remove density unevenness in the image due to reflection of illumination. Next, in step S2, the character portion from which the background has been removed is cut out. Next, in step S3, the plurality of characters (
2 for the original image for each divided segment (for example, 3).
Value conversion is performed. The binarization threshold is optimized for each segment. After the binarized segments are combined and the threshold value is corrected for the entire character, pattern matching with dictionary data is performed for each character in the next step S4. When a match is made, a recognition output is sent to the management computer 22.

【００１１】図５は図４の概略フロー図の詳細を示すフ
ロー図である。まずステップＳ１０で読取装置のＣＰＵ
３２内のカウンタ、ワークエリア等を初期化し、またレ
ジスタに読取用の諸定数を設定する。次にステップＳ１
１でＩＴＶカメラ１４から画像データを取込み、更にス
テップＳ１２で濃度補正の処理を行う。この濃度補正処
理は図１の背景除去の処理Ａに対応し、また図４の背景
平準化処理Ｓ１に相当する。FIG. 5 is a flow diagram showing details of the schematic flow diagram of FIG. First, in step S10, the CPU of the reading device
The counter, work area, etc. in 32 are initialized, and various constants for reading are set in the register. Next step S1
Image data is captured from the ITV camera 14 in step S1, and density correction processing is further performed in step S12. This density correction process corresponds to the background removal process A in FIG. 1, and also corresponds to the background leveling process S1 in FIG.

【００１２】濃度補正の処理Ｓ１２は、図６〜図８で示
すようなＭＩＮ−ＭＡＸ差分法及びスムージングにより
行われる。なお図６及び図７の横軸は画像のＸ軸方向の
画素の並びに対応し、縦軸は各画素の濃度に対応する。原画像は図６に示すように文字部ａ、ｂと背景ｃとから
成る。背景ｃは鋼材１０に対する照明の当たり具合や反
射により部分的に異なるレベルを示し、段差ｄを有して
いる。The density correction process S12 is performed by the MIN-MAX difference method and smoothing as shown in FIGS. 6 to 8. Note that the horizontal axis in FIGS. 6 and 7 corresponds to the arrangement of pixels in the X-axis direction of the image, and the vertical axis corresponds to the density of each pixel. As shown in FIG. 6, the original image consists of character parts a and b and a background c. The background c shows partially different levels depending on the degree of illumination and reflection on the steel material 10, and has a step d.

【００１３】この原画像に対し、まずスムージングフィ
ルタをかけ、次に最小値（ＭＩＮ）フィルタをかける。このＭＩＮフィルタ処理においては、図８に示すように
、処理対象画素ｆ２２を中心とした３×３の画素ｆｉｊ
中の最小濃度値を処理結果の画素Ｆ２２の値として画像
メモリに格納し、これを全画素について行う。このＭＩ
Ｎフィルタ処理を文字の基準太さに対応した画素数の１
／２回繰り返すことにより、文字部ａ、ｂを消去し、背
景ｃを取出すことができる。[0013] First, a smoothing filter is applied to this original image, and then a minimum value (MIN) filter is applied. In this MIN filter processing, as shown in FIG.
The minimum density value among them is stored in the image memory as the value of pixel F22 as a processing result, and this is performed for all pixels. This MI
N filter processing is performed using 1 of the number of pixels corresponding to the standard thickness of the character.
By repeating /2 times, character parts a and b can be erased and background c can be taken out.

【００１４】この背景ｃに対し、次に最大値（ＭＡＸ）
フィルタを同一回数だけかけ、ＭＩＮフィルタ処理で生
じた画像の歪みを補正する。このＭＡＸフィルタ処理で
は、図８示す処理と同様に、処理対象の画素を中心とし
た３×３の画素中の最大濃度値を処理結果の画素濃度と
する。ＭＡＸフィルタの処理により、図６のｅで示すよ
うな背景が取出される。For this background c, next the maximum value (MAX)
The filter is applied the same number of times to correct image distortion caused by MIN filter processing. In this MAX filter processing, similarly to the processing shown in FIG. 8, the maximum density value among 3×3 pixels centered on the pixel to be processed is set as the pixel density of the processing result. Through the processing of the MAX filter, a background as shown by e in FIG. 6 is extracted.

【００１５】次に原画像（ａ、ｂ、ｃ）から背景ｅを減
算すると、図７に示すような文字部ａ、ｂを含む画像を
濃度むらのある背景から分離して抽出することができる
。[0015] Next, by subtracting the background e from the original image (a, b, c), it is possible to separate and extract the image including the character parts a and b as shown in FIG. 7 from the uneven density background. .

【００１６】濃度補正処理により抽出された文字画像デ
ータに対し、次に図５のステップＳ１３〜Ｓ１７から成
る文字の切出し処理（図４のステップＳ２）が行われる
。この処理は、図１の２値化、しきい値最適化の手順Ｂ
及び投影分布により文字枠を設定する手順Ｃにおけるデ
ータ処理に相当する。The character image data extracted by the density correction process is then subjected to a character cutting process (step S2 in FIG. 4) consisting of steps S13 to S17 in FIG. This process is step B of binarization and threshold optimization in Figure 1.
This corresponds to the data processing in step C of setting a character frame based on the projection distribution.

【００１７】まず図５のステップＳ１３では、暫定的な
しきい値で文字画像の２値化が行われる。２値化のしき
い値は、図９に示すような周知のパーセンタイル法（Ｐ
タイル法）で決定することができる。このＰタイル法で
は、画像の濃度値（例えば０〜２５５レベル）の夫々に
対応した画素数のヒストグラムを作成し、例えば高濃度
側から矢印Ｎで示す積算方向に画素個数を積算し、積算
値Ｓが画像の全画素数のＰ％に達した位置の濃度ｎをし
きい値とする。First, in step S13 of FIG. 5, a character image is binarized using a provisional threshold value. The binarization threshold is determined by the well-known percentile method (P
(tile method). In this P-tile method, a histogram of the number of pixels corresponding to each density value of the image (for example, 0 to 255 levels) is created, and the number of pixels is integrated, for example, from the high density side in the integration direction indicated by the arrow N. The density n at the position where S reaches P% of the total number of pixels of the image is set as a threshold value.

【００１８】次にステップＳ１４で、画像両端のノイズ
を除去する処理を行う。画像の左右両端分では、図１０
に示すように照明装置１８（図２）の影響を受けて濃度
が不自然に上昇していることがある。このため点線Ｐ、
Ｑで示すＸ座標の両外側を文字認識の画像領域から削除
する。位置Ｐ、Ｑは予め設定した固定位置であってよい
。Next, in step S14, processing is performed to remove noise at both ends of the image. At both left and right ends of the image, Figure 10
As shown in FIG. 2, the density may rise unnaturally due to the influence of the lighting device 18 (FIG. 2). Therefore, the dotted line P,
Both sides of the X coordinate indicated by Q are deleted from the image area for character recognition. The positions P and Q may be fixed positions set in advance.

【００１９】次にステップＳ１５及びＳ１６で文字列の
切出し及び列ごとの各文字の切出し（抽出）を順次行う
。これらの切出し処理は、図１１に示すようにＸ軸方向
及びＹ軸方向に２値化画像を投影することにより、文字
の座標位置を決定することにより行う。まずステップＳ
１５では、Ｙ軸への投影分布をとり、画像の文字列を抽
出する。Ｙ軸への投影分布は、Ｙ軸座標の各画素位置に
おいてＸ軸と平行な全ての画素の値（１又は０）を加算
することにより得られる。この投影分布に対し、Ｙ軸に
沿って隣接画素ごとに差分をとり、差分が一定値（例え
ば５画素分）を越える所を文字列のエッジ（立上り及び
立下り）として、その座標位置を決定する。これにより
、図１１に示す第０列、第１列、第２列に位置する各文
字列を原画像から切出すことができる。Next, in steps S15 and S16, character strings are cut out and each character in each string is cut out (extracted) in sequence. These cutting processes are performed by projecting a binarized image in the X-axis direction and the Y-axis direction to determine the coordinate position of the character, as shown in FIG. First step S
In step 15, the projection distribution on the Y axis is taken and character strings of the image are extracted. The projection distribution onto the Y-axis is obtained by adding the values (1 or 0) of all pixels parallel to the X-axis at each pixel position on the Y-axis coordinate. For this projection distribution, take the difference for each adjacent pixel along the Y axis, and determine the coordinate position of the edge (rising and falling edge) of the character string where the difference exceeds a certain value (for example, 5 pixels) do. Thereby, each character string located in the 0th column, the 1st column, and the 2nd column shown in FIG. 11 can be cut out from the original image.

【００２０】同様にして、ステップＳ１６では、各列ご
との２値化画像に対し、Ｘ軸への投影分布を取り、分布
データの立上り、立下りの各座標位置を決定する。この
位置データを用いて０、１、２の各列を構成する各文字
Ｃ０〜Ｃ４を原画像から個別に抽出することができる。Similarly, in step S16, the projection distribution onto the X axis is obtained for the binarized image for each column, and the coordinate positions of the rising and falling edges of the distribution data are determined. Using this position data, each character C0 to C4 constituting each column of 0, 1, and 2 can be individually extracted from the original image.

【００２１】次にステップＳ１７で、切り出しの列位置
及び文字位置の妥当性のチェックを行う。このチェック
は、図１１に示すＹ軸上の列間距離Ｌ１　、Ｘ軸上の文
字間距離Ｌ２　及び個々の文字間隔を基準値と比較する
ことにより行われる。また各列の文字数もチェックする
。これらのチェックにより切出し位置が不適当であると
判断した場合には、図５に示すようにステップＳ１３に
おける２値化のしきい値を変更し、以後のステップＳ１
４〜Ｓ１７を再度行う（リトライ）。Next, in step S17, the validity of the column position and character position for extraction is checked. This check is performed by comparing the inter-column distance L1 on the Y-axis, the inter-character distance L2 on the X-axis, and the individual character spacing shown in FIG. 11 with reference values. Also check the number of characters in each column. If it is determined that the extraction position is inappropriate through these checks, the threshold value for binarization in step S13 is changed as shown in FIG.
4 to S17 are performed again (retry).

【００２２】リトライにより、切出し位置が妥当である
と判断された場合には、ステップＳ１８から次のステッ
プＳ１９に進む。ステップＳ１８はリトライ回数のチェ
ックを行っていて、設定回数を越えても適正な切出し位
置の決定ができなかった場合には、図１６に示す後述の
リトライオーバの処理に進む。[0022] If it is determined that the cutout position is appropriate after the retry, the process advances from step S18 to the next step S19. In step S18, the number of retries is checked, and if a proper cutting position cannot be determined even after the set number of times has been exceeded, the process proceeds to retry over processing shown in FIG. 16, which will be described later.

【００２３】なお、図５のステップＳ１７における妥当
性のチェックは、ステップＳ１５の文字列の切出し及び
ステップＳ１６の文字抽出の各処理ごとに行ってもよい
。この場合、文字列の切出しに対するチェックで不適当
の判断が生じたとき、しきい値を変更してステップＳ１
３の２値化処理からステップＳ１４、Ｓ１５を再試行す
る。また文字抽出の切出しに対するチェックで不適当の
判断が生じたとき、しきい値を変更してステップＳ１３
の２値化処理を行った後、ステップＳ１６の文字抽出か
ら再試行する。Note that the validity check in step S17 in FIG. 5 may be performed for each of the character string extraction in step S15 and the character extraction in step S16. In this case, when an inappropriate judgment occurs in the check for character string extraction, the threshold value is changed and step S1
Steps S14 and S15 are retried from the binarization process in step 3. In addition, if an inappropriate judgment occurs in the check for character extraction, the threshold value is changed and step S13 is performed.
After performing the binarization process, the character extraction is retried in step S16.

【００２４】図５のステップＳ１９においては、図１２
に示すように、各列の各文字ごとに領域（枠）を設定し
、領域を上、中、下の３つのセグメントに分割する。文字枠Ｗの座標はステップＳ１５、Ｓ１６の投影分布デ
ータに基いて決定する。各分割領域Ｐ１〜Ｐ３のＹ座標
は、文字枠Ｗの両端から一定距離の位置とする。次にス
テップＳ２０で、各分割領域ごとにＰタイル法にて概算
しきい値を決定し、文字枠Ｗの画像を２値化する。次に
ステップＳ２１で、各分割領域ごとに、２値化画像の面
積率（分割領域の全体に対する画素値１を持つ画素の割
合）が予め定められた所定の範囲に入るまで、各分割領
域のしきい値を補正する。In step S19 of FIG.
As shown in the figure, an area (frame) is set for each character in each column, and the area is divided into three segments: top, middle, and bottom. The coordinates of the character frame W are determined based on the projection distribution data in steps S15 and S16. The Y coordinate of each divided area P1 to P3 is a position at a constant distance from both ends of the character frame W. Next, in step S20, an approximate threshold value is determined for each divided region using the P-tile method, and the image of the character frame W is binarized. Next, in step S21, for each divided area, the area ratio of the binarized image (ratio of pixels with pixel value 1 to the entire divided area) falls within a predetermined range. Correct the threshold.

【００２５】次にステップＳ２２で、３つの分割領域か
ら１つの文字を合成し、文字枠Ｗ内で文字の占める割合
が基準値の範囲に入るまで、各分割領域ごとに設定され
ているしきい値を等量ずつ変更する。この処理により、
図１３に示すように、各分割領域のしきい値の相違によ
る不自然な領域境界が是正され、スムーズにされた合成
文字が得られる。これらのステップＳ１９〜Ｓ２２は、
図１の手順Ｄにおける処理に相当する。Next, in step S22, one character is synthesized from the three divided regions, and the threshold set for each divided region is Change values by equal amounts. With this process,
As shown in FIG. 13, the unnatural region boundaries caused by the difference in the threshold values of the divided regions are corrected, and a smooth composite character is obtained. These steps S19 to S22 are
This corresponds to the process in step D in FIG.

【００２６】次にステップＳ２３で、文字枠を最終補正
する。即ち図１２に示すように、初期文字枠Ｗ内の２値
化画像データのＹ軸及びＸ軸への投影分布をとり、各投
影分布波形Ｓｙ　、Ｓｘ　の両端エッジ（立上り、立下
り）を算出し、最終文字枠Ｗｘ　、Ｗｙ　の座標を決定
する。そして最終文字枠においてステップＳ２２で設定
されたしきい値にて原画像に対する２値化を行ない、辞
書と照合するための文字データを得る。Next, in step S23, the character frame is finally corrected. That is, as shown in FIG. 12, the projection distribution of the binarized image data in the initial character frame W onto the Y axis and the Then, the coordinates of the final character frames Wx and Wy are determined. Then, in the final character frame, the original image is binarized using the threshold value set in step S22 to obtain character data to be checked against the dictionary.

【００２７】次に、ステップＳ２４で辞書データとの照
合及び判定が行われる。辞書データはビットパターンデ
ータであり、図１４に一例（数字０）を示すように、基
準位置のデータＤ１　、横及び縦方向に例えば２画素ず
つ変位させた左上、右上、左下、右下の各位置のデータ
Ｄ２　〜Ｄ５　から成る。原画像の２値化によって得ら
れた文字データは、これらの基準位置のデータ及び変位
させたデータの全てとビットマッチングにより比較され
る。即ち、辞書データ列と文字データ列との排他的論理
和をとって、差異のあるビットを検出し、それを計数す
ることにより、相関値（類似度）を得る。この照合処理
を辞書中の全てのデータと抽出された一文字分のデータ
とについて行い、類似度が最も高い照合結果により文字
を特定する。Next, in step S24, comparison with dictionary data and determination are performed. The dictionary data is bit pattern data, and as shown in an example (number 0) in FIG. It consists of position data D2 to D5. The character data obtained by binarizing the original image is compared with all of the data at the reference position and the displaced data by bit matching. That is, a correlation value (similarity) is obtained by calculating the exclusive OR of the dictionary data string and the character data string, detecting bits with a difference, and counting them. This matching process is performed on all the data in the dictionary and the data for one extracted character, and the character is specified based on the matching result with the highest degree of similarity.

【００２８】なお、図１４の基準位置の辞書データのみ
を用意し、辞書メモリの読出し位置のシフト操作により
、辞書データを各斜め方向にずらしながら抽出文字デー
タとの照合処理を順次行うようにしてもよい。或いは、
抽出した文字データの位置を左上、右上、左下、右下の
ようにシフトして、一つの辞書データと比較してもよい
。図１５は上述の辞書データのずらし処理を模式的に示
したものである。Note that only the dictionary data at the reference position shown in FIG. 14 is prepared, and the matching process with the extracted character data is sequentially performed while shifting the dictionary data in each diagonal direction by shifting the reading position of the dictionary memory. Good too. Or,
The extracted character data may be shifted to the upper left, upper right, lower left, or lower right and compared with one dictionary data. FIG. 15 schematically shows the above-mentioned dictionary data shifting process.

【００２９】以上の処理を全列の文字について順次行っ
て各文字を特定する。ステップＳ２４で得られた照合結
果は、次のステップＳ２５でホストコンピュータ（プロ
セスコンピュータ２０又は管理コンピュータ２２）に送
信されると共に、読取装置のディスプレイ３８上に表示
される。The above processing is sequentially performed for all the characters in the string to identify each character. The verification result obtained in step S24 is transmitted to the host computer (process computer 20 or management computer 22) in the next step S25, and is also displayed on the display 38 of the reading device.

【００３０】次に、図１６はリトライオーバーの処理（
基準文字処理）を示す。この処理は、図５のステップＳ
１８において、文字切出し位置が不適当であったときの
しきい値変更のリトライ回数が、規定回数を越えた場合
に行われる。この基準文字処理では、図１１に示す第２
列の文字行Ｃ１及びＣ３の位置に印字された基準文字５
０を使用して文字の切出し位置を算出する。この例では
、基準文字５０は、＋であるが、直交座標上の一点を容
易に確定することができる文字又は図形のパターンであ
ればよい。Next, FIG. 16 shows retry over processing (
standard character processing). This process is performed in step S in FIG.
Step 18 is performed when the number of retries for changing the threshold value when the character cutting position is inappropriate exceeds a predetermined number of times. In this standard character processing, the second
Standard character 5 printed in character rows C1 and C3 of the column
0 is used to calculate the character cutout position. In this example, the reference character 50 is +, but it may be any pattern of characters or figures that allows one point on the orthogonal coordinates to be easily determined.

【００３１】図１６のステップＳ２６では、初期化処理
を行い、リトライ回数をチェックするカウンタやＣＰＵ
のワークエリアをクリアすると共に、文字列の中心位置
（Ｘ座標）を算出し、基準文字処理の準備を行う。次に
ステップＳ２７で、図１７に示すように、注目している
基準文字５０に対する文字枠Ｗを定め、Ｙ軸座標を３つ
の領域Ｐ１〜Ｐ３に分割する。この処理は図１２に示し
たコード番号に対する文字枠の分割処理（図５のステッ
プＳ１９）と同じである。なお、文字枠Ｗの座標は、図
１１に示したＸ軸及びＹ軸への投影分布から大ざっぱに
決定することができる。In step S26 of FIG. 16, initialization processing is performed to check the number of retries and the CPU.
At the same time as clearing the work area, the center position (X coordinate) of the character string is calculated and preparations are made for standard character processing. Next, in step S27, as shown in FIG. 17, a character frame W for the reference character 50 of interest is determined, and the Y-axis coordinate is divided into three regions P1 to P3. This process is the same as the character frame division process for code numbers shown in FIG. 12 (step S19 in FIG. 5). Note that the coordinates of the character frame W can be roughly determined from the projection distribution on the X and Y axes shown in FIG.

【００３２】次に、ステップＳ２８で、各分割領域Ｐ１
〜Ｐ３ごとにＰタイル法により概略しきい値を算出し、
各分割領域において切取られた入力画像を２値化した後
、各分割領域ごとにしきい値を最適化する。この最適化
処理は、図５のステップＳ２１での処理と同じであって
よく、図１８に示すように、各領域Ｐ１、Ｐ２、Ｐ３ご
とに、文字部の面積率が規定の範囲に入るまで夫々のし
きい値を変更する。Next, in step S28, each divided area P1
~ Calculate the approximate threshold value using the P tile method for each P3,
After the input image cut out in each divided area is binarized, the threshold value is optimized for each divided area. This optimization process may be the same as the process in step S21 of FIG. 5, and as shown in FIG. Change each threshold.

【００３３】次にステップＳ２９で最適化されたしきい
値により枠Ｗ内の入力画像を分割領域ごとに２値化する
。更に、ステップＳ３０で、図１８に示すように、各分
割領域の２値化画像を１文字に合成した後、合成した文
字部全体が基準値になるように各領域のしきい値を補正
する。次にステップＳ３１で分割領域の境界のスムージ
ングを行う。このステップＳ３０、Ｓ３１の処理により
、図１９に示すように、基準文字５０に対して領域間の
つながりがスムースにされた合成文字が得られる。Next, in step S29, the input image within the frame W is binarized for each divided area using the optimized threshold value. Furthermore, in step S30, as shown in FIG. 18, after the binarized images of each divided area are combined into one character, the threshold value of each area is corrected so that the entire combined character part has a reference value. . Next, in step S31, smoothing of the boundaries of the divided areas is performed. Through the processing of steps S30 and S31, as shown in FIG. 19, a composite character with smooth connections between regions is obtained with respect to the reference character 50.

【００３４】次にステップＳ３２で、文字枠を最終補正
する。即ち、図１７に示すように、初期文字枠Ｗ内の２
値化画像データのＹ軸及びＸ軸への投影分布をとり、各
投影分布波形ＳＹ　、ＳＸ　の両端エッジを認識し、最
終文字枠ＷＸ　、ＷＹ　の座標を決定する。この処理は
図１２に示した処理と同様である。そして最終文字枠に
おいて入力画像の２値化を行い、基準文字５０の２値化
データを得る。Next, in step S32, the character frame is finally corrected. That is, as shown in FIG.
The projection distribution of the digitized image data onto the Y-axis and the X-axis is taken, the edges at both ends of each projection distribution waveform SY , SX are recognized, and the coordinates of the final character frames WX , WY are determined. This process is similar to the process shown in FIG. Then, the input image is binarized in the final character frame to obtain binarized data of the reference character 50.

【００３５】次にステップＳ３３では、抽出された文字
データと基準文字用辞書データとを照合し、基準文字に
該当するか否かを判定する。この処理では、図２０に示
すように、基準位置の辞書データＤ１　、横及び縦方向
に例えば２画素ずつ変位させた左上、右上、左下、右下
のデータＤ２　〜Ｄ５　と抽出した文字データとのビッ
トマチングをとる。このマッチング処理により得た類似
度が規定値に達しなければ、次のステップで注目文字列
、行を変更し、新たな文字枠を設定して、ステップＳ２
７〜Ｓ３３を繰り返す。Next, in step S33, the extracted character data is compared with reference character dictionary data to determine whether or not it corresponds to the reference character. In this process, as shown in FIG. 20, dictionary data D1 at the reference position, data D2 to D5 at the upper left, upper right, lower left, and lower right, which are displaced by 2 pixels in the horizontal and vertical directions, and the extracted character data are combined. Perform bit matching. If the similarity obtained through this matching process does not reach the specified value, the next step is to change the character string and line of interest, set a new character frame, and step S2
7 to S33 are repeated.

【００３６】以上の処理結果に基きステップＳ３５で類
似度が最大の文字を基準文字と判定する。次にステップ
Ｓ３６で、類似度の高い基準文字に対し妥当性のチェッ
クを行う。このチェックは、図２１に示すＬ字及び逆Ｌ
字のテンプレートパターンＴ１、Ｔ２を基準文字データ
の文字枠に当てはめてマッチングをとることにより行う
。この２つのテンプレートパターンＴ１、Ｔ２は元の基
準文字の一部であり、しかも夫々は、基準文字の中心Ｏ
の座標を決定するのに必要な２点Ｊ、Ｋを含んでいる。従ってこれらのテンプレートパターンＴ１、Ｔ２の双方
と抽出文字データとのマッチングがとれたならば、抽出
した基準文字から算出する中心位置座標の精度が非常に
高いことになる。Based on the above processing results, in step S35, the character with the highest degree of similarity is determined to be the reference character. Next, in step S36, the validity of the reference characters having a high degree of similarity is checked. This check is performed using the L-shape and reverse L-shape shown in Figure 21.
This is performed by applying the character template patterns T1 and T2 to the character frame of the reference character data and performing matching. These two template patterns T1 and T2 are part of the original reference character, and each template pattern is located at the center O of the reference character.
It contains the two points J and K necessary to determine the coordinates of . Therefore, if both of these template patterns T1 and T2 are matched with the extracted character data, the accuracy of the center position coordinates calculated from the extracted reference character will be extremely high.

【００３７】ステップＳ３６の妥当性チェックが済むと
、次にステップＳ３７で基準文字の中心座標より各文字
枠の座標を幾何学的に算出する。基準文字の中心座標は
、図２１の点Ｊ、Ｋの座標から算出することができる。またコード番号１２を構成する各文字の切取り用文字枠
Ｗは、図２２に示すように基準文字５０の中心座標から
算出することができる。なお、基準文字５０は２つ付さ
れているので、これらの中心座標の間隔により、入力画
像の伸縮度を求めることができ、伸縮度に基いて各文字
枠を正確に算出することができる。After completing the validity check in step S36, the coordinates of each character frame are calculated geometrically from the center coordinates of the reference character in step S37. The center coordinates of the reference character can be calculated from the coordinates of points J and K in FIG. Furthermore, the character frame W for cutting out each character constituting the code number 12 can be calculated from the center coordinates of the reference character 50, as shown in FIG. Note that since two reference characters 50 are attached, the degree of expansion/contraction of the input image can be determined from the interval between these center coordinates, and each character frame can be accurately calculated based on the degree of expansion/contraction.

【００３８】文字切取り枠が決定されると、以下図５の
ステップＳ１９の処理から各文字に対する照合処理が行
われる。Once the character cutting frame is determined, a matching process for each character is performed from the process of step S19 in FIG. 5 below.

【００３９】図２３は図１６の基準文字処理に対応する
要素的特徴を示すブロック図である。入力画像は２値化
手段１で２値化され、基準文字５０の部分が照合手段２
において辞書データと照合される。これらの２値化手段
１及び照合手段２は、図１６のステップＳ２７〜Ｓ３６
における処理に対応する。照合により基準文字であると
認識されると、基準文字の２値化画像に基いて座標算出
手段３（図１６のステップＳ３７に対応）により基準座
標が算出される。この基準座標に基いて文字枠が文字枠
設定手段４において設定され、原入力画像から文字部が
切出される。この切出された画像に基いて認識手段５に
おいて辞書との照合により文字が特定される。FIG. 23 is a block diagram showing elemental features corresponding to the standard character processing shown in FIG. 16. The input image is binarized by the binarization means 1, and the portion of the reference character 50 is binarized by the comparison means 2.
It is compared with dictionary data at These binarization means 1 and collation means 2 perform steps S27 to S36 in FIG.
Corresponds to the processing in When the character is recognized as a reference character by comparison, the coordinate calculation means 3 (corresponding to step S37 in FIG. 16) calculates reference coordinates based on the binarized image of the reference character. Based on these reference coordinates, a character frame is set by the character frame setting means 4, and a character portion is cut out from the original input image. Based on this cut out image, the recognition means 5 identifies the character by comparing it with a dictionary.

【００４０】以上の実施例はコード番号１２の認識シス
テムについて述べられているが、本発明は数字だけでな
く、アルファベット、かな文字、漢字等の認識に適用可
能である。Although the above embodiment has been described with respect to a recognition system for code number 12, the present invention is applicable to recognition of not only numbers but also alphabets, kana characters, kanji characters, and the like.

【００４１】[0041]

【発明の効果】本発明の文字認識方法は、上述のように
、文字部全体に対する２値化と、個々の文字枠内の分割
領域ごとの局所的２値化とを順次行っているので、全体
的な濃度むら及び局所的な濃度むらの双方に対応させた
２値化により、パターンマッチングのための対象データ
を非常に高精度に抽出することができ、マッチング精度
が著しく向上する。また本発明の文字認識装置によれば
、文字列に付された基準文字又は基準図形を利用して文
字切取り枠を設定するように成されているので、入力画
像にノイズや欠けがあっても、それらの影響を避けて正
確に文字を切取ることが可能となり、マッチング精度が
上がり認識率が向上する。[Effects of the Invention] As described above, the character recognition method of the present invention sequentially performs binarization for the entire character portion and local binarization for each divided area within each character frame. Binarization that deals with both overall density unevenness and local density unevenness allows target data for pattern matching to be extracted with extremely high accuracy, and matching accuracy is significantly improved. Furthermore, according to the character recognition device of the present invention, a character cutting frame is set using a reference character or a reference figure attached to a character string, so even if there is noise or chipping in the input image, , it becomes possible to avoid these influences and accurately cut out characters, improving matching accuracy and recognition rate.

【図面の簡単な説明】[Brief explanation of the drawing]

【図１】本発明の文字認識装置の要素的特徴を示すブロ
ック図である。FIG. 1 is a block diagram showing the essential features of a character recognition device of the present invention.

【図２】本発明の文字認識装置が適用される鋼材ナンバ
リングシステムの要部ブロック図である。FIG. 2 is a block diagram of main parts of a steel numbering system to which the character recognition device of the present invention is applied.

【図３】図２のコード番号読取装置の構成例を示すブロ
ック図である。FIG. 3 is a block diagram showing a configuration example of the code number reading device shown in FIG. 2;

【図４】文字認識のための画像処理の概略フローチャー
トである。FIG. 4 is a schematic flowchart of image processing for character recognition.

【図５】文字認識のための画像処理の詳細フローチャー
トである。FIG. 5 is a detailed flowchart of image processing for character recognition.

【図６】ＭＩＮ−ＭＡＸ差分法による画像処理を示す画
像濃度のグラフである。FIG. 6 is a graph of image density showing image processing using the MIN-MAX difference method.

【図７】ＭＩＮ−ＭＡＸ差分法により抽出された文字画
像の濃度グラフである。FIG. 7 is a density graph of a character image extracted by the MIN-MAX difference method.

【図８】ＭＩＮフィルタ、ＭＡＸフィルタの画像処理を
説明する図である。FIG. 8 is a diagram illustrating image processing of a MIN filter and a MAX filter.

【図９】Ｐタイル法によるしきい値設定処理を示す画像
ヒストグラムである。FIG. 9 is an image histogram showing threshold setting processing using the P-tile method.

【図１０】２値化画像の両端処理を示す濃度グラフであ
る。FIG. 10 is a density graph showing both end processing of a binarized image.

【図１１】２値化画像のＸ−Ｙ投影分布図である。FIG. 11 is an X-Y projection distribution diagram of a binarized image.

【図１２】文字枠の分割及び最終文字枠の設定を示す図
である。FIG. 12 is a diagram showing division of a character frame and setting of a final character frame.

【図１３】合成文字に対するスムージング処理を示す図
である。FIG. 13 is a diagram showing smoothing processing for composite characters.

【図１４】辞書データの展開例を示す図である。FIG. 14 is a diagram showing an example of expansion of dictionary data.

【図１５】辞書データのシフトを模式的に示す図である
。FIG. 15 is a diagram schematically showing a shift of dictionary data.

【図１６】基準文字処理の手順を示すフローチャートで
ある。FIG. 16 is a flowchart showing the procedure of standard character processing.

【図１７】基準文字の文字枠の設定及び領域分割を説明
する図である。FIG. 17 is a diagram illustrating setting of a character frame of a reference character and area division.

【図１８】基準文字の分割領域ごとのしきい値補正処理
を説明する図である。FIG. 18 is a diagram illustrating threshold correction processing for each divided area of a reference character.

【図１９】基準文字の合成画像についてのしきい値補正
処理を説明する図である。FIG. 19 is a diagram illustrating threshold correction processing for a composite image of reference characters.

【図２０】基準文字の辞書データの展開例を示す図であ
る。FIG. 20 is a diagram showing an example of expansion of dictionary data of reference characters.

【図２１】テンプレートマッチングパターンを示す図で
ある。FIG. 21 is a diagram showing a template matching pattern.

【図２２】基準文字に基いて文字枠を設定する座標系の
図である。FIG. 22 is a diagram of a coordinate system for setting a character frame based on a reference character.

【図２３】文字認識装置の要素的特徴を示す要部ブロッ
ク図である。FIG. 23 is a block diagram of main parts showing elemental features of a character recognition device.

【符号の説明】[Explanation of symbols]

１　　２値化手段２　　照合手段３　　座標算出手段４　　文字枠設定手段５　　認識手段１０　　鋼材１２　　製造コード番号１４　　ＩＴＶカメラ１６　　コード番号読取装置 1 Binarization means 2. Verification means 3 Coordinate calculation means 4 Character frame setting means 5. Recognition means 10 Steel material 12 Manufacturing code number 14 ITV camera 16 Code number reader

Claims

【特許請求の範囲】[Claims]

【請求項１】　　入力画像から文字部の背景を除去して
文字部を抽出する手順Ａと、抽出した文字部を２値化し
、その直交２軸への投影分布データに基いて２値化のし
きい値を最適化する手順Ｂと、最適化された２値化画像
の直交２軸への投影分布データに基いて複数の領域に分
割された文字枠を設定する手順Ｃと、上記文字枠内の各
分割領域ごとに上記入力画像を２値化し、夫々の独立し
た２値化しきい値を最適化する手順Ｄと、上記手順Ｄで
最適化された２値化画像の直交２軸への投影分布データ
に基いて文字枠を再設定する手順Ｅと、上記手順Ｅの文
字枠内の２値化画像と辞書データとを照合して文字を特
定する手順Ｆとから成ることを特徴とする文字認識方法
。Claim 1: Procedure A for removing the background of a character part from an input image and extracting the character part; binarizing the extracted character part; and performing the binarization based on projection distribution data on two orthogonal axes. Step B of optimizing the threshold value, Step C of setting a character frame divided into a plurality of regions based on the projection distribution data of the optimized binarized image on two orthogonal axes, and the above character frame. A procedure D in which the input image is binarized for each divided region within the area and each independent binarization threshold is optimized; The method is characterized by comprising a step E in which a character frame is reset based on projection distribution data, and a step F in which a character is identified by comparing the binarized image within the character frame in step E with dictionary data. Character recognition method.

【請求項２】　　上記手順Ａが、入力画像に対し最小値
フィルタの処理を行い、その処理結果に対し最大値フィ
ルタの処理を行って背景を抽出し、入力画像から背景を
減算して文字部を抽出する各手順から成ることを特徴と
する請求項１に記載の文字認識方法。[Claim 2] The above step A performs minimum value filter processing on the input image, performs maximum value filter processing on the processing result to extract the background, and subtracts the background from the input image to extract the character part. 2. The character recognition method according to claim 1, comprising steps of extracting a character recognition method.

【請求項３】　　上記手順Ｂにおける初期しきい値が抽
出文字部の画像に対するＰタイル法により決定されてい
ることを特徴とする請求項１に記載の文字認識方法。3. The character recognition method according to claim 1, wherein the initial threshold value in step B is determined by a P-tile method for an image of the extracted character portion.

【請求項４】　　上記手順Ｂが、直交２軸への投影分布
データに基く座標情報と予め記憶された基準値とを比較
して、比較結果に基いてしきい値を最適化する各手順か
ら成ることを特徴とする請求項１に記載の文字認識方法
。4. The step B comprises steps of comparing coordinate information based on projection distribution data on two orthogonal axes with a pre-stored reference value and optimizing a threshold value based on the comparison result. The character recognition method according to claim 1, characterized in that:

【請求項５】　　上記手順Ｄが、各分割領域ごとにＰタ
イル法により初期しきい値を決定する手順を含むことを
特徴とする請求項１に記載の文字認識方法。5. The character recognition method according to claim 1, wherein step D includes a step of determining an initial threshold value for each divided region by a P-tile method.

【請求項６】　　上記手順Ｄが、個々の分割領域の２値
画像の文字部の面積比率が所定範囲の値となるようにし
きい値を変更する手順を含むことを特徴とする請求項１
に記載の文字認識方法。6. The step D includes a step of changing the threshold so that the area ratio of the character portion of the binary image of each divided region falls within a predetermined range.
Character recognition method described in.

【請求項７】　　上記手順Ｄが、個々の分割領域ごとに
２値化された画像の合成画像について、その文字部の面
積比率が所定範囲の値となるように上記各分割領域の夫
々のしきい値を変更することを特徴とする請求項１又は
６に記載の文字認識方法。7. In step D, for a composite image of images binarized for each divided region, each of the divided regions is divided so that the area ratio of the character portion thereof falls within a predetermined range. 7. The character recognition method according to claim 1, further comprising changing a threshold value.

【請求項８】　　上記手順Ｃが、認識すべき文字の座標
系を決定するために付加された基準文字又は基準図形に
対応する入力画像に基いて、座標系を決定し、この座標
系に基き上記文字枠を設定することを特徴とする請求項
１に記載の文字認識方法。8. The step C determines a coordinate system based on the input image corresponding to the reference character or reference figure added to determine the coordinate system of the character to be recognized, and determines the coordinate system based on this coordinate system. 2. The character recognition method according to claim 1, further comprising setting the character frame.

【請求項９】　　入力画像を２値化する２値化手段と、
認識すべき文字の座標系を決定するために文字の近傍に
付された基準文字又は基準図形の２値化画像と辞書デー
タとを照合する照合手段と、照合結果に基き、上記基準
文字又は基準図形の２値化画像から基準座標を算出する
座標算出手段と、上記基準座標に基き、認識すべき文字
を切出すための文字枠を設定する文字枠設定手段と、入
力画像から上記文字枠に基いて切出された個々の文字を
認識する認識手段とを備える文字認識装置。[Claim 9] Binarization means for binarizing an input image;
A collating means for collating a binary image of a reference character or reference figure attached near the character with dictionary data in order to determine the coordinate system of the character to be recognized; a coordinate calculation means for calculating reference coordinates from a binary image of a figure; a character frame setting means for setting a character frame for cutting out characters to be recognized based on the reference coordinates; A character recognition device comprising a recognition means for recognizing individual characters cut out based on the base.

【請求項１０】　　上記２値化手段が、上記基準文字又
は基準図形を複数の領域に分割して夫々の分割領域ごと
に入力画像を２値化し、各領域の２値化しきい値を最適
化する手段と、最適化された２値画像の直交２軸への投
影分布データに基いて上記基準文字又は基準図形の切取
り枠を設定する手段とを備える請求項９に記載の文字認
識装置。10. The binarization means divides the reference character or reference figure into a plurality of regions, binarizes the input image for each divided region, and optimizes a binarization threshold for each region. 10. The character recognition device according to claim 9, further comprising means for setting a cutting frame for the reference character or reference figure based on projection distribution data of the optimized binary image onto two orthogonal axes.

【請求項１１】　　上記最適化する手段が、各分割領域
の合成画像について、その文字又は図形部分の面積比率
が所定範囲の値となるように各分割領域のしきい値を変
更する手段を備えることを特徴とする請求項１０に記載
の文字認識装置。11. The optimizing means includes means for changing the threshold value of each divided region so that the area ratio of the character or graphic portion of the composite image of each divided region falls within a predetermined range. 11. The character recognition device according to claim 10.

【請求項１２】　　上記照合手段における辞書データが
上記基準文字又は基準図形の全体に対応する辞書データ
と、上記基準文字又は基準図形の要部に対応する辞書デ
ータとから成り、全体についての照合と要部についての
照合とを行うことを特徴とする請求項９に記載の文字認
識装置。12. The dictionary data in the collation means is comprised of dictionary data corresponding to the entire reference character or reference figure, and dictionary data corresponding to the main part of the reference character or reference figure, and the dictionary data corresponds to the entire reference character or reference figure, and the dictionary data corresponds to the entire reference character or reference figure, and the dictionary data corresponds to the entire reference character or reference figure, and the dictionary data corresponds to the entire reference character or reference figure. 10. The character recognition device according to claim 9, wherein the character recognition device performs verification on main parts.

【請求項１３】　　上記要部に対応する辞書データが、
上記基準文字又は基準図形の一部を共通にする互に異な
る少なくとも２つの部分に対応し、上記共通の一部には
、上記基準文字又は基準図形から得る座標基準点が含ま
れていることを特徴とする請求項１２に記載の文字認識
装置。13. Dictionary data corresponding to the main part is:
It corresponds to at least two mutually different parts that share a part of the reference character or reference figure, and the common part includes a coordinate reference point obtained from the reference character or figure. The character recognition device according to claim 12.

【請求項１４】　　上記基準文字又は基準図形が＋字形
であり、上記要部がＬ字形であることを特徴とする請求
項１３に記載の文字認識装置。14. The character recognition device according to claim 13, wherein the reference character or reference figure is in the shape of a + character, and the main part is in the shape of an L character.