JPH0696288A

JPH0696288A - Character recognizing device and machine translation device

Info

Publication number: JPH0696288A
Application number: JP4240602A
Authority: JP
Inventors: Etsuo Ito; 悦雄伊藤; Kimito Takeda; 公人武田
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1992-09-09
Filing date: 1992-09-09
Publication date: 1994-04-08

Abstract

PURPOSE:To correctly recognize a symbol for showing the end of a sentence by analyzing the parts before and after punctuation marks, a period and a comma, in the case there is ambiguity in a result of recognition of the punctuation marks, the period and the comma. CONSTITUTION:A character recognizing part 5 sets an arbitrary range of image data as an object and executes and executes a pattern collation with a standard character pattern recorded in advance in a recognition dictionary 6. This pattern collation processing is executed to all processings of the image data, and recognition of these corresponding characters is executed. A sentence end symbol priority processing part 7 reads in a result recognized by the recognizing part 5, and in the case where there is ambiguity in punctuation marks, a period and a comma, the parts before and after its symbol are analyzed, and in the case where possibility that this part becomes a pause of a sentence, is high, a symbol for showing the sentence end such as an end point and a period is allowed to take precedence, and in other case, a symbol except the sentence end such as a pause point and a comma is allowed to take precedence. In such a way, a symbol for showing the end of the sentence can be recognized correctly.

Description

【発明の詳細な説明】Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、原稿に記載された文字
を読取る文字認識装置，及び第１の言語を第２の言語に
翻訳する機械翻訳装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition device for reading characters written on a manuscript and a machine translation device for translating a first language into a second language.

【０００２】[0002]

【従来の技術】近年の文書の処理の電子化に伴い、紙に
印刷された文書を読み取り、電子メディア化する文字読
取装置が開発されている。従来の文字認識装置は、各種
文書が記載された原稿を読み取るスキャナなどの読み取
り部により読み取る。このとき、読み取り部は原稿を文
字の集まりとして捕らえるのではなく、点の集まりから
なる画像データとして読み取る。そして、読み取り部が
読み取った画像データから文字認識部は任意の範囲を取
り出し、認識用辞書に格納している標準文字パターン照
合処理を行う。この結果、前記文字認識部で照合ができ
れば、この文字は出力部に出力されると共に記録部に記
録される。これらの処理を行う事によって原稿上の文字
が該装置に入力される。しかし、これらの処理では、数
字の”０”とアルファベットの”Ｏ”、数字の”１”と
小文字の”ｌ（エル）”と大文字の”Ｉ（ア
イ）”、”、（読点）”と”。（句点）”、”．（ピリ
オド）”と”，（カンマ）”など類似した文字の間に曖
昧性が残り、読み間違いが起こる場合があった。このた
め、曖昧性がある部分をオペレータに示し、修正を促す
機能が備えられている。2. Description of the Related Art With the recent computerization of document processing, a character reader for reading a document printed on paper and converting it into an electronic medium has been developed. A conventional character recognition device reads with a reading unit such as a scanner that reads a document in which various documents are described. At this time, the reading unit does not capture the document as a collection of characters, but reads it as image data composed of a collection of dots. Then, the character recognition unit extracts an arbitrary range from the image data read by the reading unit, and performs the standard character pattern matching process stored in the recognition dictionary. As a result, if the character recognition unit can perform the collation, the character is output to the output unit and recorded in the recording unit. By performing these processes, the characters on the document are input to the device. However, in these processes, the number "0" and the alphabet "O", the number "1" and the lower case "l (el)" and the upper case "I (eye)", ", (reading mark)" Ambiguity remained between similar characters such as ". (Phrase)", ". (Period)" and ", (comma)", and misreading sometimes occurred. For this reason, a function is provided to show the ambiguity to the operator and prompt the user to make corrections.

【０００３】また、文字認識装置を計算機を用いて文章
の翻訳を行う機械翻訳装置の入力装置として使用する場
合が増えてきている。機械翻訳装置では、入力された文
章を文に分割し、分割された文毎に一文単位で翻訳を行
う。文への分割は、改行やピリオド、句点等文末を示す
記号によって行う場合が多い。しかし、文字認識装置で
の認識結果には、”、（読点）”と”。（句
点）”、”．（ピリオド）”と”，（カンマ）”の曖昧
性があり、これらを誤って認識する場合が多く、他の文
字より小さい記号であるためオペレータもこれらの認識
誤りを見落とすことが多かった。このため、このよう
に”、（読点）”と”。（句点）”、”．（ピリオ
ド）”と”，（カンマ）”の認識誤りを含むデータを機
械翻訳システムの入力データとして使用すると、文章中
に於ける文の分割に誤りが生じ、機械翻訳の結果が著し
く悪くなるという問題があった。In addition, the character recognition device is increasingly used as an input device of a machine translation device for translating sentences by using a computer. The machine translation device divides an input sentence into sentences, and translates each divided sentence on a sentence-by-sentence basis. The division into sentences is often done by symbols indicating the end of sentences such as line breaks, periods, and punctuation. However, the recognition result by the character recognition device has ambiguity of ", (punctuation)" and ". (Punctuation mark)", ". (Period)" and ", (comma)", and these are erroneously recognized. In many cases, since the symbols are smaller than other characters, operators often overlook these recognition errors. Therefore, when data including recognition errors of ", (punctuation)", ". (Punctuation)", ". (Period)" and ", (comma)" is used as input data of the machine translation system, There was a problem that the result of the machine translation was remarkably deteriorated due to an error in the division of the sentence in the sentence.

【０００４】また、機械翻訳、文書データベース、電子
出版等、計算機を用いた文書処理が盛んに行われるにつ
れ、新聞、雑誌、マニュアルなどの一般に流通している
既存の文書を簡単に入力したいという要求が増えてきた
ため、原稿のレイアウトを自動的に認識する機能を備え
た文字認識装置もある。Further, with the active use of computer-based document processing such as machine translation, document databases, and electronic publishing, there is a demand for easily inputting generally distributed existing documents such as newspapers, magazines, and manuals. Since the number of characters has increased, some character recognition devices have a function of automatically recognizing the layout of a document.

【０００５】例えば、従来装置は図１７に示すように、
まず各種原稿を読み取り部３０１によって読み取る。こ
の時、読み取り部３０１は原稿を文字の集まりとして扱
うのでなく、点の集まりからなる画像データとして入力
する。そして、読み取り部３０１が読み取った画像デー
タから文字認識部３０２は任意の範囲を切出し、図示し
ない認識用辞書に格納されている標準パターンとパター
ン照合処理を行なう。この結果、前記文字認識部３０２
で照合ができれば、その文字コードを出力部３０３に出
力するとともに、記憶部３０４に記憶する。これらの処
理を行なうことで、原稿上の文字が該装置に入力され
る。なお、制御部３０５は各部の制御を行なうものであ
る。For example, the conventional device is shown in FIG.
First, various documents are read by the reading unit 301. At this time, the reading unit 301 does not treat the document as a collection of characters, but inputs it as image data composed of a collection of dots. Then, the character recognition unit 302 cuts out an arbitrary range from the image data read by the reading unit 301, and performs a pattern matching process with a standard pattern stored in a recognition dictionary (not shown). As a result, the character recognition unit 302
If the collation can be performed with, the character code is output to the output unit 303 and stored in the storage unit 304. By performing these processes, the characters on the document are input to the device. The control unit 305 controls each unit.

【０００６】また、一般に流通している既存文書は、紙
の質、紙の表面の加工、文字の背景等の違いで、入力さ
れる画像を一定に保つことができない。このため、予め
定められた読取りレベルを選択して文字画像を読み取る
ことができる。この場合、読み取りレベルは、読み取り
部３０１から原稿を読み取る前に、例えば、薄い原稿、
普通の原稿、濃い原稿等を出力部３０３に表示を行ない
オペレータが選択することで実現されている。この選択
された読み取りレベルは、読み取り濃度の白黒を判定す
るしきい値として読み取り部３０１に送られ、送られた
しきい値に従って、画像データが入力される。[0006] In addition, generally existing existing documents cannot maintain a constant input image due to differences in the quality of paper, the processing of the paper surface, the background of characters, and the like. Therefore, the character image can be read by selecting a predetermined reading level. In this case, the reading level is, for example, a thin original before reading the original from the reading unit 301.
It is realized by displaying an ordinary manuscript, a dark manuscript, etc. on the output unit 303 and selecting by the operator. The selected reading level is sent to the reading unit 301 as a threshold value for determining black and white of the reading density, and the image data is input according to the sent threshold value.

【０００７】しかし、この種の文字読取り装置の読み取
りレベルは、予め定められた値をオペレータが選択する
ようになっている。どの値を選択するかは、オペレータ
の感覚で相違し、どの値で読み取るのが最適であるか知
ることは困難であるという問題点があった。However, as for the reading level of this type of character reading device, the operator selects a predetermined value. There is a problem in that it is difficult to know which value is optimum for reading because the operator has a different sense of what to select.

【０００８】一方、従来における機械翻訳装置では、文
字認識装置によって原稿を読取っている。このような文
字認識装置には、原稿上のどの部分に文字があるか、ま
た、どの部分が記事としてまとまりがあるかを解析する
レイアウト解析機能を有し、その解析結果に従って文字
認識を行う機能を有するものもある。さらに、読み取っ
た原稿を印刷したり、ファイルとして出力したりする際
に、レイアウト解析の結果を利用して、もとの原稿に近
いイメージで出力する機能を有するものもある。On the other hand, in the conventional machine translation device, the document is read by the character recognition device. Such a character recognition device has a layout analysis function that analyzes which part of the manuscript contains characters, and which part of the manuscript is collected as an article, and a function for performing character recognition according to the analysis result. Some have. Further, when printing the read document or outputting it as a file, there is also a function of outputting an image close to the original document by using the result of the layout analysis.

【０００９】また、日本語文書の読取りが行われるにお
よび、縦書き文書の読取りや、レイアウト解析の一部と
して文字が横書きか縦書きかを識別して読み取る機能も
開発されている。Further, as a Japanese document is read, a function of reading a vertically written document and identifying and reading whether a character is horizontally written or vertically written have been developed as part of layout analysis.

【００１０】一方、計算機上で文書をWYSWYGで編集する
DTP アプリケーションが開発されている。こういったツ
ールを使用すると、原稿を見た通りに計算機上に再現で
きる。しかし、こういったツールでは、横書きが中心で
あり、縦書きの表示ができるものは少ない。したがっ
て、文書読取り装置によって、縦書きの文書を読取り、
レイアウトを反映させて出力しても、それをWYSWYGで編
集することができない。On the other hand, edit the document with WYSWYG on the computer.
DTP applications are being developed. By using these tools, you can reproduce a manuscript exactly as you see it on a computer. However, most of these tools are mainly for horizontal writing, and few can display vertically. Therefore, a document reading device reads a vertically written document,
Even if the layout is reflected and output, it cannot be edited with WYSWYG.

【００１１】さらに、機械翻訳システムの入力装置とし
て、文書読取装置を使用する場合には、入力原稿のイメ
ージで翻訳結果を出力するシステムも開発されている
が、例えば日本語の縦書き文書を英語に翻訳する場合に
は、翻訳結果である英文を縦書きすることが出来ないの
で、レイアウト情報を利用することが出来なかった。Further, when a document reading device is used as an input device of a machine translation system, a system for outputting a translation result as an image of an input manuscript has been developed. When translating to, it was not possible to use the layout information because the translated English text could not be written vertically.

【００１２】[0012]

【発明が解決しようとする課題】上述したように、従来
の文字認識装置では、読み誤り箇所の候補をオペレータ
に示していたが、”、（読点）”、”。（句
点）”、”．（ピリオド）”、”，（カンマ）”といっ
た記号は見落とし易く、また、この様な文の終了を示す
記号の認識誤りを含むデータを機械翻訳システムの入力
データとして使用すると、文章中に於ける文の分割に誤
りが生じ、機械翻訳の結果が著しく悪くなるが、従来の
文字認識装置では文の終了を示す記号は他の文字と同程
度の重要性しかなかったため、これらの記号の認識率が
必ずしも高くないという問題が有った。As described above, in the conventional character recognition device, the candidate of the reading error portion is shown to the operator, but ", (reading mark)", ". (Phrase mark)", ". Symbols such as (period) ",", (comma) "are easy to overlook, and if data including the recognition error of such a symbol indicating the end of a sentence is used as input data of a machine translation system, it will be included in the sentence. Although the result of machine translation is badly deteriorated due to the sentence division error, in the conventional character recognition device, the symbol indicating the end of the sentence was as important as other characters, so the recognition rate of these symbols was high. There was a problem that is not necessarily high.

【００１３】また、従来の文字認識装置は、読み取る原
稿の違いをオペレータが感覚的に指示することで実現さ
れ、どの読み取りレベルで原稿を読み取るのが認識装置
にとって最適であるか知ることが困難であった。更に、
原稿に合った最適な読み取りレベルを記憶する方法が提
供されていないため同様な原稿を読み取る場合に簡単に
読み取りベルを指示することができないという問題点が
あった。Further, the conventional character recognition device is realized by the operator sensuously instructing the difference between the documents to be read, and it is difficult to know at which reading level the document is optimally read by the recognition device. there were. Furthermore,
Since a method for storing the optimum reading level suitable for the original is not provided, there is a problem that the reading bell cannot be easily instructed when reading the same original.

【００１４】そして、従来の機械翻訳装置では、縦書き
の文書を読み込んだときそのレイアウト情報を有効に使
用できないという問題があった。Further, the conventional machine translation device has a problem that the layout information cannot be effectively used when a vertically written document is read.

【００１５】この発明はこのような従来の課題を解決す
るためになされたもので、その第１の目的は、文字中の
句読点，ピリオド及びカンマを正しく認識し得る文字認
識装置を提供することである。また、第２の目的は、読
取る原稿に適合した最適な読取りレベルで文字を認識す
ることのできる文字認識装置を提供することである。更
に、第３の目的は、レイアウト情報を有効に利用して翻
訳結果を出力し得る機械翻訳装置を提供することであ
る。The present invention has been made to solve such conventional problems, and a first object thereof is to provide a character recognition device capable of correctly recognizing punctuation marks, periods and commas in characters. is there. A second object is to provide a character recognition device capable of recognizing characters at an optimum reading level suitable for a document to be read. Further, a third object is to provide a machine translation device capable of effectively utilizing layout information and outputting a translation result.

【００１６】[0016]

【課題を解決するための手段】上記目的を達成するた
め、本願第１の発明は、入力されたイメージデータの文
字認識を行なう文字認識装置において、読取られた文字
中に存在する句読点，ピリオド，カンマの認識性に曖昧
性があるか否かを判定する手段と、曖昧性があると判定
された場合には当該曖昧な句読点，ピリオド，カンマの
前後の文章を参照して句読点，ピリオド，カンマのいず
れかを決定する手段と、を有することが特徴である。In order to achieve the above object, the first invention of the present application is a character recognition apparatus for character recognition of input image data, wherein punctuation marks, periods, and A means for determining whether or not there is ambiguity in the recognition of commas, and when it is determined that there is ambiguity, refer to the ambiguous punctuation mark, period, and the text before and after the comma for punctuation marks, periods, commas. And a means for determining any of the above.

【００１７】また、本願第２の発明は、設定された読取
りレベルに従って文書画像を入力する手段と、入力され
た文書画像から文字行を切出し、該文字行から各文字を
切出して文字認識する手段とを備えた文字認識装置にお
いて、文書画像を読取る原稿の質、濃淡等の各属性に応
じた読取りレベルを予め格納する手段と、実際に原稿を
読取る際には当該原稿の属性を識別し、これに対応する
読取りレベルにて読取りを行なう手段と、を有すること
を特徴とする。Further, the second invention of the present application is a means for inputting a document image according to a set reading level, and a means for cutting out a character line from the input document image and cutting out each character from the character line for character recognition. In the character recognition device including, the quality of the document to read the document image, a means for pre-storing the reading level corresponding to each attribute such as light and shade, and when actually reading the document, to identify the attribute of the document, Means for reading at a reading level corresponding to this.

【００１８】更に、本願第３の発明は第１の言語を第２
の言語に翻訳する機械翻訳装置において、原稿に記載さ
れた第１の言語文字の画像データを認識し、当該文字の
レイアウトを解析するレイアウト解析手段と、この第１
の言語文字が縦書きであるか横書きであるかを判定する
文字方向判定手段と、第１の言語文字と、これを翻訳し
た第２の言語文字との文字方向が異なる場合には前記レ
イアウト解析手段にて解析されたレイアウトの方向を文
字方向と同様に変更し、変更後のレイアウトで第２の言
語文字を出力する手段と、を有することを特徴とする。Further, the third invention of the present application uses the first language as the second language.
In a machine translation device for translating into a language, a layout analysis means for recognizing image data of a first language character written in a manuscript and analyzing a layout of the character,
Direction determining means for determining whether the language character is written vertically or horizontally, and the layout analysis is performed when the character directions of the first language character and the translated second language character are different. Means for changing the layout direction analyzed by the means in the same manner as the character direction and outputting the second language character in the changed layout.

【００１９】[0019]

【作用】上述の如く構成された本願第１の発明に係る文
字認識装置では、句読点やピリオド、カンマの認識結果
に曖昧性がある場合に、その句読点やピリオド、カンマ
の前後を解析することにより、その場所で文が終了する
可能性が高い場合には、句点またはピリオドなど文の終
了を示す記号を優先させ、それ以外の場合は、読点・カ
ンマなど文を終了させない記号を優先させる機能を有
し、文の終了を示す記号を正しく認識することができ
る。With the character recognition device according to the first aspect of the present invention configured as described above, when there is ambiguity in the recognition result of a punctuation mark, a period, or a comma, by analyzing the punctuation mark, the period, or before or after the comma. , If there is a high possibility that the sentence ends at that place, the symbol that indicates the end of the sentence such as a punctuation mark or period will be prioritized, and in other cases, the symbol that does not end the sentence such as a punctuation mark or comma will be prioritized. It is possible to correctly recognize a symbol indicating the end of a sentence.

【００２０】また、本願第２の発明に係る文字認識装置
では、読み取る原稿の紙の質、紙の表面の加工、文字の
背景等の違いで一定でない読み取りレベルをオペレータ
が選択する必要がなく、原稿に合った最適な読み取りレ
ベルを求めることができる。かつ、その読み取りレベル
に識別名を付けて記憶することで、同様の原稿を読み取
るとき識別名で簡単に読み取りレベルを指示することが
でき、オペレータの労力を大幅に軽減できる。Further, in the character recognition device according to the second aspect of the present invention, the operator does not need to select a reading level that is not constant due to differences in the quality of the original document to be read, the surface of the paper, the background of the characters, etc. The optimum reading level suitable for the original can be obtained. In addition, by storing the reading level with an identification name, the reading level can be easily instructed by the identification name when reading the same document, and the labor of the operator can be greatly reduced.

【００２１】更に、本願第３の発明に係る機械翻訳装置
では、縦書きの文書の場合はレイアウト情報を回転し、
横書きの文書として出力することによって、レイアウト
情報を有効に利用できる。Further, in the machine translation apparatus according to the third invention of the present application, in the case of a vertically written document, the layout information is rotated,
Layout information can be effectively used by outputting as a horizontally written document.

【００２２】[0022]

【実施例】以下、本発明の実施例を図面に基づいて説明
する。図１は本発明の第１実施例に係る文字認識装置で
ある。同図において、入力部１は、コマンドを入力した
り、認識結果に対する修正文字を入力するためのもので
通常、キーボードやマウス、タッチパネルなどが用いら
れる。表示部２は、前記入力部より入力されたコマンド
や文字列を表示したり、後述する読み取り部からの読み
取り結果や文字認識部による文字認識の途中状態や認識
結果などを表示したりするもので、通常ビットマップデ
ィスプレイなどが用いられるが、オペレータが装着した
ゴーグルや壁面に投射しても良い。なお、画面を分割
し、片方を読み取り部が読み取った画像イメージを表示
させ、他方に本装置が認識した文字を対応させて表示す
ることもできる。記録部３は後述する読み取り部で読み
取った画像データや、文字認識部による文字認識の途中
経過や最終結果等を記録するものである。読み取り部４
は原稿を読み取る為の物で、例えばラインイメージセン
サを備えたスキャナなどが用いられる。なお、このスキ
ャナの解像度により、読み取れる文字の種類が決まる。
文字認識部５は前記読み取り部が原稿を読み取った結果
得られた画像等のデータを文字として認識するものであ
る。前記読み取り部が原稿を読み取る時は、文字単位で
読み取っているのではなく、原稿の一端から順次スキャ
ンしていき、各点に情報があるか無いかの２値で表して
画像データを得る。このため解像度を上げるほど、より
細かく原稿を読み取る事になり、より複雑な文字も読み
取る事ができる。そして、文字認識部５は画像データの
任意の範囲を対象にして認識用辞書６に予め記録してい
る標準文字パターンとのパターン照合を行う。このパタ
ーン照合処理を画像データの全ての処理に対して行い、
これらの対応する文字の認識を行う。文末記号優先処理
部７は、文字認識部によって認識された結果を読み込
み、句読点やピリオド、カンマに曖昧性がある場合に
は、その記号の前後を解析し、この部分が文の切れ目に
なる可能性が高い場合には、句点やピリオドといった文
末を示す記号を優先し、それ以外の場合には読点やカン
マといった文末以外の記号を優先するといった処理を行
うものである。制御部８は、前記各部の制御やデータの
受け渡しを行うためのものである。Embodiments of the present invention will be described below with reference to the drawings. FIG. 1 shows a character recognition device according to a first embodiment of the present invention. In the figure, an input unit 1 is for inputting a command or a correction character for a recognition result, and usually a keyboard, a mouse, a touch panel, or the like is used. The display unit 2 displays a command or a character string input from the input unit, a reading result from a reading unit described later, an intermediate state of character recognition by the character recognition unit, a recognition result, and the like. Usually, a bitmap display or the like is used, but it may be projected on goggles or a wall surface worn by the operator. It is also possible to divide the screen and display the image image read by the reading unit on one side and display the character recognized by the apparatus on the other side. The recording unit 3 records the image data read by the reading unit, which will be described later, the progress of character recognition by the character recognition unit, the final result, and the like. Reading unit 4
Is an object for reading an original, and for example, a scanner having a line image sensor is used. The resolution of the scanner determines the type of characters that can be read.
The character recognition unit 5 recognizes data such as an image obtained as a result of reading the document by the reading unit as a character. When the reading unit reads a document, it does not read characters by character, but sequentially scans from one end of the document and obtains image data by expressing it as a binary value indicating whether or not there is information at each point. Therefore, the higher the resolution, the finer the original can be read, and the more complicated characters can be read. Then, the character recognition unit 5 performs pattern matching with a standard character pattern previously recorded in the recognition dictionary 6 for an arbitrary range of image data. This pattern matching processing is performed for all processing of image data,
Recognize these corresponding characters. The sentence end symbol priority processing unit 7 reads the result recognized by the character recognizing unit, and if there is ambiguity in punctuation marks, periods, or commas, it analyzes the front and back of the symbol, and this part can become a break in the sentence. In the case where the property is high, a symbol indicating the end of a sentence such as a punctuation mark or a period is given priority, and in other cases, a symbol other than the end of the sentence such as a reading point or a comma is given priority. The control unit 8 is for controlling each of the above-mentioned units and transferring data.

【００２３】図２は、本発明に於ける文字認識装置の全
体の処理の流れを示すフローチャートである。原稿を文
字データとして電子化するためには、まず、原稿を読み
取り部により読み取る（ステップＳＴ１）。次に、読み
取った結果に対して文字認識を行う（ステップＳＴ
２）。この認識結果中に認識時に曖昧性が有ると判断さ
れた句読点、カンマ、ピリオドが有るか否かを検査し
（ステップＳＴ３）、ある場合には認識結果を文末記号
優先処理部に転送し、文末記号優先処理を行い（ステッ
プＳＴ４）、最終的な結果を得る。FIG. 2 is a flow chart showing the overall processing flow of the character recognition device according to the present invention. In order to digitize a document as character data, the document is first read by the reading unit (step ST1). Next, character recognition is performed on the read result (step ST
2). It is checked whether or not there is a punctuation mark, a comma, or a period judged to have ambiguity at the time of recognition in this recognition result (step ST3), and if there is, the recognition result is transferred to the sentence end symbol priority processing section, The symbol priority process is performed (step ST4) to obtain the final result.

【００２４】図３は文末記号優先部の構成の一例を示す
ブロック図である。文末記号優先処理部は認識結果を認
識結果入力部１１を通し受けとり、必要な処理を制御部
１２で行い、出力部１３を介して処理結果を出力する。
制御部１２では、句点・ピリオドと読点・カンマのうち
どちらかを優先するかの判断を行う。判断の為に、制御
部は、文末になりやすい語リスト１４、文頭になりやす
い語リスト１５、文の切れ目になりやすい語リスト１
６、切れ目判定条件１７、形態素解析部１８等を利用す
る。文末になりやすい語リスト１４には、日本語の場合
では例えば「です」「ます」「でした」などが記録され
ており、この語が曖昧性のある句読点の直前に在る場合
には、句点を優先するという処理を行う。文頭になりや
すい語リスト１５には、日本語の場合、例えば「しか
し」などの接続詞や間投詞などが記録されており、この
語が曖昧性のある句読点の直後にある場合には句点を優
先するという処理を行う。文の切れ目になりやすい語リ
スト１６には、例えば「？」などの記号が記録されてお
り、この語が曖昧性のある句読点の前後に在る場合には
句点を優先するという処理を行う。切れ目判定条件１７
には、例えば、改行の直前の曖昧性のある句読点は句点
優先、あるいは、大文字の直前の場合はピリオド優先な
どの規則が記述されている。また、形態素解析部では曖
昧性のある句読点の前の部分を形態素解析し、制御部で
はその結果を使用し、句読点の直前が終助詞、助動詞や
用言の終止形で在る場合には句点を優先させる。FIG. 3 is a block diagram showing an example of the structure of the sentence end symbol priority section. The sentence end symbol priority processing unit receives the recognition result through the recognition result input unit 11, the control unit 12 performs necessary processing, and outputs the processing result via the output unit 13.
The control unit 12 determines which of a punctuation mark / period and a reading mark / comma has priority. For the sake of judgment, the control unit has a word list 14 that is likely to be at the end of a sentence, a word list 15 that is likely to be at the beginning of a sentence, and a word list 1 that is likely to be a break in a sentence.
6, the break determination condition 17, the morphological analysis unit 18, etc. are used. In the word list 14 that is likely to be at the end of the sentence, for example, in the case of Japanese, “is”, “masu”, “was”, etc. are recorded. If this word is immediately before an ambiguous punctuation mark, Performs processing that gives priority to punctuation. In the word list 15 that is likely to be the beginning of a sentence, in the case of Japanese, for example, a conjunction such as "but" or an interjection is recorded. When this word is immediately after an ambiguous punctuation, the punctuation is given priority. Is performed. A symbol such as "?" Is recorded in the word list 16 that is prone to breaks in sentences, and when this word is before or after an ambiguous punctuation mark, a process of giving priority to the punctuation mark is performed. Break judgment condition 17
Describes rules such as, for example, punctuation with ambiguity immediately before a line feed has priority of punctuation, or just before capitalization has priority of periods. In addition, the morpheme analysis unit performs morpheme analysis on the part before the ambiguous punctuation mark, and the control unit uses the result, and when the punctuation mark is immediately before the final particle, auxiliary verb or end phrase, the punctuation mark Prioritize.

【００２５】図４，５は文末記号優先部の一実施例の動
作の概要を示すフローチャートである。文末記号優先部
は、認識結果Ａを読み込む事により動作を開始する（ス
テップＳＴ１１）。次に読み込んだデータ中に、文字認
識部で曖昧性が有ると判断された句読点、カンマ、ピリ
オドがあるか検索しこれをＢとする（ステップＳＴ１
２）。Ｂの直前の語句が「文末になりやすい語句リス
ト」に含まれているか否かを検査する（ステップＳＴ１
３）。含まれている場合には、Ｂは文末を表す記号であ
る可能性が高いので、句点・ピリオドを優先させる（ス
テップＳＴ２０）。含まれていない場合には、Ｂの直後
の語句が「文頭になりやすい語句リスト」に含まれてい
るか否かを検査する（ステップＳＴ１４）。含まれてい
る場合には、Ｂは文末を表す記号である可能性が高いの
で、句点・ピリオドを優先させる（ステップＳＴ２
０）。含まれていない場合には、Ｂの直前・直後の語句
が「文の切れ目になりやすい語句リスト」に含まれてい
るか否かを検査する（ステップＳＴ１５）。含まれてい
る場合には、Ｂは文末を表す記号である可能性が高いの
で、句点・ピリオドを優先させる（ステップＳＴ２
０）。含まれていない場合には、Ｂが文の切れ目条件を
満たしているか否かを検査する（ステップＳＴ１６）。
満たしている場合には、Ｂは文末を表す記号である可能
性が高いので、句点・ピリオドを優先させる（ステップ
ＳＴ２０）。満たしていない場合には、Ａ中のＢの全部
を形態素解析を行い、Ｂの直前の単語の品詞と活用形を
得る（ステップＳＴ１７）。これが、終助詞、あるいは
用言・助動詞の終止形である場合には、Ｂは文末を表す
記号である可能性が高いので、句点・ピリオドを優先さ
せる（ステップＳＴ２０）。それ以外の場合は、Ｂは文
末を表す記号でない可能性が高いので、読点・カンマを
優先させる（ステップＳＴ１９）。こういった処理をＡ
中に曖昧性がある句読点、カンマ、ピリオドが無くなる
まで繰り返す（ステップＳＴ２１）。4 and 5 are flowcharts showing the outline of the operation of one embodiment of the sentence end symbol priority section. The sentence end symbol priority section starts the operation by reading the recognition result A (step ST11). The data read next is searched for punctuation marks, commas, or periods that are determined to be ambiguous by the character recognition unit, and this is designated as B (step ST1).
2). It is checked whether or not the word immediately before B is included in the "word list that tends to end the sentence" (step ST1).
3). If it is included, B is likely to be a symbol indicating the end of a sentence, so that the punctuation / period is prioritized (step ST20). If not included, it is checked whether or not the word immediately after B is included in the "word list that tends to be the beginning of a sentence" (step ST14). If it is included, B is likely to be a symbol indicating the end of a sentence, so that punctuation / period is prioritized (step ST2).
0). If not included, it is checked whether or not the words immediately before and after B are included in the "word list that tends to break sentences" (step ST15). If it is included, B is likely to be a symbol indicating the end of a sentence, so that punctuation / period is prioritized (step ST2).
0). If not included, it is checked whether B satisfies the sentence break condition (step ST16).
If so, B is likely to be a symbol indicating the end of a sentence, so that punctuation / period is prioritized (step ST20). If not satisfied, morphological analysis is performed on all of B in A to obtain the part of speech and the inflectional form of the word immediately before B (step ST17). If this is a final particle, or a final form of a noun / auxiliary verb, B is likely to be a symbol representing the end of a sentence, so punctuation / period is prioritized (step ST20). In other cases, since it is highly likely that B is not a symbol indicating the end of a sentence, the reading point / comma is prioritized (step ST19). Such processing is A
Repeat until there are no punctuation marks, commas, or periods with ambiguity in them (step ST21).

【００２６】なお、ステップＳＴ１３からＳＴ１８の条
件判断の順序はこの順番に限らないし、すべての条件を
使用する必要もない。また、これらの条件の論理和や論
理積によって判断を行う事もできる。The order of condition judgment in steps ST13 to ST18 is not limited to this order, and it is not necessary to use all conditions. Further, the judgment can be made by the logical sum or logical product of these conditions.

【００２７】次に、本実施例に係わる文字読み取り装置
の動作を画面の図を参照して説明する。Next, the operation of the character reading apparatus according to the present embodiment will be described with reference to the drawings on the screen.

【００２８】表１に示す様に”I have a pen, book and
bag. You …”文書を認識した際、”I have a pen. bo
ok and bag, You …”としてしまい、「．」と「，」に
は曖昧性があるとする。As shown in Table 1, "I have a pen, book and
bag. You… ”When a document is recognized,“ I have a pen. bo
ok and bag, You… ”, and there is ambiguity between“. ”and“, ”.

【００２９】[0029]

【表１】このとき、”bag ”の次のカンマは次が大文字であるた
めピリオドである可能性が高く、”pen ”の次のピリオ
ドは図４，５に示す条件のどれにも当てはまらないため
カンマである可能性が高い事がわかる。このため、優先
度を付け直して出力したものが表２であり、正しい認識
結果となっていることがわかる。[Table 1] At this time, the comma next to "bag" is likely to be a period because the next capital is, and the period following "pen" is a comma because it does not match any of the conditions shown in Figs. You can see that there is a high possibility. Therefore, Table 2 shows the output with re-prioritization, and it can be seen that the recognition result is correct.

【００３０】[0030]

【表２】なお、第１の実施例では、日本語及び英語を例にあげて
説明しているが、これは、この言語に限定するものでは
なく、各リストや条件を対象言語に適したものに変更す
ることにより他の言語の認識にも適用できる。[Table 2] In the first embodiment, Japanese and English are described as examples, but this is not limited to this language, and each list or condition is changed to one suitable for the target language. It can also be applied to the recognition of other languages.

【００３１】次に、本発明の第２実施例について説明す
る。図６は第２実施例を示す構成図であり、以下、日本
語の原稿を読取る例について説明する。Next, a second embodiment of the present invention will be described. FIG. 6 is a block diagram showing the second embodiment, and an example of reading a Japanese document will be described below.

【００３２】図６において、読み取り部２１は原稿を読
み取るもので、例えばイメージセンサを備えたスキャナ
が用いられる。なお、このスキャナの解像度により読み
取れる文字の種類が決まる。また、しきい値は、例えば
１〜２５６の段階で設定でき、読み取る原稿に合わせて
制御できる。In FIG. 6, the reading unit 21 reads a document, and for example, a scanner having an image sensor is used. The resolution of the scanner determines the type of characters that can be read. Further, the threshold value can be set, for example, in the range of 1 to 256, and can be controlled according to the document to be read.

【００３３】出力部２２は前記読み取り部２１で読み取
られた画像データに対し、後述するパターン照合処理し
た結果を表示するために、例えばビットマップディスプ
レイ等が使用される。なお、ディスプレイの画面上を分
割して、読み取り部２１で入力した画像データと本装置
が認識した文字を対応させて表示することもできる。The output unit 22 uses, for example, a bitmap display or the like in order to display the result of the pattern matching process described later on the image data read by the reading unit 21. It is also possible to divide the screen of the display and display the image data input by the reading unit 21 and the characters recognized by the apparatus in association with each other.

【００３４】入力部２３は出力部２２に表示され認識さ
れた文字の修正、読み取りレベルに付けられた識別名の
選択等の文字認識装置を制御するための各種コマンドの
入力を可能にしたもので、キーボード、マウス等が使用
される。The input unit 23 enables input of various commands for controlling the character recognition device such as correction of the characters displayed and recognized on the output unit 22 and selection of an identification name given to the reading level. , Keyboard, mouse, etc. are used.

【００３５】文字認識部２４は前記読み取り部２１が原
稿を読み取った画像データを文字として認識するもので
ある。前記読み取り部２１が原稿を読み取るときは、文
字単位で読み取るのでなく、原稿の一端から順次スキャ
ン（走査）していき、各点に情報があるかないかの２値
で表された画像データを得る。このため解像度を上げる
ほどより細かく原稿を読み取ることになり、より複雑な
文字を読み取ることができる。そして文字認識部２４は
画像データの任意の範囲を対象にして（つまり文字単
位、単語単位で切出して）文字認識部２４が予め記憶し
ている標準文字パターンとのパターン照合処理を行な
う。このパターン照合処理を画像データの全てに対して
行ない、これらに対応する文字認識を行なう。The character recognition unit 24 recognizes the image data obtained by reading the document by the reading unit 21 as characters. When the reading section 21 reads a document, it does not read character by character, but sequentially scans from one end of the document to obtain image data represented by binary values indicating whether or not there is information at each point. . Therefore, the higher the resolution is, the finer the original is read, and more complicated characters can be read. Then, the character recognition unit 24 performs a pattern matching process with a standard character pattern stored in advance by the character recognition unit 24, targeting an arbitrary range of image data (that is, cutting out in character units or word units). This pattern matching process is performed on all the image data, and character recognition corresponding to these is performed.

【００３６】最適読取りレベル判定部２５は文字認識部
２４で認識された結果どれだけ正しく認識されているか
形態素解析をすることで判定する。形態素解析で失敗す
る文字がないときは、最適な読取りレベルと判定する。
形態素解析で失敗する文字数が多いときは、読取りレベ
ルを変更し、再度読取り部２１から原稿を読み取る。The optimum reading level determining unit 25 determines by the morphological analysis how correctly the character is recognized as a result of being recognized by the character recognizing unit 24. If there is no character that fails in morphological analysis, it is determined as the optimum reading level.
When a large number of characters fail in the morphological analysis, the reading level is changed and the document is read again from the reading unit 21.

【００３７】形態素解析辞書部２６は形態素解析をする
ための辞書で単語単位の見出し語情報、付属語情報等を
記憶する。記憶部２７は、読み取り部２１で読み取った
画像データ、文字認識部２３による文字認識結果、読取
りレベルと対応する識別名等を記憶するものである。制
御部２８は前記各部の制御やデータの仲介を行なうもの
である。The morphological analysis dictionary unit 26 is a dictionary for performing morphological analysis and stores headword information, adjunct word information and the like in word units. The storage unit 27 stores the image data read by the reading unit 21, the character recognition result by the character recognition unit 23, the identification name corresponding to the reading level, and the like. The control unit 28 controls the above-mentioned units and mediates data.

【００３８】図７〜図１１は制御部２８の動作を示すフ
ローチャートである。オペレータが読取り部２１に読み
取らせる原稿をセットすると、出力部２２にどの読取り
レベルで行なうかが表示される。実施例では、既に原稿
に合った最適な読取りレベルに識別名を付けられて記憶
されている場合はその識別名が一覧表として表示され
る。最適な読取りレベルが判らないときは最適な読取り
レベルの設定モードが選択できるようになっている（ス
テップＳＴ３１）。7 to 11 are flowcharts showing the operation of the control unit 28. When the operator sets a document to be read by the reading unit 21, the output unit 22 displays the reading level. In the embodiment, when the optimum reading level suitable for the document is already stored with the identification name, the identification name is displayed as a list. When the optimum read level is not known, the optimum read level setting mode can be selected (step ST31).

【００３９】入力部２３で識別名が選択されたときは識
別名に対応した読取りレベルを読取り部２１に設定する
（ステップＳＴ３２〜ＳＴ３３）。その後、読取りモー
ドｓｗを０にする（ステップＳＴ３４）。読取りモード
ｓｗは、原稿の読取り、文字認識、形態素解析を共通に
使用するためのスイッチである。When the identification name is selected by the input unit 23, the reading level corresponding to the identification name is set in the reading unit 21 (steps ST32 to ST33). After that, the reading mode sw is set to 0 (step ST34). The reading mode sw is a switch for commonly using document reading, character recognition, and morphological analysis.

【００４０】最適な読取りレベルの設定モードが選択さ
れたときは、標準の読取りレベルを読取り部２１に設定
する（ステップＳＴ３５）。その後、読取りモードｓｗ
を１に設定する（ステップＳＴ３６）。When the optimum reading level setting mode is selected, the standard reading level is set in the reading section 21 (step ST35). After that, read mode sw
Is set to 1 (step ST36).

【００４１】設定された読取りレベルで原稿の読取りを
読取り部２１で行なう（ステップＳＴ３７）。読み取っ
た画像データは記憶部２７に記憶され読み取りが終了
後、文字認識部２４でレイアウト解析、文字認識と順番
に行われる（ステップＳＴ３８）。The reading unit 21 reads the original at the set reading level (step ST37). The read image data is stored in the storage unit 27, and after the reading is completed, the character recognition unit 24 sequentially performs layout analysis and character recognition (step ST38).

【００４２】文字認識された結果は記憶部２７に記憶さ
れる。この文字認識された文字列は形態解析され形態素
解析に失敗した文字列はマーキングされる（ステップＳ
Ｔ３９）。レイアウト解析、文字認識、形態素解析は公
知の技術であるので詳細な説明は省略する。The result of character recognition is stored in the storage unit 27. The character string thus recognized is morphologically analyzed, and the character string for which the morphological analysis has failed is marked (step S
T39). Layout analysis, character recognition, and morphological analysis are known techniques, and thus detailed description thereof will be omitted.

【００４３】設定された読み取りレベルで入力された画
像データの文字認識はここで終了する。読み取りモード
ｓｗが０のときは、識別名が付けられて記憶されていた
読み取りレベルで入力する要求がなされたときで、認識
された文字及び形態素解析で失敗した文字列を他の文字
と区別して表示する（ステップＳＴ４０）。Character recognition of image data input at the set reading level ends here. When the reading mode sw is 0, the recognized character and the character string failed in the morphological analysis are distinguished from other characters when a request is made to input at the reading level stored with the identification name. It is displayed (step ST40).

【００４４】読み取りモードｓｗが１のときは、最適な
読取りレベルの設定モードが選択されたときで、形態素
解析で失敗した文字列をカウントする（ステップＳＴ４
１）。When the reading mode sw is 1, the character string which has failed in the morphological analysis when the optimum reading level setting mode is selected is counted (step ST4).
1).

【００４５】形態素解析で失敗した文字数が一定数以下
（実施例ではオペレータが設定を変えられるようになっ
ているが、この説明は省略する。例えば５文字と設定さ
れ、失敗文字数が、５文字以下のときは読み取りレベル
の変更は行なわない）のときは、他にもっと最適な読み
取りレベルが存在する可能性もあるが、実施例では最適
な読み取りレベルで画像データを読み取れたとする（ス
テップＳＴ４２）。この失敗許容文字数を０に設定する
と形態素解析に失敗する文字数が一番少ない読み取りレ
ベルが求められる。The number of characters failed in the morphological analysis is less than a certain number (in the embodiment, the setting can be changed by the operator, but this description is omitted. For example, it is set as 5 characters, and the number of failed characters is less than 5 characters. In this case, the reading level is not changed), but there is a possibility that there is another more optimum reading level, but in the embodiment it is assumed that the image data could be read at the optimum reading level (step ST42). If the number of allowable characters for failure is set to 0, the reading level with the smallest number of characters for which morphological analysis fails is required.

【００４６】形態素解析に失敗した文字数が一定数以上
のときで、かつ、読み取りレベルが標準のときは標準値
を基に増減を行なう（ステップＳＴ４３〜ＳＴ４４）。
読み取りレベルの増減は、例えば、文字毎に標準的な文
字構成ビット数を記憶し、このビット数と設定されてい
る読み取りレベルで読み取った文字毎のビット数を比較
する。比較する対象の文字の種類は全ての文字で行なう
と精度は高いが、代表的な文字を対象に行なうことが可
能である。よく出現する漢字、平仮名、数字を対象に比
較することで増減の方向を知ることもできる。When the number of characters for which the morphological analysis has failed is a certain number or more and the reading level is standard, the number is increased or decreased based on the standard value (steps ST43 to ST44).
To increase / decrease the reading level, for example, a standard character constituting bit number is stored for each character, and this bit number is compared with the bit number for each character read at the set reading level. Although the accuracy of the characters to be compared is high when all the characters are compared, it is possible to perform the typical characters. It is also possible to know the direction of increase or decrease by comparing the frequently used kanji, hiragana, and numbers.

【００４７】変更された読み取りレベルが標準値に比
べ、増加しているときは、増減フラグに１を設定する。
減少しているときは、増減フラグに−１を設定する（ス
テップＳＴ４５〜ＳＴ４７）。増減する量はＮで、実施
例ではオペレータが設定を変えられるようになっている
が、この説明は省略する。このＮの値を小さくすると紙
面の微妙な変化に対応した読み取りレベルを求めること
ができる。次の読み取りレベルが求まったら、読み取り
レベルをスキャナに設定し、再度、原稿の読み取りを行
なう。When the changed read level is higher than the standard value, the increase / decrease flag is set to 1.
If it is decreasing, the increase / decrease flag is set to -1 (steps ST45 to ST47). The amount of increase / decrease is N, and the operator can change the setting in the embodiment, but this description is omitted. By reducing the value of N, it is possible to obtain a reading level corresponding to a subtle change in the paper surface. When the next reading level is obtained, the reading level is set to the scanner and the original is read again.

【００４８】標準の読み取りレベルで読み取ったときで
ないときは、前回の読み取りレベルでの形態素解析の失
敗文字数と今回の新たに設定された読み取りレベル形態
素解析の失敗文字数と比較を行なう（ステップＳＴ４
８）。前回の読み取りレベルに比べ増加したときは読み
取りレベルの変更が正しくなかったので逆方向の読み取
りレベルに変更する。但し、このとき既に逆方向に対し
て読み取りレベルの変更を行なっているときは、失敗文
字数が最小である一つ前の読み取りレベルを最適値とす
る（ステップＳＴ４９、ステップＳＴ５８）。逆方向に
対して読み取りレベルの変更を行なっていないときは、
逆方向増減フラグを１にする（ステップＳＴ５０）。When the reading is not performed at the standard reading level, the number of failed characters of the morphological analysis at the previous reading level is compared with the newly set number of failed characters of the reading level morphological analysis at this time (step ST4).
8). When the reading level increased compared to the previous reading level, the reading level was not changed correctly, so change the reading level to the opposite direction. However, if the reading level has already been changed in the opposite direction at this time, the reading level immediately before which the number of failed characters is the minimum is set to the optimum value (step ST49, step ST58). If you have not changed the reading level in the opposite direction,
The reverse direction increase / decrease flag is set to 1 (step ST50).

【００４９】増減フラグの値が１のときは、標準値に増
加したときであるので逆方向すなわち標準値−Ｎの値に
変更する。増減フラグが−１のときは、標準値＋Ｎに変
更する（ステップＳＴ５１〜ＳＴ５３）。When the value of the increase / decrease flag is 1, it means that the value has increased to the standard value. Therefore, the value is changed in the reverse direction, that is, the value of standard value-N. When the increase / decrease flag is -1, it is changed to the standard value + N (steps ST51 to ST53).

【００５０】ステップＳＴ４８の比較で失敗文字数が前
回に比べ減少したときは、または、等しいときは読み取
りレベルの変更が正しかったのでさらに同一方向の読み
取りレベルに変更する。増減フラグの値が１のときは、
前回の読み取りレベル＋Ｎに変更する。増減フラグが−
１のときは、前回の読み取りレベル−Ｎに変更する（ス
テップＳＴ５４〜ＳＴ５６）。ステップＳＴ４８の比較
で失敗文字数が等しいときは実施例では、減少したと扱
っているが逆方向に読み取りレベルを変更することもで
きる。または、読み取りレベルが最適な値であるとする
こともできる。If the number of failed characters has decreased compared to the previous time in the comparison in step ST48, or if they are equal to each other, the change in the reading level was correct, so the reading level is changed to the same direction. When the value of the increase / decrease flag is 1,
Change to the previous reading level + N. Increase / decrease flag is −
When it is 1, the reading level is changed to the previous reading level -N (steps ST54 to ST56). When the number of failed characters is equal in the comparison in step ST48, it is considered that the number has decreased in the embodiment, but the reading level can be changed in the opposite direction. Alternatively, the reading level may be set to an optimum value.

【００５１】増減された読み取りレベルをスキャナに設
定し、再度、原稿の読み取りを行なう（ステップＳＴ５
７）。The increased or decreased reading level is set in the scanner, and the original is read again (step ST5).
7).

【００５２】ステップＳＴ４２の比較で形態素解析の失
敗文字数が許容文字数のときは、許容文字数以下となっ
た読み取りレベルに識別名を付けて記憶する。オペレー
タが識別名を入力すると読み取りレベルと対応させて記
憶部２７に記憶する（ステップＳＴ５９〜ＳＴ６０）。When the number of failed characters in the morphological analysis is the allowable number of characters in the comparison in step ST42, the reading level which is equal to or less than the allowable number of characters is given an identification name and stored. When the operator inputs the identification name, it is stored in the storage unit 27 in association with the reading level (steps ST59 to ST60).

【００５３】文字認識された文字列は出力部２２に表示
される。このとき、形態素解析で失敗した文字列は他と
区別して表示する（ステップＳＴ６１）。この表示でオ
ペレータは形態素解析に失敗した文字列の前後を注意し
て見れば認識誤りを簡単に発見できる。The character string recognized as a character is displayed on the output unit 22. At this time, the character string that failed in the morphological analysis is displayed separately from the others (step ST61). With this display, the operator can easily find the recognition error by carefully looking before and after the character string for which the morphological analysis has failed.

【００５４】なお、ここまでの説明は、スキャナから２
値（０または１）で画像データを読み取り文字認識をす
る方法について述べたが、多値で読み取ることができる
スキャナでは、読み取りレベルに応じて多値から２値に
変換することもできる。この場合は、図７において、読
み取りレベルを変更する度に原稿を読み取っていたステ
ップＳＴ３７の替わりに読み取りレベルに応じて多値か
ら２値に変換する構成にすることで実現できる。Incidentally, the explanation so far is made from the scanner.
Although the method of reading image data with a value (0 or 1) and performing character recognition has been described, a multi-value scanner can also convert from multi-value to binary according to the reading level. In this case, in FIG. 7, each time the reading level is changed, instead of step ST37 in which the document is read, the multivalue is converted into the binary according to the reading level.

【００５５】また、第２実施例では、読み取りレベルを
読み取った頁全体に１つ設定することで説明したが、読
み取る原稿によっては、部分的に濃淡が異なる場所や文
字の背景が相違することがある。このような原稿のとき
は、例えばレイアウト認識されたブロック単位に最適な
読み取りレベルで読み取ることもできる。例えば、ブロ
ック１は読み取りレベルを濃い原稿レベルで、ブロック
２と３は薄い原稿レベルで等、ブロック毎に形態素解析
で失敗する文字数が最小な読み取りレベルで読み取った
画像データの認識結果を出力できる。In the second embodiment, one reading level is set for the entire read page. However, depending on the document to be read, the place where the shade is different and the background of the character may be different. is there. In the case of such an original, for example, it is possible to read at the optimum reading level for each block whose layout has been recognized. For example, the recognition result of the image data read at the reading level in which the number of characters failed in the morphological analysis is small for each block can be output, for example, the reading level of the block 1 is a dark original level, and the blocks 2 and 3 are a thin original level.

【００５６】この他、読み取りレベルを変更して複数回
読み取り、文字認識、形態素解析を行なう場合は、ある
読み取りレベルで読み取ったとき形態素解析に失敗した
文字列が、別の読み取りレベルでは失敗しない場合に複
数の読み取りレベルの成功した文字列を集めて１つの認
識文字列とすることもできる。このように構成すること
により、原稿の濃淡のバラツキを吸収することができ
る。In addition, when the reading level is changed and reading is performed a plurality of times to perform character recognition and morphological analysis, a character string that fails morphological analysis when read at a certain reading level does not fail at another reading level. It is also possible to collect successful character strings of a plurality of reading levels into one recognized character string. With this configuration, it is possible to absorb variations in light and shade of the document.

【００５７】次に、本発明の第３実施例について説明す
る。図１２は第３実施例に係る機械翻訳装置の構成を示
すブロック図である。Next, a third embodiment of the present invention will be described. FIG. 12 is a block diagram showing the configuration of the machine translation device according to the third embodiment.

【００５８】同図において、入力部３１は、本発明に於
ける文字認識装置の動作に必要なコマンドを入力した
り、認識結果に対する修正文字を入力したりするもので
あり、通常キーボードやマウス、タッチパネルなどが用
いられる。表示部３２は、前記入力部から入力されたコ
マンドや文字列を表示したり、後述する読取り部からの
読取り結果や文字認識部による文字認識の途中状態や認
識結果などを表示したりするもので、通常ビットマップ
ディスプレイやCRT ディスプレイなどが用いられるが、
オペレータが装着したゴーグルや壁面に投射してもよ
い。なお、画面を分割して片方を読取り部が読み取った
画像イメージを表示させ、他方に本装置が認識した結果
を対応させて表示することもできる。レイアウト解析部
３３は、画像イメージから文字の記述されている部分を
識別したり、文字が縦書きであるか横書きであるかを判
別するためのものである。レイアウト記録部３４はレイ
アウト解析部３３で認識されたレイアウト情報を記録す
るためのものである。読取り部３５は原稿を読み取るた
めのもので、例えばラインイメージセンサを備えたスキ
ャナなどが用いられる。なお、このスキャナの解像度に
より、読み取れる文字の種類が決まる。文字認識部３６
は前記読取り部が原稿を読み取った結果得られた画像な
どのデータを文字として認識するものである。認識用辞
書３７は、前記文字認識部に於ける文字認識の時にパタ
ーン照合を行うために用いる。翻訳部３８は、文字認識
部で認識した結果を翻訳するためのものである。レイア
ウト反映部３９は、前記レイアウト記録部に記録されて
いるレイアウト情報と文字認識の結果を翻訳部によって
翻訳した結果得られた文字情報とを組み合わせて、元の
原稿のイメージに近い状態で出力するための処理を行う
ものである。レイアウト回転部４０は、レイアウト認識
部で原稿が縦書きであると判定された場合には、レイア
ウト情報を回転させ、横書きのレイアウトに変換するた
めのものである。記録部４１は、認識したデータや読み
取ったデータ画像データなどを記録するためのものであ
り、通常磁気的、電気的に実現される。制御部４２は前
記各部の制御やデータの受け渡しを行うためのものであ
る。In the figure, an input unit 31 is for inputting a command necessary for the operation of the character recognition device according to the present invention and for inputting a correction character for the recognition result. A touch panel or the like is used. The display unit 32 displays a command or a character string input from the input unit, a reading result from a reading unit described later, an intermediate state of character recognition by the character recognition unit, a recognition result, or the like. , Usually a bitmap display or CRT display is used,
It may also be projected onto the goggles or the wall surface worn by the operator. It is also possible to divide the screen so that one side displays the image image read by the reading unit and the other side displays the result recognized by the present apparatus. The layout analysis unit 33 is for identifying a portion in which a character is described from an image image and determining whether the character is in vertical writing or horizontal writing. The layout recording unit 34 is for recording the layout information recognized by the layout analysis unit 33. The reading unit 35 is for reading a document, and for example, a scanner having a line image sensor is used. The resolution of the scanner determines the type of characters that can be read. Character recognition unit 36
Is for recognizing data such as an image obtained as a result of reading the document by the reading unit as characters. The recognition dictionary 37 is used for pattern matching at the time of character recognition in the character recognition unit. The translation unit 38 is for translating the result recognized by the character recognition unit. The layout reflection unit 39 combines the layout information recorded in the layout recording unit and the character information obtained as a result of the translation of the character recognition result by the translation unit, and outputs it in a state close to the image of the original document. This is to perform processing for. The layout rotation unit 40 rotates the layout information and converts the layout information into a horizontal layout when the layout recognition unit determines that the document is vertically written. The recording unit 41 is for recording recognized data, read data image data, and the like, and is usually realized magnetically and electrically. The control unit 42 is for controlling each of the above-mentioned units and transferring data.

【００５９】図１３は、第３実施例の動作を示すフロー
チャートである。原稿を文字データとして電子化するた
めには、まず、原稿を読取り部により読み取る（ステッ
プＳＴ７１）。つぎに、読み取った結果に対してレイア
ウト認識を行う（ステップＳＴ７２）。レイアウト認識
では、文字がある場所をブロックとして捕らえる他、文
字が縦書きであるか横書きであるかをも認識する。次に
レイアウト認識の結果を参考に文字認識を行う（ステッ
プＳＴ７３）。そして、文字認識結果を翻訳し（ステッ
プＳＴ７４）、文字認識の結果をレイアウト認識の結果
に反映させ、原稿に近いイメージで出力する（ステップ
ＳＴ７５）。FIG. 13 is a flow chart showing the operation of the third embodiment. In order to digitize a document as character data, the document is first read by the reading section (step ST71). Next, layout recognition is performed on the read result (step ST72). In the layout recognition, the place where the character is present is recognized as a block, and it is also recognized whether the character is written vertically or horizontally. Next, character recognition is performed with reference to the result of layout recognition (step ST73). Then, the result of character recognition is translated (step ST74), the result of character recognition is reflected in the result of layout recognition, and an image close to the original is output (step ST75).

【００６０】図１４は、図１３におけるステップＳＴ７
５のレイアウト反映機能の処理の流れを示すフローチャ
ートである。レイアウト反映機能では、まず、レイアウ
ト認識で得られた結果を読み込む（ステップＳＴ８
１）。次に、文字認識の翻訳結果を読み込む（ステップ
ＳＴ８２）。そして、ステップＳＴ８１で読み込んだ情
報が縦書きのデータであれば（ステップＳＴ８３）、レ
イアウト情報を回転させる（ステップＳＴ８４）。ステ
ップＳＴ８２で読み込んだ文字データをレイアウトデー
タの該当箇所に挿入することによってレイアウト反映が
終了する（ステップＳＴ８５）。FIG. 14 shows step ST7 in FIG.
6 is a flowchart showing a processing flow of the layout reflection function of FIG. In the layout reflection function, first, the result obtained by the layout recognition is read (step ST8).
1). Next, the translation result of character recognition is read (step ST82). If the information read in step ST81 is vertical writing data (step ST83), the layout information is rotated (step ST84). The layout reflection is completed by inserting the character data read in step ST82 into the corresponding portion of the layout data (step ST85).

【００６１】次に、第３実施例で示した機械翻訳装置の
全体の動作の概要について説明する。Next, an outline of the overall operation of the machine translation device shown in the third embodiment will be described.

【００６２】いま、表３に示すような縦書きの原稿を電
子化する例で説明する。An example of digitizing a vertically written document as shown in Table 3 will now be described.

【００６３】[0063]

【表３】この原稿を読み取り部から読取り、レイアウト認識する
と図１５のようになる。図１５では、文字が書かれてい
る部分の塊を矩形のブロックとして認識し、その頂点の
座標を記録している。また、このデータが縦書きである
ということも記録している。文字認識後、レイアウト反
映部では、このデータが縦書きであるため図１６に示す
ように、座標を変換する。そして、これのデータに対し
翻訳結果を対応するブロックに反映させると表４のよう
な結果を出力することができる。[Table 3] When this document is read from the reading unit and the layout is recognized, the result is as shown in FIG. In FIG. 15, a block of a portion in which characters are written is recognized as a rectangular block, and the coordinates of its vertices are recorded. It also records that this data is written vertically. After character recognition, the layout reflection unit converts the coordinates as shown in FIG. 16 because this data is written vertically. Then, when the translation result is reflected in the corresponding block with respect to this data, the result as shown in Table 4 can be output.

【表４】 [Table 4]

【００６４】[0064]

【発明の効果】以上説明したように、本願第１の発明に
よる文字認識装置では、句読点やピリオド、カンマの認
識結果に曖昧性がある場合に、その句読点やピリオド、
カンマの前後を解析することにより、その場所で文が終
了する可能性が高い場合には、句点またはピリオドなど
文の終了を示す記号を優先させ、それ以外の場合は、読
点・カンマなど文を終了させない記号を優先させる機能
を有し、文の終了を示す記号を正しく認識することがで
きる。As described above, in the character recognition device according to the first aspect of the present invention, when there is ambiguity in the recognition result of a punctuation mark, a period, or a comma, the punctuation mark, the period, the
By parsing before and after the comma, if there is a high possibility that the sentence will end at that location, the symbol that indicates the end of the sentence such as a punctuation mark or period will be given priority. It has the function of giving priority to the symbols that do not end, and can correctly recognize the symbols that indicate the end of a sentence.

【００６５】また、本願第２の発明に係る文字認識装置
によれば、原稿毎に異なるスキャナの読み取りレベルを
最適な値に設定することができ、誤った読取りレベルで
の認識をなくしオペレータの負担を大幅に軽減できる。
また、原稿にあった最適な読み取りレベルに識別名を付
けて記憶することで読み取りレベルを意識する必要がな
くなる。Further, according to the character recognition apparatus in the second aspect of the present invention, the reading level of the scanner which is different for each document can be set to an optimum value, eliminating the recognition at the wrong reading level and burdening the operator. Can be significantly reduced.
Further, by storing the optimum reading level suitable for the original document with an identification name, it is not necessary to be aware of the reading level.

【００６６】また、本願第３の発明に係る機械翻訳装置
では、縦書きの文書の場合はレイアウト情報を回転し、
横書きの文書として出力することによって、レイアウト
情報を有効に利用することができるという効果が得られ
る。Further, in the machine translation device according to the third invention of the present application, in the case of a vertically written document, the layout information is rotated,
By outputting as a horizontally-written document, it is possible to effectively use the layout information.

【図面の簡単な説明】[Brief description of drawings]

【図１】本発明の第１実施例に係る文字認識装置の構成
を示すブロック図である。FIG. 1 is a block diagram showing a configuration of a character recognition device according to a first embodiment of the present invention.

【図２】第１実施例の動作を示すフローチャートであ
る。FIG. 2 is a flowchart showing the operation of the first embodiment.

【図３】文末記号優先部の概略を表すブロック図であ
る。FIG. 3 is a block diagram showing an outline of a sentence end symbol priority unit.

【図４】文末記号優先部の動作を示すフローチャートの
第１の分図である。FIG. 4 is a first partial diagram of a flowchart showing an operation of a sentence end symbol priority section.

【図５】文末記号優先部の動作を示すフローチャートの
第２の分図である。FIG. 5 is a second partial diagram of the flowchart showing the operation of the sentence end symbol priority section.

【図６】本発明の第２実施例に係る文字認識装置の構成
を示すブロック図である。FIG. 6 is a block diagram showing a configuration of a character recognition device according to a second embodiment of the present invention.

【図７】第２実施例の動作を示すフローチャートの第１
の分図である。FIG. 7 is a first flowchart showing the operation of the second embodiment.
FIG.

【図８】第２実施例の動作を示すフローチャートの第２
の分図である。FIG. 8 is a second flowchart showing the operation of the second embodiment.
FIG.

【図９】第２実施例の動作を示すフローチャートの第３
の分図である。FIG. 9 is a third flowchart showing the operation of the second embodiment.
FIG.

【図１０】第２実施例の動作を示すフローチャートの第
４の分図である。FIG. 10 is a fourth partial diagram of the flowchart showing the operation of the second embodiment.

【図１１】第２実施例の動作を示すフローチャートの第
５の分図である。FIG. 11 is a fifth partial diagram of the flowchart showing the operation of the second embodiment.

【図１２】本発明の第３実施例に係る機械翻訳装置の構
成を示すブロック図である。FIG. 12 is a block diagram showing a configuration of a machine translation device according to a third embodiment of the present invention.

【図１３】第３実施例の動作を示すフローチャートであ
る。FIG. 13 is a flowchart showing the operation of the third embodiment.

【図１４】レイアウト反映機能の処理動作を示すローチ
ャートである。FIG. 14 is a row chart showing a processing operation of a layout reflection function.

【図１５】レイアウトの認識結果の例を示す説明図であ
る。FIG. 15 is an explanatory diagram illustrating an example of a layout recognition result.

【図１６】レイアウトデータの回転例を示す説明図であ
る。FIG. 16 is an explanatory diagram showing a rotation example of layout data.

【図１７】従来における文字認識装置を示すブロック図
である。FIG. 17 is a block diagram showing a conventional character recognition device.

【符号の説明】[Explanation of symbols]

１入力部２表示部３記録部４読取部５文字認識部６認識用辞書７文末記号優先処理部８制御部２１読み取り部２２出力部２３入力部２４文字認識部２５最適読み取りレベル判定部２６形態素解析辞書２７記憶部２８制御部３１入力部３２表示部３３レイアウト解析部３４レイアウト記録部３５読取り部３６文字認識部３７認識用辞書３８翻訳部３９レイアウト反映部４０レイアウト回転部４１記録部４２制御部 DESCRIPTION OF SYMBOLS 1 input section 2 display section 3 recording section 4 reading section 5 character recognition section 6 recognition dictionary 7 sentence ending symbol priority processing section 8 control section 21 reading section 22 output section 23 input section 24 character recognition section 25 optimal reading level determination section 26 morpheme Analysis dictionary 27 Storage unit 28 Control unit 31 Input unit 32 Display unit 33 Layout analysis unit 34 Layout recording unit 35 Reading unit 36 Character recognition unit 37 Recognition dictionary 38 Translation unit 39 Layout reflection unit 40 Layout rotation unit 41 Recording unit 42 Control unit

Claims

【特許請求の範囲】[Claims]

【請求項１】入力されたイメージデータの文字認識を
行なう文字認識装置において、読取られた文字中に存在する句読点，ピリオド，カンマ
の認識性に曖昧性があるか否かを判定する手段と、曖昧
性があると判定された場合には当該曖昧な句読点，ピリ
オド，カンマの前後の文章を参照して句読点，ピリオ
ド，カンマのいずれかを決定する手段と、を有すること
を特徴とする文字認識装置。1. A character recognition device for character recognition of input image data, and means for determining whether or not there is ambiguity in the recognizability of punctuation marks, periods, and commas present in a read character, Character recognition characterized by having a means for determining one of a punctuation mark, a period and a comma by referring to the ambiguous punctuation mark, a sentence before and after the period and the comma when it is determined that there is ambiguity. apparatus.

【請求項２】設定された読取りレベルに従って文書画
像を入力する手段と、入力された文書画像から文字行を
切出し、該文字行から各文字を切出して文字認識する手
段とを備えた文字認識装置において、文書画像を読取る原稿の質、濃淡等の各属性に応じた読
取りレベルを予め格納する手段と、実際に原稿を読取る際には当該原稿の属性を識別し、こ
れに対応する読取りレベルにて読取りを行なう手段と、を有することを特徴とする文字認識装置。2. A character recognition device comprising means for inputting a document image according to a set reading level, and means for cutting out a character line from the input document image and cutting out each character from the character line to perform character recognition. In the above, the means for pre-storing the reading level corresponding to each attribute such as the quality and shade of the original for reading the document image, and the attribute of the original when actually reading the original are identified, and the reading level corresponding to this is identified. A character recognition device comprising:

【請求項３】第１の言語を第２の言語に翻訳する機械
翻訳装置において、原稿に記載された第１の言語文字の
画像データを認識し、当該文字のレイアウトを解析する
レイアウト解析手段と、この第１の言語文字が縦書きであるか横書きであるかを
判定する文字方向判定手段と、第１の言語文字と、これを翻訳した第２の言語文字との
文字方向が異なる場合には前記レイアウト解析手段にて
解析されたレイアウトの方向を文字方向と同様に変更
し、変更後のレイアウトで第２の言語文字を出力する手
段と、を有することを特徴とする機械翻訳装置。3. A machine translation device for translating a first language into a second language, and layout analysis means for recognizing image data of a first language character written in a manuscript and analyzing a layout of the character. When the character direction determining means for determining whether the first language character is vertical writing or horizontal writing, and when the character direction of the first language character and the second language character that translates the first language character are different from each other, And a means for changing the layout direction analyzed by the layout analysis means in the same manner as the character direction and outputting the second language character in the changed layout.