JPH06139277A

JPH06139277A - Electronic dictionary device

Info

Publication number: JPH06139277A
Application number: JP4287890A
Authority: JP
Inventors: Ayako Itsubo; 綾子伊坪
Original assignee: Seiko Epson Corp
Current assignee: Seiko Epson Corp
Priority date: 1992-10-26
Filing date: 1992-10-26
Publication date: 1994-05-20

Abstract

PURPOSE:To provide an electronic dictionary device in which a character string being the object of a dictionary retrieval can be inputted at a high speed. CONSTITUTION:The image of the character sting described in a document 101 is read by a reader 102. Next, the character image of one character unit is extracted by a character recognizing and Chinese character/Japanese converting device 103, corrected into recognition data, compared with standard recognition data corresponding to the recognition data, and a character recognition is operated. Then, the dictionary retrieval is operated by using a Chinese character/ Japanese dictionary 104 based on the recognized result, and the retrieved result is displayed at a display device 105.

Description

【発明の詳細な説明】Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は文書等に記載された文字
をスキャンし読み、意味等を出力する電子辞書装置に関
する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an electronic dictionary device which scans and reads characters written in a document or the like and outputs the meaning and the like.

【０００２】[0002]

【従来の技術】電子辞書においては、辞書検索対象の文
字列をキーボード等により入力している。このキーボー
ドは複数のキーに対し、それぞれ１文字または複数の文
字を割り当てておき、少なくとも１個のキーを操作する
事により文字を入力するものである。2. Description of the Related Art In an electronic dictionary, a character string to be searched for in a dictionary is input with a keyboard or the like. In this keyboard, one character or a plurality of characters is assigned to each of a plurality of keys, and a character is input by operating at least one key.

【０００３】そこで、前記電子辞書における入力所要時
間の短縮化を図るために、英文をスキャンして文字認識
を行うことにより高速入力を実現し、単語の意味等を出
力する方法（特開Ｓ６３−２７３１６６）や、漢字一文
字をスキャンし文字認識を行い、入力した一文字の漢字
の読み等を出力する方法（特開Ｓ６２−１３４７６５）
が示されている。Therefore, in order to shorten the time required for input in the electronic dictionary, a method of realizing high-speed input by scanning an English sentence and performing character recognition and outputting the meaning of a word (Japanese Patent Laid-Open No. S63- 273166), or a method of scanning one character of Kanji and recognizing the character and outputting reading of the input Kanji (Japanese Patent Laid-Open No. S62-134765).
It is shown.

【０００４】[0004]

【発明が解決しようとする課題】このような従来技術で
は、（１）日本語を高速に入力しても一文字づつの入力検索
しか行なえず、検索に時間を要してしまう。In such a conventional technique, (1) even if Japanese is input at high speed, only character-by-character input retrieval can be performed, and the retrieval takes time.

【０００５】（２）日本語は英文のように単語の区切り
がないので、入力文字列からの単語の抽出が困難であ
る。(2) Since there is no word division in Japanese unlike in English, it is difficult to extract a word from an input character string.

【０００６】（３）文字列が縦書きか横書きかを判断す
ることができない。(3) It is impossible to determine whether the character string is written vertically or horizontally.

【０００７】以上のような問題がある。There are problems as described above.

【０００８】本発明はこのような事情のもとに成された
もので、その目的とするところは、辞書検索対象の文字
列を高速に入力し、縦書きであるか横書きであるか判断
をし、単語を抽出し辞書検索し得る電子辞書装置を提供
することにある。The present invention has been made under such circumstances, and its purpose is to input a character string to be searched for in a dictionary at a high speed and determine whether it is vertical writing or horizontal writing. However, it is another object of the present invention to provide an electronic dictionary device capable of extracting a word and searching a dictionary.

【０００９】[0009]

【課題を解決するための手段】上記課題を解決するため
に、本発明の電子辞書装置は文書等に記載された文字の
読み取りを行う読み取り処理と、読み取り処理で読み取
られた文書の一文字単位の文字イメージ抽出を行う文字
抽出処理と、文字抽出処理で抽出された文字イメージか
ら文字認識のための特徴である認識用データを抽出する
特徴抽出処理と、特徴抽出処理で得られた認識用データ
と標準認識用データとを比較することにより文字認識を
行う文字認識処理と、文字認識処理で得られた認識結果
に基づいて辞書に対して検索を行う辞書検索処理と、辞
書検索処理で得られた検索結果を表示装置に表示する表
示処理とを備えている。In order to solve the above-mentioned problems, the electronic dictionary device of the present invention uses a reading process for reading characters written in a document or the like, and a character unit for each character read by the reading process. A character extraction process for extracting a character image, a feature extraction process for extracting recognition data that is a feature for character recognition from the character image extracted by the character extraction process, and a recognition data obtained by the feature extraction process. Character recognition processing that performs character recognition by comparing with the standard recognition data, dictionary search processing that searches the dictionary based on the recognition results obtained by the character recognition processing, and dictionary search processing Display processing for displaying the search result on the display device.

【００１０】[0010]

【実施例】【Example】

（実施例１）以下、本発明の一実施例を図面に基づいて
説明する。(Embodiment 1) An embodiment of the present invention will be described below with reference to the drawings.

【００１１】図１は本発明の電子辞書装置を実行するた
めの制御システムの一実施例及び処理制御状態を示す図
である。本実施例では辞書として漢和辞書を用いた例を
示す。この実施例に係わる制御システムは、活字印刷等
の文書１０１を読み取るための読み取り装置１０２と、
読み取り装置１０２によって読み取られたイメージデー
タを認識して漢和変換する文字認識・漢和変換装置１０
３と、文字認識・漢和変換装置１０３によって検索され
るデータがファイルされた漢和辞書１０４と、文字認識
・漢和変換装置で得られた結果が表示される表示装置１
０５と、電子辞書装置の電源や読み取りスイッチや認識
した漢字の読み・意味を順次表示するスイッチなどのス
イッチ群１０６とから成る。FIG. 1 is a diagram showing an embodiment of a control system for executing the electronic dictionary device of the present invention and a process control state. In this embodiment, an example in which a Hanwa dictionary is used as the dictionary is shown. A control system according to this embodiment includes a reading device 102 for reading a document 101 such as type printing.
Character recognition / Kanwa conversion device 10 for recognizing image data read by the reading device 102 and converting the data into Hanwa
3, a Hanwa dictionary 104 in which data searched by the character recognition / Kanwa conversion device 103 is filed, and a display device 1 for displaying results obtained by the character recognition / Kanwa conversion device
05, and a switch group 106 such as a power supply of the electronic dictionary device, a reading switch, and a switch for sequentially displaying the reading and meaning of the recognized kanji.

【００１２】図２は文字認識・漢和変換装置１０３の構
成例を示す図である。文字認識・漢和変換装置１０３
は、読み取り装置１０２に接続されて文書イメージの読
み取りを行うイメージ読み取り部２０１と、イメージ読
み取り部２０１で読み取られた文書のイメージデータが
格納されるイメージメモリ２０２と、イメージメモリ２
０２に格納されたデータに一文字切り出し処理を施す文
字抽出部２０３と、文字抽出部２０３で抽出された一文
字単位のイメージデータが格納される文字イメージメモ
リ２０４と、文字イメージメモリ２０４に格納された一
文字単位のイメージデータを認識用のデータに変換する
特徴抽出部２０５と、特徴抽出部２０５によって得られ
た認識用データを格納する認識用データメモリ２０６
と、標準文字イメージから事前に抽出した標準認識用デ
ータと標準文字イメージが示す文字コードが組になって
格納された認識用辞書メモリ２０８と、認識用データメ
モリ２０６に格納された認識用データを認識用辞書メモ
リ２０８に格納された標準認識用データと比較すること
により、抽出した文字イメージが示す文字コードを決定
する文字認識部２０７と、文字認識部２０７で得られた
文字コードを格納する認識結果データメモリ２０９と、
認識結果データメモリ２０９に格納された認識結果デー
タである文字コードに基づいて漢和辞書１０４に対して
検索を行う漢和辞書検索部２１０と、漢和辞書検索部２
１０で得られた検索結果を表示装置１０５に表示する表
示部２１１と、文字認識・漢和変換装置１０３を構成す
る各部の制御を行う制御部２１２とを有している。FIG. 2 is a diagram showing a configuration example of the character recognition / Kanawa conversion device 103. Character recognition and Hanwa conversion device 103
Is an image reading unit 201 that is connected to the reading device 102 to read a document image, an image memory 202 that stores image data of a document read by the image reading unit 201, and an image memory 2.
A character extraction unit 203 for performing a character extraction process on the data stored in 02, a character image memory 204 in which the image data of each character extracted by the character extraction unit 203 is stored, and a character stored in the character image memory 204. A feature extraction unit 205 that converts unit image data into recognition data, and a recognition data memory 206 that stores the recognition data obtained by the feature extraction unit 205.
A recognition dictionary memory 208 in which the standard recognition data previously extracted from the standard character image and the character code indicated by the standard character image are stored as a set, and the recognition data stored in the recognition data memory 206. A character recognition unit 207 that determines the character code indicated by the extracted character image by comparing with the standard recognition data stored in the recognition dictionary memory 208, and a recognition that stores the character code obtained by the character recognition unit 207. A result data memory 209,
A Hanwa dictionary search unit 210 that searches the Hanwa dictionary 104 based on the character code that is the recognition result data stored in the recognition result data memory 209, and a Hanwa dictionary search unit 2
It has a display unit 211 for displaying the search result obtained in 10 on the display device 105, and a control unit 212 for controlling each unit constituting the character recognition / Kanawa conversion device 103.

【００１３】図３は本実施例の処理の概要を示す流れ図
である。FIG. 3 is a flow chart showing an outline of the processing of this embodiment.

【００１４】始めに読み取り処理Ｓ３０１において、イ
メージ読み取り部２０１は文書の読み取りを行いイメー
ジメモリ２０２に格納する。First, in the reading process S301, the image reading unit 201 reads a document and stores it in the image memory 202.

【００１５】次に文字抽出処理Ｓ３０２において、文字
抽出部２０３はイメージメモリ２０２に格納された文書
の一文字単位の文字イメージ抽出を行い、文字イメージ
メモリ２０４に格納する。Ｓ３０２での抽出処理の一例
を図６を参照しながら以下に示す。横書きの文字列を読
み取った場合、まず読み取った文書画像全体の横方向
（垂直軸に対する）の射影を求め、求めた射影の値が設
定した値を越える範囲を認識対象となる文字列と判断す
る。そして認識対象と判断された文字列内の縦方向（水
平軸に対する）の射影（図６の６０５）を求め、求めた
射影の値が設定した値（図６の６０６）を越える範囲に
文字が存在すると考え、文字が存在しない部分を文字と
文字との境界と判断する。次に一文字づつ外接矩形を求
め抽出処理を終える。但し、縦方向の（水平軸に対す
る）射影（図６の６０５）を求めたとき、求めた射影の
値が設定した値（図６の６０６）を越える範囲であって
も、読み取り範囲の端に引っかかっていれば、抽出処理
の対象としない。Next, in the character extraction processing S302, the character extraction unit 203 extracts a character image of the document stored in the image memory 202 in units of one character and stores it in the character image memory 204. An example of the extraction process in S302 will be described below with reference to FIG. When a horizontally written character string is read, first the projection of the entire read document image in the horizontal direction (with respect to the vertical axis) is obtained, and the range in which the value of the obtained projection exceeds the set value is determined as the character string to be recognized. . Then, the projection (605 in FIG. 6) in the vertical direction (with respect to the horizontal axis) in the character string determined as the recognition target is obtained, and the character is within the range in which the value of the obtained projection exceeds the set value (606 in FIG. 6). Considering that the character exists, the portion where the character does not exist is determined as the boundary between the characters. Then, a circumscribed rectangle is obtained for each character, and the extraction process is completed. However, when the projection in the vertical direction (with respect to the horizontal axis) (605 in FIG. 6) is obtained, even if the value of the obtained projection exceeds the set value (606 in FIG. 6), it will be at the end of the reading range. If it is caught, it will not be subject to extraction processing.

【００１６】文字抽出処理ｓ３０２において文字抽出処
理を行った一例を図６に示す。図中の実線で示した矩形
が上記文字抽出処理で抽出した文字イメージである。こ
の例ような文字イメージを読み取った際に、読み取り範
囲の端に引っかかった文字（点線で示した６０１・６０
２）は以下の処理の対象とせず、文字イメージ「で漢字
を」のみ抽出される。FIG. 6 shows an example of the character extraction processing performed in the character extraction processing s302. The rectangle shown by the solid line in the figure is the character image extracted by the character extraction processing. When a character image like this example is read, the characters caught on the edge of the reading range (601.60 shown by the dotted line
In 2), only the character image “de Kanji” is extracted without subjecting it to the following processing.

【００１７】縦書きの文字列を読み取った例を図６の
（ｂ）に示す。縦書きの場合も横書きのときと同様の処
理により、６０３・６０４のような読み取り範囲の端に
引っかかった文字イメージを除くことができる。An example of reading a vertically written character string is shown in FIG. 6 (b). In the case of vertical writing as well, by the same processing as in horizontal writing, it is possible to remove character images such as 603 and 604 that are caught at the end of the reading range.

【００１８】次に特徴抽出処理Ｓ３０３において、特徴
抽出部２０５は文字イメージメモリ２０４に格納された
文字イメージから文字認識のための特徴である認識用デ
ータを抽出し、認識用データメモリ２０６に格納する。
ここで、文字認識のための特徴としてペリフェラル特徴
を用いた例を以下に示す。まず、文字パターンの外接枠
の４辺をそれぞれｎ分割し、分割された外接枠と、外接
枠からみて最初に出会う文字部で囲まれた白領域の面積
を計数し、これを全体の面積で規格化することによっ
て、ペリフェラル特徴量が抽出される。Next, in the feature extraction processing S303, the feature extraction unit 205 extracts the recognition data which is the feature for character recognition from the character image stored in the character image memory 204 and stores it in the recognition data memory 206. .
Here, an example using the peripheral feature as a feature for character recognition is shown below. First, each of the four sides of the circumscribing frame of the character pattern is divided into n parts, and the area of the divided circumscribing frame and the area of the white area surrounded by the first character portion that is encountered from the circumscribing frame are counted, and this is calculated as By normalizing, the peripheral feature amount is extracted.

【００１９】次に文字認識処理ｓ３０４において、文字
認識部２０７は認識用データメモリ２０６に格納された
認識用データと認識用辞書メモリ２０８に格納された標
準認識用データとを比較することにより、縦書き横書き
の判断をするとともに文字認識を行い、認識結果の文字
コードを認識結果データメモリ２０９に格納する。文字
認識処理ｓ３０４における縦書き横書きの判断方法は、
抽出した最初のｎ文字の文字イメージを９０度回転させ
認識した結果と、そのままの文字イメージを認識した結
果を比較し、どちらが標準認識用データとの類似度が高
くなるかを判断することで、読み込まれた文字列が縦書
きか横書きかを判断するという方法である。そして判断
した結果に基づいてｎ＋１文字目以降の文字イメージは
横書きと判断した場合はそのままで、縦書きと判断した
場合は文字イメージを９０度回転させて文字認識を行
う。例えばｎ＝２とすると、図６（ａ）において、最初
の２つの文字イメージをそれぞれ時計回りに９０度回転
させ認識した結果と、そのままの文字イメージを認識し
た結果を比較し、９０度回転させ認識した結果の方が標
準認識用データとの類似度が高ければ縦書き、そのまま
の文字イメージを認識した結果の方が標準認識用データ
との類似度が高ければ横書きと判断する。図６（ａ）は
横書き、図６（ｂ）は縦書きと判断され、３文字目以降
の文字イメージはその結果に基づいて、そのまま、もし
くは時計回りに９０度回転して文字認識を行う。Next, in the character recognition processing s304, the character recognition unit 207 compares the recognition data stored in the recognition data memory 206 with the standard recognition data stored in the recognition dictionary memory 208, and thereby the vertical recognition is performed. The writing or horizontal writing is determined and the character recognition is performed, and the character code of the recognition result is stored in the recognition result data memory 209. The method of determining vertical writing and horizontal writing in the character recognition processing s304 is as follows.
By comparing the result of recognizing the extracted character image of the first n characters by rotating it by 90 degrees and the result of recognizing the character image as it is, by judging which has a higher similarity to the standard recognition data, This is a method of determining whether the read character string is vertical writing or horizontal writing. Then, based on the result of the judgment, the character image after the (n + 1) th character is left as it is if it is judged to be horizontal writing, and if it is judged to be vertical writing, the character image is rotated by 90 degrees for character recognition. For example, if n = 2, in FIG. 6 (a), the result of recognizing the first two character images rotated 90 degrees clockwise is compared with the result of recognizing the character image as it is, and rotated 90 degrees. If the recognized result has a higher degree of similarity with the standard recognition data, vertical writing is performed. If the result of recognizing the character image as it is has a higher degree of similarity with the standard recognition data, it is determined to be horizontal writing. It is determined that FIG. 6A is horizontal writing and FIG. 6B is vertical writing, and the character images of the third and subsequent characters are recognized as they are or by rotating clockwise by 90 degrees based on the result.

【００２０】次に辞書検索処理Ｓ３０５において、漢和
辞書検索部２１０は、認識結果データメモリ２０９に格
納されたデータに基づいて漢和辞書１０４に対して検索
を行い、単語の抽出をし、抽出した単語の読み・意味等
を抽出する。Next, in the dictionary search processing S305, the Hanwa dictionary search unit 210 searches the Hanwa dictionary 104 based on the data stored in the recognition result data memory 209, extracts the words, and extracts the extracted words. Extract the reading and meaning of.

【００２１】最後に表示処理ｓ３０６において、表示部
２１１は漢和辞書検索部で検索した結果を表示装置１０
５に表示する。検索結果が表示装置１０５に表示しきれ
ない場合は、スイッチ群１０６の表示送りスイッチによ
り次の表示ができるものとする。Finally, in the display processing s306, the display unit 211 displays the result of the search by the Hanwa dictionary search unit on the display device 10.
Display in 5. If the search results cannot be displayed on the display device 105, the next display can be performed by the display advance switch of the switch group 106.

【００２２】図４は図３の各ステップの処理の概要を入
力例を用いて示した図である。例えば「で漢字を」とい
う文字列中の「漢字」の２文字を辞書検索したい場合に
ついてみる。まず読み取り処理ｓ３０１によって「で漢
字を」という文字列を読み取ると、読み取ったままのイ
メージデータｄ３０１としてイメージメモリ２０２に格
納される。次に文字抽出処理ｓ３０２によってイメージ
メモリ２０２に格納されたイメージデータｄ３０１の文
字抽出を行う。文字抽出処理が終了すると文字イメージ
データｄ３０２として文字イメージメモリ２０４に格納
される。次に特徴抽出処理ｓ３０３によって文字イメー
ジメモリ２０４に格納された文字イメージデータｄ３０
２の認識のための特徴を抽出し、抽出結果を認識用デー
タｄ３０３として認識用データメモリ２０６に格納す
る。次に文字認識処理ｓ３０４によって、認識用辞書メ
モリ２０８に格納された標準認識用データと認識用デー
タメモリ２０６に格納された認識用データｄ３０３を比
較することにより認識を行い、認識結果を認識結果デー
タｄ３０４として認識結果データメモリ２０９に格納す
る。次に辞書検索処理Ｓ３０５において、認識結果デー
タメモリ２０９に格納された認識結果データｄ３０４に
基づいて漢和辞書１０４に対して検索を行う。そしてｓ
３０６によってディスプレイ１０５に辞書検索処理ｓ３
０５で得られた検索結果を表示する。本実施例では認識
した漢字の読みと意味を表示する。FIG. 4 is a diagram showing an outline of the processing of each step of FIG. 3 by using an input example. For example, let's consider a case where a user wants to perform a dictionary search for two characters of "Kanji" in a character string "de Kanji". First, when the character string “de-kanji” is read by the reading process s301, it is stored in the image memory 202 as the read image data d301. Next, the character extraction processing s302 extracts characters from the image data d301 stored in the image memory 202. When the character extraction processing is completed, the character image data d302 is stored in the character image memory 204. Next, the character image data d30 stored in the character image memory 204 by the feature extraction processing s303.
The second recognition feature is extracted, and the extraction result is stored in the recognition data memory 206 as the recognition data d303. Next, the character recognition processing s304 performs recognition by comparing the standard recognition data stored in the recognition dictionary memory 208 with the recognition data d303 stored in the recognition data memory 206, and the recognition result is recognized. It is stored in the recognition result data memory 209 as d304. Next, in the dictionary search process S305, the Hanwa dictionary 104 is searched based on the recognition result data d304 stored in the recognition result data memory 209. And s
The dictionary search process s3 on the display 105 by 306
The search result obtained in 05 is displayed. In this embodiment, the reading and meaning of the recognized kanji are displayed.

【００２３】図５は本発明の電子辞書装置の概略図の例
であり、５０１は電子辞書装置の電源、５０２は文字読
み取り部、５０３は読み取りスイッチ、５０４は認識し
た漢字の読み・意味を順次表示するスイッチ、５０５は
認識結果の次候補を順次表示するスイッチ、５０６は認
識結果・読み・意味の表示部である。FIG. 5 is an example of a schematic view of the electronic dictionary device of the present invention. 501 is a power source of the electronic dictionary device, 502 is a character reading unit, 503 is a reading switch, and 504 is a reading and meaning of the recognized kanji. A display switch, 505 is a switch for sequentially displaying the next candidate of the recognition result, and 506 is a display part of the recognition result / reading / meaning.

【００２４】なお、本実施例では、漢和辞書を用いて漢
字の読み・意味等の検索例を示したが、本発明は国語辞
書を用いた日本語の意味の検索、中国語辞書を用いた読
み・意味等の検索も可能である。In this embodiment, an example of retrieval of kanji readings and meanings is shown using the Hanwa dictionary, but the present invention uses the Japanese dictionary to retrieve Japanese meanings and the Chinese dictionary. You can also search for reading and meaning.

【００２５】（実施例２）本実施例では、実施例１にお
ける辞書検索処理ｓ３０５での単語抽出処理の実施方法
について説明する。(Embodiment 2) In this embodiment, a method of executing the word extraction processing in the dictionary search processing s305 in Embodiment 1 will be described.

【００２６】図７は図３の辞書検索処理ｓ３０５におけ
る単語抽出処理の流れ図を説明するための図である。ま
た図７の（ｂ）は漢和辞書に登録されている文字列（単
語）の一部を表す図である。本実施例では読み取ること
ができる文字数に限りがあり構文解析はできないため図
７に示すような処理を行う。例えば認識結果データメモ
リ２０９に７０１のようなｎ文字の文字認識結果がある
場合、まず７０２でｉ＝１とし、７０３でｉ番目の文字
コードをキーとし、７０４でキーから始まる単語を漢和
辞書１０４から抽出する。次に７０５で７０４によって
抽出された単語とｉ番目以降の文字コードとで一致する
文字列（単語）があるかどうかを判断し、ＹＥＳであれ
ば７０６で一致した文字列（単語）を検索の対象として
抽出し、７０７で終了する。７０５でＮＯの場合は、７
０８でｉ＝ｉ＋１とし、７０９でｉがｎ以下であるかを
判断し、ＹＥＳであればまだ７０１内にキーになり得る
文字があるとし、７０３にもどり処理を続ける。ＮＯの
場合は漢和辞書１０４内に一致する文字列（単語）がな
かったということで７１０で終了する。７０１の例にお
いては２文字目の「漢」というコードをキーとして「漢
字」という文字列が漢和辞書１０４と一致した時点で単
語の抽出を終了する。FIG. 7 is a diagram for explaining a flow chart of the word extraction process in the dictionary search process s305 of FIG. Further, FIG. 7B is a diagram showing a part of a character string (word) registered in the Hanwa dictionary. In this embodiment, since the number of characters that can be read is limited and parsing is not possible, the processing shown in FIG. 7 is performed. For example, when the recognition result data memory 209 has a character recognition result of n characters such as 701, first, i = 1 in 702, the i-th character code in 703 is used as a key, and the word starting from the key is used in 704 as the Hanwa dictionary 104. Extract from. Next, in 705, it is determined whether or not there is a character string (word) that matches the word extracted by 704 and the i-th and subsequent character codes. If YES, the matching character string (word) is searched in 706. The target is extracted and the processing ends at 707. If NO at 705, 7
In step 08, i = i + 1 is set, and in step 709, it is determined whether i is n or less. If YES, it is determined that there is a character that can be a key in 701, and the processing is returned to 703 and continued. If NO, it means that there is no matching character string (word) in the Hanwa dictionary 104, and the process ends at 710. In the example of 701, when the character string “Kanji” matches the Hanwa dictionary 104 with the second character “Kan” as a key, the word extraction is terminated.

【００２７】前記辞書検索処理ｓ３０５の処理におい
て、７０６で抽出した単語以外にも７０１内に単語があ
るかどうかを見る場合、７１１においてｉ＝ｉ＋ｍ（ｍ
＝７０６で抽出した単語の文字数）とし、７０９からの
処理を続ける。すると、７０１の２つ目の単語「仮名」
も抽出される。In the processing of the dictionary search processing s305, if it is checked whether or not there is a word in 701 other than the word extracted in 706, i = i + m (m
= The number of characters of the word extracted in 706), and the processing from 709 is continued. Then, the second word of 701, "Kana"
Is also extracted.

【００２８】更に前記辞書検索処理ｓ３０５の処理にお
いて、１文字目が漢字で始まる単語のみを抽出する、す
なわちキーになる文字を漢字のみと限定した処理であれ
ば、７０３でｉ番目の文字が漢字以外の文字であった場
合は、７１２でｉ＝ｉ＋１とし、７０３にもどり処理を
続ける。Further, in the dictionary search process s305, if only the word whose first character starts with a Chinese character is extracted, that is, if the key character is limited to the Chinese character only, the i-th character is the Chinese character in 703. If the character is other than i, i = i + 1 is set in 712, and the process returns to 703 and continues.

【００２９】以上のようにして、辞書検索処理ｓ３０５
では、認識結果データメモリ２０９内の認識結果を基に
単語の抽出を行う。As described above, the dictionary search processing s305
Then, the words are extracted based on the recognition result in the recognition result data memory 209.

【００３０】[0030]

【発明の効果】以上説明したように、本発明によれば、
辞書検索対象の文字列を、キーボードにより１文字１文
字入力することなく、イメージ読み取り処理にて読み取
り、その中から一文字単位で抽出し、その一文字単位の
イメージを自動的に文字コードに補正して、辞書検索対
象の文字列として処理するように構成したので、イメー
ジ読み取り処理により辞書検索対象の文字列の入力が一
括して行われ、辞書検索対象の文字列を高速に入力する
ことができ、ひいては、高速に所望の情報（読み、意味
等）を知得することができる電子辞書装置を実現するこ
とが可能となる。As described above, according to the present invention,
The character string to be searched for in the dictionary is read by the image reading process without inputting each character with the keyboard, and it is extracted character by character from the image, and the image of each character is automatically corrected to the character code. Since it is configured to process as a character string to be searched for in a dictionary, the character strings to be searched for in a dictionary are collectively input by the image reading process, and the character string to be searched for in a dictionary can be input at high speed. As a result, it is possible to realize an electronic dictionary device that can obtain desired information (reading, meaning, etc.) at high speed.

【００３１】また、文字認識処理にて入力した文字列が
縦書きであるか横書きであるか判断をし、辞書検索処理
にて単語を抽出し辞書検索することで、日本語の単語の
意味等を得ることができる電子辞書装置を実現すること
が可能となる。Further, it is determined whether the character string input in the character recognition processing is vertical writing or horizontal writing, and the dictionary search processing extracts the words and searches the dictionary to determine the meaning of the Japanese words. It is possible to realize an electronic dictionary device that can obtain.

【図面の簡単な説明】[Brief description of drawings]

【図１】電子辞書装置を実行するための制御システムの
一実施例及び処理制御状態を示す図である。FIG. 1 is a diagram showing an embodiment of a control system for executing an electronic dictionary device and a process control state.

【図２】文字認識・漢和変換装置の構成を示す図であ
る。FIG. 2 is a diagram showing a configuration of a character recognition / Kanawa conversion device.

【図３】本実施例の処理の概要を示す流れ図である。FIG. 3 is a flowchart showing an outline of processing of this embodiment.

【図４】図３の各ステップの処理の概要を入力例を用い
て示した図である。FIG. 4 is a diagram showing an outline of processing of each step of FIG. 3 by using an input example.

【図５】本発明の電子辞書装置の概略図である。FIG. 5 is a schematic diagram of an electronic dictionary device of the present invention.

【図６】図３の文字抽出処理において文字抽出処理を行
った一例の図である。FIG. 6 is a diagram showing an example of character extraction processing performed in the character extraction processing of FIG.

【図７】図３の辞書検索処理における単語抽出処理の一
例を示す流れ図である。FIG. 7 is a flowchart showing an example of word extraction processing in the dictionary search processing of FIG.

【符号の説明】[Explanation of symbols]

１０１・・・文書１０２・・・読み取り装置１０３・・・文字認識・漢和変換装置１０４・・・漢和辞書１０５・・・表示装置１０６・・・スイッチ群 101 ... Document 102 ... Reading device 103 ... Character recognition / Kanawa conversion device 104 ... Kanawa dictionary 105 ... Display device 106 ... Switch group

Claims

【特許請求の範囲】[Claims]

【請求項１】文書等に記載された文字の読み取りを行
う読み取り処理と、前記読み取り処理で読み取られた文書の一文字単位の文
字イメージ抽出を行う文字抽出処理と、前記文字抽出処理で抽出された文字イメージから文字認
識のための特徴である認識用データを抽出する特徴抽出
処理と、前記特徴抽出処理で得られた認識用データと標準認識用
データとを比較することにより文字認識を行う文字認識
処理と、前記文字認識処理で得られた認識結果に基づいて辞書に
対して検索を行う辞書検索処理と、前記辞書検索処理で得られた検索結果を表示装置に表示
する表示処理とを備えていることを特徴とする電子辞書
装置。1. A reading process for reading characters described in a document, a character extracting process for extracting a character image of each character of the document read by the reading process, and a character extracting process for extracting the character image Feature extraction processing for extracting recognition data, which is a feature for character recognition, from a character image, and character recognition for performing character recognition by comparing the recognition data obtained by the feature extraction processing with the standard recognition data. Processing, a dictionary search processing for searching a dictionary based on the recognition result obtained by the character recognition processing, and a display processing for displaying the search result obtained by the dictionary search processing on a display device. An electronic dictionary device characterized in that

【請求項２】文書等に記載された文字の読み取りを行
う読み取り処理と、前記読み取り処理で読み取られた文
書の一文字単位の文字イメージ抽出を行う文字抽出処理
と、前記文字抽出処理で抽出された文字イメージから文
字認識のための特徴である認識用データを抽出する特徴
抽出処理と、前記特徴抽出処理で得られた認識用データ
と標準認識用データとを比較することにより文字認識を
行う文字認識処理と、前記文字認識処理で得られた認識
結果に基づいて辞書に対して検索を行う辞書検索処理
と、前記辞書検索処理で得られた検索結果を表示装置に
表示する表示処理とを備えた電子辞書装置において、前記文字抽出処理には、文字イメージを読み取った際に
読み取り範囲の端に引っかかった文字イメージは抽出処
理の対象としない機能を備えていることを特徴とする請
求項１記載の電子辞書装置。2. A reading process for reading characters written in a document, a character extracting process for extracting a character image of each character of the document read by the reading process, and a character extracting process for extracting the character image. Character recognition for performing character recognition by comparing feature recognition processing for extracting recognition data that is a feature for character recognition from a character image with recognition data obtained by the feature extraction processing and standard recognition data Processing, a dictionary search processing for searching a dictionary based on the recognition result obtained by the character recognition processing, and a display processing for displaying the search result obtained by the dictionary search processing on a display device. In the electronic dictionary device, the character extraction process does not include a character image caught at the end of the reading range when the character image is read as a target of the extraction process. That it comprises an electronic dictionary apparatus according to claim 1, wherein.

【請求項３】文書等に記載された文字の読み取りを行
う読み取り処理と、前記読み取り処理で読み取られた文
書の一文字単位の文字イメージ抽出を行う文字抽出処理
と、前記文字抽出処理で抽出された文字イメージから文
字認識のための特徴である認識用データを抽出する特徴
抽出処理と、前記特徴抽出処理で得られた認識用データ
と標準認識用データとを比較することにより文字認識を
行う文字認識処理と、前記文字認識処理で得られた認識
結果に基づいて辞書に対して検索を行う辞書検索処理
と、前記辞書検索処理で得られた検索結果を表示装置に
表示する表示処理とを備えた電子辞書装置において、前記文字認識処理には、前記文字抽出処理で抽出した最
初のｎ文字の文字イメージを９０度回転させ認識した結
果と、そのままの文字イメージを認識した結果を比較
し、どちらが標準認識用データとの類似度が高くなるか
を判断することで、読み込まれた文字列が縦書きか横書
きかを判断する機能を備えていることを特徴とする請求
項１記載の電子辞書装置。3. A reading process for reading characters written in a document, a character extracting process for extracting a character image of each character of the document read by the reading process, and a character extracting process for extracting the character image. Character recognition for performing character recognition by comparing feature recognition processing for extracting recognition data that is a feature for character recognition from a character image with recognition data obtained by the feature extraction processing and standard recognition data Processing, a dictionary search processing for searching a dictionary based on the recognition result obtained by the character recognition processing, and a display processing for displaying the search result obtained by the dictionary search processing on a display device. In the electronic dictionary device, in the character recognition processing, a result of recognizing a character image of the first n characters extracted in the character extraction processing by rotating 90 degrees, and the character It features a function to judge whether the read character string is vertical writing or horizontal writing by comparing the results of recognizing images and judging which has a higher similarity to the standard recognition data. The electronic dictionary device according to claim 1.

【請求項４】文書等に記載された文字の読み取りを行
う読み取り処理と、前記読み取り処理で読み取られた文
書の一文字単位の文字イメージ抽出を行う文字抽出処理
と、前記文字抽出処理で抽出された文字イメージから文
字認識のための特徴である認識用データを抽出する特徴
抽出処理と、前記特徴抽出処理で得られた認識用データ
と標準認識用データとを比較することにより文字認識を
行う文字認識処理と、前記文字認識処理で得られた認識
結果に基づいて辞書に対して検索を行う辞書検索処理
と、前記辞書検索処理で得られた検索結果を表示装置に
表示する表示処理とを備えた電子辞書装置において、前記辞書検索処理には、前記文字認識処理で得られた認
識結果の各文字を単語の一文字目のキーとして前記辞書
への検索を行い、単語を抽出する機能を備えていること
を特徴とする請求項１記載の電子辞書装置。4. A reading process for reading a character written in a document, a character extracting process for extracting a character image of each character of the document read by the reading process, and a character extracting process for extracting the character image. Character recognition for performing character recognition by comparing feature recognition processing for extracting recognition data that is a feature for character recognition from a character image with recognition data obtained by the feature extraction processing and standard recognition data Processing, a dictionary search processing for searching a dictionary based on the recognition result obtained by the character recognition processing, and a display processing for displaying the search result obtained by the dictionary search processing on a display device. In the electronic dictionary device, in the dictionary search process, each character of the recognition result obtained in the character recognition process is searched in the dictionary using the first character key of the word as a key, and the word Electronic dictionary apparatus according to claim 1, characterized in that it comprises a function for extracting.