JP2014146183A

JP2014146183A - Information processing device, information processing method, and program

Info

Publication number: JP2014146183A
Application number: JP2013014550A
Authority: JP
Inventors: Tomoharu Yamaguchi; 倫治山口
Original assignee: Casio Computer Co Ltd
Current assignee: Casio Computer Co Ltd
Priority date: 2013-01-29
Filing date: 2013-01-29
Publication date: 2014-08-14

Abstract

PROBLEM TO BE SOLVED: To correct a recognition result of a recognition character string with high accuracy.SOLUTION: A result acquisition unit 610 acquires a recognition result of a character string. A candidate acquisition unit 630 acquires a candidate character string registered in a word dictionary. When the same character appears in a word and the recognition result at similar positions, a comparison unit 640 compares a candidate character string with the recognition result from each of the positions to calculate a partial score. A determination unit 650 calculates a total degree of similarity from the partial score, and a selection unit 660 determines an output character string in accordance with the degree of similarity. An output unit 670 outputs the output character string.

Description

本発明は、文字列を処理する技術に関する。 The present invention relates to a technique for processing a character string.

ＯＣＲ（ＯｐｔｉｃａｌＣｈａｒａｃｔｅｒＲｅｃｏｇｎｉｔｉｏｎ）や手書き認識による文字認識結果を補正して文字認識の精度を高める技術が知られている。例えば、特許文献１は各文字に複数の文字候補が定義された認識結果と、単語辞書に登録された単語とを比較し、一致度が高い単語を選択して出力する単語認識方法を開示している。特許文献１に記載の技術では、単語の各位置に現れる文字と、対応する位置の認識結果を位置毎に比較する。 There is known a technique for improving the accuracy of character recognition by correcting a character recognition result by OCR (Optical Character Recognition) or handwriting recognition. For example, Patent Document 1 discloses a word recognition method that compares a recognition result in which a plurality of character candidates are defined for each character and a word registered in a word dictionary, and selects and outputs a word having a high degree of matching. ing. In the technique described in Patent Document 1, a character appearing at each position of a word is compared with a recognition result of a corresponding position for each position.

特開２０１１−１３８４３５号公報JP 2011-138435 A

特許文献１に記載の技術では、認識対象となった文字列（認識文字列）の認識結果と、単語辞書の単語とを、文字が現れる位置毎に局所的に比較するのみである。そのため、認識結果の位置が認識文字列とずれていた場合や、一部の文字の認識に失敗した場合に補正の精度が大きく低下するなど、文字認識結果を補正する精度が低かった。 In the technique described in Patent Document 1, the recognition result of the character string (recognized character string) that is a recognition target is only compared locally with the word in the word dictionary for each position where the character appears. Therefore, the accuracy of correcting the character recognition result is low, such as when the position of the recognition result is shifted from the recognized character string, or when the recognition of some characters fails, the accuracy of correction is greatly reduced.

本発明は、このような状況に鑑みてなされたものであり、認識文字列の認識結果を高い精度で補正する情報処理装置を提供することを目的とする。 The present invention has been made in view of such a situation, and an object thereof is to provide an information processing apparatus that corrects a recognition result of a recognized character string with high accuracy.

上記目的を達成するため、本発明に係る情報処理装置は、
文字認識装置からの単語の認識結果として、文字毎に順位付けされた複数の認識候補文字を有する認識文字列を取得する結果取得手段と、
前記認識文字列に対応する複数の候補単語を取得する候補取得手段と、
前記認識文字列の各文字に対応する複数の認識候補文字が前記候補単語の文字列上に現れる位置と当該認識候補文字の順位に基づいて、前記認識文字列と前記候補単語の類似度を決定する決定手段と、
前記複数の候補単語について前記決定手段が決定した類似度に基づいて候補単語の１つを選択し出力する出力手段と、
を備えることを特徴とする。 In order to achieve the above object, an information processing apparatus according to the present invention provides:
A result acquisition means for acquiring a recognition character string having a plurality of recognition candidate characters ranked for each character as a word recognition result from the character recognition device;
Candidate acquisition means for acquiring a plurality of candidate words corresponding to the recognized character string;
The similarity between the recognized character string and the candidate word is determined based on a position where a plurality of recognition candidate characters corresponding to each character of the recognized character string appear on the character string of the candidate word and the rank of the recognized candidate character. A decision means to
Output means for selecting and outputting one of the candidate words based on the similarity determined by the determining means for the plurality of candidate words;
It is characterized by providing.

本発明によれば、認識文字列の認識結果を高い精度で補正することができる。 According to the present invention, the recognition result of the recognized character string can be corrected with high accuracy.

本発明の実施形態１に係る文字認識装置の構成を示すブロック図である。It is a block diagram which shows the structure of the character recognition apparatus which concerns on Embodiment 1 of this invention. 本発明の実施形態１に係る文字認識装置の情報処理部の物理構成を示すブロック図である。It is a block diagram which shows the physical structure of the information processing part of the character recognition apparatus which concerns on Embodiment 1 of this invention. 実施形態１に係る文字認識装置が実行する処理を説明するための図である。It is a figure for demonstrating the process which the character recognition apparatus which concerns on Embodiment 1 performs. 実施形態１に係る文字列処理部の機能構成を示すブロック図である。It is a block diagram which shows the function structure of the character string process part which concerns on Embodiment 1. FIG. 実施形態１に係る文字認識装置が実行する認識結果出力処理を示すフローチャートである。It is a flowchart which shows the recognition result output process which the character recognition apparatus which concerns on Embodiment 1 performs. 実施形態１に係る文字認識装置が実行する類似度決定処理を示すフローチャートである。It is a flowchart which shows the similarity determination process which the character recognition apparatus which concerns on Embodiment 1 performs. 実施形態１に係る文字認識装置が実行する詳細スコア算出処理を示すフローチャートである。It is a flowchart which shows the detailed score calculation process which the character recognition apparatus which concerns on Embodiment 1 performs. 実施形態１に係るヒット位置の例を示す図である。It is a figure which shows the example of the hit position which concerns on Embodiment 1. FIG. 実施形態１に係るスコア表の例を示す図である。It is a figure which shows the example of the score table which concerns on Embodiment 1. FIG. 実施形態１に係る認識結果と比較文字列の比較例を示す図である。It is a figure which shows the comparative example of the recognition result which concerns on Embodiment 1, and a comparison character string. 実施形態１に係る認識結果と比較文字列の比較例を示す図である。It is a figure which shows the comparative example of the recognition result which concerns on Embodiment 1, and a comparison character string.

以下、本発明を実施するための形態に係る文字認識装置を、図を参照して説明する。なお、図中同一又は相当する部分には同一符号を付す。 Hereinafter, a character recognition device according to an embodiment for carrying out the present invention will be described with reference to the drawings. In the drawings, the same or corresponding parts are denoted by the same reference numerals.

（実施形態１）
実施形態１に係る文字認識装置１は、ｉ）解析対象となる文字列を記載した紙等を撮影する撮影機能、ｉｉ）撮影した画像の文字列を認識して認識結果を出力する機能、ｉｉｉ）認識結果を単語辞書に登録された単語と比較する機能、ｉｖ)比較結果に応じて認識結果を補正する機能、等を備える。 (Embodiment 1)
The character recognition device 1 according to the first embodiment includes: i) a photographing function for photographing paper or the like on which a character string to be analyzed is recorded; ii) a function for recognizing a character string of a photographed image and outputting a recognition result; A) a function for comparing the recognition result with a word registered in the word dictionary; and iv) a function for correcting the recognition result according to the comparison result.

文字認識装置１は図１に示すように画像入力部１０と、ＯＣＲ（ＯｐｔｉｃａｌＣｈａｒａｃｔｅｒＲｅａｄｅｒ）２１０と文字列処理部６０と文字辞書記憶部２２０と単語辞書記憶部２４０とを含む情報処理部２０と、表示部４０と、操作部３０と、を備える。 As shown in FIG. 1, the character recognition device 1 includes an image input unit 10, an OCR (Optical Character Reader) 210, a character string processing unit 60, a character dictionary storage unit 220, and a word dictionary storage unit 240. The display unit 40 and the operation unit 30 are provided.

画像入力部１０は、カメラと画像処理部とから構成され、このような物理構成により文字列を撮影した画像を取得する。画像入力部１０は、取得した画像を情報処理部２０のＯＣＲ２１０に伝達する。 The image input unit 10 includes a camera and an image processing unit, and acquires an image obtained by capturing a character string with such a physical configuration. The image input unit 10 transmits the acquired image to the OCR 210 of the information processing unit 20.

情報処理部２０は、物理的には、図２に示すようにＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）２１と、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）２２と、ＲＯＭ（Ｒｅａｄ−ｏｎｌｙｍｅｍｏｒｙ）２３と、Ｉ／Ｏ（Ｉｎｐｕｔ／Ｏｕｔｐｕｔ）部２４と、外部記憶装置２５と、内部バス２６と、から構成される。 As shown in FIG. 2, the information processing unit 20 physically includes a CPU (Central Processing Unit) 21, a RAM (Random Access Memory) 22, a ROM (Read-only memory) 23, an I / O (Input). / Output) unit 24, an external storage device 25, and an internal bus 26.

ＣＰＵ２１は、文字認識装置１の諸機能を制御する中央演算装置であり、ＲＯＭ２３からＲＡＭ２２にロードされたプログラムを実行して、後述する処理を実行する。 The CPU 21 is a central processing unit that controls various functions of the character recognition device 1, and executes a program loaded from the ROM 23 to the RAM 22 to execute processing to be described later.

ＲＡＭ２２は、ＣＰＵ２１が実行するプログラムや中間データを一時的に保存し、ＣＰＵ２１が各種処理を行う際の作業領域として使用する揮発性メモリである。
ＲＯＭ２３は、ＣＰＵ２１が実行するプログラムを格納している不揮発性メモリである。 The RAM 22 is a volatile memory that temporarily stores programs executed by the CPU 21 and intermediate data and is used as a work area when the CPU 21 performs various processes.
The ROM 23 is a non-volatile memory that stores a program executed by the CPU 21.

Ｉ／Ｏ部２４は、ＵＳＢ（ＵｎｉｖｅｒｓａｌＳｅｒｉａｌＢｕｓ）やシリアルポートによって情報処理部２０に接続された画像入力部１０、表示部４０、操作部３０やその他外部装置等との情報の入出力を制御する入出力デバイスである。 The I / O unit 24 controls input / output of information to / from the image input unit 10, the display unit 40, the operation unit 30, and other external devices connected to the information processing unit 20 by USB (Universal Serial Bus) or a serial port. I / O device to be used.

外部記憶装置２５は、ハードディスクドライブ、ＣＤ−ＲＯＭ（ＣｏｍｐａｃｔＤｉｓｃＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、フラッシュメモリ等の大容量記憶装置であり、文字辞書や単語辞書等の後述する処理に用いる大容量データを記憶している。
なお、ＣＰＵ２１と、ＲＡＭ２２と、ＲＯＭ２３と、Ｉ／Ｏ部２４と、外部記憶装置２５とは内部バス２６によってそれぞれ接続され、相互に情報の送信が可能である。 The external storage device 25 is a large-capacity storage device such as a hard disk drive, a CD-ROM (Compact Disc Read Only Memory), or a flash memory, and stores large-capacity data used for later-described processing such as a character dictionary and a word dictionary. Yes.
The CPU 21, the RAM 22, the ROM 23, the I / O unit 24, and the external storage device 25 are respectively connected by an internal bus 26 and can transmit information to each other.

情報処理部２０は、上記物理構成によりＯＣＲ２１０と、文字列処理部６０と、文字辞書記憶部２２０と、単語辞書記憶部２４０と、として機能する。 The information processing unit 20 functions as the OCR 210, the character string processing unit 60, the character dictionary storage unit 220, and the word dictionary storage unit 240 with the above physical configuration.

ＯＣＲ２１０は、入力部１０から伝達された画像（図３の入力画像ＴＩ）の文字を認識し、画像に含まれる文字列（文字認識の対象となる認識文字列）の各文字を文字辞書記憶部２２０に登録された文字情報を用いて光学認識により認識する。ＯＣＲ２１０は、認識文字列に含まれる少なくとも一部の文字について信頼度が異なる複数の認識候補が定義された認識結果（例えば、図３の認識結果Ｒ）を出力する。ＯＣＲ２１０は、加重方向ヒストグラムを特長量として利用するベイズ識別法を用いた認識方法や、外郭方向寄与度特徴を特徴量として利用する部分空間法を用いた認識方法など、既知の任意の方法を用いて文字認識を行ってよいが、ここでは特開平９−１２８４８９号公報に記載された方法を用いる。
各文字に対して、最大ｎ個の認識候補が定義される。本実施形態ではｎ＝２である。認識結果では、認識結果の各認識候補に、認識候補に対応する文字が認識文字列に現れる位置を示す情報として、文字番号が対応付けられている。ＯＣＲ２１０は、スペースによって認識結果を単語単位に分割し、単語毎に文字列処理部６０に伝達する。 The OCR 210 recognizes characters in the image transmitted from the input unit 10 (input image TI in FIG. 3), and stores each character of the character string (recognized character string to be character recognition target) included in the image as a character dictionary storage unit. Recognition is performed by optical recognition using the character information registered in 220. The OCR 210 outputs a recognition result (for example, recognition result R in FIG. 3) in which a plurality of recognition candidates having different reliability levels are defined for at least some characters included in the recognized character string. The OCR 210 uses a known arbitrary method such as a recognition method using a Bayesian identification method using a weighted direction histogram as a feature amount, or a recognition method using a subspace method using a contour direction contribution feature as a feature amount. Character recognition may be performed, but here, the method described in JP-A-9-128489 is used.
A maximum of n recognition candidates are defined for each character. In this embodiment, n = 2. In the recognition result, a character number is associated with each recognition candidate of the recognition result as information indicating a position where a character corresponding to the recognition candidate appears in the recognized character string. The OCR 210 divides the recognition result into words by spaces and transmits the recognition result to the character string processing unit 60 for each word.

文字列処理部６０は、図３の中段に示すように、ＯＣＲ２１０から伝達された認識結果を単語辞書記憶部２４０に登録された単語と比較して、最も類似度が高い単語を認識結果に対応する出力文字列として選択する。文字列処理部６０は、選択した出力文字列を表示部４０に出力する。 As shown in the middle part of FIG. 3, the character string processing unit 60 compares the recognition result transmitted from the OCR 210 with the word registered in the word dictionary storage unit 240, and corresponds the word having the highest similarity to the recognition result. Select as output string to be used. The character string processing unit 60 outputs the selected output character string to the display unit 40.

表示部４０は、液晶ディスプレイ等から構成され、文字列処理部６０から伝達された情報を表示する。 The display unit 40 is composed of a liquid crystal display or the like, and displays information transmitted from the character string processing unit 60.

操作部３０は、タッチパネル、キーボード、ボタン、ポインティングデバイス等の、ユーザの操作を受付ける操作受付装置と、操作受付装置が受け付けた操作の情報を情報処理部２０に伝達する伝達部と、から構成される。操作部３０はこのような物理構成によりユーザの操作を受け付けて、その情報を情報処理部２０に伝達する。 The operation unit 30 includes an operation reception device that accepts a user operation, such as a touch panel, a keyboard, a button, and a pointing device, and a transmission unit that transmits information on an operation received by the operation reception device to the information processing unit 20. The The operation unit 30 accepts a user operation with such a physical configuration and transmits the information to the information processing unit 20.

ここで文字認識装置１が文字列を撮影した画像と、認識結果と、候補文字列と、出力文字列と、の関係を図３を参照して説明する。
文字認識装置１は、ユーザが画像入力部１０を用いて文字列を含む画像（たとえば、レストランのメニュー）を撮影すると、図３の上段に示すような画像ＴＩを取得する。 Here, the relationship between an image obtained by photographing the character string by the character recognition device 1, a recognition result, a candidate character string, and an output character string will be described with reference to FIG.
When the user captures an image including a character string (for example, a restaurant menu) using the image input unit 10, the character recognition device 1 acquires an image TI as shown in the upper part of FIG.

そして、画像ＴＩをＯＣＲ２１０が光学認識する。図３の認識結果Ｒは、入力画像の一つの単語(sweet)を認識した結果である。認識結果において、認識文字列の各位置に現れる文字の認識候補が信頼度順に定義されている。文字番号は各認識候補が、認識文字列の認識結果として何番目に認識されたかを示す。候補番号は、各文字の何番目に信頼度が高い認識候補であるかを示す。図３では、文字番号０に対して小文字の「ｓ」が最も信頼度が高い認識候補として、大文字の「Ｓ」が次に信頼度が高い認識候補として設定されている。 Then, the OCR 210 optically recognizes the image TI. The recognition result R in FIG. 3 is a result of recognizing one word (sweet) in the input image. In the recognition result, recognition candidates of characters appearing at respective positions in the recognized character string are defined in order of reliability. The character number indicates what number each recognition candidate is recognized as the recognition result of the recognized character string. The candidate number indicates what number of each character is the recognition candidate with the highest reliability. In FIG. 3, lowercase “s” is set as the recognition candidate with the highest reliability for character number 0, and uppercase “S” is set as the recognition candidate with the next highest reliability.

単語辞書記憶部２４０は、認識結果に対応する出力文字列の候補となる候補文字列として、複数の単語を記録している（図３中段右）。文字列処理部６０は、単語辞書に登録されている単語を順次その処理における候補文字列として取得し、認識結果と候補文字列が類似する程度を示す類似度を算出する。そして、単語辞書に登録されている単語のうち、最も類似度が高い単語を出力文字列として選択する。そして、ＯＣＲ２１０が出力する各単語について出力文字列を選択して、表示部４０に出力する（図３下段）。 The word dictionary storage unit 240 records a plurality of words as candidate character strings that are candidates for the output character string corresponding to the recognition result (right in the middle of FIG. 3). The character string processing unit 60 sequentially acquires words registered in the word dictionary as candidate character strings in the processing, and calculates a similarity indicating the degree of similarity between the recognition result and the candidate character string. And the word with the highest similarity is selected as an output character string among the words registered in the word dictionary. Then, an output character string is selected for each word output from the OCR 210 and output to the display unit 40 (lower part in FIG. 3).

ここで、ＣＰＵ２１は、ＲＯＭ２３内の各種プログラムを読み出し、ＲＡＭ２２にロードすることにより、図４に示すような文字列処理部６０の各部の機能を発揮することができる。
具体的には、文字列処理部６０は機能単位として結果取得部６１０、特定部６２０、候補取得部６３０、比較部６４０、決定部６５０、選択部６６０及び出力部６７０を含む。 Here, the CPU 21 can exhibit the functions of each part of the character string processing unit 60 as shown in FIG. 4 by reading various programs in the ROM 23 and loading them into the RAM 22.
Specifically, the character string processing unit 60 includes a result acquisition unit 610, a specification unit 620, a candidate acquisition unit 630, a comparison unit 640, a determination unit 650, a selection unit 660, and an output unit 670 as functional units.

結果取得部６１０は、ＯＣＲ２１０から認識結果Ｒを単語単位で一つずつ取得する。候補取得部６３０は、単語辞書記憶部２４０が記憶する単語辞書から、現在のループで処理対象となる注目候補文字列を取得する。 The result acquisition unit 610 acquires the recognition results R from the OCR 210 one by one for each word. The candidate acquisition unit 630 acquires a target candidate character string to be processed in the current loop from the word dictionary stored in the word dictionary storage unit 240.

特定部６２０は、認識結果と候補文字列とで同じ文字が互いに類似する位置に現れるヒット位置を特定する。ここで、候補文字列に含まれ、かつ認識結果の認識候補にも含まれる文字のうち、文字番号の差が所定の閾値（例えば２）以下である文字を共通文字とする。そして、共通文字の認識結果における文字番号が認識結果におけるヒット位置となる。また、共通文字の候補文字列における文字番号が、候補文字列におけるヒット位置となる。 The identifying unit 620 identifies a hit position where the same character appears at a position similar to each other in the recognition result and the candidate character string. Here, among the characters included in the candidate character string and also included in the recognition candidate of the recognition result, a character whose character number difference is a predetermined threshold value (for example, 2) or less is set as a common character. The character number in the recognition result of the common character becomes the hit position in the recognition result. Further, the character number in the candidate character string of the common character becomes the hit position in the candidate character string.

比較部６４０は、認識結果の複数の位置を基点として、候補文字列に現れかつ認識候補に含まれる文字の、候補文字列における位置と、認識結果における位置及び信頼度に基づいて、認識結果と候補文字列とを比較する。具体的には、特定部６２０が特定した認識結果のヒット位置を、比較の起点（注目認識番号）として順次選択する。そして、比較の起点から連続して、認識候補の何れかが、候補文字列の対応する位置に現れる文字の何れかと一致する範囲について比較注目候補文字列と認識結果を比較する。具体的には、信頼度が高い認識候補が、候補文字列のより類似した位置に現れる程度を示す部分スコアを求める。部分スコアを求める具体的な方法については後述する。 The comparison unit 640 uses the plurality of positions of the recognition result as a base point, the recognition result based on the position in the candidate character string, the position in the recognition result, and the position and reliability of the character that appears in the candidate character string and is included in the recognition candidate. Compare with candidate string. Specifically, the hit position of the recognition result specified by the specifying unit 620 is sequentially selected as a comparison starting point (attention recognition number). Subsequently, the comparison target candidate character string is compared with the recognition result for a range in which any of the recognition candidates coincides with any of the characters appearing at the corresponding positions of the candidate character string continuously from the comparison starting point. Specifically, a partial score indicating the degree to which recognition candidates with high reliability appear at more similar positions in the candidate character string is obtained. A specific method for obtaining the partial score will be described later.

決定部６５０は、比較部６４０の比較結果に基づいて、認識結果と候補文字列の類似度を決定する。具体的には、比較部６４０が認識結果のヒット位置のそれぞれについて求めた部分スコアを合算した値を、類似度として決定する。 The determination unit 650 determines the similarity between the recognition result and the candidate character string based on the comparison result of the comparison unit 640. Specifically, a value obtained by adding the partial scores obtained by the comparison unit 640 for each hit position of the recognition result is determined as the similarity.

選択部６６０は、候補取得部６３０が単語辞書から取得した複数の候補文字列から、決定部６５０が決定した類似度に応じて出力文字列を選択する。具体的には、複数の候補文字列のうち、決定部６５０が決定した類似度が最も高い候補文字列を、出力文字列として選択する。 The selection unit 660 selects an output character string according to the similarity determined by the determination unit 650 from the plurality of candidate character strings acquired by the candidate acquisition unit 630 from the word dictionary. Specifically, the candidate character string having the highest similarity determined by the determination unit 650 among the plurality of candidate character strings is selected as the output character string.

出力部６７０は、選択部６６０が選択した出力文字列を出力する。具体的には、出力文字列の情報を表示部４０に表示させる。あるいは、出力文字列を補正した認識結果として外部記憶装置２５に記憶する、Ｉ／Ｏ部２４を介して外部装置に出力するなど、ユーザの設定操作に応じた出力態様により出力してもよい。 The output unit 670 outputs the output character string selected by the selection unit 660. Specifically, information on the output character string is displayed on the display unit 40. Or you may output by the output mode according to a user's setting operation, such as memorize | storing in the external storage device 25 as a recognition result which correct | amended the output character string, and outputting to an external device via the I / O part 24.

文字認識装置１が実行する処理を、フローチャートを用いて説明する。文字認識装置１は、ユーザが画像入力部１０を用いて文字列を含む画像（入力画像ＴＩ）を撮影すると、図５に示す認識結果出力処理を開始する。 Processing executed by the character recognition device 1 will be described with reference to a flowchart. When the user captures an image (input image TI) including a character string using the image input unit 10, the character recognition device 1 starts a recognition result output process shown in FIG.

認識結果出力処理では、まずＯＣＲ２１０が画像入力部１０から入力画像ＴＩを取得する（ステップＳ１０１）。そして、ＯＣＲ２１０が、入力画像ＴＩに含まれる各文字を認識する（ステップＳ１０２）。 In the recognition result output process, first, the OCR 210 acquires the input image TI from the image input unit 10 (step S101). Then, the OCR 210 recognizes each character included in the input image TI (step S102).

次に、ＯＣＲ２１０が認識結果をスペース文字を区切として単語単位に分割する（ステップＳ１０３）。そして、分割された単語の一つが現在のループにおいて処理対象となる注目認識文字列として選択される。選択された注目認識文字列の認識結果Ｒを結果取得部６１０が取得する（ステップＳ１０４）。 Next, the OCR 210 divides the recognition result into word units with a space character as a delimiter (step S103). Then, one of the divided words is selected as an attention recognized character string to be processed in the current loop. The result acquisition unit 610 acquires the recognition result R of the selected attention recognition character string (step S104).

次に、候補取得部６３０が、単語辞書記憶部２４０が記憶する単語辞書から、現在のループにおいて比較対象となる候補文字列（注目候補文字列）を取得する（ステップＳ１０５）。本実施形態では処理速度を向上するため、認識結果の文字の長さを基準として、文字長の差が所定の閾値以下である単語を候補文字列として、その他の単語は処理対象としない。例えば、認識結果の長さが５文字である場合、３文字以上７文字以下の単語を、候補文字列とする。 Next, the candidate acquisition unit 630 acquires a candidate character string (target candidate character string) to be compared in the current loop from the word dictionary stored in the word dictionary storage unit 240 (step S105). In this embodiment, in order to improve the processing speed, a word whose difference in character length is equal to or less than a predetermined threshold is set as a candidate character string on the basis of the length of the character as a recognition result, and other words are not processed. For example, when the length of the recognition result is 5 characters, a word of 3 to 7 characters is set as a candidate character string.

そして、特定部６２０、比較部６４０及び決定部６５０が、注目認識文字列の認識結果と注目候補文字列の類似度を決定するための処理（類似度決定処理）を実行する（ステップＳ１０６）。以下、必要に応じて図３の認識結果Ｒと、注目候補文字列“ｓｗｅｅｔ”の類似度を決定する場合を例にとって説明する。 Then, the specifying unit 620, the comparison unit 640, and the determination unit 650 execute a process (similarity determination process) for determining the similarity between the recognition result of the attention recognized character string and the attention candidate character string (step S106). Hereinafter, the case where the similarity between the recognition result R in FIG. 3 and the attention candidate character string “sweet” is determined as necessary will be described as an example.

ステップＳ１０６で実行される類似度決定処理について、図６を参照して説明する。類似度決定処理では、文字列処理部６０による使用変数の初期化等に続いて、特定部６２０が認識結果の文字番号（認識番号）から、処理対象となる注目認識番号を選択する（ステップＳ２０１）。この例では、認識文字列の“ｓ”の位置に対応する「０番」から、最後の“ｔ”に対応する「４番」まで順に選択するものとする。注目認識番号は、処理対象となる認識結果の位置、即ち認識文字列の何番目の文字が処理対象であるかを示す。 The similarity determination process executed in step S106 will be described with reference to FIG. In the similarity determination process, following the initialization of the variables used by the character string processing unit 60, the specifying unit 620 selects a target recognition number to be processed from the character number (recognition number) of the recognition result (step S201). ). In this example, it is assumed that “No. 0” corresponding to the position of “s” in the recognized character string to “No. 4” corresponding to the last “t” are selected in order. The attention recognition number indicates the position of the recognition result to be processed, that is, what number character in the recognized character string is the processing target.

ステップＳ２０１にて注目認識番号を選択すると、次に特定部６２０は注目候補文字列の文字番号（比較番号）のうち、処理対象となる注目比較番号を選択する（ステップＳ２０２）。ここでは、比較処理のループ毎に候補文字列の“ｓ”の位置に対応する「０番」から、最後の“ｔ”に対応する「４番」まで順に選択するものとする。注目比較番号は、処理対象となる注目候補文字列の位置、即ち注目候補文字列の何番目の文字が処理対象であるかを示す。 When the attention recognition number is selected in step S201, the specifying unit 620 next selects the attention comparison number to be processed among the character numbers (comparison numbers) of the attention candidate character strings (step S202). Here, it is assumed that “0” corresponding to the position of “s” in the candidate character string is sequentially selected from “No. 4” corresponding to the last “t” for each comparison processing loop. The attention comparison number indicates the position of the attention candidate character string to be processed, that is, what number character of the attention candidate character string is the processing target.

ステップＳ２０２にて注目比較番号を選択すると、次に特定部６２０は注目認識番号に対応する認識候補から、処理対象となる注目候補を選択する（ステップＳ２０３）。ここでは、信頼度が高いものを優先して処理するため、ループ毎に候補番号が若い順に選択する。この例では、０番の認識候補のうち候補番号０に対応する“ｓ”が最初に、１番に対応する“Ｓ”が次に選択される。 When the attention comparison number is selected in step S202, the specifying unit 620 next selects the attention candidate to be processed from the recognition candidates corresponding to the attention recognition number (step S203). Here, in order to prioritize processing with high reliability, the candidate numbers are selected in ascending order for each loop. In this example, among the 0th recognition candidates, “s” corresponding to candidate number 0 is selected first, and “S” corresponding to number 1 is selected next.

そして、特定部６２０は注目候補が注目比較番号に対応する比較文字と一致するか否か判別する（ステップＳ２０４）。一致すると判別すると（ステップＳ２０４；ＹＥＳ）、処理対象の位置はヒット部位であるので、比較部６４０が注目認識番号及び注目比較番号の位置を基点に詳細スコアを算出するための詳細スコア算出処理を開始する（ステップＳ２０５）。この場合、注目比較番号に対応する候補文字列の位置が候補文字列におけるヒット部位、認識結果の注目認識番号の位置に対応する位置が認識結果におけるヒット部位となる。 Then, the specifying unit 620 determines whether the attention candidate matches the comparison character corresponding to the attention comparison number (step S204). If it is determined that they match (step S204; YES), since the processing target position is a hit site, a detailed score calculation process is performed for the comparison unit 640 to calculate a detailed score based on the position of the attention recognition number and the attention comparison number. Start (step S205). In this case, the position of the candidate character string corresponding to the attention comparison number is the hit part in the candidate character string, and the position corresponding to the position of the attention recognition number of the recognition result is the hit part in the recognition result.

例えば、図３の認識結果Ｒと、注目候補文字列"sweet"を比較すると、ヒット部位は図８のように示すことができる。図８の左上の円は、認識結果の先頭位置（文字番号０）の、候補番号０に対応する候補“ｓ”が、候補文字列の先頭位置（文字番号０）に現れる文字と一致することを示す。この場合、認識結果の０番に相当する位置（文字番号０の行）が、認識結果のヒット部位である。また、候補文字列の０番に相当する位置（文字番号０の列）が、候補文字列のヒット部位である。 For example, when the recognition result R of FIG. 3 is compared with the candidate character string “sweet”, the hit site can be shown as shown in FIG. The upper left circle in FIG. 8 indicates that the candidate “s” corresponding to the candidate number 0 in the start position (character number 0) of the recognition result matches the character appearing at the start position (character number 0) of the candidate character string. Indicates. In this case, the position corresponding to number 0 of the recognition result (the line of character number 0) is the hit part of the recognition result. Further, the position corresponding to the number 0 of the candidate character string (character number 0) is the hit part of the candidate character string.

本実施形態では、認識結果のｎ番に対応する認識候補の何れかが、点線長方形でしめした候補文字列の文字番号が（ｎ−２）〜（ｎ＋２）までの何れかの位置に現れる文字と一致した場合に、その行は認識結果のヒット部位とする。即ち、許容誤差を２文字とする。 In the present embodiment, any of the recognition candidates corresponding to the nth recognition result is a character that appears in any position from (n-2) to (n + 2) where the character number of the candidate character string indicated by the dotted rectangle is If it matches, the line is set as a hit part of the recognition result. That is, the allowable error is 2 characters.

なお、ステップＳ２０４で注目候補は注目比較番号の文字と一致しないと判別された場合（ステップＳ２０４；ＮＯ）は、特定部６２０が注目認識番号に定義された全認識候補を処理済みであるか判別し（ステップＳ２０６）、未処理の認識候補が残っている場合は（ステップＳ２０６；ＮＯ）、次の認識候補についてステップＳ２０３から処理を繰り返す。一方、全認識候補が処理済である場合は（ステップＳ２０６；ＹＥＳ）、ステップＳ２０７にスキップする。これにより、例えば文字番号２の位置の、候補番号０に対応する候補“ｃ”について処理を終えた後、未処理の認識候補“ｅ”を注目候補としてステップＳ２０３から処理が繰り返される。 When it is determined in step S204 that the attention candidate does not match the character of the attention comparison number (step S204; NO), it is determined whether all the recognition candidates defined in the attention recognition number have been processed by the specifying unit 620. However, if an unprocessed recognition candidate remains (step S206; NO), the process is repeated from step S203 for the next recognition candidate. On the other hand, when all the recognition candidates have been processed (step S206; YES), the process skips to step S207. Thus, for example, after the process for the candidate “c” corresponding to the candidate number 0 at the position of the character number 2 is completed, the process is repeated from step S203 with the unprocessed recognition candidate “e” as the attention candidate.

このように、定義された認識文字のいずれかが、注目候補文字列の対応する位置及びその位置を基準として許容範囲内（図８ではその行の点線四角形に含まれる何れかの位置）に現れる文字の何れかに一致する場合に、該当位置はヒット部位となり、その位置を基点に詳細スコアが算出される。特定部６２０は各ループで注目認識番号、注目比較番号、注目候補が更新される毎にステップＳ２０４で注目候補と比較文字とが一致するかを判別する処理を実行する。このようにして、特定部６２０は認識結果と候補文字列とで同じ文字が、互いに類似した位置で現れるヒット位置（認識番号と比較番号）を特定する。 In this way, any of the defined recognized characters appears within the permissible range based on the corresponding position of the candidate character string of interest and the position thereof (in FIG. 8, any position included in the dotted rectangle in the row). If it matches any of the characters, the corresponding position becomes a hit site, and a detailed score is calculated based on that position. Each time the attention recognition number, the attention comparison number, and the attention candidate are updated in each loop, the specifying unit 620 executes a process of determining whether the attention candidate matches the comparison character in step S204. In this way, the specifying unit 620 specifies hit positions (recognition numbers and comparison numbers) where the same character appears in similar positions in the recognition result and the candidate character string.

ステップＳ２０５で実行される詳細スコア算出処理について、図７を参照して説明する。詳細スコア算出処理では、比較部６４０がまず比較の起点となるヒット部位に対応して、注目候補の信頼度と位置のズレ（認識番号と比較番号の差）に応じたスコアを詳細スコアに加算する（ステップＳ３０１）。 The detailed score calculation process executed in step S205 will be described with reference to FIG. In the detailed score calculation process, the comparison unit 640 first adds a score according to the reliability of the candidate candidate and the positional deviation (difference between the recognition number and the comparison number) to the detailed score in correspondence with the hit site that is the starting point of the comparison. (Step S301).

より信頼度が高い認識候補が、候補文字列のより類似した位置に現れるほど、候補文字列と認識結果は類似していると判断できる。そこで、ヒット位置のそれぞれに対応してステップＳ３０１とＳ３０５で加算するスコアを図９のように設定する。図９の例では、認識番号と選択番号が一致し、かつ信頼度が高い（候補番号０）文字に対応するヒット位置に最も高いスコア（１０点）が設定されている。一方、認識番号と選択番号が一致し、信頼度が低い（候補番号１）文字に対応するヒット位置には、これよりも低いスコア（５点）が設定されている。さらに、これを基準として認識番号と選択番号が１違う場合には、同一の場合よりも小さくスコア（候補番号０に対して６点、候補番号１に対して３点）が、２違う場合には、さらに小さくスコア（候補番号０に対して４点、候補番号１に対して２点）がそれぞれ設定されている。その他の各位置については位置の差が大きすぎるとの判断のもと、０点が設定されている。０点の部分は比較範囲外に設定されるため、たとえ共通文字が現れていても詳細スコアは算出されない。 It can be determined that the recognition result is more similar to the candidate character string as the recognition candidate with higher reliability appears at a more similar position in the candidate character string. Therefore, the score to be added in steps S301 and S305 corresponding to each hit position is set as shown in FIG. In the example of FIG. 9, the highest score (10 points) is set at the hit position corresponding to the character whose recognition number matches the selection number and has high reliability (candidate number 0). On the other hand, a lower score (5 points) is set at the hit position corresponding to the character whose recognition number matches the selected number and has low reliability (candidate number 1). Furthermore, when the recognition number and the selection number are different from each other on the basis of this, the score (6 points for candidate number 0 and 3 points for candidate number 1) is 2 different from the case of the same case. Are set with smaller scores (4 points for candidate number 0 and 2 points for candidate number 1). For each of the other positions, 0 point is set based on the determination that the position difference is too large. Since the 0 point portion is set outside the comparison range, the detailed score is not calculated even if a common character appears.

スコアの設定はこれに限らず、位置のズレにより厳しく、比較番号と認識番号が一致していない部分（一文字以上ずれている位置）にはすべて０点を設定しても良い。このような設定は、入力画像が鮮明で、文字の位置がズレる割合が少ない場合などに好適である。あるいは、候補番号の数値が上がった場合によりスコアの減少幅を大きくするなど、条件に合わせて任意に設定可能である。 The setting of the score is not limited to this, and it is severe due to the position shift, and 0 points may be set for all portions where the comparison number and the recognition number do not match (a position shifted by one or more characters). Such a setting is suitable for a case where the input image is clear and the ratio of character positions is small. Alternatively, it can be arbitrarily set according to the conditions, such as increasing the decrease range of the score when the numerical value of the candidate number increases.

例えば図９のようにスコアが設定されている場合であって、注目認識番号が３番、注目比較番号が３番の候補番号０（図１０の星型六角形）が比較の起点となるヒット部位であったときには、ステップＳ３０１で１０点が加算される。一方、注目認識番号が３番、注目比較番号が２番の候補番号０（図１０の三角形）が詳細スコアの算出基準であったときには６点が加算される。 For example, in the case where a score is set as shown in FIG. 9, the candidate number 0 (star hexagon in FIG. 10) having the attention recognition number 3 and the attention comparison number 3 is the starting point of the comparison. If it is a part, 10 points are added in step S301. On the other hand, when candidate number 0 (triangle in FIG. 10) with attention recognition number 3 and attention comparison number 2 is the detailed score calculation criterion, 6 points are added.

ステップＳ３０１で詳細スコアに加算すると、次に比較部６４０は起点からの比較処理を開始する。具体的には、まず調査方向に一つずらした位置、即ち一つ前（又は後ろ）の位置を調査位置として選択する（ステップＳ３０２）。本実施形態ではまず前方向に調査し、前方向の調査が終われば後ろ方向に変更する。例えば前に調査する場合、認識番号と比較番号とをデクリメントした位置が新たな調査位置として選択される。例えば、図１０の星型６角形から、認識番号２、比較番号２の位置（斜線の部分）で認識結果と候補文字列を比較する。 If it adds to a detailed score by step S301, the comparison part 640 will start the comparison process from a starting point next. Specifically, first, a position shifted by one in the investigation direction, that is, the position immediately before (or behind) is selected as the investigation position (step S302). In this embodiment, the survey is first performed in the forward direction, and when the survey in the forward direction is finished, the direction is changed to the backward direction. For example, when the survey is performed before, the position where the recognition number and the comparison number are decremented is selected as a new survey position. For example, the recognition result and the candidate character string are compared at the position of the recognition number 2 and the comparison number 2 (shaded portion) from the star hexagon in FIG.

認識結果と候補文字列の比較の具体例として、まず比較部６４０は調査位置の認識候補から、比較対象となる調査候補を選択する（ステップＳ３０３）。ここでは、信頼度が高い順に選択する。例えば、認識番号２、比較番号２の位置（図１０の斜線部）の場合、認識候補“ｃ”が先に選択される。そして、比較部６４０は調査候補と対応文字（候補文字列の調査位置における文字）が一致するか否か判別する（ステップＳ３０４）。例えば、調査位置が認識番号２、比較番号２であり、調査候補が１番目の認識候補“ｃ”であった場合、対応文字は“ｅ”であるので、文字は一致しないと判別される。 As a specific example of the comparison between the recognition result and the candidate character string, the comparison unit 640 first selects a survey candidate to be compared from the recognition candidates of the survey position (step S303). Here, selection is performed in descending order of reliability. For example, in the case of the positions of the recognition number 2 and the comparison number 2 (shaded area in FIG. 10), the recognition candidate “c” is selected first. Then, the comparison unit 640 determines whether or not the survey candidate matches the corresponding character (the character at the survey position of the candidate character string) (step S304). For example, if the survey position is the recognition number 2 and the comparison number 2 and the survey candidate is the first recognition candidate “c”, the corresponding character is “e”, so it is determined that the characters do not match.

調査候補と対応文字が一致しないと判別された場合には（ステップＳ３０４；ＮＯ）、次に調査位置の全認識候補が処理済であるか否かを判別する（ステップＳ３０６）。未処理の認識候補が存在する場合（ステップＳ３０６；ＮＯ）、次の認識候補を調査候補としてステップＳ３０３から処理を繰り返す。例えば、調査位置が認識番号２、比較番号２であり、調査候補が１番目の認識候補“ｃ”であった場合、２番目の認識候補“ｅ”が未処理であるので、ステップＳ３０３にて２番目の認識候補“ｅ”が調査候補として選択される。 If it is determined that the survey candidate and the corresponding character do not match (step S304; NO), it is then determined whether all the recognition candidates at the survey position have been processed (step S306). If there is an unprocessed recognition candidate (step S306; NO), the process is repeated from step S303 with the next recognition candidate as the investigation candidate. For example, if the survey position is the recognition number 2 and the comparison number 2 and the survey candidate is the first recognition candidate “c”, the second recognition candidate “e” is unprocessed, so in step S303. The second recognition candidate “e” is selected as the survey candidate.

一方、ステップＳ３０６にて全認識候補が処理済であると判別された場合は（ステップＳ３０６；ＹＥＳ）、調査位置のどの認識候補も、対応文字（認識文字列の対応する位置の文字）と一致しないので、現在の調査方向については調査を終了し、ステップＳ３０８にスキップする。このため、比較部６４０は起点となる位置から連続して、認識候補の何れかが、候補文字列の対応する位置に現れる文字の何れかと一致する範囲で、認識結果と候補文字列とを比較することとなる。 On the other hand, if it is determined in step S306 that all recognition candidates have been processed (step S306; YES), any recognition candidate at the survey position matches the corresponding character (the character at the corresponding position in the recognized character string). Therefore, the survey is terminated for the current survey direction, and the process skips to step S308. For this reason, the comparison unit 640 compares the recognition result with the candidate character string in a range in which any of the recognition candidates matches with any of the characters appearing at the corresponding position of the candidate character string continuously from the starting position. Will be.

このように本実施形態では、起点となる位置から認識候補の何れかが、候補文字列の対応する位置に現れる文字の何れかと一致する範囲で認識結果と候補文字列とを比較する。一致する範囲外については比較を打ち切ることで、処理負担を軽減することが出来る。また、複数の起点からこのような比較を行うことで、認識文字列の一部に認識不能な文字が有った場合や、認識結果の位置がずれていた場合でも、文字列全体を比較して類似度を求めることが出来る。 Thus, in the present embodiment, the recognition result and the candidate character string are compared within a range in which any of the recognition candidates from the starting position matches any of the characters appearing at the corresponding position of the candidate character string. Processing out of the matching range can be reduced by terminating the comparison. In addition, by making such a comparison from multiple starting points, even if there are unrecognizable characters in some of the recognized character strings, or even if the recognition results are misaligned, the entire character strings are compared. The similarity can be obtained.

ステップＳ３０４で調査候補と対応文字が一致すると判別された場合（ステップＳ３０４；ＹＥＳ）、ステップＳ３０１と同様に、調査候補の信頼度と位置のズレ（認識番号と比較番号の差）に応じたスコアを詳細スコアに加算する（ステップＳ３０５）。例えば、調査位置が認識番号２、比較番号２であり、調査候補が２番目の認識候補“ｅ”であった場合、認識番号と比較番号が一致し、かつ調査候補の認識番号が２番であることに対応して、５点が詳細スコアに加算される。なお、一つの認識番号に３つ以上の認識候補が定義されている場合には、３番目の認識候補には１番目の認識候補の１／３のスコアを設定すればよい。 When it is determined in step S304 that the survey candidate matches the corresponding character (step S304; YES), the score corresponding to the reliability of the survey candidate and the positional deviation (difference between the recognition number and the comparison number) is the same as in step S301. Is added to the detailed score (step S305). For example, when the survey position is the recognition number 2 and the comparison number 2 and the survey candidate is the second recognition candidate “e”, the recognition number matches the comparison number, and the survey candidate recognition number is 2. Correspondingly, 5 points are added to the detailed score. When three or more recognition candidates are defined for one recognition number, a score that is 1/3 of the first recognition candidate may be set for the third recognition candidate.

ステップＳ３０５でスコアを加算すると、次に比較部６４０は調査位置が文字列の端部に到達したか否か判別する（ステップＳ３０７）。具体的には、ステップＳ３０２の調査方向が前方向の場合であって調査位置が認識結果又は候補文字列の先端の何れかに到達したときには、それ以上調査方向に比較を続けることが出来ないので、端部に到達したと判別する。調査方向が後方向の場合であって調査位置が認識結果又は候補文字列のいずれかの後端に到達したときには、それ以上調査方向に比較を続けることが出来ないので、端部に到達したと判別する。 When the scores are added in step S305, the comparison unit 640 next determines whether or not the survey position has reached the end of the character string (step S307). Specifically, if the investigation direction in step S302 is the forward direction and the investigation position reaches either the recognition result or the leading end of the candidate character string, the comparison cannot be continued further in the investigation direction. It is determined that the end has been reached. When the survey direction is the backward direction and the survey position reaches the trailing end of either the recognition result or the candidate character string, the comparison cannot be continued in the survey direction any further, so the end has been reached. Determine.

文字列の端部に到達していない場合には（ステップＳ３０７；ＮＯ）、ステップＳ３０２に戻って、次の位置を調査位置として、現在の調査方向への比較を続ける。例えば、現在の調査位置が図１０の斜線部（認識番号２、比較番号２に対応）であった場合で、調査方向が前方向のときは、矢印の示すように認識番号１、比較番号１を調査位置として比較を続ける。 If the end of the character string has not been reached (step S307; NO), the process returns to step S302, and the comparison in the current survey direction is continued with the next position as the survey position. For example, when the current survey position is the hatched portion in FIG. 10 (corresponding to recognition number 2 and comparison number 2) and the survey direction is forward, recognition number 1 and comparison number 1 as indicated by the arrows. Continue the comparison with the survey position.

一方、端部に到達した場合は（ステップＳ３０７；ＹＥＳ）、現在の調査方向へは比較を続けられないので、現在の調査方向に対する比較は中止する。そして、比較部６４０は前後の両方向に比較処理を終えたか否か判別する（ステップＳ３０８）。前後のいずれかについて比較処理を終えていない場合は（ステップＳ３０８；ＮＯ）、比較部３４０は調査方向の前後を入れ替える（ステップＳ３０９）。そして、ステップＳ３０２に戻って新たな調査方向（今まで前方向だった場合には後ろ方向）へ比較を続ける。このとき、一旦比較を開始した基準位置に戻って、基準位置から新たな調査方向に一つ進んだ位置を新たな調査位置とする。
例えば、図１０の比較処理が前方向に進んで、認識番号０、比較番号０に対応する位置まで到達した場合に、次に基点である星型六角形の位置まで戻り、次に後ろ方向（認識番号４、比較番号４）に比較処理を進める。 On the other hand, if the end has been reached (step S307; YES), the comparison cannot be continued in the current survey direction, so the comparison for the current survey direction is stopped. Then, the comparison unit 640 determines whether the comparison process has been completed in both the front and rear directions (step S308). If the comparison process has not been completed for either the front or rear (step S308; NO), the comparison unit 340 switches the front and rear in the investigation direction (step S309). Then, the process returns to step S302, and the comparison is continued in a new investigation direction (in the case where it has been forward until now, backward). At this time, returning to the reference position where the comparison has been started once, a position advanced by one in the new investigation direction from the reference position is set as a new investigation position.
For example, when the comparison process of FIG. 10 proceeds in the forward direction and reaches the position corresponding to the recognition number 0 and the comparison number 0, it returns to the position of the star hexagon that is the next base point, and then the backward direction ( The comparison process proceeds to recognition number 4 and comparison number 4).

ステップＳ３０８で前後の両方向に比較を終えたと判別した場合は（ステップＳ３０８；ＹＥＳ），比較部６４０はその時点まで加算されたスコアを詳細スコアとして、詳細スコア算出処理を終了する。 When it is determined in step S308 that the comparison has been completed in both the front and rear directions (step S308; YES), the comparison unit 640 ends the detailed score calculation process using the score added up to that point as the detailed score.

例えば、比較の起点が図１０の星型六角形である場合には、前後に実線矢印にそって比較処理を進めた結果、前方に比較した結果として（１０点＋５点＋１０点＋１０点）が加算され、後方に比較した結果としてさらに１０点が加算されるため、４５点が詳細スコアとなる。一方、同じ比較結果の３番目を比較の基準とした場合でも、図１０の三角形に対応する候補文字列の２文字目を起点とした場合には、前後の何れの方向にもヒット位置が無いので、詳細スコアはステップＳ３０１にて加算された起点のスコア６点が最終的な詳細スコアとなる。 For example, when the starting point of the comparison is the star-shaped hexagon in FIG. 10, the comparison process is advanced along the solid arrows before and after, and as a result of comparison ahead (10 points + 5 points + 10 points + 10 points). Since 10 points are added as a result of comparison after the addition, 45 points become the detailed score. On the other hand, even when the third comparison result is used as a reference for comparison, there is no hit position in any direction before or after the second character of the candidate character string corresponding to the triangle in FIG. Therefore, the detailed score is the final score obtained from the starting score of 6 added in step S301.

図６に戻って、ステップＳ２０５にて詳細スコアを算出するか、ステップＳ２０６にてＹＥＳと判別するか、何れかの場合は次に文字列処理部６０は比較範囲の全比較番号について上記処理済であるか否か判別する（ステップＳ２０７）。例えば、注目認識番号がｎであって、許容誤差が２である場合に、比較番号（ｎ−２）〜（ｎ＋２）の範囲が比較範囲となる。例えば、注目認識番号が３である場合に、図８の下から２行目の点線四角形の部分（候補文字列の１〜４）が比較範囲である。この比較範囲に、現在のステップＳ２０２〜ステップＳ２０７までのループにおいて処理されていない比較番号が有る場合は（ステップＳ２０７；ＮＯ）、次の未処理の比較番号についてステップＳ２０２から処理を繰り返す。 Returning to FIG. 6, the detailed score is calculated in step S205, or YES is determined in step S206. In either case, the character string processing unit 60 next processes all the comparison numbers in the comparison range. Is determined (step S207). For example, when the attention recognition number is n and the allowable error is 2, the range of comparison numbers (n−2) to (n + 2) is the comparison range. For example, when the attention recognition number is 3, the dotted square part (1 to 4 of the candidate character string) in the second line from the bottom of FIG. 8 is the comparison range. If there is a comparison number not processed in the current loop from step S202 to step S207 in this comparison range (step S207; NO), the process is repeated from step S202 for the next unprocessed comparison number.

例えば、注目認識番号が「３」である場合、ステップＳ２０１からステップＳ２０９のループの中で、ステップＳ２０２〜ステップＳ２０７までのループで図１０の星型六角形の位置について詳細スコア（４５点）を算出した後、次のループで三角形の位置（注目比較番号２）について詳細スコア（６点）を算出する。 For example, when the attention recognition number is “3”, the detailed score (45 points) is obtained for the position of the star-shaped hexagon in FIG. 10 in the loop from step S202 to step S207 in the loop from step S201 to step S209. After the calculation, a detailed score (6 points) is calculated for the position of the triangle (attention comparison number 2) in the next loop.

一方、全比較番号が処理済であった場合（ステップＳ２０７；ＹＥＳ）、詳細スコアに基づいて注目認識番号の部分スコアを決定する（ステップＳ２０８）。具体的には、現在の注目認識番号についてステップＳ２０２〜ステップＳ２０７のループで算出した詳細スコアのうち、最も高い値を部分スコアとして決定する。このようにして、より精度が高い比較結果に対応する詳細スコアを、注目認識番号における比較結果として採用する。 On the other hand, when all the comparison numbers have been processed (step S207; YES), the partial score of the attention recognition number is determined based on the detailed score (step S208). Specifically, the highest value among the detailed scores calculated in the loop of step S202 to step S207 for the current attention recognition number is determined as the partial score. In this way, the detailed score corresponding to the comparison result with higher accuracy is adopted as the comparison result in the attention recognition number.

部分スコアは、現在の認識番号の位置から認識結果と候補文字列を比較した場合に、認識結果のより信頼度が高い認識候補が、候補文字列のより類似した位置に現れる程度を示す。部分スコアが高いほど、より信頼度が高い認識候補が、候補文字列のより類似した位置に現れる。 The partial score indicates the degree to which a recognition candidate with higher reliability of the recognition result appears at a more similar position in the candidate character string when the recognition result is compared with the candidate character string from the position of the current recognition number. The higher the partial score, the more reliable the recognition candidate appears at a more similar position in the candidate character string.

ステップＳ２０８が終わると、次に文字列処理部６０は注目識別文字列の認識結果の全位置について上記処理済であるか否か判別する（ステップＳ２０９）。未処理の位置がある場合（ステップＳ２０９；ＮＯ）、例えば図１０の認識結果の文字番号３の行の次に４の行を処理するといったように、次の未処理の認識番号についてステップＳ２０１から処理を繰り返す。 When step S208 ends, the character string processing unit 60 then determines whether or not the above processing has been completed for all positions of the recognition result of the attention identification character string (step S209). If there is an unprocessed position (step S209; NO), for example, the next unprocessed recognition number is processed from step S201, such as processing the 4th line after the line of character number 3 in the recognition result of FIG. Repeat the process.

一方、全位置が処理済の場合（ステップＳ２０９；ＹＥＳ）、次に決定部６５０が、比較部６４０が算出した部分スコアに基づいて、注目認識文字列と注目候補文字列の類似度を決定する（ステップＳ２１０）。具体的には、注目認識文字列と、注目候補文字列について算出した部分スコアを合算した値（本例では４５×５＝２２５点）を類似度とする。そして、類似度決定処理を終了する。なお、文字数によって類似スコアが変わるのを防ぐために、類似度は、合算した値を文字数の２乗で除算して正規化した値としてもよい。 On the other hand, when all the positions have been processed (step S209; YES), the determination unit 650 determines the similarity between the attention recognized character string and the attention candidate character string based on the partial score calculated by the comparison unit 640. (Step S210). Specifically, a value (45 × 5 = 225 points in this example) obtained by adding the attention recognized character string and the partial score calculated for the attention candidate character string is set as the similarity. Then, the similarity determination process ends. In order to prevent the similarity score from changing depending on the number of characters, the similarity may be a normalized value obtained by dividing the sum by the square of the number of characters.

図５に戻って、ステップＳ１０６にて注目候補文字列について類似度を決定すると、次に文字列処理部６０は全候補文字列について類似度を決定済みであるか否か判別する（ステップＳ１０７）。未処理の候補文字列が有る場合（ステップＳ１０７；ＮＯ）は、ステップＳ１０５に戻って次の候補文字列を注目候補文字列として処理を繰り返す。 Returning to FIG. 5, when the similarity is determined for the candidate character string of interest in step S106, the character string processing unit 60 then determines whether or not the similarity has been determined for all candidate character strings (step S107). . If there is an unprocessed candidate character string (step S107; NO), the process returns to step S105, and the process is repeated using the next candidate character string as the attention candidate character string.

図１１は、図１０とは異なる候補文字列（“ewder”）について類似度決定処理をした場合の算出結果を示す。この場合、認識番号１、２、４が認識結果のヒット位置である。認識結果のヒット位置を基点に部分スコアを算出すると、認識番号１については１０点、認識番号２については２点、認識番号４については５点となる。共通文字が認識候補に含まれない認識番号１はヒット部位が無い。認識番号３については、共通文字が認識候補に含まれてはいるが、共通文字が現れる位置が比較範囲に含まれないので、やはりヒット位置は無い。ヒット位置が無い位置については比較部６４０は部分スコアを算出しない。その結果、類似度は１７点となり、認識候補“ｅｗｄｅｒ”は認識候補“ｓｗｅｅｔ” （類似度２２５点）よりも認識結果に類似する程度が低いことがわかる。 FIG. 11 shows a calculation result when similarity determination processing is performed for a candidate character string (“ewder”) different from that in FIG. In this case, recognition numbers 1, 2, and 4 are hit positions of recognition results. When the partial score is calculated based on the hit position of the recognition result, it is 10 points for the recognition number 1, 2 points for the recognition number 2, and 5 points for the recognition number 4. The recognition number 1 in which the common character is not included in the recognition candidates has no hit part. For the recognition number 3, although the common character is included in the recognition candidates, the position where the common character appears is not included in the comparison range, so there is no hit position. The comparison unit 640 does not calculate a partial score for a position where there is no hit position. As a result, the similarity is 17 points, and it can be seen that the recognition candidate “ewder” is less similar to the recognition result than the recognition candidate “sweet” (similarity 225 points).

一方、全候補文字列が処理済である場合（ステップＳ１０７；ＹＥＳ）は、選択部６６０が出力文字列を選択する（ステップＳ１０８）。具体的には、候補文字列のうち、最も類似度が高いものを、出力文字列として選択する。ただし、同じ類似度が決定された複数の候補文字列が有る場合は、認識結果の文字長に最も近い候補文字列を選択する。類似度が同じ場合には、文字長が最も近い候補文字列が、認識結果と最も相関性が高いと判断できるからである。 On the other hand, if all candidate character strings have been processed (step S107; YES), the selection unit 660 selects an output character string (step S108). Specifically, the candidate character string having the highest similarity is selected as the output character string. However, if there are a plurality of candidate character strings for which the same degree of similarity is determined, the candidate character string closest to the character length of the recognition result is selected. This is because if the degree of similarity is the same, it can be determined that the candidate character string having the closest character length has the highest correlation with the recognition result.

次に文字列処理部６０はステップＳ１０３で分割した全単語について上記処理済みであるか否か判別する（ステップＳ１０９）。未処理の単語が有る場合（ステップＳ１０９；ＮＯ）は、ステップＳ１０４に戻って次の単語を注目認識文字列として処理を繰り返す。一方、全単語が処理済である場合（ステップＳ１０９；ＹＥＳ）は、出力部６７０が出力文字列を出力する（ステップＳ１１０）。具体的には、出力部６７０が出力文字列を単語順に並べて、補正済みの認識結果として表示部４０に出力する Next, the character string processing unit 60 determines whether or not all the words divided in step S103 have been processed (step S109). If there is an unprocessed word (step S109; NO), the process returns to step S104 and the process is repeated with the next word as the attention recognition character string. On the other hand, when all the words have been processed (step S109; YES), the output unit 670 outputs an output character string (step S110). Specifically, the output unit 670 arranges the output character strings in the order of words and outputs them to the display unit 40 as corrected recognition results.

以上説明したとおり、本実施形態の文字認識装置１は認識対象となった文字列（認識文字列）の認識結果と、単語辞書の単語とを、複数の起点から比較した結果に基づいてそれぞれ実行する。そのため、認識結果と候補文字列をより包括的に比較した結果に基づいて認識結果を精度良く補正できる。 As described above, the character recognition device 1 of the present embodiment executes the recognition result of the character string (recognized character string) that is the recognition target and the word dictionary word based on the results of comparison from a plurality of starting points. To do. Therefore, the recognition result can be corrected with high accuracy based on the result of more comprehensive comparison between the recognition result and the candidate character string.

また、本実施形態の文字認識装置１は前記複数の位置のそれぞれを起点として、認識結果のより信頼度が高い認識候補が、候補文字列のより類似した位置に現れる程度を示す部分スコアを求め、各スコアを合算して類似度を求める。この部分スコアは、信頼度が高い認識候補が、起点から比較した認識結果と候補文字列が一致する程度を示す。そのため、各部分スコアを合算することで、各起点から比較した一致する程度をそれぞれ反映した精度の高い類似度を求めることが出来る。また、各起点からの比較結果を示す部分スコアを総合して類似度を求めるため、例え認識結果の位置が実際とはずれていたり、一部の文字が認識不能であった場合でも認識精度の低下を押さえることが出来る。そのため、精度良く類似度を決定することが出来る。 Further, the character recognition device 1 according to the present embodiment obtains a partial score indicating the degree to which recognition candidates with higher reliability of the recognition result appear at more similar positions in the candidate character string, starting from each of the plurality of positions. Then, the scores are combined to obtain the similarity. This partial score indicates the degree to which a recognition candidate having a high reliability matches the recognition result compared with the starting point and the candidate character string. Therefore, by adding the partial scores, it is possible to obtain a high-accuracy similarity that reflects the degree of matching compared from the starting points. In addition, since the similarity is obtained by combining the partial scores indicating the comparison results from each starting point, even if the position of the recognition result is different from the actual position or some characters are unrecognizable, the recognition accuracy decreases. Can be suppressed. Therefore, the similarity can be determined with high accuracy.

また、本実施形態の文字認識装置１は複数の起点となる位置から連続して、認識候補の何れかが、候補文字列の対応する位置に現れる文字の何れかと一致する範囲で、認識結果と候補文字列とを比較した部分スコアに基づいて類似度を決定する。言い換えれば、認識結果と候補文字列が類似する部分のそれぞれついて比較スコアを求め、それぞれの比較スコアを合算して全体の類似度を決定する。そのため、認識結果と認識文字列の類似度が高い場合でも低い場合でも、全体の類似度を精度良く求めることが出来る。また、起点から比較して、類似しない文字に到達すると比較処理を中断するため、処理量を低減することができる。 In addition, the character recognition device 1 according to the present embodiment continuously recognizes the recognition result within a range in which any of the recognition candidates matches any of the characters appearing at the corresponding position of the candidate character string, starting from a plurality of starting positions. The degree of similarity is determined based on the partial score obtained by comparing the candidate character string. In other words, a comparison score is obtained for each portion where the recognition result and the candidate character string are similar, and the overall similarity is determined by adding the comparison scores. Therefore, the overall similarity can be obtained with high accuracy regardless of whether the similarity between the recognition result and the recognized character string is high or low. In addition, the comparison processing is interrupted when characters that are not similar to each other are reached from the starting point, so that the processing amount can be reduced.

さらに、一つの認識結果に複数のヒット位置があった場合に、各ヒット位置を基点として詳細スコアを求め、最も大きい詳細スコアを部分スコアとして採用する。そのため、認識結果と単語とが一致する程度を正確に表す比較結果に基づいて、認識結果を補正することが出来る。 Furthermore, when there are a plurality of hit positions in one recognition result, a detailed score is obtained from each hit position as a base point, and the largest detailed score is adopted as a partial score. Therefore, the recognition result can be corrected based on the comparison result that accurately represents the degree to which the recognition result matches the word.

また、特定部が特定した認識結果と候補文字列とで同じ文字が、互いに類似した位置で現れる複数の位置（ヒット位置）を起点として比較する。即ち、認識結果と候補文字列とが一致する複数の部位について、認識結果と候補文字列とを比較した結果に基づいて認識結果を補正することが出来る。このため、認識結果と候補文字列の一部が大きく類似し、一部で類似度が小さい場合などでも、それぞれの類似部分を総合的に評価して、認識結果を補正することが出来る。 In addition, a comparison is made using a plurality of positions (hit positions) where the same character appears in positions similar to each other in the recognition result specified by the specifying unit and the candidate character string. That is, the recognition result can be corrected based on the result of comparing the recognition result with the candidate character string for a plurality of portions where the recognition result matches the candidate character string. For this reason, even if the recognition result and a part of the candidate character string are largely similar and the degree of similarity is small in part, the similar result can be comprehensively evaluated to correct the recognition result.

（変形例）
本発明の実施形態１について説明したが、本発明の実施形態はこれに限られず、さまざまな変形が可能である。例えば、上記実施形態では、ＯＣＲ２１０が出力する認識結果には複数の認識候補に対して信頼度の順番が設定されていた。しかし、これに限らず信頼度の数値や、信頼度のレベル（信頼度低、信頼度中、信頼度高の３レベルなど）が定義された認識結果を出力するＯＣＲ２１０を用いても良い。
この場合、ステップＳ３０１やステップＳ３０５で加算するスコアを、信頼度の順位（候補番号）に応じて決定する代わりに、正規化した信頼度の数値やレベルに応じて決定すれば良い。例えば、スコア表では信頼度に関わらず位置のズレ（認識番号と比較番号の差）に応じたスコアを設定し、ステップＳ３０１やステップＳ３０５において加算する際に、信頼度の大きさに応じた重み係数を用いてスコアを調整すればよい。 (Modification)
Although Embodiment 1 of this invention was demonstrated, embodiment of this invention is not restricted to this, A various deformation | transformation is possible. For example, in the above embodiment, the order of reliability is set for a plurality of recognition candidates in the recognition result output by the OCR 210. However, the present invention is not limited to this, and an OCR 210 that outputs a recognition result in which a numerical value of reliability and a reliability level (three levels of low reliability, medium reliability, and high reliability) are defined may be used.
In this case, the score to be added in step S301 or step S305 may be determined according to the normalized numerical value or level of reliability, instead of being determined according to the rank (candidate number) of reliability. For example, in the score table, a score corresponding to a positional shift (difference between a recognition number and a comparison number) is set regardless of the reliability, and a weight corresponding to the magnitude of the reliability when adding in step S301 or step S305. The score may be adjusted using the coefficient.

また上記実施形態１では、ＯＣＲ２１０が、一文字に対して最大値２つの認識候補を出力する場合について説明した。しかしこれに変えて、より多くの認識候補（例えば各１０文字づつ）を出力するＯＣＲや、所定の信頼度以上の認識候補を無制限に出力するＯＣＲを用いても良い。 In the first embodiment, the case has been described in which the OCR 210 outputs a recognition candidate having two maximum values for one character. However, instead of this, an OCR that outputs more recognition candidates (for example, 10 characters each) or an OCR that outputs an unlimited number of recognition candidates of a predetermined reliability may be used.

これに関連して、多くの認識候補を出力するＯＣＲを用いる場合は、結果取得部が認識候補の数を制限してもよい。通常、認識結果の補正に当たっては、各文字の認識結果に正しい文字が含まれるように、多くの認識候補を出力させる。しかし、本実施形態では、複数の起点から比較して結果に基づいて類似度を決定するため、認識候補の数を制限しても、補正精度の低下の割合は小さいため、認識候補の数を制限することが可能となる。これにより、処理対象となる認識候補の数を制限することにより、類似度の算出処理に必要な計算量を抑えることが出来る。また、多くの認識候補に基づいて補正する場合、信頼度の低い認識結果によってノイズが発生するが、認識候補の数を制限することでノイズが低減する場合もある。 In relation to this, when using OCR that outputs many recognition candidates, the result acquisition unit may limit the number of recognition candidates. Usually, when correcting the recognition result, many recognition candidates are output so that the recognition result of each character includes a correct character. However, in this embodiment, since the similarity is determined based on the result by comparing from a plurality of starting points, even if the number of recognition candidates is limited, the rate of decrease in correction accuracy is small, so the number of recognition candidates is It becomes possible to restrict. As a result, by limiting the number of recognition candidates to be processed, the amount of calculation required for the similarity calculation process can be suppressed. Further, when correction is performed based on many recognition candidates, noise is generated due to a recognition result with low reliability, but noise may be reduced by limiting the number of recognition candidates.

認識候補の数を制限する手段としては、結果取得部が得られた認識結果に所定数以上の認識候補が含まれている場合に、信頼度が低いものを削除すればよい。あるいは、ＯＣＲ２１０の設定を変更して、信頼度が高い認識候補のみを出力するように制御しても良い。また、認識候補の数を制限する代わりに、信頼度の下限を設定してもよい。 As a means for limiting the number of recognition candidates, if the recognition result obtained by the result acquisition unit includes a predetermined number or more of recognition candidates, the one with low reliability may be deleted. Alternatively, the setting of the OCR 210 may be changed so that only recognition candidates with high reliability are output. Further, instead of limiting the number of recognition candidates, a lower limit of reliability may be set.

また、上記実施形態では候補文字列は一般的な単語辞書として説明したが、これに限らず、熟語辞書や、テンプレートとなる文書を登録したテンプレート辞書であってもよい。また、文字認識装置を用いて認識する認識文字列が、料理メニューや人名など、現れる単語が既知の特定のジャンルに限定されるものである場合には、一般的な単語辞書に代わって特定のジャンルに対応する専門の単語辞書を用いても良い。このような構成によれば、一般的な辞書を用いる場合よりも、認識結果の補正の精度を高めることが出来る。 In the above embodiment, the candidate character string has been described as a general word dictionary. However, the candidate character string is not limited to this, and may be a idiom dictionary or a template dictionary in which a template document is registered. In addition, when the recognized character string to be recognized using the character recognition device is such that the appearing word is limited to a known specific genre such as a cooking menu or a person name, a specific word dictionary is used instead of a specific word dictionary. A specialized word dictionary corresponding to the genre may be used. According to such a configuration, the accuracy of correction of the recognition result can be improved as compared with the case of using a general dictionary.

また、上記実施形態ではＯＣＲの認識結果を補正する場合について説明したが、本発明は手書き認識の結果など、少なくとも一部の文字に対して複数の認識候補が定義された任意の文字認識の結果を補正する場合にも応用できる。 Moreover, although the said embodiment demonstrated the case where the recognition result of OCR was correct | amended, this invention is the result of the arbitrary character recognition in which several recognition candidates were defined with respect to at least one part character, such as the result of handwriting recognition. It can also be applied when correcting

また上記実施例では、決定部は類似度を、部分スコアを合算して求めた。しかしこれに限定されず、部分スコアを重み付け加算するなど、部分スコアが大きくなるにつれて類似度が大きくなるような代替の方法を用いて部分スコアに応じた類似度を決定しても良い。例えば、誤認識が発生しやすい文字に対応する認識候補については、比較結果への影響を低減するため、重みを小さくすることが考えられる。また、文字列の類似度の算出段階で、認識結果と候補文字列の長さの違いに応じて類似度を加減しても良い。例えば、文字列の長さが同じ場合には類似度を１．２倍にし、２文字以上異なる場合は０．９倍する。この加減率は、文字長の認識誤差が発生する程度に応じて自由に設定可能である。このような構成によれば、文字長の違いを反映した精度の高い類似度を決定できる。 Moreover, in the said Example, the determination part calculated | required similarity by adding a partial score. However, the present invention is not limited to this, and the degree of similarity according to the partial score may be determined using an alternative method in which the degree of similarity increases as the partial score increases, such as weighted addition of partial scores. For example, for a recognition candidate corresponding to a character that is likely to be erroneously recognized, it is conceivable to reduce the weight in order to reduce the influence on the comparison result. Further, in the step of calculating the similarity between character strings, the similarity may be adjusted according to the difference between the recognition result and the length of the candidate character string. For example, when the lengths of character strings are the same, the similarity is increased by 1.2, and when two or more characters are different, the similarity is increased by 0.9. This adjustment rate can be freely set according to the degree of occurrence of character length recognition errors. According to such a configuration, it is possible to determine a high-accuracy similarity that reflects the difference in character length.

また、選択部は候補文字列のうち、類似度が最も高い候補文字列を出力文字列として選択した。しかし、これに限らず、選択部は類似度が高い順に所定数（例えば３つ）の候補文字列を出力する、類似度が所定の閾値以上の候補文字列を全て出力する、等ユーザの設定操作により指定された任意の選択条件を用いて出力文字列を選択してよい。 Moreover, the selection part selected the candidate character string with the highest similarity among candidate character strings as an output character string. However, the present invention is not limited to this, and the selection unit outputs a predetermined number (for example, three) of candidate character strings in descending order of similarity, and outputs all candidate character strings having a similarity equal to or higher than a predetermined threshold. The output character string may be selected using an arbitrary selection condition designated by the operation.

また、ＣＰＵ２１、ＲＡＭ２２，ＲＯＭ２３、等から構成される文字認識装置が実行する認識結果の補正処理（認識結果出力処理）を行う中心となる部分は、専用のシステムによらず、通常のコンピュータシステムを用いて実現可能である。たとえば、前記の動作を実行するためのコンピュータプログラムを、コンピュータが読み取り可能な記録媒体（フレキシブルディスク、ＣＤ−ＲＯＭ、ＤＶＤ−ＲＯＭ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｋＲｅａｄＯｎｌｙＭｅｍｏｒｙ）等）に格納して配布し、当該コンピュータプログラムをコンピュータにインストールすることにより、前記の処理を実行する情報端末を構成してもよい。また、インターネット等の通信ネットワーク上のサーバ装置が有する記憶装置に当該コンピュータプログラムを格納しておき、通常のコンピュータシステムがダウンロード等することで情報処理装置を構成してもよい。 In addition, a central part for performing recognition result correction processing (recognition result output processing) executed by a character recognition device including the CPU 21, RAM 22, ROM 23, etc. is not a dedicated system, but a normal computer system. It can be realized using. For example, a computer program for executing the above operation is stored and distributed in a computer-readable recording medium (flexible disk, CD-ROM, DVD-ROM (Digital Versatile Disk Read Only Memory, etc.) You may comprise the information terminal which performs the said process by installing a computer program in a computer. Alternatively, the computer program may be stored in a storage device included in a server device on a communication network such as the Internet, and the information processing device may be configured by being downloaded by a normal computer system.

また、文字認識装置の機能を、ＯＳ（ＯｐｅｒａｔｉｎｇＳｙｓｔｅｍ）とアプリケーションプログラムの分担、またはＯＳとアプリケーションプログラムとの協働により実現する場合などには、アプリケーションプログラム部分のみを記録媒体や記憶装置に格納してもよい。 In addition, when the functions of the character recognition device are realized by sharing an OS (Operating System) and an application program, or by cooperation between the OS and the application program, only the application program portion is stored in a recording medium or a storage device. May be.

また、搬送波にコンピュータプログラムを重畳し、通信ネットワークを介して配信することも可能である。たとえば、通信ネットワーク上の掲示板（ＢＢＳ：ＢｕｌｌｅｔｉｎＢｏａｒｄＳｙｓｔｅｍ）に前記コンピュータプログラムを掲示し、ネットワークを介して前記コンピュータプログラムを配信してもよい。そして、このコンピュータプログラムを起動し、ＯＳの制御下で、他のアプリケーションプログラムと同様に実行することにより、前記の処理を実行できるように構成してもよい。 It is also possible to superimpose a computer program on a carrier wave and distribute it via a communication network. For example, the computer program may be posted on a bulletin board (BBS: Bulletin Board System) on a communication network, and the computer program may be distributed via the network. The computer program may be started and executed in the same manner as other application programs under the control of the OS, so that the above-described processing may be executed.

また、上記メニュー表示装置が実行する処理の一部を、文字認識装置とは独立したコンピュータを用いて実現しても良い。 A part of the processing executed by the menu display device may be realized by using a computer independent of the character recognition device.

以上、本発明の好ましい実施形態について説明したが、本発明は係る特定の実施形態に限定されるものではなく、本発明には、特許請求の範囲に記載された発明とその均等の範囲が含まれる。以下に、本願出願の当初の特許請求の範囲に記載された発明を付記する。 As mentioned above, although preferable embodiment of this invention was described, this invention is not limited to the specific embodiment which concerns, This invention includes the invention described in the claim, and its equivalent range It is. Hereinafter, the invention described in the scope of claims of the present application will be appended.

（付記１）
文字認識装置からの単語の認識結果として、文字毎に順位付けされた複数の認識候補文字を有する認識文字列を取得する結果取得手段と、
前記認識文字列に対応する複数の候補単語を取得する候補取得手段と、
前記認識文字列の各文字に対応する複数の認識候補文字が前記候補単語の文字列上に現れる位置と当該認識候補文字の順位に基づいて、前記認識文字列と前記候補単語の類似度を決定する決定手段と、
前記複数の候補単語について前記決定手段が決定した類似度に基づいて候補単語の１つを選択し出力する出力手段と、
を備えることを特徴とする情報処理装置。 (Appendix 1)
A result acquisition means for acquiring a recognition character string having a plurality of recognition candidate characters ranked for each character as a word recognition result from the character recognition device;
Candidate acquisition means for acquiring a plurality of candidate words corresponding to the recognized character string;
The similarity between the recognized character string and the candidate word is determined based on a position where a plurality of recognition candidate characters corresponding to each character of the recognized character string appear on the character string of the candidate word and the rank of the recognized candidate character. A decision means to
Output means for selecting and outputting one of the candidate words based on the similarity determined by the determining means for the plurality of candidate words;
An information processing apparatus comprising:

（付記２）
前記決定手段は、前記認識候補文字が前記候補単語の文字列上に現れる位置を起点として、前後方向に位置する文字について比較することにより、前記認識候補文字のより順位が高い認識候補文字が、前記候補単語のより類似した位置に現れる程度を示すスコアを求め、求めたスコアに基づいて前記類似度を決定する、
ことを特徴とする付記１に記載の情報処理装置。 (Appendix 2)
The determination unit compares a character positioned in the front-rear direction with a position where the recognition candidate character appears on the character string of the candidate word as a starting point, so that the recognition candidate character having a higher rank than the recognition candidate character, Obtaining a score indicating the degree of appearing at a more similar position of the candidate word, and determining the similarity based on the obtained score;
The information processing apparatus according to appendix 1, wherein

（付記３）
前記決定手段は、前記起点となる位置から連続して、前記認識候補文字の何れかが、前記候補単語の文字列の対応する位置に現れる文字と一致する範囲で、前記認識候補文字と前記候補単語を比較する、
ことを特徴とする付記２に記載の情報処理装置。 (Appendix 3)
The determining means includes the recognition candidate character and the candidate within a range in which any of the recognition candidate characters matches a character appearing at a corresponding position in the character string of the candidate word continuously from the starting position. Compare words,
The information processing apparatus according to Supplementary Note 2, wherein

（付記４）
複数の単語を記憶している単語記憶手段をさらに含み、
前記候補取得手段は、前記認識文字列の文字数に応じて、前記単語記憶手段が記憶している単語の中から複数の候補単語を取得する、
ことを特徴とする付記１乃至３の何れかに記載の情報処理装置。 (Appendix 4)
A word storage means for storing a plurality of words;
The candidate acquisition unit acquires a plurality of candidate words from the words stored in the word storage unit according to the number of characters of the recognized character string.
The information processing apparatus according to any one of appendices 1 to 3, characterized in that:

（付記５）
文字認識装置からの単語の認識結果として、文字毎に順位付けされた複数の認識候補文字を有する認識文字列を取得し、
前記認識文字列に対応する複数の候補単語を取得し、
前記認識文字列の各文字に対応する複数の認識候補文字が前記候補単語の文字列上に現れる位置と当該認識候補文字の順位に基づいて、前記認識文字列と前記候補単語の類似度を決定し、
前記複数の候補単語について前記決定した類似度に基づいて候補単語の１つを選択し出力する、
ことを特徴とする情報処理方法。 (Appendix 5)
As a recognition result of the word from the character recognition device, obtain a recognition character string having a plurality of recognition candidate characters ranked for each character,
Obtaining a plurality of candidate words corresponding to the recognized character string;
The similarity between the recognized character string and the candidate word is determined based on a position where a plurality of recognition candidate characters corresponding to each character of the recognized character string appear on the character string of the candidate word and the rank of the recognized candidate character. And
Selecting and outputting one of the candidate words based on the determined similarity for the plurality of candidate words;
An information processing method characterized by the above.

（付記６）
コンピュータを、
文字認識装置からの単語の認識結果として、文字毎に順位付けされた複数の認識候補文字を有する認識文字列を取得する結果取得手段と、
前記認識文字列に対応する複数の候補単語を取得する候補取得手段と、
前記認識文字列の各文字に対応する複数の認識候補文字が前記候補単語の文字列上に現れる位置と当該認識候補文字の順位に基づいて、前記認識文字列と前記候補単語の類似度を決定する決定手段と、
前記複数の候補単語について前記決定手段が決定した類似度に基づいて候補単語の１つを選択し出力する出力手段、
として機能させるためのプログラム。 (Appendix 6)
Computer
A result acquisition means for acquiring a recognition character string having a plurality of recognition candidate characters ranked for each character as a word recognition result from the character recognition device;
Candidate acquisition means for acquiring a plurality of candidate words corresponding to the recognized character string;
The similarity between the recognized character string and the candidate word is determined based on a position where a plurality of recognition candidate characters corresponding to each character of the recognized character string appear on the character string of the candidate word and the rank of the recognized candidate character. A decision means to
Output means for selecting and outputting one of the candidate words based on the similarity determined by the determining means for the plurality of candidate words;
Program to function as.

１…文字認識装置、１０…画像入力部、２０…情報処理部、２１…ＣＰＵ、２２…ＲＡＭ、２３…ＲＯＭ、２４…Ｉ／Ｏ部、２５…外部記憶装置、２６…内部バス、２１０…ＯＣＲ、２２０…文字辞書記憶部、２４０…単語辞書記憶部、３０…操作部、４０…表示部、６０…文字列処理部、６１０…結果取得部、６２０…特定部、６３０…候補取得部、６４０…比較部、６５０…決定部、６６０…選択部、６７０…出力部、ＴＩ…入力画像、Ｒ…認識結果 DESCRIPTION OF SYMBOLS 1 ... Character recognition apparatus, 10 ... Image input part, 20 ... Information processing part, 21 ... CPU, 22 ... RAM, 23 ... ROM, 24 ... I / O part, 25 ... External storage device, 26 ... Internal bus, 210 ... OCR, 220 ... character dictionary storage unit, 240 ... word dictionary storage unit, 30 ... operation unit, 40 ... display unit, 60 ... character string processing unit, 610 ... result acquisition unit, 620 ... identification unit, 630 ... candidate acquisition unit, 640 ... comparison unit, 650 ... determination unit, 660 ... selection unit, 670 ... output unit, TI ... input image, R ... recognition result

Claims

文字認識装置からの単語の認識結果として、文字毎に順位付けされた複数の認識候補文字を有する認識文字列を取得する結果取得手段と、
前記認識文字列に対応する複数の候補単語を取得する候補取得手段と、
前記認識文字列の各文字に対応する複数の認識候補文字が前記候補単語の文字列上に現れる位置と当該認識候補文字の順位に基づいて、前記認識文字列と前記候補単語の類似度を決定する決定手段と、
前記複数の候補単語について前記決定手段が決定した類似度に基づいて候補単語の１つを選択し出力する出力手段と、
を備えることを特徴とする情報処理装置。 A result acquisition means for acquiring a recognition character string having a plurality of recognition candidate characters ranked for each character as a word recognition result from the character recognition device;
Candidate acquisition means for acquiring a plurality of candidate words corresponding to the recognized character string;
The similarity between the recognized character string and the candidate word is determined based on a position where a plurality of recognition candidate characters corresponding to each character of the recognized character string appear on the character string of the candidate word and the rank of the recognized candidate character. A decision means to
Output means for selecting and outputting one of the candidate words based on the similarity determined by the determining means for the plurality of candidate words;
An information processing apparatus comprising:

前記決定手段は、前記認識候補文字が前記候補単語の文字列上に現れる位置を起点として、前後方向に位置する文字について比較することにより、前記認識候補文字のより順位が高い認識候補文字が、前記候補単語のより類似した位置に現れる程度を示すスコアを求め、求めたスコアに基づいて前記類似度を決定する、
ことを特徴とする請求項１に記載の情報処理装置。 The determination unit compares a character positioned in the front-rear direction with a position where the recognition candidate character appears on the character string of the candidate word as a starting point, so that the recognition candidate character having a higher rank than the recognition candidate character, Obtaining a score indicating the degree of appearing at a more similar position of the candidate word, and determining the similarity based on the obtained score;
The information processing apparatus according to claim 1.

前記決定手段は、前記起点となる位置から連続して、前記認識候補文字の何れかが、前記候補単語の文字列の対応する位置に現れる文字と一致する範囲で、前記認識候補文字と前記候補単語を比較する、
ことを特徴とする請求項２に記載の情報処理装置。 The determining means includes the recognition candidate character and the candidate within a range in which any of the recognition candidate characters matches a character appearing at a corresponding position in the character string of the candidate word continuously from the starting position. Compare words,
The information processing apparatus according to claim 2.

複数の単語を記憶している単語記憶手段をさらに含み、
前記候補取得手段は、前記認識文字列の文字数に応じて、前記単語記憶手段が記憶している単語の中から複数の候補単語を取得する、
ことを特徴とする請求項１乃至３の何れかに記載の情報処理装置。 A word storage means for storing a plurality of words;
The candidate acquisition unit acquires a plurality of candidate words from the words stored in the word storage unit according to the number of characters of the recognized character string.
The information processing apparatus according to claim 1, wherein the information processing apparatus is an information processing apparatus.

文字認識装置からの単語の認識結果として、文字毎に順位付けされた複数の認識候補文字を有する認識文字列を取得し、
前記認識文字列に対応する複数の候補単語を取得し、
前記認識文字列の各文字に対応する複数の認識候補文字が前記候補単語の文字列上に現れる位置と当該認識候補文字の順位に基づいて、前記認識文字列と前記候補単語の類似度を決定し、
前記複数の候補単語について前記決定した類似度に基づいて候補単語の１つを選択し出力する、
ことを特徴とする情報処理方法。 As a recognition result of the word from the character recognition device, obtain a recognition character string having a plurality of recognition candidate characters ranked for each character,
Obtaining a plurality of candidate words corresponding to the recognized character string;
The similarity between the recognized character string and the candidate word is determined based on a position where a plurality of recognition candidate characters corresponding to each character of the recognized character string appear on the character string of the candidate word and the rank of the recognized candidate character. And
Selecting and outputting one of the candidate words based on the determined similarity for the plurality of candidate words;
An information processing method characterized by the above.

コンピュータを、
文字認識装置からの単語の認識結果として、文字毎に順位付けされた複数の認識候補文字を有する認識文字列を取得する結果取得手段と、
前記認識文字列に対応する複数の候補単語を取得する候補取得手段と、
前記認識文字列の各文字に対応する複数の認識候補文字が前記候補単語の文字列上に現れる位置と当該認識候補文字の順位に基づいて、前記認識文字列と前記候補単語の類似度を決定する決定手段と、
前記複数の候補単語について前記決定手段が決定した類似度に基づいて候補単語の１つを選択し出力する出力手段、
として機能させるためのプログラム。 Computer
A result acquisition means for acquiring a recognition character string having a plurality of recognition candidate characters ranked for each character as a word recognition result from the character recognition device;
Candidate acquisition means for acquiring a plurality of candidate words corresponding to the recognized character string;
The similarity between the recognized character string and the candidate word is determined based on a position where a plurality of recognition candidate characters corresponding to each character of the recognized character string appear on the character string of the candidate word and the rank of the recognized candidate character. A decision means to
Output means for selecting and outputting one of the candidate words based on the similarity determined by the determining means for the plurality of candidate words;
Program to function as.