JP5344338B2

JP5344338B2 - Program, information storage medium, and character string recognition device

Info

Publication number: JP5344338B2
Application number: JP2008199388A
Authority: JP
Inventors: 碧蘭朱; 正樹中川
Original assignee: NATIONAL UNIVERSITY CORPORATION TOKYO UNIVERSITY OF AGRICULUTURE & TECHNOLOGY
Current assignee: NATIONAL UNIVERSITY CORPORATION TOKYO UNIVERSITY OF AGRICULUTURE & TECHNOLOGY
Priority date: 2008-08-01
Filing date: 2008-08-01
Publication date: 2013-11-20
Anticipated expiration: 2028-08-01
Also published as: JP2010039609A

Abstract

<P>PROBLEM TO BE SOLVED: To provide a program, an information storage medium and a character string recognition device, for improving a recognition rate by weakening tendency to interpret a character string into the small number of characters, or tendency to interpret the character string into the large number of characters. <P>SOLUTION: An off-stroke to be set as a temporary segmentation point is determined from among off-strokes between strokes, and character recognition is carried out for each temporary segmentation unit segmented by the temporary segmentation point, and a candidate character string is generated. The temporary segmentation unit includes at least one minimum temporary segmentation unit to be segmented by the adjacent temporary segmentation point. As for the connection of the minimum temporary segmentation unit included in the temporary segmentation unit corresponding to each candidate character of the candidate character string and the adjacent minimum segmentation unit, the certainty of a plurality of items is evaluated, and as for the certainty of the plurality of items, the value of weight to be given to each item is set. <P>COPYRIGHT: (C)2010,JPO&INPIT

Description

本発明は、プログラム、情報記憶媒体及び文字列認識装置に関する。 The present invention relates to a program, an information storage medium, and a character string recognition device.

従来から、与えられた文字列パタンを１文字ごとに分割して切出し候補とし、切出し候補に対応する文字パタンを文字認識して候補文字を出力し、候補文字列を評価して最適な文字列を認識結果として出力するオンライン文字列認識装置が知られている（例えば特許文献１参照）。
特開平１１−３２８３１６号公報 Conventionally, a given character string pattern is divided into characters to be extracted candidates, character patterns corresponding to the extracted candidates are recognized, candidate characters are output, the candidate character strings are evaluated, and the optimal character string Is known as an on-line character string recognition device (see, for example, Patent Document 1).
JP-A-11-328316

しかしながら従来の文字列認識装置では、文字列を少ない文字数に解釈する傾向や、文字列を多い文字数に解釈する傾向があった。 However, the conventional character string recognition apparatus has a tendency to interpret a character string into a small number of characters or to interpret a character string into a large number of characters.

本発明は、以上のような課題に鑑みてなされたものであり、その目的とするところは、
文字列を少ない文字数に解釈する傾向や、文字列を多い文字数に解釈する傾向を弱めて、認識率を向上させることが可能なプログラム、情報記憶媒体及び文字列認識装置を提供することにある。 The present invention has been made in view of the problems as described above, and the object is as follows.
An object of the present invention is to provide a program, an information storage medium, and a character string recognition device capable of improving the recognition rate by weakening the tendency to interpret a character string into a small number of characters and the tendency to interpret a character string into a large number of characters.

（１）本発明は、手書き入力された文字パタンのストロークから文字列を認識するためのプログラムであって、
ストローク間のオフストロークのうち、仮切出しポイントとして設定するオフストロークを決定する仮切出しポイント決定部と、
前記仮切出しポイントによって切り出された仮切出しユニット毎に文字認識を行い、候補文字列を生成する候補文字列生成部と、
前記候補文字列について複数項目の確からしさを評価し、評価結果に基づき文字列を認識する認識部としてコンピュータを機能させ、
前記仮切出しユニットは、隣接する前記仮切出しポイントにより切り出される最小仮切出しユニットを少なくとも１つ含み、
前記認識部は、
前記候補文字列の各候補文字に対応する前記仮切出しユニットに含まれる前記最小仮切出しユニット、および隣接する前記最小切出しユニット間の連結について、前記複数項目の確からしさを評価し、前記複数項目の確からしさについて各項目に与える重みの値を設定することを特徴とする。 (1) The present invention is a program for recognizing a character string from a stroke of a character pattern input by handwriting,
Of the off-strokes between strokes, a provisional cut-out point determination unit that determines an off-stroke to be set as a temporary cut-out point;
Character recognition for each provisional cutout unit cut out by the provisional cutout point, a candidate character string generation unit that generates a candidate character string,
Evaluating the probability of multiple items for the candidate character string, causing the computer to function as a recognition unit that recognizes the character string based on the evaluation result,
The temporary cutting unit includes at least one minimum temporary cutting unit cut out by the adjacent temporary cutting point,
The recognition unit
For the connection between the minimum provisional cutout unit included in the provisional cutout unit corresponding to each candidate character of the candidate character string and the adjacent minimum cutout unit, the probability of the plurality of items is evaluated, and the plurality of items A feature is to set a weight value to be given to each item for the probability.

また本発明は、上記各部を含む文字列認識装置に関係する。また本発明はコンピュータ読み取り可能な情報記憶媒体であって、上記各部としてコンピュータを機能させるプログラムを記憶した情報記憶媒体に関係する。 The present invention also relates to a character string recognition device including the above-described parts. The present invention is also a computer-readable information storage medium, and relates to an information storage medium storing a program that causes a computer to function as each of the above-described units.

本発明によれば、各最小仮切出しユニットと隣接する最小仮切出しユニット間の連結について複数項目の確からしさを評価し、前記複数項目の各項目に適切な重みの値を設定することで、候補文字列の尤度評価を最小仮切出しユニットに基づき行うことができ、文字列候補の文字数に影響されず文字を認識することができる。すなわち、文字列を少ない文字数に解釈する傾向や、文字列を多い文字数に解釈する傾向を弱めて認識率を向上させることができる。 According to the present invention, by evaluating the likelihood of a plurality of items for the connection between each minimum provisional cutout unit and the adjacent minimum provisional cutout unit, and setting an appropriate weight value for each item of the plurality of items, The likelihood evaluation of the character string can be performed based on the minimum provisional cutout unit, and the character can be recognized without being influenced by the number of characters of the character string candidate. That is, the recognition rate can be improved by weakening the tendency of interpreting a character string to a small number of characters and the tendency to interpret a character string to a large number of characters.

（２）また本発明に係るプログラム及び情報記憶媒体では、
前記認識部は、
前記候補文字列の前記最小仮切出しユニット、および隣接する前記最小切出しユニット間の連結について、言語文脈の確からしさ、文字パタンサイズの確からしさ、文字パタン内分割の確からしさ、文字認識の確からしさ、単一文字パタン位置の確からしさ、ペア文字パタン位置の確からしさ及び前記仮切出しポイントの確からしさを含む複数項目の確からしさを評価するようにしてもよい。 (2) In the program and information storage medium according to the present invention,
The recognition unit
Regarding the connection between the minimum provisional cutout unit of the candidate character string and the adjacent minimum cutout unit, the accuracy of language context, the probability of character pattern size, the probability of division within character patterns, the probability of character recognition, You may make it evaluate the probability of several items including the probability of a single character pattern position, the probability of a pair character pattern position, and the probability of the said provisional cutout point.

本発明によれば、各候補文字列について上記７項目の確からしさを評価することで、認識率を向上させることができる。 According to the present invention, the recognition rate can be improved by evaluating the likelihood of the above seven items for each candidate character string.

（３）また本発明に係るプログラム及び情報記憶媒体では、
前記認識部は、
前記仮切出しポイントを、隣接する文字パタン間の仮切出しポイントとして判断する場合には、前記仮切出しポイントが切出しポイントとして識別される確からしさを評価し、前記仮切出しポイントを、文字パタン内部の仮切出しポイントとして判断する場合には、前記仮切出しポイントが非切出しポイントとして識別される確からしさを評価するようにしてもよい。 (3) In the program and information storage medium according to the present invention,
The recognition unit
When determining the temporary cutting point as a temporary cutting point between adjacent character patterns, the probability of identifying the temporary cutting point as a cutting point is evaluated, and the temporary cutting point is determined as a temporary cutting point inside the character pattern. When determining as a cut-out point, the probability that the temporary cut-out point is identified as a non-cut-out point may be evaluated.

本発明によれば、仮切出しポイントが文字パタン間にあるか、或いは文字パタン内部にあるかによって、それぞれ切出しポイントとして識別される確からしさ、或いは非切出しポイントとして識別される確からしさを評価することによって、文字列を少ない文字数に解釈する傾向や、文字列を多い文字数に解釈する傾向を弱めて認識率を向上させることができる。 According to the present invention, the probability of being identified as a cut-out point or the probability of being identified as a non-cut-out point is evaluated depending on whether the provisional cut-out point is between character patterns or inside a character pattern. Accordingly, the recognition rate can be improved by weakening the tendency of interpreting a character string into a small number of characters and the tendency of interpreting a character string into a large number of characters.

（４）また本発明に係るプログラム及び情報記憶媒体では、
前記仮切出しポイント決定部は、
前記オフストロークから抽出した多次元の特徴値をサポートベクターマシン（ＳＶＭ）に適用することで、前記仮切出しポイントのＳＶＭ出力値を求め、
前記認識部は、
前記ＳＶＭ出力値に基づき前記仮切出しポイントの確からしさを評価するようにしてもよい。 (4) In the program and information storage medium according to the present invention,
The temporary cut point determination unit
By applying a multi-dimensional feature value extracted from the off-stroke to a support vector machine (SVM), an SVM output value of the temporary cut point is obtained.
The recognition unit
You may make it evaluate the certainty of the said temporary extraction point based on the said SVM output value.

本発明によれば、仮切出しポイントを多角的に評価することができる。 According to the present invention, the provisional cut-out point can be evaluated from various perspectives.

（５）また本発明に係るプログラム及び情報記憶媒体では、
前記認識部は、
遺伝的アルゴリズムにより学習した複数の重みの値を前記複数項目の確からしさに与えるようにしてもよい。 (5) In the program and information storage medium according to the present invention,
The recognition unit
A plurality of weight values learned by a genetic algorithm may be given to the probability of the plurality of items.

以下、本実施形態について説明する。なお、以下に説明する本実施形態は、特許請求の範囲に記載された本発明の内容を不当に限定するものではない。また本実施形態で説明される構成の全てが、本発明の必須構成要件であるとは限らない。 Hereinafter, this embodiment will be described. In addition, this embodiment demonstrated below does not unduly limit the content of this invention described in the claim. In addition, all the configurations described in the present embodiment are not necessarily essential configuration requirements of the present invention.

１．構成
図１に本実施形態の文字列認識装置の機能ブロック図の一例を示す。なお本実施形態の文字列認識装置は図１の構成要素（各部）の一部を省略した構成としてもよい。 1. Configuration FIG. 1 shows an example of a functional block diagram of the character string recognition apparatus of this embodiment. Note that the character string recognition apparatus of this embodiment may have a configuration in which some of the components (each unit) in FIG. 1 are omitted.

文字入力部１６０は、ユーザがペンで手書き文字を入力するためのものであり、その機能は、タブレット（筆記面）などにより実現できる。文字入力部１６０は、ペンが筆記面に触れてから離れるまでのペン先の位置を表す座標データを一定時間間隔で検出し、検出された座標データを筆点情報取得部１１０に出力する。 The character input unit 160 is for the user to input handwritten characters with a pen, and the function can be realized by a tablet (writing surface) or the like. The character input unit 160 detects coordinate data representing the position of the pen tip from when the pen touches the writing surface until it leaves, and outputs the detected coordinate data to the writing point information acquisition unit 110.

記憶部１７０は、処理部１００や通信部１９６などのワーク領域となるもので、その機能はＲＡＭなどにより実現できる。 The storage unit 170 serves as a work area for the processing unit 100, the communication unit 196, and the like, and its function can be realized by a RAM or the like.

情報記憶媒体１８０（コンピュータにより読み取り可能な媒体）は、プログラムやデータなどを格納するものであり、その機能は、光ディスク（ＣＤ、ＤＶＤ）、光磁気ディスク（ＭＯ）、磁気ディスク、ハードディスク、磁気テープ、或いはメモリ（ＲＯＭ）などにより実現できる。処理部１００は、情報記憶媒体１８０に格納されるプログラム（データ）に基づいて本実施形態の種々の処理を行う。即ち情報記憶媒体１８０には、本実施形態の各部としてコンピュータを機能させるためのプログラム（各部の処理をコンピュータに実行させるためのプログラム）が記憶される。 The information storage medium 180 (computer-readable medium) stores programs, data, and the like, and functions as an optical disk (CD, DVD), magneto-optical disk (MO), magnetic disk, hard disk, and magnetic tape. Alternatively, it can be realized by a memory (ROM). The processing unit 100 performs various processes of the present embodiment based on a program (data) stored in the information storage medium 180. That is, the information storage medium 180 stores a program for causing a computer to function as each unit of the present embodiment (a program for causing a computer to execute processing of each unit).

表示部１９０は、本実施形態により生成された画像を出力するものであり、その機能は、ＣＲＴ、ＬＣＤ、タッチパネル型ディスプレイ、或いはＨＭＤ（ヘッドマウントディスプレイ）などにより実現できる。 The display unit 190 outputs an image generated according to the present embodiment, and its function can be realized by a CRT, LCD, touch panel display, HMD (head mounted display), or the like.

処理部１００（プロセッサ）は、文字入力部１６０からの座標データやプログラムなどに基づいて、認識処理、表示処理などの処理を行う。この処理部１００は記憶部１７０内の主記憶部１７２をワーク領域として各種処理を行う。処理部１００の機能は各種プロセッサ（ＣＰＵ、ＤＳＰ等）、ＡＳＩＣ（ゲートアレイ等）などのハードウェアや、プログラムにより実現できる。 The processing unit 100 (processor) performs processing such as recognition processing and display processing based on coordinate data from the character input unit 160, a program, and the like. The processing unit 100 performs various processes using the main storage unit 172 in the storage unit 170 as a work area. The functions of the processing unit 100 can be realized by hardware such as various processors (CPU, DSP, etc.), ASIC (gate array, etc.), and programs.

処理部１００は、筆点情報取得部１１０、仮切出しポイント決定部１１２、候補文字列生成部１１４、認識部１１６、表示処理部１２０を含む。なおこれらの一部を省略する構成としてもよい。 The processing unit 100 includes a writing point information acquisition unit 110, a provisional cutout point determination unit 112, a candidate character string generation unit 114, a recognition unit 116, and a display processing unit 120. Note that some of these may be omitted.

筆点情報取得部１１０は、ペンが筆記面に触れてから離れるまでの座標データ列（筆点座標系列）をストロークのデータとして文字入力部１６０から取得する。また筆点情報取得部１１０は、他の情報処理装置からネットワーク及び通信部１９６を介してストロークのデータを取得するようにしてもよい。 The writing point information acquisition unit 110 acquires, from the character input unit 160, a coordinate data string (writing point coordinate series) from when the pen touches the writing surface until it leaves. The writing point information acquisition unit 110 may acquire stroke data from another information processing apparatus via a network and communication unit 196.

仮切出しポイント決定部１１２は、ストローク間のオフストローク（ペンが筆記面から離れてから再び触れるまでのベクトル）のうち、仮切出しポイントとして設定するオフストロークを決定する。具体的には、オフストロークから距離や交わりを表す特徴値を抽出し、抽出した特徴値に基づき仮切出しポイントとして設定するオフストロークを決定する。 The temporary cut point determination unit 112 determines an off stroke to be set as a temporary cut point among the off strokes between the strokes (vectors from when the pen leaves the writing surface until it touches again). Specifically, feature values representing distances and intersections are extracted from the off-stroke, and an off-stroke to be set as a temporary cut-out point is determined based on the extracted feature values.

また仮切出しポイント決定部１１２は、前記オフストロークから抽出した多次元の特徴値をサポートベクターマシン（ＳＶＭ）に適用することで、前記仮切出しポイントのＳＶＭ出力値を求めるようにしてもよい。 The temporary cut point determination unit 112 may obtain the SVM output value of the temporary cut point by applying a multidimensional feature value extracted from the off-stroke to a support vector machine (SVM).

候補文字列生成部１１４は、前記仮切出しポイントによって切り出された仮切出しユニット（隣接する前記仮切出しポイントにより切り出される最小仮切出しユニットを少なくとも１つ含む仮切出しユニット）毎に文字認識を行い、候補文字列を生成する。 The candidate character string generation unit 114 performs character recognition for each temporary cutting unit cut out by the temporary cutting point (a temporary cutting unit including at least one minimum temporary cutting unit cut out by the adjacent temporary cutting point) Generate a string.

認識部１１６は、前記候補文字列について複数項目の確からしさを評価し、評価結果に基づき文字列を認識する。また認識部１１６は、前記候補文字列の各候補文字に対応する前記仮切出しユニットに含まれる前記最小仮切出しユニット、および隣接する前記最小切出しユニット間の連結について、前記複数項目の確からしさを評価し、前記複数項目の確からしさについて各項目に適切な重みの値を設定する。 The recognition unit 116 evaluates the probability of a plurality of items for the candidate character string, and recognizes the character string based on the evaluation result. The recognizing unit 116 evaluates the probability of the plurality of items regarding the minimum provisional cutout unit included in the provisional cutout unit corresponding to each candidate character of the candidate character string and the connection between the adjacent minimum cutout units. Then, an appropriate weight value is set for each item for the probability of the plurality of items.

また認識部１１６は、前記候補文字列の各前記最小仮切出しユニットと隣接する前記最小切出しユニット間の連結について、言語文脈の確からしさ、文字パタンサイズの確からしさ、文字パタン内分割の確からしさ、文字認識の確からしさ、単一文字パタン位置の確からしさ、ペア文字パタン位置の確からしさ及び前記仮切出しポイントの確からしさを含む複数項目の確からしさを評価するようにしてもよい。 The recognizing unit 116, for the connection between each of the minimum provisional cutout units of the candidate character string and the adjacent minimum cutout unit, the accuracy of language context, the probability of character pattern size, the probability of division within character patterns, You may make it evaluate the probability of several items including the probability of character recognition, the probability of a single character pattern position, the probability of a pair character pattern position, and the probability of the said provisional cutout point.

また認識部１１６は、前記仮切出しポイントを、隣接する文字パタン間の仮切出しポイントとして判断する場合には、前記仮切出しポイントが切出しポイントとして識別される確からしさを評価し、前記仮切出しポイントを、文字パタン内部の仮切出しポイントとして判断する場合には、前記仮切出しポイントが非切出しポイントとして識別される確からしさを評価するようにしてもよい。 Further, when the recognition unit 116 determines the temporary cut point as a temporary cut point between adjacent character patterns, the recognition unit 116 evaluates the probability that the temporary cut point is identified as the cut point, and determines the temporary cut point as the temporary cut point. When determining as a temporary cut point inside a character pattern, the probability of identifying the temporary cut point as a non-cut point may be evaluated.

また認識部１１６は、前記ＳＶＭ出力値に基づき前記仮切出しポイントの確からしさを評価するようにしてもよい。 Further, the recognition unit 116 may evaluate the probability of the temporary cut point based on the SVM output value.

また認識部１１６は、遺伝的アルゴリズムにより学習した複数の重みの値を前記複数項目の確からしさに与えるようにしてもよい。 The recognition unit 116 may give a plurality of weight values learned by a genetic algorithm to the probability of the plurality of items.

表示処理部１２０は、認識部１１６により認識された文字列（認識結果）を表示部１９０に出力する。なお処理部１００は、認識部１１６による認識結果をネットワーク及び通信部１９６を介して他の情報処理装置に送信するようにしてもよい。 The display processing unit 120 outputs the character string (recognition result) recognized by the recognition unit 116 to the display unit 190. Note that the processing unit 100 may transmit the recognition result by the recognition unit 116 to another information processing apparatus via the network and communication unit 196.

通信部１９６は、他の情報処理装置との間で通信を行うための各種制御を行うものであり、その機能は、各種プロセッサ又は通信用ＡＳＩＣなどのハードウェアや、プログラムなどにより実現できる。 The communication unit 196 performs various controls for communicating with other information processing apparatuses, and functions thereof can be realized by hardware such as various processors or communication ASICs, programs, and the like.

なお本実施形態の各部としてコンピュータを機能させるためのプログラム（データ）は、サーバが有する情報記憶媒体からネットワーク及び通信部１９６を介して記憶部１７０、情報記憶媒体１８０に配信してもよい。このようなサーバの情報記憶媒体の使用も本発明の範囲内に含めることができる。 Note that a program (data) for causing a computer to function as each unit of the present embodiment may be distributed from the information storage medium of the server to the storage unit 170 and the information storage medium 180 via the network and the communication unit 196. Use of such server information storage media can also be included within the scope of the present invention.

２．本実施形態の手法
次に本実施形態の手法について図面を用いて説明する。 2. Next, the method of this embodiment will be described with reference to the drawings.

２−１．仮切出しポイントの決定
本実施形態では、入力された各ストロークを、隣接するストローク間の距離や重なりなどの特徴によって、切出しポイント、非切出しポイントと不確定ポイントに分類する。切出しポイントは、２つの文字パタン間にあると判断されたポイントであり、非切出しポイントは、１つの文字パタンの内部にあると判断されたポイントである。また不確定ポイントは、切出しポイントであるか非切出しポイントであるかを決定できないポイントである。本実施形態では、切出しポイントと不確定ポイントとを仮切出しポイントとする。 2-1. Determination of Temporary Cutting Point In this embodiment, each input stroke is classified into a cutting point, a non-cutting point, and an indeterminate point according to features such as the distance between adjacent strokes and overlap. The cut-out point is a point determined to be between two character patterns, and the non-cut-off point is a point determined to be inside one character pattern. An indeterminate point is a point that cannot be determined whether it is a cut point or a non-cut point. In the present embodiment, the cut-out point and the indeterminate point are set as temporary cut-out points.

本実施形態では、入力されたオフストロークから水平方向距離と交差長さの２つの特徴値を抽出する。オフストロークの水平方向距離特徴ｆ_ｄは、次式により求めることができる。 In the present embodiment, two feature values of the horizontal distance and the intersection length are extracted from the input off-stroke. The off-stroke horizontal distance feature f _d can be obtained by the following equation.

ｆ_ｄ＝Ｄ_Ｂｘ／ａｃｓ（１）
ここでａｃｓは平均文字サイズを示し、各ストロークの外接矩形における長い方の辺長をもとめ、全てのストロークについてその辺長をソートし、それらの辺長の中において長い方から３分の１のものに対して平均を取ることによって求めることができる。 f _d = D _Bx / acs (1)
Here, acs indicates the average character size, finds the longer side length in the circumscribed rectangle of each stroke, sorts the side lengths for all strokes, and is one-third from the longer side among those side lengths. It can be determined by taking an average over things.

またＤ_Ｂｘは、図２に示すように、注目するオフストロークの前の全ての外接矩形Ｂｂ_{ｐ＿ａｌｌ}と、注目するオフストロークの後の全ての外接矩形Ｂｂ_{ｓ＿ａｌｌ}とのＸ方向距離を示し、Ｄ_Ｂｘは、次式により求めることができる。 The _{D Bx,} as shown in FIG. 2, shows all of the circumscribed rectangles _{Bb P_ALL} previous off stroke of interest, the X direction between the all enclosing rectangles _{Bb S_all} after off stroke of _{interest, D Bx} Can be obtained by the following equation.

Ｄ_Ｂｘ＝Ｂｂ_{ｓ＿ａｌｌ}の左位置Ｘ座標−Ｂｂ_{ｐ＿ａｌｌ}の右位置Ｘ座標（２）
また、オフストロークの交差長さ特徴ｆ_ｉは、以下のように求める。 _D _Bx = _Bb s_all the left position X coordinate _{-BB P_ALL} the right position X-coordinate (2)
The off-stroke crossing length feature f _i is obtained as follows.

すわなち、図３に示すように、注目するオフストロークを基準として、前の全てのストローク列をＳ_{ｐ＿ａｌｌ}とし、後ろの全てのストローク列をＳ_{ｓ＿ａｌｌ}とし、Ｓ_{ｐ＿ａｌｌ}に属する１つのストロークをＳ_ｐとし、Ｓ_{ｓ＿ａｌｌ}に属する１つのストロークをＳ_ｓとし、Ｓ_ｐとＳ_ｓとが交差する点をｐとし、ｐからＳ_ｐの右側の端点までのＳ_ｐ上の長さとｐからＳ_ｓの左側の端点までのＳ_ｓ上の長さの短い方をｌ_ｎとし、Ｓ_ｐの長さとＳ_ｓの長さの長い方をｌ_ｄとして、Ｓ_ｐとＳ_ｓの交差長さｌ_ｉを、次式により求める。 _That is, as shown in FIG. 3, with reference to the off-stroke of interest, all previous stroke sequences are set to _{Sp_all} , all subsequent stroke sequences are set to _{Ss_all,} and one stroke belonging to _{Sp_all} is set to _{S_all.} and _{_p,} one stroke belonging to _{S S_all} and _{S s,} the point where the _{S p} and _{S s} intersects the p, p from the length and p on _{S p} to the right end point of the _{S p} of _{S s} The shorter length on S _s to the left end point is _defined as _ln , the longer length of S _p and the longer length of S _s is _defined as l _d , and the intersection length l _i of S _p and S _s is _expressed as follows: Obtained by the following equation.

ｌ_ｉ＝−ｌ_ｎ／ｌ_ｄ（３）
なお、Ｓ_ｐとＳ_ｓが交差していない場合にはｌ_ｉ＝０とする。 l _i = −l _n / l _d (3)
Note that l _i = 0 when _Sp and S _s do not intersect.

次に、交差長さの和Ｌ_ｓｕｍを、Ｓ_ｐとＳ_ｓの組み合わせペアの数をｎとして、次式により求める。 Then, the sum _{L sum} of cross length, the number of combinations pairs _{S p} and _{S s} is n, calculated by the following equation.

そしてオフストロークの交差長さ特徴ｆ_ｉを、Ｌ_ｓｕｍ＜０の場合には、ｆ_ｉ＝Ｌ_ｓｕｍとし、ｆ_ｄ（水平方向距離特徴）＞０の場合には、ｆ_ｉ＝ｆ_ｄとし、それ以外の場合には、ｆ_ｉ＝０として求める。 The cross length feature _{f i} off _{stroke, L sum} _<0, the a _{_{f i = L sum, f d}} ( horizontal distance characteristic)> _0, the a f i = _{f d,} In other cases, it is determined as f _i = 0.

図４は、水平方向距離特徴と交差長さ特徴の学習パタンの分布を示す図である。図４において、横軸は水平方向距離特徴ｆ_ｄを示し、縦軸は交差長さ特徴ｆ_ｉを示す。また、実線と黒点は、学習パタンにおける切出しポイント（文字パタン間のオフストローク）の特徴値の分布を示し、グレーの領域は、学習パタンにおける非切出しポイント（文字パタン内部のオフストローク）の特徴値の分布を示す。 FIG. 4 is a diagram showing a distribution of learning patterns of horizontal distance features and intersection length features. In FIG. 4, the horizontal axis represents the horizontal distance feature f _d , and the vertical axis represents the intersection length feature f _i . The solid lines and black dots show the distribution of feature values of the cut points (off strokes between character patterns) in the learning pattern, and the gray area shows the feature values of the non-cut points (off strokes inside the character pattern) in the learning patterns. The distribution of.

本実施形態では、入力されたオフストロークから抽出した水平方向距離特徴と交差長さ特徴が０以上であり、或いはＯＡＢＣＤＥ（学習パタンにおいて切出しポイントが密集する２つの長方形を重ねた形）のエリアに位置する場合に、当該オフストロークを不確定ポイントに分類し、それ以外のオフストロークを非切出しポイントに分類する。 In the present embodiment, the horizontal distance feature and the intersection length feature extracted from the input off-stroke are 0 or more, or in the area of OABCDE (a shape in which two rectangles where cut points are densely gathered in the learning pattern are overlapped). If it is located, the off-stroke is classified as an indeterminate point, and the other off-stroke is classified as an uncut point.

その後、全ての隣接する２つの不確定ポイントについて、２つの不確定ポイント間のストローク列の外接矩形の幅を求め、当該外接矩形の幅／ａｃｓ（平均文字サイズ）が所定の閾値より大きい場合には、当該２つの不確定ポイント間に存在する非切出しポイントの特徴値がＯＦＧＨ（学習パタンにおいて切出しポイントが存在し得る長方形）のエリアに位置する場合に、当該非切出しポイントを不確定ポイントに修正する。 After that, for all two adjacent uncertain points, the width of the circumscribed rectangle of the stroke sequence between the two uncertain points is obtained, and the width / acs (average character size) of the circumscribed rectangle is larger than a predetermined threshold. If the feature value of a non-cut point that exists between the two uncertain points is located in the area of OFGH (a rectangle where a cut point can exist in the learning pattern), the non-cut point is corrected to an indeterminate point. To do.

次に、各不確定ポイント（仮切出しポイント）から、交差長さ特徴値ｆ_ｉに加えて、１８個の特徴値と、広さ特徴値を含む２０個の特徴値を抽出する。１８個の特徴値ｆ_１〜ｆ_１８は、それぞれ次式（５）〜（２２）により求めることができる。 Next, in addition to the intersection length feature value f _i , 18 feature values and 20 feature values including the width feature value are extracted from each indeterminate point (provisional cut point). The 18 feature values f _{1 to} f ₁₈ can be obtained by the following equations (5) to (22), respectively.

ｆ_１＝オフストロークの経過時間（５）
ｆ_２＝Ｄ_Ｂｘ／ａｃｓ（６）
ｆ_３＝Ｂ_{ｂｐ＿ａｌｌ}とＢ_{ｂｓ＿ａｌｌ}の重なり面積／(ａｃｓ)^２（７）
ｆ_４＝Ｄ_ｂｘ／Ｂｂ_ｐ１の幅（８）
ｆ_５＝Ｄ_ｂｘ／Ｂｂ_ｓ１の幅（９）
ｆ_６＝Ｄ_ｂｘ／ａｃｓ（１０）
ｆ_７＝Ｄ_ｂｙ／Ｂｂ_ｐ１の高さ（１１）
ｆ_８＝Ｄ_ｂｙ／Ｂｂ_ｓ１の高さ（１２）
ｆ_９＝Ｄ_ｂｙ／ａｃｓ（１３）
ｆ_１０＝Ｏ_ｂ／(Ｂｂ_ｓ１の幅×高さ) （１４）
ｆ_１１＝Ｏ_ｂ／(ａｃｓ)^２（１５）
ｆ_１２＝Ｄ_ｂｓｙ／ａｃｓ（１６）
ｆ_１３＝Ｄ_ｂｓ／ａｃｓ（１７）
ｆ_１４＝Ｄｆ_ｂ／ａｃｓ（１８）
ｆ_１５＝オフストロークの長さ／ａｃｓ（１９）
ｆ_１６＝オフストロークの正弦値（２０）
ｆ_１７＝オフストロークの余弦値（２１）
ｆ_１８＝ｆ_２／文字列中の最大ｆ_２（２２）
ここで、Ｂｂ_ｐ１は、１つ前のストロークの外接矩形を示し、Ｂｂ_ｓ１は、１つ後のストロークの外接矩形を示す。またＤ_Ｂｙは、Ｂｂ_{p＿ａｌｌ}とＢｂ_{ｓ＿ａｌｌ}のＹ方向距離（Ｄ_Ｂｙ＝Ｂｂ_{ｓ＿ａｌｌ}の上位置Ｙ座標−Ｂｂ_{ｐ＿ａｌｌ}の下位置Ｙ座標）を示す。またＤ_ｂｘは、Ｂｂ_ｐ１とＢｂ_ｓ１のＸ方向距離（Ｄ_ｂｘ＝Ｂｂ_ｓ１の左位置Ｘ座標−Ｂｂ_ｐ１の右位置Ｘ座標）を示す。またＤ_ｂｙは、Ｂｂ_ｐ１とＢｂ_ｓ１のＹ方向距離（Ｄ_ｂｙ＝Ｂｂ_ｓ１の上位置Ｙ座標−Ｂｂ_ｐ１の下位置Ｙ座標）を示す。またＯ_ｂは、Ｂｂ_ｐ１とＢｂ_ｓ１の重なり面積を示す。またＤ_ｂｓｙは、Ｂｂ_ｐ１とＢｂ_ｓ１の中心のＹ方向距離（Ｄ_ｂｓｙ＝Ｂｂ_ｓ１の中心のＹ座標−Ｂｂ_ｐ１の中心のＹ座標）を示す。またＤ_ｂｓは、Ｂｂ_ｐ１とＢｂ_ｓ１の中心の距離を示す。またＤｆ_ｂは、Ｂｂ_{ｐ＿ａｌｌ}とＢｂ_ｓ１の上位置Ｙ座標のずれ（Ｄｆ_ｂ＝ａｂｓ（Ｂｂ_{ｐ＿ａｌｌ}の上位置Ｙ座標−Ｂｂ_ｓ１の上位置Ｙ座標)）を示す。 f ₁ = elapsed time of off-stroke (5)
f ₂ = D _Bx / acs (6)
f ₃ = B _{bp_all} and B _{bs_all} overlap area / (acs) ² (7)
f ₄ = width of D _bx / Bb _p1 (8)
f ₅ = width of D _bx / Bb _s1 (9)
f ₆ = D _bx / acs (10)
f ₇ = D _by / Bb _p1 height (11)
The height of the _{_{_{f 8 = D by / Bb s1}}} (12)
f ₉ = D _by / acs (13)
f ₁₀ = O _b / (width of Bb _s1 × height) (14)
f ₁₁ = O _b / (acs) ² (15)
f ₁₂ = D _bsy / acs (16)
f ₁₃ = D _bs / acs (17)
f ₁₄ = Df _b / acs (18)
f ₁₅ = length of off-stroke / acs (19)
f ₁₆ = off stroke sine value (20)
f ₁₇ = cosine value of off-stroke (21)
f ₁₈ = f ₂ / maximum f ₂ in the character string (22)
Here, Bb _p1 represents the circumscribed rectangle of the previous stroke, and Bb _s1 represents the circumscribed rectangle of the next stroke. The _{D By A} shows the Y-direction distance of Bb _{P_ALL} and _{_Bb s_all} _(D _{By =} _Bb position Y coordinate _{-BB P_ALL} the lower position Y-coordinate on the _{s_all).} D _bx represents the distance in the X direction between Bb _p1 and Bb _s1 (D _bx = the left position X coordinate of Bb _s1− the right position X coordinate of Bb _p1 ). D _by indicates a distance in the Y direction between Bb _p1 and Bb _s1 (D _by = the upper position Y coordinate of Bb _s1− the lower position Y coordinate of Bb _p1 ). In addition, O _b indicates the overlapping area of Bb _p1 and Bb _s1 . D _bsy represents a Y-direction distance between the centers of Bb _p1 and Bb _s1 (D _bsy = the Y coordinate of the center of Bb _s1− the Y coordinate of the center of Bb _p1 ). D _bs indicates the distance between the centers of Bb _p1 and Bb _s1 . Further, Df _b indicates a _shift between the upper position Y coordinates of Bb _{p_all} and Bb _s1 (Df _b = abs (the upper position Y coordinate of Bb _{p_all−} the upper position Y coordinate of Bb _s1 )).

また、広さ特徴値ｆ_ｗは、次式により求めることができる。 Further, the breadth feature value f _w can be calculated by the following equation.

ｆ_ｗ＝ｗ／ａｃｓ（２３）
ここで、ｗは、１つ前の仮切出しポイントから１つ後の仮切出しポイントまでのストローク列の外接矩形の幅である。 f _w = w / acs (23)
Here, w is the width of the circumscribed rectangle of the stroke sequence from the previous temporary cut point to the next temporary cut point.

本実施形態では、学習パタンに対して、切出しポイントに教師信号１、非切出しポイントに教師信号−１を与えて、オフストロークから抽出した２０個の特徴値（１８個の特徴値ｆ_１〜ｆ_１８、交差長さ特徴値ｆ_ｉ、広さ特徴値ｆ_ｗ）をＳＶＭに適用し、ＳＶＭモデルを学習しておく。 In the present embodiment, 20 feature values (18 feature values f _{1 to} f) extracted from an off-stroke by applying a teacher signal 1 to a cut-out point and a teacher signal -1 to a non-cut-out point with respect to a learning pattern. ₁₈ , the intersection length feature value f _i and the area feature value f _w ) are applied to the SVM to learn the SVM model.

そして、不確定ポイント（仮切出しポイント）から抽出した上記２０個の特徴値を、学習したＳＶＭに適用することで、不確定ポイントのＳＶＭ出力値を求める。ＳＶＭ出力値が所定の閾値より大きく、且つ広さ特徴値ｆ_ｗが所定の閾値より大きい不確定ポイントを切出しポイントとして確定する。 Then, the SVM output value of the indeterminate point is obtained by applying the 20 feature values extracted from the indeterminate point (provisional cut point) to the learned SVM. SVM output value is greater than a predetermined threshold value, and wide feature value f _w is determined as a point cut indeterminate point greater than a predetermined threshold value.

２−２．候補ラティスの構築
図５に、本実施形態の候補ラティスの一例を示す。 2-2. Construction of Candidate Lattice FIG. 5 shows an example of a candidate lattice of the present embodiment.

本実施形態では、図５に示すように、仮切出しポイント（不確定ポイントＵＰ及び切出しポイントＤＰ）によりストローク列を切り出し、切り出されたストローク列（仮切出しユニット）毎に文字認識を行って候補文字（認識候補）を並べ、それらをリンクして複数の候補文字列からなる候補ラティスを構築する。 In the present embodiment, as shown in FIG. 5, a stroke sequence is cut out by temporary cut points (indeterminate point UP and cut point DP), and character recognition is performed for each cut stroke sequence (temporary cut unit) to obtain candidate characters. (Recognition candidates) are arranged and linked to construct a candidate lattice composed of a plurality of candidate character strings.

ここで不確定ポイントＵＰは、切出しポイントと非切出しポイントの２通りに仮定され、切出しポイントとして仮定される場合には、切出しユニットの抽出に利用される。例えば図５において、不確定ポイントＵＰを切出しユニットとして仮定した場合に、「明」という手書き文字パタンから２つの仮切出しユニットを抽出し、それぞれ「日、目、月」、「月、口、目、旦」という候補文字を生成する。また不確定ポイントＵＰを非切出しユニットとして仮定した場合には、「明」という手書き文字パタンから１つの仮切出しユニットを抽出して「朋、胴、明」という候補文字を生成する。なお、１つの仮切出しユニットの内部に仮切出しポイントを含まないもの、すなわち隣接する仮切出しポイントによって切り出されたユニットを最小仮切出しユニットとする。 Here, the indeterminate point UP is assumed to be a cut-out point and a non-cut-out point, and when it is assumed as a cut-out point, it is used for extraction of a cut-out unit. For example, in FIG. 5, assuming that the indeterminate point UP is a cut-out unit, two temporary cut-out units are extracted from the handwritten character pattern “Akira”, and “day, eye, month”, “month, mouth, eye” are extracted. , "Canon" is generated. If the indeterminate point UP is assumed to be a non-cutout unit, one temporary cutout unit is extracted from the handwritten character pattern “Ming”, and a candidate character “という, torso, light” is generated. Note that a unit that does not include a temporary cutting point in one temporary cutting unit, that is, a unit cut out by an adjacent temporary cutting point is defined as a minimum temporary cutting unit.

候補ラティスにおいて、各ノードＮＤは文字認識によって1つの仮切出しユニットに関して生成される1つの候補文字を示し、各連結線ＡＬは1つの仮切出しポイントを文字の境として文字列の連接を示す。後述する複数項目の確からしさからなる尤度を評価するスコアは、各連結線ＡＬとノードＮＤに与えられる。本実施形態では、閾値より低い確からしさを持つノードＮＤを削除する。このようにすると、文字列候補パスの数を削減して認識処理を高速化することができる。 In the candidate lattice, each node ND indicates one candidate character generated for one provisional cutout unit by character recognition, and each connection line AL indicates connection of character strings with one provisional cutout point as a character boundary. A score for evaluating the likelihood composed of the probability of a plurality of items to be described later is given to each connection line AL and node ND. In the present embodiment, the node ND having a probability lower than the threshold is deleted. In this way, the number of character string candidate paths can be reduced to speed up the recognition process.

２−３．文字列認識
本実施形態では、仮切出しポイントから構築した候補ラティスの各文字列候補パス（候補文字列）について複数項目の確からしさを評価する。 2-3. Character String Recognition In this embodiment, the probability of a plurality of items is evaluated for each character string candidate path (candidate character string) of a candidate lattice constructed from temporary cut points.

図６（Ａ）に示すように、１つの文字列パタンＸに対して、ある切出しＳ＝Ｓ_１Ｓ_２．．Ｓ_ｉ．．Ｓ_ｎ（ｎは仮切出しユニットの数）を選択して、ある候補文字列Ｃ＝Ｃ_１Ｃ_２．．Ｃ_ｉ．．Ｃ_ｎに認識するとする。ここで仮切出しユニットＳ_ｉはＣ_ｉに対応している。図６（Ｂ）に示すように、仮切出しユニットＳ_ｉは、最小仮切出しユニットの列Ｓ_i１Ｓ_i２．．Ｓ_ｉj．．Ｓ_ikiからなる。Ｓ_ｉjは、１つの最小仮切出しユニットである。 As shown in FIG. 6A, with respect to one character string pattern X, a certain cutout S = S ₁ S ₂ . . S _i . . S _n (where n is the number of provisional cut-out units) is selected, and a candidate character string C = C ₁ C ₂ . . C _i. . _Let C _n be recognized. Here, the provisional cutting unit S _i corresponds to C _i . As shown in FIG. 6B, the temporary cut-out unit S _i includes a row of minimum temporary cut-out units S _i1 S _i2 . . S _ij . . It _{consists of Siki} . S _ij is one minimum provisional cutout unit.

文字列パタンＸが候補文字列Ｃに認識される条件付き確立Ｐ（Ｃ｜Ｘ）は、次式により求めることができる。 The conditional establishment P (C | X) in which the character string pattern X is recognized by the candidate character string C can be obtained by the following equation.

ここで、最小仮切出しユニットＳ_ｉjは、外接矩形ｂ_ｉｊ、内分割ｑ_ｉｊと位置情報ｐ^ｕ _ｉｊを持ち、隣接する最小仮切出しユニットにおいて、位置関係情報p^ｂ _ｉｊと仮切出しポイントｇ_ｉｊをもつ。Ｃ_ｉｊは、Ｓ_ｉｊの認識候補の１つであり、文字Ｃ_ｉの全部か一部である。ｂ，ｑ，Ｓ，ｐ^ｕ，ｐ^ｂ，ｇは仮切出しの特徴値の列である。 Here, the minimum provisional cut unit _{S ij} is circumscribed rectangle _{b ij,} having position information ^p _{u ij} and the inner dividing _{q ij,} at the minimum provisional cut units adjacent, the positional relationship information ^p _{b ij} and a temporary cut point _{g ij} Have. C _ij is one of the recognition candidates of S _ij and is all or part of the character C _i . b, q, S, p ^u , p ^b , and g are sequences of feature values of provisional cutout.

式（２４）において、分母Ｐ（ｂ，ｑ，Ｓ，ｐ^ｕ，ｐ^ｂ，ｇ）は、文字列候補パスによらない。そこで特徴値間の独立を仮定して、文字列候補パスの尤度ｆ（Ｘ，Ｃ）を、次式のように近似する。 In the equation (24), the denominator P (b, q, S, p ^u , p ^b , g) does not depend on the character string candidate path. Therefore, assuming the independence between feature values, the likelihood f (X, C) of the character string candidate path is approximated by the following equation.

ここで、Ｃ^−１ _ｉｊとＣ^−２ _ｉｊは、直前の２つの認識候補を示し、ｋ_ｉは仮切出しユニットＳ_ｉに含まれる最小仮切出しユニットの数を示し、Ｌ_ｉｊは、切出しポイントか非切出しポイントであるかを示すラベルである。 Here, C ⁻¹ _ij and C ⁻² _ij indicate the previous two recognition candidates, k _i indicates the number of minimum provisional cutout units included in the provisional cutout unit S _i , and L _ij is a cutout point. It is a label which shows whether it is an uncut point.

Ｃ_ｉｊが文字の一部である場合、式（２５）の第７項目（仮切出しポイントに関わる項目）以外については、求めるのが困難である。これを解決するために、Ｃ_ｉｊをそれを含む文字Ｃ_ｉに置き換えて、式（２５）の第１項目について、次式のように近似する。 When C _ij is a part of a character, it is difficult to obtain other than the seventh item (item related to the provisional cut-out point) in Expression (25). In order to solve this, C _ij is replaced with a character C _i including it, and the first item of Expression (25) is approximated as the following expression.

ここで、λ_１１とλ_１２は重みパラメータを示し、λ_１はバイアスを示す。 Here, λ ₁₁ and λ ₁₂ indicate weight parameters, and λ ₁ indicates a bias.

式（２５）の第７項目については、ラベルＬ_ｉｊを、切出しｓｐと非切出しｎ−ｓｐの２種類に分類してその推定誤差を調節するための重みを付け、それ以外の第２〜第６項目については、式（２６）と同じ方法で近似すると、文字列候補パスの尤度ｆ（Ｘ，Ｃ）は、次式により求めることができる。 As for the seventh item of the expression (25), the label L _ij is classified into two types of cut-out sp and non-cut-out n-sp, and weights for adjusting the estimation error are given, and the other second through second labels When the six items are approximated by the same method as in equation (26), the likelihood f (X, C) of the character string candidate path can be obtained by the following equation.

ここで、ｂ_ｉ、ｑ_ｉ、ｐ^ｕ _ｉは、それぞれ仮切出しユニットＳ_ｉの外接矩形、内分割、位置情報に関する特徴ベクトルであり、ｐ^ｂ _ｉは、隣接する仮切出しユニット間の位置関係に関する特徴ベクトルを示し、ｇ_ｉ１、ｇ_ｉｊ（ｊ＝２〜ｋ_ｉ）は、仮切出しポイントの特徴ベクトルを示す。これらの詳細については後述する。またλ_１１〜λ_７１、λ_１２〜λ_７２とλは、重みパラメータを示し、後述する遺伝的アルゴリズムを用いて学習されたものである。 Here, b _i , q _i , and p ^u _i are feature vectors related to the circumscribed rectangle, inner division, and position information of the temporary cutting unit S _i , respectively, and p ^b _i is related to the positional relationship between adjacent temporary cutting units. A feature vector is indicated, and g _i1 and g _ij (j = 2 to k _i ) indicate feature vectors of provisional cut points. Details of these will be described later. Further, λ _{11 to} λ ₇₁ , λ ₁₂ to λ _72, and λ indicate weight parameters, which are learned using a genetic algorithm described later.

式（２７）において、第１項目は言語文脈の確からしさ、第２項目は文字パタンサイズの確かしさ、第３項目は文字パタン内分割の確からしさ、第４項目は、文字認識の確からしさ、第５項目は単一文字パタン位置の確からしさ、第６項目はペア文字パタン位置の確からしさ、第７、第８項目（以下、第７項目という）は仮切出しポイントの確からしさを評価するための項目である。 In Equation (27), the first item is the probability of language context, the second item is the accuracy of character pattern size, the third item is the probability of character pattern division, and the fourth item is the probability of character recognition. The fifth item is used to evaluate the accuracy of the single character pattern position, the sixth item is used to evaluate the probability of the paired character pattern position, and the seventh and eighth items (hereinafter referred to as the seventh item) are used to evaluate the accuracy of the provisional cutout point. It is an item.

本実施形態では、式（２７）に示すように、文字列候補パスの尤度は最小切出しユニットに基づいて尤度を積み上げたものであり、文字列候補の文字数に影響されない。従って、文字列を少ない文字数に解釈する傾向や、文字列を多い文字数に解釈する傾向を弱めて認識率を向上させることができる。 In this embodiment, as shown in Expression (27), the likelihood of the character string candidate path is obtained by accumulating the likelihood based on the minimum cutout unit, and is not affected by the number of characters of the character string candidate. Accordingly, the recognition rate can be improved by weakening the tendency of interpreting a character string into a small number of characters and the tendency of interpreting a character string into a large number of characters.

本実施形態では、式（２７）によって求めた文字列候補パスの尤度を候補ラティスの各ノードと各連結線に与え、当該候補ラティスに対してビタビ（Ｖｉｔｅｒｂｉ）探索を行うことによって、最適な候補文字列（最適な切出しと文字認識結果）を決定する。 In the present embodiment, the likelihood of the character string candidate path obtained by the equation (27) is given to each node and each connection line of the candidate lattice, and a Viterbi search is performed on the candidate lattice, so that the optimum Candidate character strings (optimum clipping and character recognition results) are determined.

以下、式（２７）の各項目の詳細について説明する。 Hereinafter, details of each item of Expression (27) will be described.

第１項目のＰ（Ｃ_ｉ｜Ｃ_ｉ−１，Ｃ_ｉ−２）は、Ｃ_ｉ−１，Ｃ_ｉ−２からＣ_ｉへの移転的確率（ｔｒｉ−ｇｒａｍ確率）である。なお、１番目の文字Ｃ_１については、ｕｎｉ−ｇｒａｍ確率Ｐ（Ｃ_１）、２番目の文字Ｃ_２については、ｂｉ−ｇｒａｍ確率Ｐ（Ｃ_２｜Ｃ_１）により求める。ｔｒｉ−ｇｒａｍ確率を学習するには膨大なテキストが必要なため、Ｐ（Ｃ_ｉ｜Ｃ_ｉ−１，Ｃ_ｉ−２）の代わりに、次式に示すＰ’（Ｃ_ｉ｜Ｃ_ｉ−１，Ｃ_ｉ−２）を用いる。 The first item P (C _i | C _i-1 , C _i-2 ) is a transfer probability (tri-gram probability) from C _i _-1 , C _i-2 to C _i . The first character C ₁ is determined by the uni-gram probability P (C ₁ ), and the second character C ₂ is determined by the bi-gram probability P (C ₂ | C ₁ ). For tri-gram probabilities required enormous text to learning _{_{a, P (C i | C i}} -1, C i-2) instead of, in the following equation _{_{P '(C i | C i}} -1 , C _i-2 ).

ここで、β_１、β_２、β_３、β_４は、ｔｒｉ−ｇｒａｍ確率の学習用テキストと異なるテキストによって認識率が最大になるように求める。 Here, β ₁ , β ₂ , β ₃ , and β ₄ are determined so that the recognition rate is maximized by a text different from the text for learning with the tri-gram probability.

第２項目のＰ（ｂ_ｉ｜Ｃ_ｉ）は、文字パタンサイズの条件付き確率である。ｂ_ｉは、平均文字サイズａｃｓで正規化した文字パタン（仮切出しユニット）の外接矩形の幅と高さからなる幾何学的な特徴ベクトルである。 The second item P (b _i | C _i ) is a conditional probability of the character pattern size. b _i is a geometric feature vector composed of the width and height of the circumscribed rectangle of the character pattern (temporary cutout unit) normalized with the average character size acs.

第３項目のＰ（ｑ_ｉ｜Ｃ_ｉ）は、文字パタン内分割の条件付き確率である。ｑ_ｉは、図７に示すｑ_ｉｋ（ｋ＝１〜６）からなる幾何的特徴ベクトルである。ｑ_ｉｋ（ｋ＝１〜３）は各垂直の分割での文字パタンの垂直方向の射影におけるギャップｄ_ｖの長さを平均文字サイズａｃｓで正規化した値であり、ｑ_ｉｋ（ｋ＝４〜６）は各水平の分割での文字パタンの水平方向の射影におけるギャップｄ_ｖの長さを平均文字サイズａｃｓで正規化した値である。各分割において２つ以上のギャップがある場合、これらのギャップの和が使用される。 The third item P (q _i | C _i ) is a conditional probability of character pattern division. q _i is a geometric feature vector composed of q _ik (k = 1 to 6) shown in FIG. q _ik (k = 1~3) are normalized values of the length in average character size acs gaps _{d v} in the vertical projection of the character patterns in the division of each _vertical, q ik (k = _4~ 6) is the normalized value of the length in average character size acs gaps d _v in the horizontal direction of the projection of the character patterns in the division of each horizontal. If there are more than one gap in each partition, the sum of these gaps is used.

第４項目のＰ（Ｓ_ｉ｜Ｃ_ｉ）は、仮切出しユニットの条件付き確率である。Ｓ_ｉは、文字パタンとしての仮切出しユニットの特徴ベクトルを示す。Ｐ（Ｓ_ｉ｜Ｃ_ｉ）は、文字認識のスコアから近似することができる。 The fourth item P (S _i | C _i ) is the conditional probability of the provisional cut-out unit. S _i indicates a feature vector of the temporary cut-out unit as a character pattern. P (S _i | C _i ) can be approximated from the score of character recognition.

第５項目のＰ（ｐ^ｕ _ｉ｜Ｃ_ｉ）は、単一文字パタン位置の条件付き確率である。ｐ^ｕ _ｉは、図８（Ａ）に示すように、単一文字パタン位置（ｕｎａｒｙ位置）の２つの要素からなる幾何学的な特徴ベクトルである。２つの要素は、それぞれ文字パタンの上部から文字列の中心線ＣＬまでの距離を平均文字サイズａｃｓで正規化した値と、文字パタンの底部から文字列の中心線ＣＬまでの距離を平均文字サイズａｃｓで正規化した値である。 The fifth item P (p ^u _i | C _i ) is the conditional probability of the single character pattern position. As shown in FIG. 8A, p ^u _i is a geometric feature vector composed of two elements of a single character pattern position (unary position). The two elements are the value obtained by normalizing the distance from the top of the character pattern to the center line CL of the character string by the average character size acs, and the distance from the bottom of the character pattern to the center line CL of the character string as the average character size. It is a value normalized by acs.

第６項目のＰ（ｐ^ｂ _ｉ｜Ｃ_ｉ，Ｃ_ｉ−１）は、ペア文字パタン位置の条件付き確率である。ｐ^ｂ _ｉは、図８（Ａ）に示すように、２つの隣接する文字パタン（仮切出しユニット）間におけるペア文字パタン位置（ｂｉｎａｒｙ位置）の２つの要素からなる幾何学的な特徴ベクトルである。２つの要素は、それぞれ隣接する文字パタンの上部同士の距離を平均文字サイズａｃｓで正規化した値と、隣接する文字パタンの下部同士の距離を平均文字サイズａｃｓで正規化した値である。 The sixth item, P (p ^b _i | C _i , C _i−1 ), is a conditional probability of the pair character pattern position. As shown in FIG. 8A, p ^b _i is a geometric feature vector composed of two elements of a pair character pattern position (binary position) between two adjacent character patterns (temporary cutout unit). . The two elements are a value obtained by normalizing the distance between the upper parts of the adjacent character patterns with the average character size acs and a value obtained by normalizing the distance between the lower parts of the adjacent character patterns with the average character size acs.

第７項目のＰ（ｇ_ｉ１｜ｓｐ）は、切出しポイントがｇ_ｉ１を生じる条件付き確率であり、Ｐ（ｇ_ｉｊ｜ｎ−ｓｐ）（ｊ＝２〜ｋ_ｉ）は、非切出しポイントがｇ_ｉｊを生じる条件付き確率である。図８（Ｂ）に示すように、ｇ_ｉ１は、仮切出しユニットＳ_ｉの先頭に位置し、切出しポイントとして識別される仮切出しポイント（隣接する文字パタン間の仮切出しポイント）から抽出した２０次元の幾何学的な特徴ベクトル（２０個の特徴値）であり、ｇ_ｉｊ（ｊ＝２〜ｋ_ｉ）は、仮切出しユニットＳ_ｉの内部に位置し、非切出しポイントして識別される仮切出しポイント（文字パタン内部の仮切出しポイント）から抽出した２０次元の幾何学的な特徴ベクトルである。 The seventh item P (g _i1 | sp) is a conditional probability that the cut-out point gives g _i1 , and P (g _ij | n-sp) (j = 2 to k _i ) is the conditional probability that yields _ij . As shown in FIG. 8B, g _i1 is a 20-dimensional pattern extracted from a temporary cutting point (temporary cutting point between adjacent character patterns) that is located at the beginning of the temporary cutting unit S _i and is identified as a cutting point. , And g _ij (j = 2 to k _i ) is located inside the temporary cutting unit S _i and is identified as a non-cutting point. This is a 20-dimensional geometric feature vector extracted from a point (a temporary cutout point inside a character pattern).

このように本実施形態では、切出しポイントして識別される仮切出しポイントのみならず、非切出しポイントして識別される仮切出しポイントについても、非切出しポイントとして識別される確からしさを評価することで、文字列を少ない文字数に解釈する傾向や、文字列を多い文字数に解釈する傾向を弱めて認識率を向上させることができる。 Thus, in the present embodiment, not only the provisional cut point identified as the cut-out point but also the provisional cut-out point identified as the non-cut-out point is evaluated by evaluating the probability of being identified as the non-cut-out point. The recognition rate can be improved by weakening the tendency of interpreting a character string into a small number of characters and the tendency of interpreting a character string into a large number of characters.

なお本実施形態では、Ｐ（ｇ_ｉ１｜ｓｐ）を、Ｐ（ｏ_ｉ１＝ＳＶＭ（ｇ_ｉ１）｜ｓｐ）により近似し、Ｐ（ｇ_ｉｊ｜ｎ−ｓｐ）（ｊ＝２〜ｋ_ｉ）を、Ｐ（ｏ_ｉｊ＝ＳＶＭ（ｇ_ｉｊ）｜ｎ−ｓｐ）により近似する。ここでは、ｏ_ｉｊはｇ_ｉｊに関するＳＶＭ出力値である。Ｐ（ｏ_ｉ１｜ｓｐ）は切出しポイントに対して、学習パタンで学習したＳＶＭが出力値ｏ_ｉ１を生じる条件付き確率である。Ｐ（ｏ_ｉｊ｜ｎ−ｓｐ）（ｊ＝２〜ｋ_ｉ）は非切出しポイントに対して、学習パタンで学習したＳＶＭが出力値ｏ_ｉｊを生じる条件付き確率である。なお、Ｐ（ｏ_１１｜ｓｐ）＝１とする。 In this embodiment, P (g _i1 | sp) is approximated by P (o _i1 = SVM (g _i1 ) | sp), and P (g _ij | n-sp) (j = 2 to k _i ) is , P (o _ij = SVM (g _ij ) | n-sp). Here, o _ij is the SVM output value for g _ij . P (o _i1 | sp) is a conditional probability that the SVM learned by the learning pattern generates the output value o _i1 with respect to the extraction point. P (o _ij | n-sp) (j = 2 to k _i ) is a conditional probability that the SVM learned by the learning pattern generates the output value o _ij with respect to the uncut point. Note that P (o ₁₁ | sp) = 1.

以下、説明を簡略化するためにｏ_ｉｊの代わりに符号ｏ_ｉを使用する。図９（Ａ）に示すようにＳＶＭの出力値ｏ_ｉについて、学習パタンからＰ（ｏ_ｉ｜ｓｐ）とＰ（ｏ_ｉ｜ｎ−ｓｐ）を学習することができる。ここで、ＳＶＭの出力値が１か−１に近づくと確率値が激しく降下する。そこで、Ｐ（ｏ_ｉ｜ｓｐ）とP（ｏ_ｉ｜ｎ−ｓｐ）の代わりに、次式に示す累積の関数Ｐ’（ｏ_ｉ｜ｓｐ）とＰ’（ｏ_ｉ｜ｎ−ｓｐ）を使用する。 Hereinafter, in order to simplify the description, the symbol o _i is used instead of o _ij . As shown in FIG. 9A, P (o _i | sp) and P (o _i | n-sp) can be learned from the learning pattern for the output value o _i of the SVM. Here, when the output value of the SVM approaches 1 or -1, the probability value drops drastically. Therefore, instead of P (o _i | sp) and P (o _i | n-sp), cumulative functions P ′ (o _i | sp) and P ′ (o _i | n-sp) shown in the following equations are used. use.

図９（Ｂ）に、Ｐ’（ｏ_ｉ｜ｓｐ）とＰ’（ｏ_ｉ｜ｎ−ｓｐ）の確率値を示す。さらに、次式に示すシグモンド関数によってこれらを近似することができる。 FIG. 9B shows the probability values of P ′ (o _i | sp) and P ′ (o _i | n-sp). Furthermore, these can be approximated by a sigmond function expressed by the following equation.

図９（ｂ）に示すＳＶＭの出力値に対して、−１０から１０まで０．０２の間隔でＳＶＭの出力値ｏ_ｋ（ｋ＝１〜Ｎｓ）を取り、確率Ｐ’（ｏ_ｉ｜ｓｐ）とＰ’（ｏ_ｉ｜ｎ−ｓｐ）を得る。ここで、ＮｓはＳＶＭ出力値を取る個数である。そして、最急降下法により次式に示すエラー関数の最小化の尺度に基づいてパラメータα_１０，α_１１，α_２０，α_２１を求める。 With respect to the SVM output value shown in FIG. 9B, SVM output values o _k (k = 1 to Ns) are taken at intervals of 0.02 from −10 to 10, and the probability P ′ (o _i | sp ) And P ′ (o _i | n-sp). Here, Ns is the number of SVM output values. Then, parameters α ₁₀ , α ₁₁ , α ₂₀ , and α ₂₁ are obtained by the steepest descent method based on the error function minimization scale shown in the following equation.

２−４．重みパラメータの学習
本実施形態では、学習パタンを利用して遺伝的アルゴリズムにより、式（２７）における複数の重みパラメータ（λ_１１〜λ_７１、λ_１２〜λ_７２、λ）を学習する。ここでは、複数の重みパラメータからなる重み系列を１つの染色体とみなす。１つの重みパラメータは染色体の１つの遺伝子である。以下、学習の処理手順について説明する。 2-4. Weight Parameter Learning In this embodiment, a plurality of weight parameters (λ _{11 to} λ ₇₁ , λ _{12 to} λ ₇₂ , λ) in Equation (27) are learned by a genetic algorithm using a learning pattern. Here, a weight series composed of a plurality of weight parameters is regarded as one chromosome. One weight parameter is one gene of the chromosome. Hereinafter, the learning process procedure will be described.

まず、Ｎ個の染色体をランダムで０〜１の値に初期化する。Ｎ個の染色体の古い平均評価値ｆ_ｏｌｄを０、繰返し数ｔを１に初期化する。 First, N chromosomes are initialized to a value of 0 to 1 at random. The old average evaluation value f _old of N chromosomes is initialized to 0, and the repetition number t is initialized to 1.

次に、Ｎ個の染色体の中からランダムで選択した２つの染色体について、ランダムで決定した２位置の中の遺伝子を交叉させることで新しい２つの染色体を生成する。これをＭ個の新染色体が生成されるまで繰り返す。 Next, two new chromosomes are generated by crossing genes in two randomly determined positions for two chromosomes randomly selected from N chromosomes. This is repeated until M new chromosomes are generated.

次に、Ｎ＋Ｍ個の染色体の各遺伝子に確率Ｐ_ｍｕｔで突然変異を起こさせる。具体的には−１〜１のランダム値を加算し、［０，１］に丸める。 Next, each gene of N + M chromosomes is mutated with probability _Pmut . Specifically, random values of −1 to 1 are added and rounded to [0, 1].

次に、各染色体を学習パタンに関する認識率により評価する。ここでは、各染色体の遺伝子は式（２７）における重みに設定される。 Next, each chromosome is evaluated by the recognition rate regarding the learning pattern. Here, the gene of each chromosome is set to the weight in equation (27).

次に、各染色体の評価値によりそのルーレット確率を求める。ここでは、評価値が高い染色体ほどルーレット確率が大きくなるようにする。そして、評価値が最高の２つの染色体を選択し、Ｎ個の新染色体を得るまでルーレット確率により染色体を選択する。Ｎ個の新染色体でＮ個の古い染色体を置換する。 Next, the roulette probability is obtained from the evaluation value of each chromosome. Here, the higher the evaluation value, the higher the roulette probability. Then, the two chromosomes having the highest evaluation value are selected, and the chromosome is selected by the roulette probability until N new chromosomes are obtained. Replace N old chromosomes with N new chromosomes.

次に、Ｎ個の新染色体の平均評価値ｆ_ｎｅｗを求める。もし(ｆ_ｎｅｗ−ｆ_ｏｌｄ＜閾値)がｎ_ｓｔｏｐ回以上発生するかそれともｔ＞Ｔであれば、評価値が最高の染色体を返す。それ以外の場合には、ｆ_ｎｅｗをｆ_ｏｌｄに、ｔをｔ＋１に置換し、前記交叉の処理に戻る。ここでは、Ｎを５０に、Ｍを１００に、Ｐ_ｍｕｔを０．０３に、ｎ_ｓｔｏｐを２５に、Ｔを１００００にセットする。 Next, determine the average evaluation value _{f new} of the N new chromosomes. If (f _new _−fold <threshold) occurs n _stop times or more or t> T, the chromosome with the highest evaluation value is returned. Otherwise, f _new is replaced with f _old and t is replaced with t + 1, and the process returns to the crossover process. Here, N is set to 50, M is set to 100, P _{mut is set} to 0.03, n _stop is set to 25, and T is set to 10,000.

染色体を評価するために学習パタンに対して手書き文字認識（単文字認識）を適用し、候補ラティスを生成し、そこでＶｉｔｅｒｂｉ探索を行う必要がある。仮切出しと単文字認識は1回だけで済むが、Ｖｉｔｅｒｂｉ探索は重み系列（染色体）ごとに行う必要があるが、多大な処理時間を要することになる。そこで、まず各重みを１にセットし、学習パタンの各文字列からＶｉｔｅｒｂｉ探索を用いて上位１００個の文字列候補パスを選択する。 In order to evaluate the chromosome, it is necessary to apply handwritten character recognition (single character recognition) to the learning pattern to generate a candidate lattice and perform a Viterbi search there. Temporary extraction and single character recognition need only be performed once, but Viterbi search needs to be performed for each weight sequence (chromosome), but it takes a lot of processing time. Therefore, first, each weight is set to 1, and the top 100 character string candidate paths are selected from each character string of the learning pattern using Viterbi search.

そして遺伝的アルゴリズムにより重み系列を発生し、これらの選択した文字列候補パスに発生した重み系列を適用して評価を行うことで、可能性の高い重みを選択する。収束したら、学習した新しい重み系列を適用して、学習パタンの各文字列からＶｉｔｅｒｂｉ探索を用いて上位１００個の文字列候補パスを選択する。遺伝的アルゴリズムにより重み系列を学習する。このプロセスをもう1度繰り返す。このようにすることで、重み系列の組み合わせを１つずつ試すことなく、効率よく最適な重みの組み合わせを求めることができる。 A weight sequence is generated by a genetic algorithm, and a weight sequence having a high probability is selected by applying the generated weight sequence to the selected character string candidate path. After convergence, the learned new weight sequence is applied, and the top 100 character string candidate paths are selected from each character string of the learning pattern using Viterbi search. A weight sequence is learned by a genetic algorithm. Repeat this process one more time. By doing in this way, it is possible to obtain an optimum weight combination efficiently without trying one combination of weight series.

３．性能評価結果
性能評価に先立って、データベースを利用してＰ（Ｓ_ｉ｜Ｃ_ｉ）について文字認識のモデルを学習した。またデータベースを利用して文字パタンサイズ、文字パタン内分割、単一文字パタン位置（ｕｎａｒｙ位置特徴）とペア文字パタン位置（ｂｉｎａｒｙ位置特徴）における４つの二次識別関数（ＱＤＦ）を学習した。また新聞データにより言語文脈の確からしさのためのｔｒｉ−ｇｒａｍ表を作った。そして、データベースによりβ_１、β_２、β_３、β_４を学習した。 3. Performance Evaluation Results Prior to performance evaluation, a character recognition model was learned for P (S _i | C _i ) using a database. Also, four secondary discriminant functions (QDF) at the character pattern size, character pattern division, single character pattern position (unary position feature) and pair character pattern position (binary position feature) were learned using a database. In addition, a tri-gram table for the accuracy of language context was created from newspaper data. Then, β ₁ , β ₂ , β ₃ , and β ₄ were learned from the database.

さらに、１００人分の文字列方向と文字方向自由のオンライン日本語手書き文字列データベースから１行ずつの左から右への文字列を抽出した。このうち７５人分のデータから２０次元の仮切出し特徴値を抽出し、これらのデータを利用することで、仮切出しのための閾値、仮切出しポイント確かさしさのためのＳＶＭモデルと式（２７）における重みパラメータを学習した。そして、残った２５人分の文字列データをテストパタンとして本実施形態の方法（式（２７）に示す文字列候補パス評価方法）の性能を評価した。 Furthermore, a character string from left to right was extracted line by line from an online Japanese handwritten character string database of 100 character strings and free character direction. Of these, 20-dimensional temporary cutout feature values are extracted from data for 75 persons, and by using these data, the threshold value for temporary cutout and the SVM model and formula (27 ) Weight parameters were learned. Then, the performance of the method of the present embodiment (character string candidate path evaluation method shown in Expression (27)) was evaluated using the remaining 25 character string data as test patterns.

なお比較対象として、式（３２）、式（３３）に示す文字列候補パス評価方法（方法１、方法２）についても評価した。 For comparison, the character string candidate path evaluation methods (Method 1 and Method 2) shown in Expression (32) and Expression (33) were also evaluated.

ここで、式（３２）、式（３３）におけるλ_ｉ（ｉ＝１〜７）とλは、本実施形態と同様の重みパラメータであり、遺伝的アルゴリズムにより学習されたものである。式（３２）に示すように、方法１では、最小仮切出しユニットではなく仮切出しユニットに基づいて尤度を積み上げたもので、文字列候補の文字数に影響され、文字列を少ない文字数に解釈する傾向がある。つまり、そこでは、学習した重みパラメータをそのまま各項目に与えており、本実施形態のように、最小切出しユニットに基づいて尤度を積み上げていない。また式（３３）に示すように、方法２では、文字列候補の文字数によって文字列候補パスの尤度を正規化するため、文字列を多い文字数に解釈する傾向がある。また方法１、方法２では、非切出しポイントして識別される確からしさを評価していない。 Here, λ _i (i = 1 to 7) and λ in the equations (32) and (33) are the same weighting parameters as in this embodiment, and are learned by a genetic algorithm. As shown in Expression (32), in Method 1, the likelihood is accumulated based on the temporary cutout unit, not the minimum temporary cutout unit, and is influenced by the number of characters of the character string candidate, and the character string is interpreted as a small number of characters. Tend. That is, the learned weight parameter is given to each item as it is, and the likelihood is not accumulated based on the minimum cutout unit as in this embodiment. Further, as shown in Expression (33), in Method 2, since the likelihood of the character string candidate path is normalized by the number of characters of the character string candidate, the character string tends to be interpreted as a large number of characters. In the method 1 and the method 2, the probability of being identified as an uncut point is not evaluated.

図１０に、本実施形態の方法、方法１、方法２それぞれの性能評価結果を示す。 FIG. 10 shows the performance evaluation results of the method, method 1, and method 2 of this embodiment.

なお、図１０に示すｆは、次式に示す文字切出し性能のｆ尺度である。 In addition, f shown in FIG. 10 is an f scale of the character extraction performance shown in the following equation.

図１０に示すように、本実施形態の方法では、方法１、方法２に比べて、文字切出し性能と文字認識率が向上していることが確認できる。文字認識率の差は１％程度であるが、９０％を越えた文字認識率をさらに１％向上させることは容易なことではない。 As shown in FIG. 10, in the method of this embodiment, it can be confirmed that the character extraction performance and the character recognition rate are improved as compared with the method 1 and the method 2. The difference in the character recognition rate is about 1%, but it is not easy to further improve the character recognition rate exceeding 90% by 1%.

４．本実施形態の処理
図１１は、本実施形態の文字列認識装置の処理の一例を示すフローチャートである。 4). Processing of this Embodiment FIG. 11 is a flowchart showing an example of processing of the character string recognition device of this embodiment.

まず、筆点情報取得部１１０は、文字入力部１６０からストロークのデータ（筆点座標系列）を取得する（ステップＳ１０）。 First, the writing point information acquisition unit 110 acquires stroke data (writing point coordinate series) from the character input unit 160 (step S10).

次に、仮切出しポイント決定部１１２は、各オフストロークから抽出した水平方向距離と交差長さの特徴値に基づいて、不確定ポイント（仮切出しポイント）を決定する（ステップＳ１２）。次に、各オフストロークから抽出した２０個の特徴値を学習したＳＶＭに適用して仮切出しポイントのＳＶＭ出力値を求める（ステップＳ１４）。次に、仮切出しポイントのＳＶＭ出力値に基づき切出しポイントを確定する（ステップＳ１６）。 Next, the temporary cut point determination unit 112 determines an indeterminate point (temporary cut point) based on the horizontal distance extracted from each off-stroke and the feature value of the intersection length (step S12). Next, the 20 feature values extracted from each off-stroke are applied to the learned SVM to obtain the SVM output value of the temporary cut point (step S14). Next, the cutting point is determined based on the SVM output value of the temporary cutting point (step S16).

次に、候補文字列生成部１１４は、仮切出しポイント（不確定ポイント、切出しポイント）によって切出された仮切出しユニット毎に文字認識を行い、候補ラティスを生成する（ステップＳ１８）。 Next, the candidate character string generation unit 114 performs character recognition for each temporary cutting unit cut out by the temporary cutting point (indeterminate point, cutting point), and generates a candidate lattice (step S18).

次に、認識部１１６は、候補ラティスの各文字列候補パス（候補文字列）について、それぞれ重み付けされた複数項目の確からしさを考慮した尤度を求める（ステップＳ２０）。次に、候補ラティスに対してＶｉｔｅｒｂｉ探索を行って、最適な文字列候補パスを選択する認識処理を行う（ステップＳ２２）。 Next, the recognizing unit 116 obtains a likelihood in consideration of the probabilities of weighted items for each character string candidate path (candidate character string) of the candidate lattice (step S20). Next, a Viterbi search is performed on the candidate lattice, and a recognition process for selecting an optimum character string candidate path is performed (step S22).

なお、本発明は、上述の実施の形態に限定されるものではなく、種々の変形が可能である。本発明は、実施の形態で説明した構成と実質的に同一の構成（例えば、機能、方法及び結果が同一の構成、あるいは目的及び効果が同一の構成）を含む。また、本発明は、実施の形態で説明した構成の本質的でない部分を置き換えた構成を含む。また、本発明は、実施の形態で説明した構成と同一の作用効果を奏する構成又は同一の目的を達成することができる構成を含む。また、本発明は、実施の形態で説明した構成に公知技術を付加した構成を含む。 In addition, this invention is not limited to the above-mentioned embodiment, A various deformation | transformation is possible. The present invention includes configurations that are substantially the same as the configurations described in the embodiments (for example, configurations that have the same functions, methods, and results, or configurations that have the same objects and effects). In addition, the invention includes a configuration in which a non-essential part of the configuration described in the embodiment is replaced. In addition, the present invention includes a configuration that exhibits the same operational effects as the configuration described in the embodiment or a configuration that can achieve the same object. Further, the invention includes a configuration in which a known technique is added to the configuration described in the embodiment.

本実施形態の文字列認識装置の機能ブロック図の一例。An example of the functional block diagram of the character string recognition apparatus of this embodiment. 本実施形態の手法について説明するための図。The figure for demonstrating the method of this embodiment. 本実施形態の手法について説明するための図。The figure for demonstrating the method of this embodiment. 本実施形態の手法について説明するための図。The figure for demonstrating the method of this embodiment. 本実施形態の手法について説明するための図。The figure for demonstrating the method of this embodiment. 本実施形態の手法について説明するための図。The figure for demonstrating the method of this embodiment. 本実施形態の手法について説明するための図。The figure for demonstrating the method of this embodiment. 本実施形態の手法について説明するための図。The figure for demonstrating the method of this embodiment. 本実施形態の手法について説明するための図。The figure for demonstrating the method of this embodiment. 性能評価結果を示す図。The figure which shows a performance evaluation result. 本実施形態の文字列認識装置の処理の流れを示すフローチャート図。The flowchart figure which shows the flow of a process of the character string recognition apparatus of this embodiment.

符号の説明Explanation of symbols

１００処理部、１１０筆点情報取得部、１１２仮切出しポイント決定部、１１４候補文字列生成部、１１６認識部、１２０表示処理部、１６０文字入力部、１７０記憶部、１８０情報記憶媒体、１９０表示部、１９６通信部 DESCRIPTION OF SYMBOLS 100 Processing part, 110 Writing point information acquisition part, 112 Temporary extraction point determination part, 114 Candidate character string generation part, 116 Recognition part, 120 Display processing part, 160 Character input part, 170 Storage part, 180 Information storage medium, 190 Display Part, 196 communication part

Claims

手書き入力された文字パタンのストロークから文字列を認識するためのプログラムであって、
ストローク間のオフストロークのうち、仮切出しポイントとして設定するオフストロークを決定する仮切出しポイント決定部と、
前記仮切出しポイントによって切り出された仮切出しユニット毎に文字認識を行い、候補文字列を生成する候補文字列生成部と、
前記候補文字列について複数項目の確からしさを評価し、評価結果に基づき文字列を認識する認識部としてコンピュータを機能させ、
前記仮切出しユニットは、隣接する前記仮切出しポイントにより切り出される最小仮切出しユニットを少なくとも１つ含み、
前記認識部は、
前記候補文字列の各候補文字に対応する前記仮切出しユニットに含まれる前記最小仮切出しユニット、および隣接する前記最小切出しユニット間の連結について、前記複数項目の確からしさを評価し、前記複数項目の確からしさについて各項目に与える重みの値を、前記仮切出しユニットに含まれる前記最小仮切出しユニットの数に応じて設定することを特徴とするプログラム。 A program for recognizing a character string from a stroke of a character pattern input by handwriting,
Of the off-strokes between strokes, a provisional cut-out point determination unit that determines an off-stroke to be set as a temporary cut-out point;
Character recognition for each provisional cutout unit cut out by the provisional cutout point, a candidate character string generation unit that generates a candidate character string,
Evaluating the probability of multiple items for the candidate character string, causing the computer to function as a recognition unit that recognizes the character string based on the evaluation result,
The temporary cutting unit includes at least one minimum temporary cutting unit cut out by the adjacent temporary cutting point,
The recognition unit
For the connection between the minimum provisional cutout unit included in the provisional cutout unit corresponding to each candidate character of the candidate character string and the adjacent minimum cutout unit, the probability of the plurality of items is evaluated, and the plurality of items A program that sets a weight value to be given to each item with respect to the probability in accordance with the number of the minimum provisional extraction units included in the provisional extraction unit.

請求項１において、
前記複数項目の確からしさは、単一文字パタン位置の確からしさを含み、
前記認識部は、
文字パタンの上部から文字列の中心線までの距離を平均文字サイズで正規化した値と、文字パタンの底部から文字列の中心線までの距離を平均文字サイズで正規化した値とからなる特徴ベクトルに基づいて、前記単一文字パタン位置の確からしさを評価することを特徴とするプログラム。 In claim 1,
The certainty of the plurality of items includes the certainty of a single character pattern position,
The recognition unit
Characteristic consisting of a value obtained by normalizing the distance from the top of the character pattern to the center line of the character string by the average character size, and a value obtained by normalizing the distance from the bottom of the character pattern to the center line of the character string by the average character size. A program for evaluating the probability of the single character pattern position based on a vector.

請求項１において、
前記複数項目の確からしさは、ペア文字パタン位置の確からしさを含み、
前記認識部は、
隣接する文字パタンの上部同士の距離を平均文字サイズで正規化した値と、隣接する文字パタンの下部同士の距離を平均文字サイズで正規化した値とからなる特徴ベクトルに基づいて、前記ペア文字パタン位置の確からしさを評価することを特徴とするプログラム。 In claim 1,
The certainty of the plurality of items includes the certainty of the pair character pattern position,
The recognition unit
Based on a feature vector consisting of a value obtained by normalizing the distance between the upper parts of adjacent character patterns with the average character size and a value obtained by normalizing the distance between the lower parts of adjacent character patterns with the average character size, the pair characters A program characterized by evaluating the probability of pattern position.

請求項１において、
前記複数項目の確からしさは、文字パタン内分割の確からしさを含み、
前記認識部は、
文字パタンの垂直方向の射影におけるギャップの長さを平均文字サイズで正規化した値と、文字パタンの水平方向の射影におけるギャップの長さを平均文字サイズで正規化した値とからなる特徴ベクトルに基づいて、前記文字パタン内分割の確からしさを評価することを特徴とするプログラム。 In claim 1,
The certainty of the plurality of items includes the certainty of division within the character pattern,
The recognition unit
A feature vector consisting of a value obtained by normalizing the gap length in the vertical projection of the character pattern by the average character size and a value obtained by normalizing the gap length in the horizontal projection of the character pattern by the average character size. A program for evaluating the probability of division within the character pattern based on the above.

請求項２乃至４のいずれかにおいて、
前記複数項目の確からしさは、言語文脈の確からしさ、文字パタンサイズの確からしさ及び文字認識の確からしさを更に含むことを特徴とするプログラム。 In any of claims 2 to 4,
The likelihood of multiple items, the probability of language context, program characterized by further comprising a likelihood of certainty and character recognition of the character pattern size.

請求項１において、
前記認識部は、
前記候補文字列について、前記複数項目の確からしさの他に、前記仮切出しポイントの確からしさを評価し、前記仮切出しポイントを、隣接する文字パタン間の仮切出しポイントとして判断する場合には、前記仮切出しポイントが切出しポイントとして識別される確からしさを評価し、前記仮切出しポイントを、文字パタン内部の仮切出しポイントとして判断する場合には、前記仮切出しポイントが非切出しポイントとして識別される確からしさを評価することを特徴とするプログラム。 In claim 1 ,
The recognition unit
For the candidate character string, in addition to the certainty of the plurality of items, when evaluating the certainty of the provisional cut point, and determining the provisional cut point as a provisional cut point between adjacent character patterns, When assessing the probability that a temporary cut point is identified as a cut point, and determining that the temporary cut point is a temporary cut point inside a character pattern, the probability that the temporary cut point is identified as a non-cut point A program characterized by evaluating

請求項６において、
前記仮切出しポイント決定部は、
前記オフストロークから抽出した多次元の特徴値をサポートベクターマシン（ＳＶＭ）に適用することで、前記仮切出しポイントのＳＶＭ出力値を求め、
前記認識部は、
前記ＳＶＭ出力値に基づき前記仮切出しポイントの確からしさを評価することを特徴とするプログラム。 In claim 6 ,
The temporary cut point determination unit
By applying a multi-dimensional feature value extracted from the off-stroke to a support vector machine (SVM), an SVM output value of the temporary cut point is obtained.
The recognition unit
A program for evaluating the probability of the provisional cut-out point based on the SVM output value.

請求項１乃至７のいずれかにおいて、
前記認識部は、
遺伝的アルゴリズムにより学習した複数の重みの値を前記複数項目の確からしさに与えることを特徴とするプログラム。 In any one of Claims 1 thru | or 7,
The recognition unit
A program characterized in that a plurality of weight values learned by a genetic algorithm are given to the probabilities of the plurality of items.

コンピュータ読み取り可能な情報記憶媒体であって、請求項１乃至８のいずれかのプログラムを記憶したことを特徴とする情報記憶媒体。 A computer-readable information storage medium, wherein the program according to any one of claims 1 to 8 is stored.

手書き入力された文字パタンのストロークから文字列を認識する文字列認識装置であって、
ストローク間のオフストロークのうち、仮切出しポイントとして設定するオフストロークを決定する仮切出しポイント決定部と、
前記仮切出しポイントによって切り出された仮切出しユニット毎に文字認識を行い、候補文字列を生成する候補文字列生成部と、
前記候補文字列について複数項目の確からしさを評価し、評価結果に基づき文字列を認識する認識部とを含み、
前記仮切出しユニットは、隣接する前記仮切出しポイントにより切り出される最小仮切
出しユニットを少なくとも１つ含み、
前記認識部は、
前記候補文字列の各候補文字に対応する前記仮切出しユニットに含まれる前記最小仮切出しユニット、および隣接する前記最小切出しユニット間の連結について、前記複数項目の確からしさを評価し、前記複数項目の確からしさについて各項目に与える重みの値を、前記仮切出しユニットに含まれる前記最小仮切出しユニットの数に応じて設定することを特徴とする文字列認識装置。 A character string recognition device for recognizing a character string from a stroke of a character pattern input by handwriting,
Of the off-strokes between strokes, a provisional cut-out point determination unit that determines an off-stroke to be set as a temporary cut-out point;
Character recognition for each provisional cutout unit cut out by the provisional cutout point, a candidate character string generation unit that generates a candidate character string,
A recognition unit that evaluates the probability of a plurality of items for the candidate character string and recognizes the character string based on the evaluation result;
The temporary cutting unit includes at least one minimum temporary cutting unit cut out by the adjacent temporary cutting point,
The recognition unit
For the connection between the minimum provisional cutout unit included in the provisional cutout unit corresponding to each candidate character of the candidate character string and the adjacent minimum cutout unit, the probability of the plurality of items is evaluated, and the plurality of items A character string recognizing apparatus, wherein a value of a weight given to each item regarding the probability is set according to the number of the minimum provisional cutout units included in the provisional cutout unit.