JPH0950490A

JPH0950490A - Handwritten character recognizing device

Info

Publication number: JPH0950490A
Application number: JP7201119A
Authority: JP
Inventors: Ikuhiko Nishio; 郁彦西尾
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1995-08-07
Filing date: 1995-08-07
Publication date: 1997-02-18

Abstract

PROBLEM TO BE SOLVED: To improve the discrimination rate of characters which are similar to each other in shape and to improve the recognition rate of a right character. SOLUTION: Input holograph is normalized in a linear normalization circuit 1 and a nonlinear normalization circuit 2, supplied to a bit map plotting circuit 3, and the plotted pattern is supplied to a feature vector generation circuit 4 and the generated feature vector is stored in a storage circuit 5. Further, the stored feature vector is supplied to a comparison and calculation circuit 6, the feature vector is compared with the feature vector of the recognition object character from a reference pattern data base 7 and the evaluation value of a degree of similarity is calculated. These calculated evaluation values are stored ascending order of the evaluation value in descending order of value having the higher degree of similarity) in a candidate character string storage circuit 8. Further, the number of strokes of the stroke string by the input holograph is stored in a number of strokes storage circuit 9, this number of strokes is supplied to a number of strokes certification circuit 10 and the number of strokes is compared with the number of strokes of the recognition object character from a number of strokes information data base 11. By this comparison result, the order of the candidate character strings stored in the storage circuit 8 is rearranged and the character strings are outputted.

Description

【発明の詳細な説明】Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は手書き文字認識装
置、特に認識結果の候補列の検定に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a handwritten character recognition apparatus, and more particularly to a verification of candidate strings of recognition results.

【０００２】[0002]

【従来の技術】例えば携帯型の情報端末装置において
は、液晶ディスプレイ等による表示装置の表面に透明タ
ッチパネル状の入力装置を設けて、いわゆる手書きで情
報等の入力を行うことができるようにした装置が実施さ
れている。2. Description of the Related Art For example, in a portable information terminal device, an input device in the form of a transparent touch panel is provided on the surface of a display device such as a liquid crystal display so that information can be input by so-called handwriting. Is being implemented.

【０００３】このような装置において、手書きの文字等
を認識する方法としては、従来からいわゆるビットマッ
プパターンによる文字認識が多く用いられている。すな
わちこのビットマップパターンによる文字認識では、文
字パターンの形そのものを特徴として、標準パターンと
の重ね合わせによって認識処理を行うものである。As a method for recognizing a handwritten character or the like in such an apparatus, a character recognition by a so-called bitmap pattern has been widely used conventionally. That is, in the character recognition using this bitmap pattern, the recognition process is performed by superimposing it on the standard pattern, which is characterized by the shape of the character pattern itself.

【０００４】従ってこの方法では、正しく重ね合わせを
行うためにパターンの位置ずれを吸収することが重要で
ある。そのため一般にビットマップパターンによる文字
認識では、パターンの正規化処理、ぼかし処理、パター
ンのモザイク化処理などが行われている。Therefore, in this method, it is important to absorb the positional deviation of the patterns in order to perform the correct overlay. Therefore, generally, in character recognition using a bitmap pattern, pattern normalization processing, blurring processing, pattern mosaicing processing, and the like are performed.

【０００５】ここで例えば上述の正規化の処理として
は、例えば特願昭６２−１８５８２６号、あるいは「昭
６２信学情報システム全国大会、Ｎｏ．６８，１９８
７、手書き漢字認識における非線形正規化」に示される
ような手段が提案されている。As the above-mentioned normalization processing, for example, Japanese Patent Application No. 62-185826 or "Sho 62 National Conference on Information Systems, No. 68, 198" is used.
7. Non-linear normalization in handwritten Chinese character recognition "has been proposed.

【０００６】すなわちこの手段では、上述の入力装置に
ペンが降されて（ペンダウン）から離される（ペンアッ
プ）までの入力筆跡を折れ線で近似し、この近似された
座標点列をストロークと称する。そしてこのストローク
ごとに大きさを補正し、大きさの補正された複数のスト
ロークの位置のバランスを補正し、バランスの補正され
たストローク列を用いてビットマップパターンを描画
し、この描画されたパターンと標準パターンとの重ね合
わせによって認識処理を行うものである。[0006] That is, in this means, the input handwriting from when the pen is dropped on the above-mentioned input device (pen down) to when it is released (pen up) is approximated by a polygonal line, and this approximated coordinate point sequence is called a stroke. Then, the size is corrected for each stroke, the balance of the positions of the plurality of size-corrected strokes is corrected, and a bitmap pattern is drawn using the balance-corrected stroke sequence. The recognition process is performed by superimposing the standard pattern with the standard pattern.

【０００７】[0007]

【発明が解決しようとする課題】ところが上述のような
ビットマップパターンによる認識処理においては、パタ
ーンの重ね合わせが正しく行われない場合には形の類似
した別の文字と重なり合ってしまい、誤った認識結果と
なる場合がある。However, in the recognition processing using the bit map pattern as described above, if the patterns are not correctly superimposed, another character having a similar shape is overlapped, resulting in incorrect recognition. It may result.

【０００８】すなわち、例えば「追」の文字を入力しよ
うとした場合に、ビットマップパターンによる認識処理
では、例えば類似度の高い順に「遅、追、遼、進、逓、
通、・・・」の文字が候補列として取り出される。ここ
で「追」の文字は必ずしも最初の候補とは成っておら
ず、従ってこのような候補列では、文字の入力を円滑に
行えない恐れが生じるものであった。That is, for example, when a character "Oi" is to be input, in the recognition processing using the bitmap pattern, for example, "slow, Oi, Liao, advance, multiply,
The letters "..., ..." Are extracted as a candidate string. Here, the character "Oi" is not always the first candidate, and therefore, in such a candidate string, there is a fear that the characters cannot be smoothly input.

【０００９】この出願はこのような点に鑑みて成された
ものであって、解決しようとする問題点は、ビットマッ
プパターンによる認識処理では、パターンの重ね合わせ
が正しく行われなかった場合に、誤った認識結果を生じ
易いというものである。This application has been made in view of such a point, and the problem to be solved is that in the recognition processing by the bitmap pattern, when the patterns are not superposed correctly, It is likely that an incorrect recognition result is generated.

【００１０】[0010]

【課題を解決するための手段】このため本発明において
は、ビットマップパターンによる認識処理に加えて、入
力筆跡のストロークの数と認識対象文字の画数のデータ
を用いて候補文字列の並べ替えを行うようにしたもので
あって、これによれば形の似た文字の識別率を高め、正
しい文字の認識率を向上させることができる。Therefore, in the present invention, in addition to the recognition processing by the bitmap pattern, the candidate character string is rearranged by using the data of the strokes of the input handwriting and the stroke number of the recognition target character. According to this, the recognition rate of characters having similar shapes can be increased and the recognition rate of correct characters can be improved.

【００１１】[0011]

【発明の実施の形態】すなわち本発明においては、ビッ
トマップパターンによる認識処理を用いる手書き文字認
識装置において、入力筆跡のストロークの数を記憶する
画数記憶手段と、認識対象文字の画数のデータを格納す
る画数データベースとを設け、画数記憶手段からの入力
筆跡のストロークの数と、画数データベースからの認識
対象文字の画数に従って、候補文字列記憶手段に記憶さ
れた認識対象文字の並べ替えを行うものである。BEST MODE FOR CARRYING OUT THE INVENTION That is, according to the present invention, in a handwritten character recognizing device using a recognition process using a bit map pattern, a stroke number storing means for storing the number of strokes of an input handwriting and a stroke number data for a character to be recognized are stored. A stroke number database is provided to sort the recognition target characters stored in the candidate character string storage means according to the number of strokes of the input handwriting from the stroke number storage means and the stroke number of the recognition target character from the stroke number database. is there.

【００１２】以下、図面を参照して本発明を説明する
に、図１は本発明による手書き文字認識装置の一例の構
成を示すブロック図である。The present invention will be described below with reference to the drawings. FIG. 1 is a block diagram showing the configuration of an example of a handwritten character recognition apparatus according to the present invention.

【００１３】すなわちこの図１において、例えば上述の
入力装置（図示せず）に書き込まれた入力筆跡によるス
トローク列が線形正規化回路１に供給される。なおスト
ロークは、例えば上述したように入力装置のペンダウン
からペンアップまでの入力筆跡を折れ線で近似し、この
近似された座標点列で表されるものである。That is, in FIG. 1, for example, the stroke sequence based on the input handwriting written in the above-mentioned input device (not shown) is supplied to the linear normalization circuit 1. The stroke is represented by a sequence of coordinate points obtained by approximating the input handwriting from pen-down to pen-up of the input device with a polygonal line as described above.

【００１４】そしてこの線形正規化回路１では、入力さ
れたストロークの筆跡の大きさが補正され、大きさの補
正されたストローク列が出力される。さらにこの線形正
規化回路１で大きさの補正されたストローク列が非線形
正規化回路２に供給される。そしてこの非線形正規化回
路２では、大きさの補正されたストローク列を入力とし
て、各ストロークの位置のバランスを補正したストロー
ク列が出力される。Then, the linear normalization circuit 1 corrects the size of the handwriting of the input stroke, and outputs the size-corrected stroke sequence. Further, the stroke sequence whose size is corrected by the linear normalization circuit 1 is supplied to the non-linear normalization circuit 2. Then, the non-linear normalization circuit 2 receives the stroke sequence having the corrected size, and outputs the stroke sequence in which the position balance of each stroke is corrected.

【００１５】さらに、この非線形正規化回路２で位置の
バランスの補正されたストローク列が、筆跡ビットマッ
プ描画回路３に供給され、上述の正規化されたストロー
ク列を入力としてビットマップパターンに描画されたパ
ターンが形成される。なお線形及び非線形の正規化の処
理は例えば上述の文献に示されたものであり、また本願
の発明の本質には直接関係しないので詳細な説明は省略
する。Further, the stroke sequence whose position balance has been corrected by the non-linear normalization circuit 2 is supplied to the handwriting bitmap drawing circuit 3 and drawn in a bitmap pattern using the above-mentioned normalized stroke sequence as an input. Pattern is formed. Note that the linear and non-linear normalization processing is shown in, for example, the above-mentioned document, and since it is not directly related to the essence of the invention of the present application, detailed description thereof is omitted.

【００１６】そしてこの筆跡ビットマップ描画回路３で
描画されたパターンが、特徴ベクタ生成回路４に供給さ
れる。また、この特徴ベクタ生成回路４で生成された特
徴ベクタが特徴ベクタ記憶回路５に記憶される。The pattern drawn by the handwriting bitmap drawing circuit 3 is supplied to the feature vector generating circuit 4. In addition, the feature vector generated by the feature vector generation circuit 4 is stored in the feature vector storage circuit 5.

【００１７】さらに、この特徴ベクタ記憶回路５に記憶
された特徴ベクタが、特徴ベクタ比較計算回路６に供給
される。そしてこの特徴ベクタ比較計算回路６では、参
照パターンデータベース７からの認識対象文字の特徴ベ
クタと特徴ベクタ記憶回路５に記憶された特徴ベクタと
が比較され、例えばこれらが一致しない部分の点数が計
算されて、これらの類似度の評価値が計算される。Further, the feature vector stored in the feature vector storage circuit 5 is supplied to the feature vector comparison calculation circuit 6. Then, the feature vector comparison / calculation circuit 6 compares the feature vector of the recognition target character from the reference pattern database 7 with the feature vector stored in the feature vector storage circuit 5, and calculates, for example, the score of the portion where these do not match. Then, the evaluation value of these similarities is calculated.

【００１８】なお、上述の参照パターンデータベース７
には、認識対象文字カテゴリーのそれぞれの特徴ベクタ
の値が格納されている。また各文字カテゴリーの特徴ベ
クタは、その文字カテゴリーに属するいくつかのサンプ
ル筆跡の特徴ベクタを平均化して求められるものであ
る。The above-mentioned reference pattern database 7
Stores the value of each feature vector of the recognition target character category. The feature vector of each character category is obtained by averaging the feature vectors of some sample handwritings belonging to that character category.

【００１９】従って上述の特徴ベクタ比較計算回路６の
計算では、類似度の高いものほど評価値は低くなる。そ
こでこの計算された評価値の低い（類似度は高い）もの
から順に、計算の行われた認識対象文字が候補文字列記
憶回路８に記憶される。なおこの候補文字列記憶回路８
には、認識対象文字の例えばＪＩＳコード等の文字コー
ドと、計算された評価値等の情報が記憶される。Therefore, in the calculation of the above-described feature vector comparison / calculation circuit 6, the higher the degree of similarity, the lower the evaluation value. Therefore, the recognition target characters for which the calculation has been performed are stored in the candidate character string storage circuit 8 in order from the one with the lowest evaluation value (the highest similarity). The candidate character string storage circuit 8
The character code such as the JIS code of the recognition target character and the information such as the calculated evaluation value are stored in the.

【００２０】そしてさらにこの装置において、上述の入
力筆跡によるストローク列が入力筆跡画数記憶回路９に
供給されて、その個数が入力筆跡画数記憶回路９に格納
される。また、この入力筆跡画数記憶回路９に格納され
た入力筆跡の画数が、画数検定回路１０に供給される。Further, in this apparatus, the stroke sequence based on the above-mentioned input handwriting is supplied to the input handwriting stroke number storage circuit 9, and the number thereof is stored in the input handwriting stroke number storage circuit 9. Further, the stroke number of the input handwriting stored in the input stroke number storage circuit 9 is supplied to the stroke number verification circuit 10.

【００２１】さらにこの画数検定回路１０では、画数情
報データベース１１からの認識対象文字の画数と入力筆
跡画数記憶回路９に記憶された画数とが比較される。そ
してこの比較によって候補文字列記憶回路８に記憶され
た候補文字列の順位が並べ替えられ、並べ替えられた候
補文字列が最終的な認識結果として出力される。Further, in the stroke number verification circuit 10, the stroke number of the character to be recognized from the stroke number information database 11 is compared with the stroke number stored in the input handwriting stroke number storage circuit 9. By this comparison, the ranks of the candidate character strings stored in the candidate character string storage circuit 8 are rearranged, and the rearranged candidate character strings are output as the final recognition result.

【００２２】次に図２は、上述の認識装置をソフトウェ
アで実現する場合の処理手順を示すフローチャートであ
る。Next, FIG. 2 is a flowchart showing a processing procedure when the above-mentioned recognition device is realized by software.

【００２３】この図２において、処理が開始されると、
まずステップ〔１〕では、上述の線形正規化回路１に相
当する入力筆跡の線形正規化処理が行われる。またステ
ップ〔２〕では、上述の非線形正規化回路２に相当する
入力筆跡の非線形正規化処理が行われる。さらにステッ
プ〔３〕では、入力筆跡の画数の算出が行われ、入力筆
跡画数記憶回路９への入力筆跡の画数の記憶が行われ
る。In FIG. 2, when the processing is started,
First, in step [1], a linear normalization process of the input handwriting corresponding to the above-described linear normalization circuit 1 is performed. In step [2], the non-linear normalization processing of the input handwriting corresponding to the non-linear normalization circuit 2 is performed. Further, in step [3], the number of strokes of the input handwriting is calculated, and the number of strokes of the input handwriting is stored in the input handwriting stroke number storage circuit 9.

【００２４】またステップ〔４〕では、上述の筆跡ビッ
トマップ描画回路３に相当する正規化されたストローク
列のビットマップパターンへの描画が行われる。そして
ステップ〔５〕では、上述の特徴ベクタ生成回路４に相
当する描画されたビットマップパターンからの特徴ベク
タの生成が行われ、特徴ベクタ記憶回路５への入力筆跡
の特徴ベクタの記憶が行われる。In step [4], drawing of a normalized stroke sequence corresponding to the handwriting bitmap drawing circuit 3 into a bitmap pattern is performed. Then, in step [5], a feature vector is generated from the drawn bitmap pattern corresponding to the above-mentioned feature vector generation circuit 4, and the feature vector of the input handwriting is stored in the feature vector storage circuit 5. .

【００２５】さらにステップ〔６〕では、認識結果の候
補を表す集合が空集合に初期化される。すなわち上述の
候補文字列記憶回路８が初期化される。またステップ
〔７〕では、上述の参照パターンデータベース７の読み
出し位置が先頭の位置に初期化される。Further, in step [6], the set representing the recognition result candidates is initialized to an empty set. That is, the above-mentioned candidate character string storage circuit 8 is initialized. In step [7], the read position of the reference pattern database 7 is initialized to the leading position.

【００２６】そしてステップ〔８〕では、参照パターン
データベース７の指定された読み出し位置に格納された
１つのレコード（特徴ベクタ）が読み出される。さらに
ステップIn step [8], one record (feature vector) stored in the designated read position of the reference pattern database 7 is read. Further steps

〔９〕では、参照パターンデータベース７から
読み出された特徴ベクタと、入力筆跡の特徴ベクタとの
距離（評価値）が算出される。In [9], the distance (evaluation value) between the feature vector read from the reference pattern database 7 and the feature vector of the input handwriting is calculated.

【００２７】またステップ〔１０〕では、算出された距
離に応じて候補集合への追加が行われる。すなわち上述
の候補文字列記憶回路８に記憶されている候補列の、評
価値の順位で決定される位置に、認識対象文字の文字コ
ードと計算された評価値等の情報の記憶が行われる。In step [10], addition to the candidate set is performed according to the calculated distance. That is, information such as the character code of the recognition target character and the calculated evaluation value is stored at the position determined by the rank of the evaluation value in the candidate string stored in the above-mentioned candidate character string storage circuit 8.

【００２８】さらにステップ〔１１〕では、参照パター
ンデータベース７から読み出された特徴ベクタが最終の
レコードであるか否か判断される。ここで最終でないと
きは、ステップ〔１２〕で参照パターンデータベース７
の読み出し位置が一つ進められてステップ〔８〕に戻さ
れ、上述の処理が繰り返される。Further, in step [11], it is judged whether or not the feature vector read from the reference pattern database 7 is the final record. If it is not the final one, the reference pattern database 7 is selected in step [12].
The reading position of 1 is advanced by 1 and returned to step [8], and the above-mentioned processing is repeated.

【００２９】また、ステップ〔１１〕で参照パターンデ
ータベース７から読み出された特徴ベクタが最終のレコ
ードのときは、ステップ〔１３〕で後述する画数検定の
処理が行われて処理が終了される。When the feature vector read out from the reference pattern database 7 in step [11] is the final record, the stroke number test process described later is performed in step [13] and the process ends.

【００３０】そこでステップ〔１３〕の画数検定の処理
は、例えば図３のフローチャートに示すようにして行わ
れる。Therefore, the stroke number verification process in step [13] is performed as shown in the flowchart of FIG. 3, for example.

【００３１】すなわち図３において、処理が開始される
と、まずステップ〔２１〕では、任意の変数ｉに１が代
入される。ここでｉは検定対象となる候補の番号を表
す。次にステップ〔２２〕では、１位候補とｉ位候補と
の評価値の差が計算され、この差がある閾値Ｔ以上のと
きは、画数検定を打ち切る処理が行われる。これによっ
て１位候補との類似度の高い候補についてのみ画数検定
を行うことができる。That is, in FIG. 3, when the process is started, first, in step [21], 1 is assigned to an arbitrary variable i. Here, i represents the number of the candidate to be tested. Next, in step [22], the difference between the evaluation values of the first-ranked candidate and the i-th candidate is calculated, and when the difference is equal to or larger than a threshold value T, the process of stopping the stroke number test is performed. Thereby, the stroke number test can be performed only on the candidate having a high degree of similarity with the first-ranked candidate.

【００３２】さらにステップ〔２３〕では、入力画数と
ｉ位候補の画数差が計算され、この画数差が保存され
る。またステップ〔２４〕では、ｉの値が候補数を越え
たか否かが判断され、越えていないときはステップ〔２
５〕で変数ｉに１が加算されてステップ〔２２〕に戻さ
れ、上述の処理が繰り返される。Further, in step [23], the difference between the input stroke number and the stroke number of the i-th candidate is calculated, and this stroke number difference is stored. In step [24], it is judged whether or not the value of i exceeds the number of candidates.
In step 5], 1 is added to the variable i, the process returns to step [22], and the above process is repeated.

【００３３】また、ステップ〔２４〕でｉの値が候補数
を越えたときは、ステップ〔２６〕で１位からｉ位まで
の候補が画数差と入力筆跡との距離でソートされて、上
述の画数検定回路１０での、候補文字列記憶回路８に記
憶された候補文字列の並べ替えが行われる。なお並べ替
えの方法としては、例えば画数差の小さいものを先頭と
し、画数差の等しいものについては評価値の小さいもの
から先頭とする。When the value of i exceeds the number of candidates in step [24], the candidates from the first place to the i-th place are sorted by the distance between the stroke number and the input handwriting in step [26]. In the stroke number verification circuit 10, the candidate character strings stored in the candidate character string storage circuit 8 are rearranged. As a rearrangement method, for example, the one having the smallest stroke number difference is first, and the one having the same stroke number difference is the smallest evaluation value.

【００３４】従って上述の装置において、例えば「追」
の文字を入力しようとした場合に、ビットマップパター
ンによる認識処理では、各認識対象文字に対して、例え
ば次に示すような評価値が得られる。Therefore, in the above-mentioned device, for example, "addition"
In the recognition process using the bitmap pattern, the following evaluation value is obtained for each recognition target character when the character is input.

【００３５】遅 [18530]、追 [18558]、遼 [18572]、進 [18991] 逓 [19532]、通 [19564]、遜 [19594]、造 [19844] 違 [19948]、遍 [20079]、逼 [20114]、逗 [20121] 避 [20134]、遇 [20245]、渥 [20305]、遁 [20393] 遣 [20396]、退 [20444]、迫 [20502]、蓮 [20536]Slow [18530], Oi [18558], Liao [18572], Susumu [18991] Increment [19532], Tsu [19564], Haruka [19594], [19844] Diff [19948], Hen [20079] , 逼 [20114], 逗 [20121] Evacuation [20134], Oops [20245], Atsushi [20305], Haruka [20393] Sending [20396], Retiring [20444], Sako [20502], Lotus [20536]

【００３６】すなわちこの場合に、候補文字列は、〔遅追遼進逓通遜造違遍逼逗避遇渥遁遣退迫蓮〕となる。In other words, in this case, the candidate character string is [delaying, delaying, passing, passing, construction, and alienation, and avoiding, lotus].

【００３７】これに対して、例えば上述の閾値Ｔの値を
Ｔ＝１０００と定めると、先頭の４文字が画数検定の対
象となり、それぞれの画数は、遅〔12画〕、追〔９画〕、遼〔15画〕、進〔11画〕のように画数情報データベース１１に格納されている。On the other hand, if the threshold value T is set to T = 1000, for example, the first four characters are subject to the stroke number test, and the stroke numbers are slow [12 strokes] and additional strokes (9 strokes). , Ryo [15 strokes] and Shin [11 strokes] are stored in the stroke number information database 11.

【００３８】一方、入力筆跡の画数（ストローク数）が
例えば８画の場合、上述の画数検定対象の各文字との画
数差は、遅〔４画〕、追〔１画〕、遼〔７画〕、進〔３画〕となる。On the other hand, when the stroke number of the input handwriting is, for example, 8 strokes, the stroke number difference from each character of the stroke number test is slow [4 strokes], additional [1 stroke], Liao [7 strokes]. ], And progress [3 strokes].

【００３９】従ってこの画数検定によって、上述の候補
文字列は、〔追進遅遼逓通遜造違遍逼逗避遇渥遁遣退迫蓮〕となり、この並べ替えられた文字列が最終的な認識結果
として出力される。[0039] Therefore by this stroke count assay, the candidate character strings described above, [Tsuishin OsoRyo逓通Herikudazo違遍逼逗避遇渥遁Yashisa SakoRen], and the final this sorted string It is output as a recognition result.

【００４０】すなわちこの場合に、下線で示すように先
頭の４文字の並べ替えが行われて、例えば入力しようと
した「追」の文字が先頭になった候補文字列が最終的な
認識結果として出力される。That is, in this case, the leading four characters are rearranged as shown by underlining, and, for example, the candidate character string in which the character of the "additional" that is to be input is at the beginning is the final recognition result. Is output.

【００４１】従ってこの装置において、従来のビットマ
ップパターンによる認識処理のみでは、パターンの重ね
合わせが正しく行われなかった場合に誤った認識結果を
生じ易い問題があったものを、本発明においては、入力
筆跡のストロークの数と認識対象文字の画数のデータを
用いて候補文字列の並べ替えを行うことにより、形の似
た文字の識別率を高め、正しい文字の認識率を向上させ
ることができる。Therefore, in the present invention, there is a problem that an erroneous recognition result is apt to occur when the patterns are not correctly superposed only by the conventional recognition process using the bitmap pattern. By rearranging the candidate character strings using the data of the number of strokes of the input handwriting and the number of strokes of the recognition target character, it is possible to increase the recognition rate of characters with similar shapes and improve the recognition rate of correct characters. .

【００４２】なお、上述の装置において、図３のフロー
チャートのステップ〔２２〕における閾値Ｔは、値を無
限大として全ての候補について画数検定を行うようにし
てもよい。In the above apparatus, the threshold T in step [22] of the flowchart of FIG. 3 may be set to infinity and the stroke number test may be performed for all candidates.

【００４３】またステップ〔２６〕における並べ替え
は、実施例のように画数差の小さいものを先頭とする以
外にも、例えば画数差をｄ_i、評価値をｅ_iとして、Ｅ_i＝α×ｄ_i＋β×ｅ_i（α、βは定数）のように２つの値を合成して、この値Ｅ_iを新たな評価
値として、値Ｅ_iの小さいものから先頭に並べ替えを行
うようにしてもよい。Further, the rearrangement in step [26] is not limited to the one having the smallest difference in the number of strokes as in the embodiment, but E _i = α ×, where the difference in the number of strokes is d _i and the evaluation value is e _i. Two values such as d _i + β × e _i (α and β are constants) are combined, and this value E _i is used as a new evaluation value, and the smallest value E _i is rearranged from the beginning. May be.

【００４４】これによれば、例えば画数は筆跡の連続な
どによって変動しやすいものであり、これを元の評価値
に合成することで、画数の変動の影響を少なくすること
ができるものである。According to this, for example, the number of strokes is likely to fluctuate due to the continuation of handwriting, etc. By combining this with the original evaluation value, it is possible to reduce the influence of the variation in the number of strokes.

【００４５】さらに、上述の画数情報データベース１１
に格納されている画数に、例えば遅〔10−12画〕、追〔７−９画〕、遼〔12−15画〕、進
〔10−11画〕のように幅を持たせることによって、画数の影響を少な
くすることもできる。なおこの場合には、上述のステッ
プ〔１３〕の画数検定の処理は、例えば図４のフローチ
ャートに示すようにして行われる。Further, the stroke number information database 11 described above is used.
By giving the number of strokes stored in the space such as slow [10-12 strokes], additional [7-9 strokes], Liao [12-15 strokes], and progression [10-11 strokes], It is also possible to reduce the influence of the number of strokes. In this case, the stroke number verification process in step [13] is performed as shown in the flowchart of FIG. 4, for example.

【００４６】すなわち図４において、処理が開始される
と、まずステップ〔３１〕では、任意の変数ｉに１が代
入される。ここでｉは検定対象となる候補の番号を表
す。次にステップ〔３２〕では、１位候補とｉ位候補と
の評価値の差が計算され、この差がある閾値Ｔ以上のと
きは、画数検定を打ち切る処理が行われる。これによっ
て１位候補との類似度の高い候補についてのみ画数検定
を行うことができる。That is, in FIG. 4, when the process is started, first, in step [31], 1 is substituted into an arbitrary variable i. Here, i represents the number of the candidate to be tested. Next, in step [32], the difference between the evaluation values of the first-ranked candidate and the i-th candidate is calculated. Thereby, the stroke number test can be performed only on the candidate having a high degree of similarity with the first-ranked candidate.

【００４７】さらにステップ〔３３〕では、例えば入力
画数とｉ位候補の最小値（ｍｉｎ）が計算され、この値
が画数差として保存される。またステップ〔３４〕で
は、ｉの値が候補数を越えたか否かが判断され、越えて
いないときはステップ〔３５〕で変数ｉに１が加算され
てステップ〔３２〕に戻され、上述の処理が繰り返され
る。Further, in step [33], for example, the input stroke number and the minimum value (min) of the i-th candidate are calculated, and this value is stored as the stroke number difference. In step [34], it is judged whether or not the value of i exceeds the number of candidates, and if it does not exceed 1, the variable i is incremented by 1 in step [35] and returned to step [32]. The process is repeated.

【００４８】また、ステップ〔３４〕でｉの値が候補数
を越えたときは、ステップ〔３６〕で１位からｉ位まで
の候補が画数差と入力筆跡との距離でソートされて、上
述の画数検定回路１０での、候補文字列記憶回路８に記
憶された候補文字列の並べ替えが行われる。このように
して、画数情報データベース１１に格納されている画数
に幅を持たせることによって、画数の影響を少なくする
ことができる。When the value of i exceeds the number of candidates in step [34], the candidates from the 1st place to the i-th place are sorted by the distance between the stroke number and the input handwriting in step [36]. In the stroke number verification circuit 10, the candidate character strings stored in the candidate character string storage circuit 8 are rearranged. In this way, by making the number of strokes stored in the stroke number information database 11 wide, the influence of the number of strokes can be reduced.

【００４９】こうして上述の手書き文字認識装置によれ
ば、ビットマップパターンによる認識処理を用いる手書
き文字認識装置において、入力筆跡のストロークの数を
記憶する画数記憶手段と、認識対象文字の画数のデータ
を格納する画数データベースとを設け、画数記憶手段か
らの入力筆跡のストロークの数と、画数データベースか
らの認識対象文字の画数に従って、候補文字列記憶手段
に記憶された認識対象文字の並べ替えを行うことによ
り、形の似た文字の識別率を高め、正しい文字の認識率
を向上させることができるものである。Thus, according to the above-mentioned handwritten character recognition device, in the handwritten character recognition device using the recognition processing by the bitmap pattern, the stroke number storage means for storing the number of strokes of the input handwriting and the stroke number data of the recognition target character are stored. A stroke number database to be stored is provided, and the recognition target characters stored in the candidate character string storage means are rearranged according to the number of strokes of the handwriting input from the stroke number storage means and the stroke number of the recognition target character from the stroke number database. This makes it possible to increase the identification rate of characters having similar shapes and improve the recognition rate of correct characters.

【００５０】なお上述の実施例においては、前置処理と
して入力筆跡をパターンの正規化処理をしてビットマッ
プパターンによる認識処理を行う場合について説明した
が、この前置処理は、ぼかし処理やパターンのモザイク
化処理などの、他の処理を含んでいてもよい。In the above-described embodiment, the case where the input handwriting is subjected to the pattern normalization processing and the recognition processing by the bitmap pattern is performed as the pre-processing is explained. Other processing such as the mosaic processing of the above may be included.

【００５１】[0051]

【発明の効果】この発明によれば、従来のビットマップ
パターンによる認識処理のみでは、パターンの重ね合わ
せが正しく行われなかった場合に誤った認識結果を生じ
易い問題があったものを、入力筆跡のストロークの数と
認識対象文字の画数のデータを用いて候補文字列の並べ
替えを行うことにより、形の似た文字の識別率を高め、
正しい文字の認識率を向上させることができるようにな
った。As described above, according to the present invention, there is a problem that an erroneous recognition result is apt to occur when pattern superimposition is not correctly performed only by the conventional recognition processing using a bitmap pattern. By rearranging the candidate character strings using the data of the number of strokes and the number of strokes of the recognition target character, the recognition rate of characters with similar shapes is increased,
It has become possible to improve the recognition rate of correct characters.

【００５２】すなわちこの発明によれば、ビットマップ
パターンによる分類結果を、ビットマップパターンには
表れない画数の情報を使って検定することで、類似パタ
ーンの識別能力を向上することができ、これによって認
識率の改善を図ることができるものである。That is, according to the present invention, the classification result by the bitmap pattern is tested by using the information of the number of strokes that does not appear in the bitmap pattern, so that the ability to identify similar patterns can be improved. It is possible to improve the recognition rate.

【図面の簡単な説明】[Brief description of drawings]

【図１】本発明の適用される手書き文字認識装置の一例
の構成図である。FIG. 1 is a configuration diagram of an example of a handwritten character recognition device to which the present invention is applied.

【図２】その動作の説明のためのフローチャート図であ
る。FIG. 2 is a flowchart for explaining the operation.

【図３】その動作の説明のためのフローチャート図であ
る。FIG. 3 is a flowchart for explaining the operation.

【図４】その動作の説明のためのフローチャート図であ
る。FIG. 4 is a flowchart for explaining the operation.

【符号の説明】[Explanation of symbols]

１線形正規化回路２非線形正規化回路３筆跡ビットマップ描画回路４特徴ベクタ生成回路５特徴ベクタ記憶回路６特徴ベクタ比較計算回路７参照パターンデータベース８候補文字列記憶回路９入力筆跡画数記憶回路１０画数検定回路１１画数情報データベース 1 linear normalization circuit 2 non-linear normalization circuit 3 handwriting bitmap drawing circuit 4 feature vector generation circuit 5 feature vector storage circuit 6 feature vector comparison calculation circuit 7 reference pattern database 8 candidate character string storage circuit 9 input handwriting stroke count storage circuit 10 strokes Test circuit 11 Stroke information database

Claims

【特許請求の範囲】[Claims]

【請求項１】入力筆跡を表すストローク列を入力とし
てこのストローク列を正規化するストローク列正規化手
段と、この正規化手段で正規化されたストローク列をビットマ
ップパターンに描画してその特徴ベクタを生成する特徴
ベクタ生成手段と、この生成手段で生成された特徴ベクタを記憶する特徴ベ
クタ記憶手段と、認識対象文字の特徴ベクタを格納する参照パターンデー
タベースとを有し、上記特徴ベクタ記憶手段からの入力ストローク列の特徴
ベクタと、上記参照パターンデータベースからの認識対
象文字の特徴ベクタとの類似度を計算する比較計算手段
と、この比較計算手段で計算された類似度の高い順に上記認
識対象文字を記憶する候補文字列記憶手段とが設けられ
た手書き文字認識装置において、上記入力筆跡のストロークの数を記憶する入力筆跡画数
記憶手段と、上記認識対象文字の画数の情報を格納する画数情報デー
タベースとを設け、上記画数記憶手段からの上記入力筆跡のストロークの数
と、上記画数情報データベースからの上記認識対象文字
の画数に従って、上記候補文字列記憶手段に記憶された
上記認識対象文字の並べ替えを行うことを特徴とする手
書き文字認識装置。1. A stroke sequence normalizing means for normalizing a stroke sequence representing an input handwriting as an input, and a stroke sequence normalized by the normalizing means is drawn in a bit map pattern and its feature vector is drawn. And a reference vector database for storing the feature vector of the recognition target character. The feature vector generation means for generating the feature vector, the feature vector storage means for storing the feature vector generated by the generation means, and the reference pattern database for storing the feature vector of the recognition target character Comparison calculation means for calculating the similarity between the feature vector of the input stroke sequence and the feature vector of the recognition target character from the reference pattern database, and the recognition target character in descending order of similarity calculated by this comparison calculation means. In the handwritten character recognition device provided with a candidate character string storage means for storing An input handwriting stroke number storage means for storing the number of strokes and a stroke number information database for storing information on the stroke number of the recognition target character are provided, and the stroke number of the input handwriting from the stroke number storage means and the stroke number information database are provided. A handwritten character recognition device for rearranging the recognition target characters stored in the candidate character string storage means according to the number of strokes of the recognition target characters from.

【請求項２】請求項１記載の手書き文字認識装置にお
いて、上記比較計算手段で計算された類似度の高い候補に対し
てのみ、上記入力筆跡のストロークの数と上記認識対象
文字の画数に従った上記候補文字列記憶手段に記憶され
た上記認識対象文字の並べ替えを行うことを特徴とする
手書き文字認識装置。2. The handwritten character recognition device according to claim 1, wherein the number of strokes of the input handwriting and the number of strokes of the recognition target character are used only for a candidate having a high degree of similarity calculated by the comparison calculation means. A handwritten character recognition device, characterized in that the recognition target characters stored in the candidate character string storage means are rearranged.