JPH08305697A

JPH08305697A - Device and method for document preparation

Info

Publication number: JPH08305697A
Application number: JP7113042A
Authority: JP
Inventors: Toshiya Tamura; 村俊哉田; Kenji Suzuki; 木謙二鈴; Yukihiro Fukunaga; 永幸弘福
Original assignee: Toshiba Corp; Toshiba AVE Co Ltd
Current assignee: Toshiba Corp; Toshiba AVE Co Ltd
Priority date: 1995-05-11
Filing date: 1995-05-11
Publication date: 1996-11-22

Abstract

PURPOSE: To divide input data into clauses by gestures and to perform the character recognition processing by these divided clauses. CONSTITUTION: An input means 11 which inputs input data like characters or gestures, a pattern recognition means 14 which recognizes input data inputted by the input means 11 as a character string pattern, a gesture pattern, or the like, a clause division means 20 which divides the character string pattern recognized by the pattern recognition means 14 into clauses by the gesture pattern, a character recognition post-processing means 19 which performs word retrieval of the character string pattern divided by the clause division means 20 in the unit of clauses and evaluates connections of clauses, and a recognition candidate control means 21 which takes out clauses in accordance with priorities of clauses whose connections are evaluated by the character recognition post- processing means 19 are provided.

Description

【発明の詳細な説明】Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】この発明は文書作成装置および文
書作成方法に係り、特に、文字パターンをジェスチャー
パターンにより文節に分かつとともにその文節の接続評
価を行い優先度の高いものから文書の作成をするように
した文書作成装置および文書作成方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a document creating apparatus and a document creating method, and more particularly, it divides a character pattern into phrases by a gesture pattern and evaluates the connection of the phrases to create a document from the highest priority. And a document creating method.

【０００２】[0002]

【従来の技術および発明が解決しようとする課題】一般
に手書の文書作成装置には文字認識後処理手段が備えら
れ、入力データを文書認識の高いものにより文書を作成
するようにしている。2. Description of the Related Art Generally, a handwritten document preparation device is provided with a character recognition post-processing means so that a document can be prepared by using input data having high document recognition.

【０００３】この文字認識後処理手段としては、１）文字認識の確信度の高い文字は１つの候補に絞り文
字を選択する方法、２）文字認識の確信度の低い文字は候補を１つに絞らず
に文字の接続情報を利用して全ての候補文字の組み合わ
せについて接続評価を行い評価の高い文字を選択する方
法、３）候補文字を選択し候補文を形成し構文的制約の高い
文字を選択する方法、４）文字認識結果の１位候補を選択し、その文字列に対
する校正支援機能を応用し誤りを評価し高い文字を選択
する方法、５）単語辞書と文法法則とを参照しながら候補文字の組
み合わせから接続度の高い文字を選択する方法、等が採用されていた。As the post-processing for character recognition, 1) a method of selecting characters narrowed down to one candidate for a character having a high certainty of character recognition, and 2) selecting one candidate for a character having a low certainty of character recognition. A method of performing connection evaluation for all combinations of candidate characters using character connection information without narrowing down and selecting a character with a high evaluation, 3) Selecting candidate characters to form candidate sentences and selecting characters with high syntactic constraints Method of selecting, 4) Method of selecting the first candidate of the character recognition result, applying the proofreading support function for the character string to evaluate the error and selecting a high character, 5) Referring to the word dictionary and grammatical rules A method of selecting a character with a high degree of connection from a combination of candidate characters was adopted.

【０００４】しかし、これらの手法では認識した候補文
字から組み合わせができる文字列の単語照合をするに
は、各文字列の候補文字数、単語の長さにより定まる回
数の照合を行わなければならないと言う面倒な問題があ
った。However, according to these methods, in order to perform word matching of character strings that can be combined from recognized candidate characters, it is necessary to perform matching times determined by the number of candidate characters in each character string and the length of the word. There was a troublesome problem.

【０００５】例えば、入力データとして“かなかんじへ
んかん”と入力した場合にはかな漢字候補には“かな・
かんじ（漢字）・へんかん（変換）”、“かな・かんじ
（感じ）・へんかん（変換）”、“かな・かん（間）・
じへん（事変）・かん（刊）”等があるため、これらの
文字列の全てについて照合しなければならない。For example, if "Kanakanjihenkan" is entered as input data, "Kana-
Kanji (Kanji) / Henkan (Conversion) ”,“ Kana / Kanji (Feeling) / Henkan (Conversion) ”,“ Kana / Kan (Between) /
Since there are "Jihen (incident)" and "Kan (published)", etc., all of these character strings must be collated.

【０００６】そこで本発明は入力される文字パターンを
ジェスチャーにより文節に分かち、その文節間の接続評
価を行つて接続度の高い候補から文書の作成をするうに
し、文字認識後処理の効率化を図った手書文書作成装置
および手書文書作成方法を提供することを目的とするも
のである。Therefore, the present invention divides the input character pattern into phrases by gestures, evaluates the connection between the phrases, and creates a document from candidates with a high degree of connection to improve the efficiency of post-character recognition processing. An object of the present invention is to provide a handwritten document creation device and a handwritten document creation method that are designed.

【０００７】[0007]

【課題を解決するための手段】本発明は、文字、ジェス
チャー等の入力データを入力する入力手段と、この入力
手段が入力した入力データを文字パターン、ジェスチャ
ーパターン等として認識するパターン認識手段と、この
パターン認識手段が認識した文字パターンをジェスチャ
ーパターンにより文節に分かつ文節分かち手段と、文節
分かち手段が分かった文字パターンを文節単位で単語検
索を行うとともにこれらの接続評価を行う文字認識後処
理手段と、この文字認識後処理手段が接続評価した文節
の優先度に応じてその文節の読み出す認識候補制御手段
とを備えたことを特徴とする文書作成装置を提供するも
のである。SUMMARY OF THE INVENTION The present invention comprises input means for inputting input data such as characters and gestures, and pattern recognition means for recognizing the input data input by the input means as character patterns, gesture patterns and the like. The character pattern recognized by this pattern recognition means is divided into bunsetsu by gesture patterns, and the bunsetsu splitting means, and the character recognition post-processing means for performing word search on the bunsetsu unit basis and performing connection search for these. An object of the present invention is to provide a document creation apparatus characterized in that the character recognition post-processing means is provided with a recognition candidate control means for reading out a phrase according to the priority of the phrase evaluated for connection.

【０００８】また、認識候補制御手段には文節をかな漢
字変換するかな漢字変換手段を備えたことを特徴とする
文書作成装置を提供するものである。Further, the present invention provides a document creation apparatus characterized in that the recognition candidate control means is provided with kana-kanji conversion means for converting a phrase into kana-kanji conversion.

【０００９】さらに、かな漢字変換手段には表示手段を
備えたことを特徴とする文書作成装置を提供するもので
ある。Furthermore, the kana-kanji conversion means is provided with a display means to provide a document preparation apparatus.

【００１０】さらにまた、文字、ジェスチャー等の入力
データを入力し、この入力した入力データを文字パター
ン、ジェスチャーパターン等として認識し、この認識し
た文字パターンをジェスチャーパターンにより文節に分
かち、この分かった文字パターンを文節単位で単語検索
を行うとともにこれらの接続評価を行い、この接続評価
した文節の優先度に応じてその読み出をすることを特徴
とする文書作成方法を提供するものである。Furthermore, input data such as characters and gestures is input, the input data input is recognized as a character pattern, a gesture pattern, etc., and the recognized character pattern is divided into phrases by the gesture pattern. (EN) A document creating method characterized in that a pattern is searched for words in units of phrases, these connections are evaluated, and the phrases are read out in accordance with the priority of the phrases evaluated by the connections.

【００１１】さらに、文節の読み出し順にかな漢字変換
することを特徴とする文書作成方法を提供するものであ
る。Further, the present invention provides a document creating method characterized by performing Kana-Kanji conversion in the reading order of clauses.

【００１２】[0012]

【作用】文字、ジェスチャー等の入力データを入力す
る。この入力した入力データを文字パターン、ジェスチ
ャーパターン等のパターンとして認識する。この認識し
た文字パターンをジェスチャーパターンによりそれぞれ
文節に分かち、この分かった文節の単語検索を行うとと
もにこれらの接続評価を行う。この接続評価した文節の
優先度に応じてその配列を行い文書の作成を行う。Function: Inputs input data such as characters and gestures. The input data thus input is recognized as a pattern such as a character pattern or a gesture pattern. Each of the recognized character patterns is divided into bunsetsu by the gesture pattern, and the word search of the found bunsetsu is performed and the connection of these is evaluated. A document is created by arranging the clauses according to the priority of the clauses evaluated for connection.

【００１３】また、接続評価した文節を優先度の高いも
のから文書を作成する。Further, a document is created from the clauses having the highest priority of the clauses evaluated for connection.

【００１４】[0014]

【実施例】以下、本発明文書作成装置および文書作成方
法の一実施例を添付図面について説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the document creating apparatus and the document creating method of the present invention will be described below with reference to the accompanying drawings.

【００１５】図１は本発明文書作成装置および文書作成
方法を実施するためのブロック線図である。FIG. 1 is a block diagram for carrying out the document creating apparatus and the document creating method of the present invention.

【００１６】この文書作成装置１０にはスタイラスペン
等を備えた入力部１１が設けられ、手書の文字、図面、
ジェスチャー等の入力データを入力するようになってい
る。この入力部１１には図示しないが座標位置検出器が
積層して設けられ、入力データの座標・時間等の入力情
報を検出するようになっている。The document creating apparatus 10 is provided with an input unit 11 including a stylus pen, etc.
Input data such as gestures is input. Although not shown, a coordinate position detector is laminated on the input unit 11 to detect input information such as coordinates and time of input data.

【００１７】この座標位置検出器には入力制御部１２が
接続され、入力情報を受け制御するようになっている。
この入力制御部１２には入力バッファ１３が接続され、
入力データが入力部１１から入力されている間中にその
入力データを格納するようになっている。An input control unit 12 is connected to the coordinate position detector so as to receive and control input information.
An input buffer 13 is connected to the input control unit 12,
The input data is stored while it is being input from the input unit 11.

【００１８】この入力制御部１２にはパターン認識部１
４が接続され、入力データを文字パターン、ジェスチャ
ーパターン等として認識処理するようになっている。ま
た、この入力制御部１２には書式制御部１５が接続さ
れ、入力データの書式を設定するようになっている。The input control section 12 includes a pattern recognition section 1
4 is connected to recognize input data as a character pattern, a gesture pattern, or the like. A format control unit 15 is connected to the input control unit 12 to set the format of input data.

【００１９】このパターン認識部１４には候補文字バッ
ファ１６、文節テーブル１７、認識用辞書１８が接続さ
れている。候補文字バッファ１６にはパターン認識部１
４が認識した文字パターンの優先度の高い候補を順に格
納するようになっている。文節テーブル１７にはパター
ン認識部１４が認識したジェスチャーパターンを格納す
るようになっている。認識用辞書１８にはパターン認識
部１４が入力データを認識するために必要な文字デー
タ、ジェスチャーデータを格納するようになっている。A candidate character buffer 16, a phrase table 17, and a recognition dictionary 18 are connected to the pattern recognition section 14. The pattern recognition unit 1 is provided in the candidate character buffer 16.
The candidates of the character patterns with high priority recognized by No. 4 are stored in order. The gesture table recognized by the pattern recognition unit 14 is stored in the phrase table 17. The recognition dictionary 18 stores character data and gesture data required for the pattern recognition unit 14 to recognize input data.

【００２０】また、このパターン認識部１４には文字認
識後処理部１９が接続され、入力データをパターン認識
した後の文節わかち処理部２０等の制御処理をするよう
になっている。この文字認識後処理部１９には文節わか
ち処理部２０、認識候補制御部２１、認識文節テーブル
２２、後処理用バッファ２３、認識候補バッファ２４、
単語辞書２５が接続されている。Further, a character recognition post-processing unit 19 is connected to the pattern recognition unit 14 so as to perform control processing of the phrase segmentation processing unit 20 etc. after pattern recognition of the input data. This character recognition post-processing unit 19 includes a phrase segment processing unit 20, a recognition candidate control unit 21, a recognition phrase table 22, a post-processing buffer 23, a recognition candidate buffer 24,
A word dictionary 25 is connected.

【００２１】文節わかち処理部２０には候補文字バッフ
ァ１６に格納した文字パターンを文節テーブル１７に格
納したジェスチャーパターンにより文節に分かつように
なっている。認識候補制御部２１は文字パターンの接続
度の高い候補を選択制御するようになっている。認識文
節テーブル２２は文節わかち処理部２０がわかった文字
パターンの番号を格納するようになっている。後処理用
バッファ２３は認識文節テーブル２２に格納した文字パ
ターンの番号を参照して候補文字バッファ１６から文字
パターンを文節毎に取り出して格納するようになってい
る。認識候補バッファ２４には各認識候補文字列の接続
評価、認識候補文字列を構成する各単語、各単語の文字
列における位置、アドレス等を格納するようになってい
る。単語辞書２５には文字認識後処理部１９が制御処理
をするのに必要な文字、ジェスチャー等のデータを格納
するようになっている。The bunsetsu-wakachi processing unit 20 divides the character pattern stored in the candidate character buffer 16 into bunsetsu by the gesture pattern stored in the bunsetsu table 17. The recognition candidate control unit 21 is configured to selectively control a candidate having a high degree of connection of character patterns. The recognition phrase table 22 is adapted to store the numbers of the character patterns found by the phrase word processor 20. The post-processing buffer 23 refers to the number of the character pattern stored in the recognition phrase table 22 and extracts the character pattern from the candidate character buffer 16 for each phrase and stores it. The recognition candidate buffer 24 stores the connection evaluation of each recognition candidate character string, each word that constitutes the recognition candidate character string, the position of each word in the character string, the address, and the like. The word dictionary 25 is configured to store data such as characters and gestures necessary for the character recognition post-processing unit 19 to perform control processing.

【００２２】文字認識後処理部１９には漢字混じりかな
漢字変換部２６が接続され、入力した文字パターンをか
な漢字変換処理をするようになっている。漢字混じりか
な漢字変換部２６には変換候補用制御部２７、変換文節
バッファ２８、候補バッファ２９、単語辞書２５が接続
されている。The character recognition post-processing unit 19 is connected to a Kana-Kanji conversion unit 26 that mixes Kanji characters, and performs Kana-Kanji conversion processing on the input character pattern. The kanji / kanji / kanji conversion unit 26 is connected to a conversion candidate control unit 27, a conversion clause buffer 28, a candidate buffer 29, and a word dictionary 25.

【００２３】変換候補用制御部２７は文字パターンの変
換候補を処理するようになっている。変換文節バッファ
２８は文字パターンの認識候補を格納するようになって
いる。候補バッファ２９は文字パターンの変換候補を格
納するようになっている。The conversion candidate control section 27 processes character pattern conversion candidates. The conversion clause buffer 28 stores character pattern recognition candidates. The candidate buffer 29 is adapted to store character pattern conversion candidates.

【００２４】漢字混じりかな漢字変換部２６および書式
用制御部１５には文書バッファ３０が接続され、表示す
べき文字パターンを所定の書式により格納するようにな
っている。この文書バッファ３０には表示制御部３１が
接続され、文字パターンを表示制御するようになってい
る。この表示制御部３１には表示用バッファ３２と表示
部３３とが接続され、表示用バッファ３２には表示すべ
きり文字パターンを一時格納し、表示部３３には文字パ
ターンを表示するようになっている。A document buffer 30 is connected to the Kanji / Kana character conversion unit 26 and the format control unit 15 to store a character pattern to be displayed in a predetermined format. A display control unit 31 is connected to the document buffer 30 to control the display of character patterns. A display buffer 32 and a display unit 33 are connected to the display control unit 31, and a character pattern to be displayed is temporarily stored in the display buffer 32, and the character pattern is displayed on the display unit 33. There is.

【００２５】このように構成した文書作成装置１０の作
用を図３のフロチャートおよび図４以下の説明図を参照
しながら説明する。The operation of the document creating apparatus 10 thus constructed will be described with reference to the flowchart of FIG. 3 and the explanatory views of FIG. 4 and subsequent figures.

【００２６】いま、入力部１１に図３に示すように手書
文字の入力データ“ていあんしたいけん”を入力する
（Ｓ１）。この入力データは“ていあんしたい”を変換
ジェスチャー“Ａ”により文節指定し、“けん”を無変
換ジェスチャー“Ｂ”により文節指定する。この入力デ
ータを座標位置検出器が検出し入力制御部１２に送る。
この入力制御部１２ではその制御により入力データを入
力バッファ１３に格納する（Ｓ２）。Now, as shown in FIG. 3, the handwritten character input data "TEENANSIKEN" is input to the input section 11 (S1). In this input data, "tei antai" is specified by the conversion gesture "A" and "ken" is specified by the non-conversion gesture "B". The coordinate position detector detects this input data and sends it to the input control unit 12.
Under this control, the input control unit 12 stores the input data in the input buffer 13 (S2).

【００２７】また、入力制御部１２では入力バッファ１
３に格納した入力データのパターンをパターン認識部１
４に送る。このパターンを受けたパターン認識部１４で
は認識用辞書１８を参考にし文字パターンであるかジェ
スチャーパターンであるかのパターン認識を行う（Ｓ
３）。In the input control unit 12, the input buffer 1
The pattern of the input data stored in 3 is recognized by the pattern recognition unit 1
Send to 4. The pattern recognition unit 14 that has received this pattern recognizes whether it is a character pattern or a gesture pattern with reference to the recognition dictionary 18 (S).
3).

【００２８】このパターンが文字パターンである場合に
は（Ｓ４）、その文字パターン番号順に領域、候補文字
等を候補文字バッファ１６に格納する（Ｓ５）。If this pattern is a character pattern (S4), the area, candidate characters, etc. are stored in the candidate character buffer 16 in the order of the character pattern numbers (S5).

【００２９】この格納は図３、図４および図５に示すよ
うに行う。すなわち、文字パターン“ていあんしたいけ
ん”を１字毎に四辺形の領域で区切り、この区切った文
字、例えば“て”を文字パターン番号１、領域（左上座
標Ｘl1、Ｙl1、右下座標Ｘr1、Ｙr1）、候補文字テーブ
ルのポインタ（０×００００）として格納する。以下同
様に“い”“あ”“ん”…の文字パターン番号、領域、
候補文字テーブルのポインタを格納する。This storage is performed as shown in FIGS. 3, 4 and 5. That is, the character pattern “teitenshitai” is separated for each character by a quadrilateral area, and the separated characters, for example, “te” are character pattern number 1, area (upper left coordinate Xl1, Yl1, lower right coordinate Xr1, Yr1), and stores it as a pointer (0x0000) of the candidate character table. Similarly, "i", "a", "n" ... character pattern number, area,
Stores the pointer of the candidate character table.

【００３０】ポインタ（０×００００）の候補文字テー
ブルには図５に示すように“て”が格納されるとともに
これに類似する“７”…等を格納する。また、ポインタ
（０×００１０）の候補文字テーブルには図６に示すよ
うに“い”を格納するとともにこれに類似する“り”…
等を格納する。以下同様にして“あ”“ん”…の候補文
字を順次格納する。As shown in FIG. 5, "te" is stored in the candidate character table of the pointer (0x0000), and "7", etc. similar to this are stored. Further, "i" is stored in the candidate character table of the pointer (0x0010) as shown in FIG.
Etc. are stored. In the same manner, candidate characters "a", "n" ... Are sequentially stored.

【００３１】また、パターンがジェスチャーパターンで
ある場合には図７に示すように番号順に領域を定め変換
ジェスチャー“Ａ”、無変換ジェスチャー“Ｂ”等を文
節テーブル１７に格納する（Ｓ６）。If the pattern is a gesture pattern, areas are defined in numerical order as shown in FIG. 7 and the conversion gesture "A", the non-conversion gesture "B", etc. are stored in the phrase table 17 (S6).

【００３２】この格納はつぎのうようにする。すなわ
ち、変換ジェスチャー“Ａ”を文節番号１としてその始
点終点位置（始点Ｘs1、Ｙs1、終点Ｘe1、Ｙe1）、ジェ
スチャーの種別“Ａ”を格納する。また、無変換ジェス
チャーを文節番号２としてその始点終点位置（始点Ｘs
2、Ｙs2、終点Ｘe2、Ｙe2）、ジェスチャーの種別
“Ｂ”を格納する。以下同様に他のジェスチャーを格納
する。This storage is performed as follows. That is, with the converted gesture "A" as the clause number 1, the start point end point position (start point Xs1, Ys1, end point Xe1, Ye1) and the gesture type "A" are stored. In addition, the unconverted gesture is set to the phrase number 2, and its start point end point position (start point Xs
2, Ys2, end point Xe2, Ye2) and the type of gesture “B” are stored. Similarly, other gestures are stored below.

【００３３】このようにして候補文字バッファ１６には
文字パターン“ていあんしたいけん”を変換ジェスチャ
ー“Ａ”の指示による文節“ていあんしたい”と無変換
ジェスチャー“Ｂ”の指示による文節“けん”…に区切
って順次格納する。In this way, the candidate character buffer 16 stores the character pattern "teananshitai" in the phrase "teansaitai" by the conversion gesture "A" and the phrase "ken" by the non-conversion gesture "B". It is divided into ... and stored sequentially.

【００３４】このようにして文字パターンを文節毎に候
補文字バッファ１６に順次格納すると、その格納情報が
パターン認識部１４から文字認識後処理部１９に送られ
る。文字認識後処理部１９がこの格納情報を受けると、
それを文節分かち処理部２０に送る。When the character patterns are sequentially stored in the candidate character buffer 16 for each phrase in this way, the stored information is sent from the pattern recognition section 14 to the character recognition post-processing section 19. When the character recognition post-processing unit 19 receives this stored information,
It is sent to the phrase segmentation processing unit 20.

【００３５】文節分かち処理部２０が格納情報を受ける
と文節テーブル１７のジェスチャー“Ａ”，“Ｂ”…を
参照して候補文字バッファ１６に格納している文字パタ
ーン番号１、２、３…を図８に示すように文節毎に読み
込み認識文節テーブル２２に格納する（Ｓ７）。この格
納は全ての文節が終了するまで行う。When the phrase segmentation processing unit 20 receives the storage information, it refers to the gestures "A", "B" ... In the phrase table 17 to retrieve the character pattern numbers 1, 2, 3 ... Stored in the candidate character buffer 16. As shown in FIG. 8, each phrase is stored in the read recognition phrase table 22 (S7). This storage is performed until all clauses are completed.

【００３６】このようにして認識文節テーブル２２に格
納した文節番号と文字パターン番号とを取り出し（Ｓ
８）、これらによる文字列候補１、２、３…ｍ…ｎを図
９に示すように後処理バッファ２３に格納する（Ｓ
９）。The phrase number and the character pattern number stored in the recognition phrase table 22 in this manner are extracted (S
8), the character string candidates 1, 2, 3, ... M ... N based on these are stored in the post-processing buffer 23 as shown in FIG. 9 (S).
9).

【００３７】この後処理バッファ２３に格納した文字列
候補１、２、３…ｍ…について認識候補用制御部２１の
制御によりその単語が単語辞書２５に格納したデータと
一致するか否かについての単語検索を行う（Ｓ１０）。With respect to the character string candidates 1, 2, 3 ... M stored in the post-processing buffer 23, whether or not the word matches the data stored in the word dictionary 25 under the control of the recognition candidate control unit 21. A word search is performed (S10).

【００３８】一致した単語が複数あるものについてはそ
の接続可能性、前後の接続可能性等について評価を行い
（Ｓ１１）、接続可能性がある文字列を認識候補バッフ
ァ２４に格納する（Ｓ１２）（Ｓ１３）。When there are a plurality of matching words, the connectability, the connectability before and after, and the like are evaluated (S11), and the connectable character string is stored in the recognition candidate buffer 24 (S12) (S12). S13).

【００３９】この認識候補バッファ２４に対する格納は
図１０に示すように文節番号をポインタにより指示し、
その指示した各ポインタを図１１に示すように候補番号
毎に評価、例えば、１００点、…、６０点、…等の点数
評価し、この結果、ポインタ０×０１００…等を図１
２、図１３に示すように単語“ていあん”を辞書アドレ
ス０×６ａ９０…等として格納する。このような格納は
未検索の候補文字列がなくなるまで行う（Ｓ１４）。Storage in the recognition candidate buffer 24 is performed by designating a clause number with a pointer as shown in FIG.
Each of the indicated pointers is evaluated for each candidate number as shown in FIG. 11, for example, a score of 100 points, ..., 60 points, etc. is evaluated, and as a result, the pointer 0 × 0100 ...
2, the word "teian" is stored as a dictionary address 0x6a90 ... Such storage is performed until there are no unsearched candidate character strings (S14).

【００４０】この格納が終わったら図１４に示すように
文字列候補１、２…の接続評価を行い高い順、すなわ
ち、１；“ていあんしたい”、２；“ていあんした
り”、３；“ていあんくたい”…等の順に配列をする。
また、選択すべき文字列候補がないときには認識候補を
文字列単位、すなわち“ていあんしたい”を“１；て、
２；７、３；１…”等のようにして選択する（Ｓ１
５）。When this storage is completed, the connection evaluation of the character string candidates 1, 2 ... Is performed as shown in FIG. 14, and the results are ranked in descending order, that is, 1; Arrange them in the order of "tetankutai" ... etc.
Also, when there is no character string candidate to be selected, the recognition candidate is set in character string units, that is, "I want to do it" is "1;
2; 7,3; 1 ... "etc. (S1
5).

【００４１】この配列された文字列候補１、２…は認識
候補用制御部２１の制御により認識候補バッファ２４に
格納し、その選択処理を行う（Ｓ１６）。この文字列候
補１に選択において候補がなければ他の文字列候補２、
３…等の選択を行う（Ｓ１７）（Ｓ１８）。The arranged character string candidates 1, 2, ... Are stored in the recognition candidate buffer 24 under the control of the recognition candidate control unit 21 and the selection processing is performed (S16). If there is no candidate for selection in this character string candidate 1, another character string candidate 2,
3 and so on are selected (S17) (S18).

【００４２】選択すべき候補があれば、つぎに、文節テ
ーブル１７を参照してジェスチャーパターン“Ａ”、
“Ｂ”…の種類の調査を行う（Ｓ１９）。If there is a candidate to be selected, then referring to the clause table 17, the gesture pattern "A",
The type "B" ... is investigated (S19).

【００４３】選択した認識候補が変換対象であれば文字
列を選択し（Ｓ２０）、（Ｓ２１）、これをを図１５に
示すように変換文節バッファ２８に格納する（Ｓ２
２）。この文節バッファ２８に格納した文字列候補に対
しては単語辞書２５からそのアドレスを取り出し漢字混
じりかな漢字変換部２６の変換処理によりかな漢字変換
を行う（Ｓ２３）（Ｓ２４）。この変換した変換候補を
候補バッファ２９に格納する（Ｓ２５）。この変換は未
処理単語がなくなるまで行う（Ｓ２６）。If the selected recognition candidate is a conversion target, a character string is selected (S20), (S21), and this is stored in the conversion clause buffer 28 as shown in FIG. 15 (S2).
2). For the character string candidates stored in the phrase buffer 28, the address is taken out from the word dictionary 25 and the kana-kanji conversion is performed by the conversion process of the kana-kanji kana-kanji conversion unit 26 (S23) (S24). The converted conversion candidates are stored in the candidate buffer 29 (S25). This conversion is performed until there are no unprocessed words (S26).

【００４４】選択した認識候補が変換対象でなければ直
接に漢字混じりかな漢字変換部２６に送り漢字混じりか
な漢字変換を行う（Ｓ２７）。この変換処理した変換候
補を候補バッファ２９に格納する（Ｓ２８）。If the selected recognition candidate is not the conversion target, it is directly sent to the kanji-mixed kana-kanji conversion unit 26 to perform kanji-mixed kana-kanji conversion (S27). The conversion candidates subjected to this conversion processing are stored in the candidate buffer 29 (S28).

【００４５】この候補バッファ２９に格納した文字列候
補が選択処理され（Ｓ２９）、この書式を所定のものに
整えて文書バッファ３０に格納する（Ｓ３０）。The character string candidates stored in the candidate buffer 29 are selected (S29), the format is adjusted to a predetermined format and stored in the document buffer 30 (S30).

【００４６】また、ジェスチャーが変換対象でなければ
書式を所定のものに整えてそのまま文書バッファ３０に
格納する（Ｓ３１）。このような処理で未処理文節があ
れば文字列パターンの番号の取り出し未処理文節がなく
なるまで行う（Ｓ３２）。If the gesture is not a conversion target, the format is adjusted to a predetermined format and stored in the document buffer 30 as it is (S31). If there is an unprocessed bunsetsu by such processing, the number of the character string pattern is taken out until there is no unprocessed bunsetsu (S32).

【００４７】未処理文節がなくなると（Ｓ３２）、表示
制御部３１、表示用バッファ３２等を介して画面表示部
３３に送り画面表示を行い文書作成処理を完了させる
（Ｓ３３）。When there are no unprocessed phrases (S32), the screen is sent to the screen display unit 33 via the display control unit 31, the display buffer 32, etc., and the screen is displayed to complete the document creation process (S33).

【００４８】１つの文節候補の作成が終了したらそれぞ
れのバッファ、テーブル等の初期化を行いつぎの文節候
補の作成を行う（Ｓ３４）。When the preparation of one phrase candidate is completed, the respective buffers, tables, etc. are initialized and the next phrase candidate is prepared (S34).

【００４９】このようにして入力文字データをジェスチ
ャーにより文節に区切り、この区切った文節を接続評価
しその優先度の高いものから文書を作成するようにした
から文字認識効率を著しく向上させることができる。In this way, the input character data is divided into clauses by gestures, the divided clauses are connected and evaluated, and a document is created from the one with the highest priority. Therefore, the character recognition efficiency can be significantly improved. .

【００５０】図１６、図１７、図１８、図１９は他の実
施例を示すものである。FIG. 16, FIG. 17, FIG. 18, and FIG. 19 show another embodiment.

【００５１】いま、入力部１１に図１６に示すように入
力データ“ていあんしたいけん”を入力する。この入力
データ“ていあん”を変換ジェスチュー“Ａ1 ”によ
り、同様に“したい”を変換ジェスチュー“Ａ2 ”によ
り、同様に“けん”を変換ジェスチューＡ3 により指定
する。Now, as shown in FIG. 16, the input data "TEEN-ANSIKAI-KEN" is input to the input section 11. This input data "teian" is designated by the conversion gesture "A1", "desired" is designated by the conversion gesture "A2", and "ken" is designated by the conversion gesture A3.

【００５２】これら入力データは図３…等に示したもの
と同様にパターン認識部１４、文字認識後処理部１９を
介して図１７に示すように変換文節バッファ２８に文字
候補として格納する。These input data are stored as character candidates in the conversion clause buffer 28 as shown in FIG. 17 via the pattern recognition section 14 and the character recognition post-processing section 19 as in the case shown in FIG.

【００５３】この変換文節バッファ２８には変換ジェス
チュー“Ａ1 ”、“Ａ2 “…等により区切られた単語文
節を始点位置、終点位置、辞書アドレスによって格納す
る。この辞書アドレスを検索し図１８に示すように漢字
混じりの身出しを取り出す。ここで第１要素“ていあ
ん”、“したい”がかな表記、第２要素“提／案”が登
録語である。この第２要素“／”には区切りが付加され
ている。In the conversion clause buffer 28, the word clauses delimited by the conversion gestures "A1", "A2" ... Are stored by the start point position, the end point position, and the dictionary address. This dictionary address is searched to take out a kanji mixed character as shown in FIG. Here, the first element “te-an” and “want to” are kana notations, and the second element “proposal / plan” is a registered word. A delimiter is added to the second element "/".

【００５４】このこの変換文節バッファ２８には格納し
た文字列を取り出し単語辞書のデータと比較し漢字混じ
りかな漢字変換部２６の変換処理により図１９に示すよ
うに簡単にかな漢字変換を行う。The character string stored in the conversion clause buffer 28 is extracted and compared with the data in the word dictionary, and the kana-kanji conversion is performed easily by the conversion processing of the kana-kanji kana-kanji conversion unit 26 as shown in FIG.

【００５５】このようにするとジェスチャー“Ａ1 ”、
“Ａ2 ”…等により入力データを文節に指定し、文字認
識後処理において単語辞書検索と単語間の接続評価を行
うので改めて辞書間の接続評価を行わなくても良い。そ
のため文字認識後処理の効率よく行うことができる。In this way, the gesture "A1",
Input data is designated as a phrase by "A2" ..., and word dictionary search and word-to-word connection evaluation are performed in character recognition post-processing, so connection evaluation between dictionaries need not be performed again. Therefore, post-character recognition processing can be efficiently performed.

【００５６】図２０、図２１、図２２はさらに他の実施
例を示すものである。20, 21, and 22 show still another embodiment.

【００５７】いま、図２０に示すように入力データ“や
までみかけたきいろのはな”を入力すると図２１に示す
ように“きいろの”の“い”が“り”と誤認識されるこ
とがある。Now, when the input data "Kironohana" which is seen as shown in FIG. 20 is inputted, the "I" of "Kirono" is mistakenly recognized as "Ri" as shown in FIG. There is.

【００５８】そこで変換ジェスチュー“Ａ1 ”により
“やまでみかけた”に、変換ジェスチュー“Ａ2 ”によ
り“きいろの”に、変換ジェスチュー“Ａ3 ”により
“はな”に文節指定して入力すると、図２２に示すよう
にな“い”を“り”と認識した“きり（霧）”、“ろ
（路）”“の”と言う接続評価が低い意味のとれない認
識候補の選択が少なくする。22. When the conversion gesture "A1" is used to specify "slightly seen", the conversion gesture "A2" is used to specify "yellow", and the conversion gesture "A3" is used to specify "hana". As shown in (3), the connection evaluations such as “cut (fog)”, “ro (road)”, and “no” that recognize “i” as “ri” are low, and the selection of meaningless recognition candidates is reduced.

【００５９】そのため“やまでみかけたきいろのはな”
を３文節指定することにより“山まで見掛けた黄色の
花”と正確に認識処理することができる。Therefore, "the yellow flower that I saw until now"
By specifying 3 clauses, it is possible to accurately recognize and process as "a yellow flower seen up to a mountain".

【００６０】図２３、図２４はさらに他の実施例を示す
ものである。23 and 24 show still another embodiment.

【００６１】いま、図２３に示すように入力データ“み
せをかいしゅうしよそおいをあらたにする”を入力する
と“しよ”が“は”と誤認識されることがある。Now, when inputting the input data "show me a new one" as shown in FIG. 23, "shiyo" may be erroneously recognized as "ha".

【００６２】そこで図２４に示すようにジェスチュー
“Ａ”により“し”と“よ”との間に文字切り出しのジ
ェスチュー“｜”を挿入し、“しよ”を“し｜よ”に文
節指定すると、“し”“よ”が正しく認識され“店を改
修し装いを新たにする”と認識処理をすることができ
る。Then, as shown in FIG. 24, a gesture "|" for character cutting is inserted between "shi" and "yo" by the gesture "A", and "shiyo" is designated as "shi | yo". Then, "shi" and "yo" are correctly recognized, and recognition processing can be performed as "refurbish the store and renew the appearance."

【００６３】[0063]

【発明の効果】本発明は、文字、ジェスチャー等の入力
データを入力する入力手段と、この入力手段が入力した
入力データを文字パターン、ジェスチャーパターン等と
して認識するパターン認識手段と、このパターン認識手
段が認識した文字パターンをジェスチャーパターンによ
り文節に分かつ文節分かち手段と、文節分かち手段が分
かった文字パターンを文節単位で単語検索を行うととも
にこれらの接続評価を行う文字認識後処理手段と、この
文字認識後処理手段が接続評価した文節の優先度に応じ
てその文節の読み出す認識候補制御手段とを備えたか
ら、入力データが文節単位で分かたれ文書が正確に作成
することができる。The present invention provides an input means for inputting input data such as characters and gestures, a pattern recognition means for recognizing the input data input by this input means as a character pattern, a gesture pattern, etc., and this pattern recognition means. A character recognition post-processing means that divides the recognized character patterns into bunsetsu using gesture patterns, and a character recognition post-processing means that performs a word search on a bunsetsu unit basis for the character patterns that the bunsetsu separation method has found and evaluates these connections. Since the post-processing means is provided with the recognition candidate control means for reading out the bunsetsu according to the priority of the bunsetsu evaluated for connection, the input data can be divided into bunsetsu units and a document can be accurately created.

【００６４】また、認識候補制御手段には文節をかな漢
字変換するかな漢字変換手段を備えたから文書の作成が
容易にできる。Further, since the recognition candidate control means is provided with kana-kanji conversion means for converting bunsetsu to kana-kanji, a document can be easily created.

【００６５】さらに、かな漢字変換手段には表示手段を
備えたから作成された文書を正確に把握することができ
る。Further, since the kana-kanji conversion means is provided with the display means, the document created can be accurately grasped.

【００６６】さらにまた、文字、ジェスチャー等の入力
データを入力し、この入力した入力データを文字パター
ン、ジェスチャーパターン等として認識し、この認識し
た文字パターンをジェスチャーパターンにより文節に分
かち、この分かった文字パターンを文節単位で単語検索
を行うとともにこれらの接続評価を行い、この接続評価
した文節の優先度に応じてその読み出をようにしたか
ら、入力データが文節単位で分かたれ文書が正確に作成
することができる。Furthermore, input data such as characters and gestures is inputted, the inputted input data is recognized as a character pattern, a gesture pattern, etc., and the recognized character pattern is divided into phrases by the gesture pattern. The pattern is searched for words in units of phrases and these connections are evaluated, and the connection is evaluated according to the priority of the phrases, so that the input data is divided in units of phrases and a document is created accurately. can do.

【００６７】さらに、文節の読み出し順にかな漢字変換
するようにしたから文書の作成が容易にできる。Further, since the kana-kanji conversion is performed in the reading order of the clauses, the document can be easily created.

【図面の簡単な説明】[Brief description of drawings]

【図１】本発明文書作成装置および文書作成方法の概要
を示すブロック図。FIG. 1 is a block diagram showing an outline of a document creating apparatus and a document creating method according to the present invention.

【図２】図１の動作を示すフローチャート図。FIG. 2 is a flowchart showing the operation of FIG.

【図３】入力データの一例を示す説明図。FIG. 3 is an explanatory diagram showing an example of input data.

【図４】候補文字バッファの格納例を示す説明図。FIG. 4 is an explanatory diagram showing a storage example of a candidate character buffer.

【図５】候補文字バッファの格納例を示す説明図。FIG. 5 is an explanatory diagram showing a storage example of a candidate character buffer.

【図６】候補文字バッファの格納例を示す説明図。FIG. 6 is an explanatory diagram showing a storage example of a candidate character buffer.

【図７】文節テーブルの格納例を示す説明図。FIG. 7 is an explanatory diagram showing a storage example of a clause table.

【図８】認識文字テーブルの格納例を示す説明図。FIG. 8 is an explanatory diagram showing a storage example of a recognized character table.

【図９】後処理バッファの格納例を示す説明図。FIG. 9 is an explanatory diagram showing a storage example of a post-processing buffer.

【図１０】認識候補文字バッファの格納例を示す説明
図。FIG. 10 is an explanatory diagram showing a storage example of a recognition candidate character buffer.

【図１１】認識候補文字バッファの格納例を示す説明
図。FIG. 11 is an explanatory diagram showing a storage example of a recognition candidate character buffer.

【図１２】認識候補文字バッファの格納例を示す説明
図。FIG. 12 is an explanatory diagram showing a storage example of a recognition candidate character buffer.

【図１３】認識候補文字バッファの格納例を示す説明
図。FIG. 13 is an explanatory diagram showing a storage example of a recognition candidate character buffer.

【図１４】認識候補文字バッファの格納例を示す説明
図。FIG. 14 is an explanatory diagram showing a storage example of a recognition candidate character buffer.

【図１５】変換文節バッファの格納例を示す説明図。FIG. 15 is an explanatory diagram showing a storage example of a conversion clause buffer.

【図１６】入力データの一例を示す説明図。FIG. 16 is an explanatory diagram showing an example of input data.

【図１７】変換文節バッファの格納例を示す説明図。FIG. 17 is an explanatory diagram showing a storage example of a conversion clause buffer.

【図１８】単語辞書の格納例を示す説明図。FIG. 18 is an explanatory diagram showing a storage example of a word dictionary.

【図１９】認識候補文字バッファの格納例を示す説明
図。FIG. 19 is an explanatory diagram showing a storage example of a recognition candidate character buffer.

【図２０】入力データの一例を示す説明図。FIG. 20 is an explanatory diagram showing an example of input data.

【図２１】変換文節バッファの格納例を示す説明図。FIG. 21 is an explanatory diagram showing a storage example of a conversion clause buffer.

【図２２】変換文節バッファの格納例を示す説明図。FIG. 22 is an explanatory diagram showing a storage example of a conversion clause buffer.

【図２３】認識候補文字バッファの格納例を示す説明
図。FIG. 23 is an explanatory diagram showing a storage example of a recognition candidate character buffer.

【図２４】認識候補文字バッファの格納例を示す説明
図。FIG. 24 is an explanatory diagram showing a storage example of a recognition candidate character buffer.

【符号の説明】[Explanation of symbols]

１０文書作成装置１１入力部１２入力制御部１３入力バッファ１４パターン認識部１５書式用制御部１６候補文字バッファ１７文節テーブル１８認識用辞書１９文字認識後処理部２０文節分かち処理部２１認識候補用制御部２２認識文節テーブル２３後処理用バッファ２４認識候補用バッファ２５単語辞書２６漢字混じりかな漢字変換部２７変換候補用制御部２８変換文節バッファ２９候補バッファ３０文書バッファ３１表示部 DESCRIPTION OF SYMBOLS 10 document creation device 11 input unit 12 input control unit 13 input buffer 14 pattern recognition unit 15 pattern control unit 16 candidate character buffer 17 phrase table 18 recognition dictionary 19 character recognition post-processing unit 20 phrase segmentation processing unit 21 recognition candidate control Part 22 Recognition clause table 23 Post-processing buffer 24 Recognition candidate buffer 25 Word dictionary 26 Kanji mixed kana-kanji conversion part 27 Conversion candidate control part 28 Conversion phrase buffer 29 Candidate buffer 30 Document buffer 31 Display part

───────────────────────────────────────────────────── フロントページの続き (72)発明者福永幸弘東京都港区新橋３丁目３番９号東芝エー・ブイ・イー株式会社内 ─────────────────────────────────────────────────── ─── Continuation of front page (72) Inventor Hiroyuki Fuku, Nagahiro 3-3-9 Shimbashi, Minato-ku, Tokyo Toshiba Abu E. Co., Ltd.

Claims

【特許請求の範囲】[Claims]

【請求項１】文字、ジェスチャー等の入力データを入力
する入力手段と、この入力手段が入力した入力データを文字パターン、ジ
ェスチャーパターン等として認識するパターン認識手段
と、このパターン認識手段が認識した文字パターンをジェス
チャーパターンにより文節に分かつ文節分かち手段と、文節分かち手段が分かった文字パターンを文節単位で単
語検索を行うとともにこれらの接続評価を行う文字認識
後処理手段と、この文字認識後処理手段が接続評価した文節の優先度に
応じてその文節の読み出す認識候補制御手段と、を備えたことを特徴とする文書作成装置。1. Input means for inputting input data such as characters and gestures, pattern recognition means for recognizing the input data input by this input means as a character pattern, gesture pattern, etc., and characters recognized by this pattern recognition means. The pattern segmentation means divides the pattern into gestures by means of gesture patterns, and the character recognition post-processing means for performing word search in phrase units for the character patterns found by the phrase division separation means and performing connection evaluation of these, and this character recognition post-processing means A document creation device, comprising: recognition candidate control means for reading out a phrase according to the priority of the phrase evaluated for connection.

【請求項２】認識候補制御手段には文節をかな漢字変換
するかな漢字変換手段を備えたことを特徴とする請求項
１に記載の文書作成装置。2. The document creating apparatus according to claim 1, wherein the recognition candidate control means is provided with kana-kanji conversion means for converting a phrase into kana-kanji conversion.

【請求項３】かな漢字変換手段には表示手段を備えたこ
とを特徴とする請求項１または２に記載の文書作成装
置。3. The document creating apparatus according to claim 1, wherein the kana-kanji conversion means includes a display means.

【請求項４】文字、ジェスチャー等の入力データを入力
し、この入力した入力データを文字パターン、ジェスチャー
パターン等として認識し、この認識した文字パターンをジェスチャーパターンによ
り文節に分かち、この分かった文字パターンを文節単位で単語検索を行う
とともにこれらの接続評価を行い、この接続評価した文節の優先度に応じて読み出す、ことを特徴とする文書作成方法。4. Inputting input data such as characters and gestures, recognizing the input input data as a character pattern, a gesture pattern, etc., dividing the recognized character pattern into clauses by the gesture pattern, and understanding the recognized character pattern. A document creation method characterized by performing a word search on a phrase-by-phrase basis, evaluating these connections, and reading them according to the priority of the clause for which this connection evaluation is performed.

【請求項５】文節の読み出しデータをかな漢字変換する
ことを特徴とする請求項４に記載の文書作成方法。5. The document creating method according to claim 4, wherein the read data of the phrase is converted into kana-kanji characters.