JPH0332825B2

JPH0332825B2 -

Info

Publication number: JPH0332825B2
Application number: JP58117453A
Authority: JP
Inventors: Mitsuo Matsumura
Original assignee: Tokyo Shibaura Electric Co Ltd
Current assignee: Toshiba Corp
Priority date: 1983-06-29
Filing date: 1983-06-29
Publication date: 1991-05-14
Also published as: JPS608922A

Description

【発明の詳細な説明】〔発明の技術分野〕この発明は入力手段から入力された文章の読み
情報を漢字混り文に変換する日本語処理装置に関
する。DETAILED DESCRIPTION OF THE INVENTION [Technical Field of the Invention] The present invention relates to a Japanese language processing device that converts reading information of a sentence inputted from an input means into a sentence containing kanji.

〔発明の技術的背景とその問題点〕[Technical background of the invention and its problems]

一般に、この種の日本語処理装置では、入力手
段から入力された文章の読み情報、例えば仮名読
み情報を変換用辞書（単語辞書）を用いて漢字混
り文に変換する方式が適用されている。この方式
を適用する日本語処理装置では、文章の読み情報
を効率よく漢字混り文に変換できるように「漢字
指定」或いは「文節指定」による変換モード指定
が要求される場合が多い。「漢字指定）は文章の
読み情報のうち漢字に変換すべき単語の読み部分
を明確に指示するものである。これにより、指示
された読み部分から該当する単語辞書を引いて適
切な漢字に変換することができる。一方「文節指
定」は、漢字に変換すべき読み部分を明確に指示
する代りに、文章の読み情報を文章単位に区切つ
て入力するものである。これにより、文節分析を
行なつてこの文節中の漢字に変換すべき箇所の読
み部分を抽出し、該当する単語辞書を用いて適切
な漢字混り文に変換することができる。 Generally, this type of Japanese language processing device uses a method of converting reading information of a sentence inputted from an input means, such as kana reading information, into a sentence containing kanji using a conversion dictionary (word dictionary). . Japanese language processing devices that apply this method are often required to specify a conversion mode using ``kanji specification'' or ``phrase specification'' so that reading information of a sentence can be efficiently converted into a sentence containing kanji. "Kanji specification)" clearly indicates the pronunciation part of the word that should be converted to kanji from the reading information of the sentence.This allows the corresponding word dictionary to be looked up from the specified reading part and converted to the appropriate kanji. On the other hand, ``Bunsetsu specification'' allows reading information of a sentence to be input by dividing it into sentence units, instead of clearly indicating the reading part to be converted into kanji. Thereby, it is possible to perform a bunsetsu analysis, extract the pronunciation of the part of the clause that should be converted into kanji, and convert it into an appropriate kanji-containing sentence using the corresponding word dictionary.

このように、文章の読み情報を漢字混り文に変
換する場合、漢字に変換すべき箇所の読み情報を
用いて変換用辞書を引く必要がある。そこで従来
の日本語処理装置では、変換単位（変換処理単
位）の読み情報が入力されてから漢字に変換すべ
き読み情報に該当する単語辞書を引き、仮名漢字
変換などの変換処理が行なわれていた。言い換え
れば、変換単位の読み情報が完全に入力されるま
では変換作業は行なわれなかつた。このため入力
された読み情報に対する漢字混り文への変換速度
が遅い欠点があつた。 In this way, when converting the reading information of a sentence into a sentence containing kanji, it is necessary to look up a conversion dictionary using the reading information of the part to be converted into kanji. Therefore, in conventional Japanese language processing devices, after the reading information of a conversion unit (conversion processing unit) is input, a word dictionary corresponding to the reading information to be converted to kanji is looked up, and conversion processing such as kana-kanji conversion is performed. Ta. In other words, the conversion operation is not performed until the reading information of the conversion unit is completely input. For this reason, there was a drawback that the conversion speed of input reading information into sentences containing kanji was slow.

〔発明の目的〕[Purpose of the invention]

この発明は上記事情に鑑みてなされたものでそ
の目的は、文章の読み情報を漢字混り文に高速変
換できる日本語処理装置を提供することにある。 The present invention has been made in view of the above circumstances, and its purpose is to provide a Japanese language processing device that can quickly convert reading information of a sentence into a sentence containing kanji.

〔発明の概要〕[Summary of the invention]

この発明では、変換単位（変換処理単位）の読
み情報が入力されている途中で、当該読み情報の
一部を用いて外部記憶装置から候補となる単語辞
書をバツフアに先取りする構成としている。更に
この発明では、バツフアに先取りされた単語辞書
を上記変換単位の読み情報の入力終了時に用いる
ことにより、当該読み情報を漢字または漢字混り
文に変換する構成としている。 In this invention, while the reading information of a conversion unit (conversion processing unit) is being input, a part of the reading information is used to quickly retrieve candidate word dictionaries from an external storage device. Furthermore, the present invention is configured to convert the reading information into kanji or a sentence containing kanji by using the word dictionary prefetched in the buffer at the end of inputting the reading information of the conversion unit.

〔発明の実施例〕[Embodiments of the invention]

第１図はこの発明の一実施例に係る日本語処理
装置の構成を示す。符号１１で示される入力装
置、例えばキー入力装置には、文章の読み情報を
入力する各種文字キー、“漢字指定キー”、“文節
指定キー”、“固有名詞指定キー”および“ひらが
なキー”など（いずれも図示せず）が設けられて
いる。キー入力装置１１からのキー入力情報はキ
ー入力制御機構１２に供給される。この実施例装
置では、仮名漢字変換を必要とする文章の読み情
報をキー入力する際に、“漢字指定キー”、“文節
指定キー”、または“固有名詞指定キー”による
変換モード指定が要求される。例えば「先生は」
という文節をオペレータが入力したいものとす
る。漢字指定式の場合、オペレータは“漢字指定
キー”を押下した後、漢字「先生」の読み「せん
せい」を逐次入力し、次に“ひらがなキー”を押
してから「は」を入力する。一方、文節指定式
（分かち書き式）の場合には、オペレータは“文
節指定キー”を押下した後、文節の読み「せんせ
いは」を逐次入力する。 FIG. 1 shows the configuration of a Japanese language processing device according to an embodiment of the present invention. The input device denoted by numeral 11, for example, a key input device, includes various character keys for inputting reading information of sentences, "Kanji designation key", "Phrase designation key", "Proper noun designation key", "Hiragana key", etc. (none of which are shown). Key input information from the key input device 11 is supplied to a key input control mechanism 12 . In this example device, when key-inputting the reading information of a sentence that requires kana-kanji conversion, it is required to specify the conversion mode using a "kanji specification key," a "phrase specification key," or a "proper noun specification key." Ru. For example, "Teacher is"
Assume that the operator wants to input the following phrase. In the case of the kanji specification method, the operator presses the ``kanji specification key'', then sequentially inputs the reading ``sensei'' of the kanji ``teacher'', then presses the ``hiragana key'', and then inputs ``ha''. On the other hand, in the case of the phrase designation type (separation type), the operator presses the "phrase designation key" and then sequentially inputs the pronunciation of the phrase "Sensei wa".

今、“漢字指定キー”または“文節指定キー”
が押下された後のキー入力に対し、この実施例装
置では第２図に示すフローチヤートに従つて以下
の動作が行なわれる。キー入力制御機構１２は
“漢字指定キー”、“文節指定キー”、または“固有
名詞指定キー”が押下されたこと、すなわち仮名
漢字変換が必要となることをステツプＳ１で判定
すると、ステツプS2の処理を実行する。このス
テツプS2ではキー入力装置１１から入力される
文字（入力文字）が入力スタツク１３に積まれ
る。この実施例において、入力スタツク１３は主
メモリ（図示せず）の特定領域を用いて実現され
てい。入力制御機構１２“漢字指定キー”、“文節
指定キー”または“固有名詞指定キー”の押下を
判定し（ステツプS1）、キー入力装置１１からの
入力文字を入力スタツク１３に１文字積す（ステ
ツプS2）ごとに、ステツプＳ３〜Ｓ７を実行す
る。これらステツプS3〜S7は判定ステツプであ
り、ステツプS4〜S7は先行するステツプS3〜S6
での判定がNO判定の場合に実行される。ステツ
プS3，S4については後述する。 Now, “Kanji specification key” or “Phrase specification key”
In response to a key input after the key is pressed, the following operations are performed in this embodiment device according to the flowchart shown in FIG. When the key input control mechanism 12 determines in step S1 that the "kanji designation key", "bunsetsu designation key", or "proper noun designation key" has been pressed, that is, that kana-kanji conversion is required, it proceeds to step S2. Execute processing. In step S2, characters input from the key input device 11 (input characters) are stacked on the input stack 13. In this embodiment, input stack 13 is implemented using a specific area of main memory (not shown). The input control mechanism 12 determines whether the "Kanji designation key", "Bunsetsu designation key" or "proper noun designation key" is pressed (step S1), and stacks one character input from the key input device 11 on the input stack 13 ( Steps S3 to S7 are executed for each step S2). These steps S3-S7 are judgment steps, and steps S4-S7 are steps S3-S6 that preceded them.
Executed if the determination in is NO. Steps S3 and S4 will be described later.

ステツプS5は入力スタツク１３中に入力文字
がｍ文字、例えば２文字積されたか否かを判定す
るステツプである。ステツプS6は同じく入力ス
タツク１３中に入力文字がｎ文字、例えば３文字
積まれたか否かを判定するステツプである。ステ
ツプＳ７は例えば文節入力（仮名漢字変換処理単
位の読み情報の入力）が終了したか否かを判定す
るステツプである。このステツプS7での判定は、
“文節指定キー”、“漢字指定キー”、“改行キー”、
旬読点を示すキー、“カタカナキー”などひらが
な以外のキーが押下られたか否かによつて行なわ
れる。ステツプS7での判定がNO判定の場合に
は、再び上記ステツプS2が実行される。 Step S5 is a step for determining whether m input characters, for example, 2 characters, have been stacked in the input stack 13. Step S6 is also a step for determining whether n input characters, for example, three characters, have been stacked in the input stack 13. Step S7 is, for example, a step for determining whether phrase input (input of reading information for a unit of kana-kanji conversion processing) has been completed. The determination at step S7 is
“Bunsetsu specification key”, “Kanji specification key”, “line feed key”,
This is performed depending on whether a key other than hiragana, such as a key indicating a shun reading mark or a ``katakana key'' is pressed. If the determination at step S7 is NO, step S2 is executed again.

しかして、キー入力装置１１から「せん」まで
入力され、入力スタツク１３に文字列「せん」が
積まれたものとする。キー入力制御機構１２は、
入力文字が入力スタツク１３に２文字積まれたこ
とをステツプS5で判定すると、先読み制御機構
１４に対し、２文字の入力文字（この例では「せ
ん」）および品詞情報（“漢字指定キー”または
“文節指定キー”が押下られたこの例では一般名
詞を示す品詞情報）を付して先読み要求を発す
る。これにより先読み制御機構１４は、入力スタ
ツク１３中のｍ（＝２）文字目までを見出しとす
る単語辞書を先読みし、単語辞書バツフア１５に
格納する（ステツプS8）この先読み制御機構１
４の動作を更に具体的に説明する。 Assume that the character string "sen" is inputted from the key input device 11 and the character string "sen" is stacked on the input stack 13. The key input control mechanism 12 is
When it is determined in step S5 that two input characters have been stacked on the input stack 13, the prefetch control mechanism 14 sends two input characters ("sen" in this example) and part-of-speech information ("kanji designation key" or In this example, when the "clause specification key" is pressed, a pre-reading request is issued with part-of-speech information indicating a common noun. As a result, the prefetch control mechanism 14 prefetches the word dictionary whose headings are up to the mth (=2) characters in the input stack 13, and stores it in the word dictionary buffer 15 (step S8).
The operation of step 4 will be explained in more detail.

第３図は先読み制御機構１４の機能構成を示
す。同図において符号１０１で示される先読み要
求認識部はキー入力制御機構１２からの要求が先
読み要求、先読み訂正要求、または先読み取消し
要求のいずれであるかを判別する。この先読み要
求認識部１０１の判別結果に応じ先読み処理部１
０２、先読み訂正処理部１０３、または先読み取
消し処理部１０４のいずれかが起動される。キー
入力制御機構１２からの要求が先読み要求である
この例では、先読み処理部１０２が起動される。
先読み処理部１０２はキー入力制御機構１２から
の入力文字（「せん」）を見出し情報として、外部
記憶装置、例えばフロツピーデイスク装置１６に
格納されている単語辞書の辞書引きを以下に示す
如く実行する。 FIG. 3 shows the functional configuration of the look-ahead control mechanism 14. A prefetch request recognition unit indicated by reference numeral 101 in the figure determines whether the request from the key input control mechanism 12 is a prefetch request, a prefetch correction request, or a prefetch cancellation request. According to the determination result of this prefetch request recognition unit 101, the prefetch processing unit 1
02, either the prefetch correction processing unit 103 or the prefetch cancellation processing unit 104 is activated. In this example where the request from the key input control mechanism 12 is a prefetch request, the prefetch processing unit 102 is activated.
The prefetch processing unit 102 uses the input character ("sen") from the key input control mechanism 12 as heading information to perform a dictionary lookup of a word dictionary stored in an external storage device, for example, a floppy disk device 16, as shown below. do.

単語辞書は第４図に示すように単語辞書本体２
０１と、１段乃至３段辞書デイレクトリ（辞書索
引）２０２〜２０４を含む辞書デイレクトリ２０
５とで構成されている。１段デイレクトリ２０２
の番地は一般名詞を示す品詞情報と読み１文字目
のJISコードにより算出される。１段辞書デイレ
クトリ２０２の各種１文字の読みに対応する各番
地にはデイレクトリ情報niが格納されている。ni
は、niが「０」の場合、読み１文字の単語または
該当語なしを意味する。一方、niが「０」でない
場合、そのniは２段辞書デイレクトリ２０３への
ポインタ値を意味する。２段辞書デイレクトリ２
０３の番地は上記デイレクトリ情報niの値と、読
み２文字目のJISコードとから算出される。２段
辞書デイレクトリ２０３の各種２文字の読みに対
応する各番地にはデイレクトリ情報riが格納され
ている。riは、読み２文字分を見出しとする単語
（単語群）への単語辞書本体２０１内格納先頭ポ
インタ値、３段辞書デイレクトリ２０４へのポイ
ンタ値、または読み２文字分の該当語なしを示す
情報（riが「０」の場合）のいずれか一つの意味
を持つ。この例では、漢字２文字で語群を分割し
ても対応する語群が多い場合、その語群の中から
或る語を持つてくるのが非常に遅くなるため、語
群の多いものに関しては３段辞書デイレクトリ２
０４を設け、読み３文字でその語群を分割してい
る。３段辞書デイレクトリ２０４の番地は上記デ
イレクトリ情報riの値と、読み３文字目のJISコ
ードとから算出される。３段辞書デイレクトリ２
０４の各種３文字の読みに対応する各番地にはデ
イレクトリ情報Siが格納されている。Siは読み３
文字分を見出しとする単語（単語群）への単語辞
書本体２０１内格納先頭アドレスを示すポインタ
値または読み３文字分の該当語なしを示す情報
（Siが「０」の場合）のいずれかの意味を持つ。 The word dictionary consists of the word dictionary body 2 as shown in Figure 4.
Dictionary directory 20 including 01 and 1st to 3rd stage dictionary directories (dictionary indexes) 202 to 204.
It consists of 5. 1st stage directory 202
The address is calculated using part-of-speech information indicating the common noun and the JIS code of the first character. Directory information ni is stored at each address corresponding to the pronunciation of each character in the one-stage dictionary directory 202. ni
If ni is "0", it means a word with one character or no corresponding word. On the other hand, if ni is not "0", ni means a pointer value to the two-level dictionary directory 203. Two-stage dictionary directory 2
The address 03 is calculated from the value of the directory information ni and the JIS code of the second character. Directory information ri is stored at each address corresponding to the pronunciation of various two characters in the two-stage dictionary directory 203. ri is the value of the first pointer stored in the word dictionary main body 201 to a word (word group) with two pronunciations as a heading, the pointer value to the three-level dictionary directory 204, or information indicating that there is no corresponding word with two pronunciations. It has one of the following meanings (when ri is "0"). In this example, even if you divide a word group by two kanji characters, if there are many corresponding word groups, it will be very slow to get a certain word from that word group, so for a word group with many words, is a three-level dictionary directory 2
04, and the word group is divided into three readings. The address of the three-level dictionary directory 204 is calculated from the value of the directory information ri and the JIS code of the third character. 3-level dictionary directory 2
Directory information Si is stored at each address corresponding to the pronunciation of various three characters of 04. Si is read 3
Either a pointer value indicating the storage start address in the word dictionary main body 201 for a word (word group) whose heading is a character, or information indicating that there is no corresponding word for 3 characters (if Si is "0"). have meaning.

なお、１段乃至３段辞書デイレクトリ２０２〜
２０４は単語辞書本体２０１内の一般名詞に関す
る単語辞書部分に対する辞書索引である。単語辞
書本体２０１内に固有名詞に関する単語辞書部分
を有するこの例では、辞書デイレクトリ２０５に
は、当該固有名詞に関する単語辞書部分に対する
デイレクトリ（図示せず）も含まれている。この
デイレクトリ中の１段デイレクトリの番地は固有
名詞を示す品詞情報と読み１文字目のJISコード
により算出される。 Note that the 1st to 3rd level dictionary directories 202~
Reference numeral 204 is a dictionary index for a word dictionary portion related to common nouns in the word dictionary main body 201. In this example, where the word dictionary main body 201 includes a word dictionary section regarding proper nouns, the dictionary directory 205 also includes a directory (not shown) for the word dictionary section regarding the proper noun. The address of the first-level directory in this directory is calculated based on the part-of-speech information indicating the proper noun and the JIS code of the first character.

先読み処理部１０２は入力スタツク１３に積ま
れた２文字目までの入力文字（読み情報「せん」）
および品詞情報を用いて上述した辞書デイレクト
リ２０５を検索し、「せん」を見出しとする単語
群（単語）への単語辞書本体２０１内格納先頭ポ
インタ値を得る。次に先読み処理部１０２はこの
先頭ポインタ値を用いてフロツピーデイスク装置
１６をアクセスし、２文字「せん」を見出しとす
る単語群（単語辞書本体２０１内の該当辞書部
分）を単語辞書バツフア１５に先読みする。この
単語辞書バツフア１５は例えば主メモリ（図示せ
ず）の特定領域を用いて実現されている。この
際、先読み処理部１０２は（２文字を見出しとす
る）単語辞書の先読みが行なわれていることを示
す先読みフラグ（図示せず）をセツト（ON）す
る（ステツプS9）。また、先読み処理部１０２は
「せん」を見出しとする単語辞書本体２０１（の
該当辞書部分）の位置情報、品詞情報、および先
読みされていない「せん」を見出しとする単語辞
書本体２０１（の該当辞書部分）の位置情報を主
メモリの所定領域に格納する。なお、「せん」を
見出しとする単語辞書に先読みされない部分が生
じるのは、該当辞書部分の容量が単語辞書バツフ
ア１５の容量より大きいからである。したがつ
て、任意の文字列を見出しとする単語辞書の容量
が単語辞書バツフア１５の容量より小さければ、
先読みされていない部分の位置情報は必要なくな
る。 The look-ahead processing unit 102 reads input characters up to the second character (pronunciation information “sen”) stacked on the input stack 13.
The above-mentioned dictionary directory 205 is searched using the information and part-of-speech information to obtain the storage start pointer value in the word dictionary main body 201 to the word group (word) with "sen" as the heading. Next, the prefetch processing unit 102 accesses the floppy disk device 16 using this start pointer value, and stores the word group (the corresponding dictionary part in the word dictionary body 201) with the two characters "sen" as a heading in the word dictionary buffer 15. Read ahead. This word dictionary buffer 15 is realized using, for example, a specific area of a main memory (not shown). At this time, the prefetch processing unit 102 sets (ON) a prefetch flag (not shown) indicating that prefetching of the word dictionary (with two characters as a heading) is being performed (step S9). In addition, the prefetch processing unit 102 stores position information and part-of-speech information of the word dictionary main body 201 (the corresponding dictionary part) whose heading is "sen", and the corresponding part of the word dictionary main body 201 whose heading is "sen" which has not been prefetched. (dictionary part) is stored in a predetermined area of the main memory. It should be noted that the reason why some parts of the word dictionary with "sen" as the heading are not prefetched is because the capacity of the corresponding dictionary part is larger than the capacity of the word dictionary buffer 15. Therefore, if the capacity of a word dictionary with an arbitrary character string as a heading is smaller than the capacity of the word dictionary buffer 15,
The position information of the part that has not been prefetched is no longer necessary.

上述した単語辞書の先読み並びに先読みフラグ
のセツト処理が行なわれると、制御がキー入力制
御機構１２に戻る。キー入力制御機構１２は文節
入力が終了したか否かの判定（ステツプS7）を
行ない、この例のようにNO判定の場合には再び
前記ステツプS2が実行される。このステツプS2
でキー入力装置１１からの３文字目の入力文字
「せ」が入力スタツク１３に積まれ、ステツプS6
で入力文字が入力スタツク１３に３文字積まれた
ことが判定されたものとする。キー入力制御機構
１２は、先読み制御機構１４に対し、３文字の入
力文字（この例では「せんせ」）および品詞情報
を付して先読み要求を発する。これにより先読み
制御機構１４は、入力スタツク１３中のｎ（＝３）
文字目までを見出しとする単語辞書を先読みして
単語辞書バツフア１５に格納し（ステツプS10）、
先読みフラグをセツトする（ステツプS9）。この
ステツプS10の動作は、見出し文字が２文字から
３文字に増え（３段デイレクトリ２０４も検索さ
れ）る点を除き、ステツプS8とほぼ同様である
ので具体的な説明については省略する。 After the above-described pre-reading of the word dictionary and setting of the pre-reading flag are performed, control returns to the key input control mechanism 12. The key input control mechanism 12 determines whether or not the phrase input has been completed (step S7), and if the determination is NO as in this example, the step S2 is executed again. This step S2
In step S6, the third input character "se" from the key input device 11 is stacked on the input stack 13.
Assume that it is determined that three input characters have been stacked on the input stack 13. The key input control mechanism 12 issues a prefetch request to the prefetch control mechanism 14 with three input characters (in this example, "sensei") and part-of-speech information. As a result, the look-ahead control mechanism 14 selects n (=3) in the input stack 13.
A word dictionary with up to the first character as a heading is read in advance and stored in the word dictionary buffer 15 (step S10).
A read-ahead flag is set (step S9). The operation of step S10 is almost the same as step S8, except that the number of heading characters is increased from two to three (the three-level directory 204 is also searched), so a detailed explanation will be omitted.

このようにして「せん」および「せんせ」を見
出しとする単語辞書の先読みが行なわれ、、かつ
「せんせ」に後続する入力文字「いは」が入力ス
タツク１３に積まれた後、オペレータがひらがな
以外のキー例えば“文節指定キー”或いは“漢字
指定キー”を押下したものとする。この場合、キ
ー入力制御機構１２はステツプS7で文節入力終
了（仮名漢字変換処理単位の文字列入力の終了）
を判定する。これによりキー入力制御機構１２は
仮名漢字変換機構１７に対し仮名漢字変換要求を
発する。仮名漢字変換機構１７は入力スタツク１
３中の漢字に変換すべき入力文字列「せんせい」
と一致する単語を、単語辞書バツフア１５に先読
みされた単語辞書部分を検索することにより求
め、入力文字列「せんせいは」を漢字混り文に換
する（ステツプS11）。この例では、文節入力終
了時には「せん」および「せんせ」を見出しとす
る単語辞書部分が単語辞書バツフア１５に先読み
されているので、文節入力終了検出と同時に仮名
漢字変換処理が行なえる。したがつて、文節入力
終了検出により辞書引きを開始し、しかる後に仮
名漢字変換処理を行なう従来装置に比べ、仮名漢
字変換速度の高速化が図れる。なお、単語辞書バ
ツフア１５に先読みされていた単語辞書部分に該
当単語が存在しない場合には、先読みされていな
い部分に関する位置情報を用いてフロツピーデイ
スク装置１６をアクセスし、該当単語辞書部分を
単語辞書バツフア１５に読み込む処理が行なわれ
る。 In this way, the word dictionary with ``sen'' and ``sense'' as headings is prefetched, and after the input character ``iha'' following ``sense'' is stacked on the input stack 13, the operator inputs hiragana. It is assumed that a key other than the above is pressed, such as the "Phrase designation key" or the "Kanji character designation key". In this case, the key input control mechanism 12 ends the phrase input in step S7 (ends the character string input in units of kana-kanji conversion processing).
Determine. As a result, the key input control mechanism 12 issues a kana-kanji conversion request to the kana-kanji conversion mechanism 17. Kana-kanji conversion mechanism 17 is input stack 1
Input string "Sensei" to be converted to kanji in 3
A word matching the word is found by searching the word dictionary part read in advance by the word dictionary buffer 15, and the input character string "Sensei wa" is converted into a sentence containing kanji (step S11). In this example, since the word dictionary section with headings "sen" and "sense" is read in advance in the word dictionary buffer 15 when the phrase input ends, the kana-kanji conversion process can be performed at the same time as the phrase input end is detected. Therefore, the kana-to-kanji conversion speed can be increased compared to the conventional device which starts dictionary lookup upon detection of the end of phrase input and then performs the kana-to-kanji conversion process. Note that if the corresponding word does not exist in the word dictionary portion that has been prefetched in the word dictionary buffer 15, the floppy disk device 16 is accessed using the position information regarding the portion that has not been prefetched, and the corresponding word dictionary portion is saved as the word. A process of reading into the dictionary buffer 15 is performed.

仮名漢字変換機構１７の仮名漢字変換結果（こ
の例では「先生は」）は編集・校正機構１８に供
給される。編集・校正機構１８は、仮名漢字変換
された文字列により文書を作成する文書作成機
能、および編集・校正機能を有している。編集・
校正機構１８により新規に作成された漢字混り文
書、或いは編集・校正が施された漢字混り文書
は、表示制御機構１９の制御により表示装置２０
に表示される。また、上記文書はオペレータから
の指定に応じ、印刷制御機構によりプリンタから
印刷出力され、或いは保存機構により文書フアイ
ル（いずれも図示せず）に保存される。 The kana-kanji conversion result (in this example, "teacher wa") of the kana-kanji conversion mechanism 17 is supplied to the editing/proofreading mechanism 18. The editing/proofreading mechanism 18 has a document creation function for creating a document using a character string converted into kana/kanji, and an editing/proofreading function. edit·
A document containing kanji newly created by the proofreading mechanism 18 or a document containing kanji that has been edited and proofread is displayed on the display device 20 under the control of the display control mechanism 19.
will be displayed. Further, the above-mentioned document is printed out from the printer by the print control mechanism, or is saved in a document file (none of which is shown) by the storage mechanism, according to instructions from the operator.

ところで、文節を構成する文字列の入力過程
で、入力文字の取消し或いは訂正のキー入力操作
が行なわれることがある。この場合の動作につい
て、第２図に示したフローチヤート並びに新たに
第５，第６図に示すフローチヤートを適宜参照し
て説明する。キー入力制御機構１２はキー入力装
置１１からの入力文字を入力スタツク１３に積む
ステツプ（ステツプS2）を実行した後、入力ス
タツク１３中の入力文字（文字列）が取消された
か否かの判定を行なう（ステツプS3）。この判定
は次のようにして行なわれる。この実施例では、
表示装置２０の表面画面の一部が入力中の文字列
を表示する入力表示行として使用される。したが
つてこの入力表示行には表示制御機構１９の制御
により入力スタツク１３の内容が表示される。ま
た、文字入力中は、次の入力位置（次に入力され
る文字の表示位置）にカーソルが表示される。入
力文字を取消す場合、オペレータはカーソルを入
力表示行の先頭位置に移動し、新たな文字列の入
力指定に必要な“漢字指定キー”、“文節指定キ
ー”などの変換モード指定キー、或いは“ひらが
なキー”などのシフトキーを押下する。そこで、
この状態をステツプS3で検出することにより、
入力文字の取消しが行なわれたことが判定でき
る。 By the way, in the process of inputting character strings constituting a phrase, a key input operation for canceling or correcting input characters may be performed. The operation in this case will be explained with appropriate reference to the flowchart shown in FIG. 2 and new flowcharts shown in FIGS. 5 and 6. After executing the step (step S2) of stacking input characters from the key input device 11 on the input stack 13, the key input control mechanism 12 determines whether or not the input characters (character string) in the input stack 13 have been canceled. (Step S3). This determination is made as follows. In this example,
A part of the front screen of the display device 20 is used as an input display line for displaying the character string being input. Therefore, the contents of the input stack 13 are displayed on this input display line under the control of the display control mechanism 19. Furthermore, while inputting characters, a cursor is displayed at the next input position (display position of the next character to be input). To cancel input characters, the operator moves the cursor to the first position of the input display line and presses the conversion mode specification keys such as the “Kanji specification key” and “Phrase specification key” required to input a new character string, or “ Press the Shift key such as "Hiragana key". Therefore,
By detecting this state in step S3,
It can be determined that the input character has been canceled.

キー入力制御機構１２は、ステツプS3での判
定がYES判定の場合、先読み制御機構１４に対
し先読み取消要求を発する。これにより先読み制
御機構１４は以下に示す先読み取消し処理（ステ
ツプS12）を実行する。キー入力制御機構１２か
ら先読みに関する要求が発せられると、先読み制
御機構１４内の先読み要求認識部１０１は要求種
類の判別を行なう。しかして先読み要求認識部１
０１は先読みに関する要求が先読み取消し要求で
あることを判別すると、先読み取消し処理部１０
４を起動する。先読み取消し処理部１０４による
先読み取消し処理の具体的な手順が第５図に示さ
れている。先読み取消し処理部１０４は、まず先
読みフラグがセツト（ON）されているか否か、
すなわち取消された入力文字列に関する単語辞書
の先読みが行なわれたか否かの判定を行なう（ス
テツプS21）。このステツプS21での判定がYES判
定の場合、先読み取消し処理部１０４は単語辞書
バツフア１５をクリアし（ステツプS22）、先読
みフラグをリセツト（OFF）する（ステツプ
S23）。このステツプS23が終了すると制御はキー
入力制御機構１２に返され、第２図のフローチヤ
ートのステツプS1に戻る。これに対し、ステツ
プS21での判定がNO判定の場合、すなわち取消
された入力文字列に関する単語辞書の先読みが行
なわれていない場合、ステツプS22，S23をスキ
ツプして第２図のフローチヤートのステツプS1
に戻る。 If the determination at step S3 is YES, the key input control mechanism 12 issues a prefetch cancellation request to the prefetch control mechanism 14. As a result, the prefetch control mechanism 14 executes the prefetch cancellation process (step S12) described below. When a request for prefetching is issued from the key input control mechanism 12, the prefetching request recognition unit 101 in the prefetching control mechanism 14 determines the type of request. However, the prefetch request recognition unit 1
01, when it is determined that the request regarding prefetch is a prefetch cancellation request, the prefetch cancellation processing unit 10
Start 4. The specific procedure of the pre-read cancellation process by the pre-read cancellation processing unit 104 is shown in FIG. The prefetch cancellation processing unit 104 first determines whether or not the prefetch flag is set (ON).
That is, it is determined whether or not the word dictionary has been pre-fetched regarding the canceled input character string (step S21). If the determination in step S21 is YES, the pre-read deletion processing unit 104 clears the word dictionary buffer 15 (step S22) and resets (OFF) the pre-read flag (step S22).
S23). When this step S23 is completed, control is returned to the key input control mechanism 12, and the process returns to step S1 in the flowchart of FIG. On the other hand, if the determination in step S21 is NO, that is, if the word dictionary has not been prefetched for the canceled input character string, steps S22 and S23 are skipped and the steps in the flowchart of FIG. 2 are executed. S1
Return to

一方、ステツプS3での判定がNO判定の場合、
キー入力制御機構１２は入力スタツク１３中の入
力文字が訂正されたか否かの判定を行なう（ステ
ツプS4）。この実施例では、入力文字を訂正する
場合、オペレータは入力表示行に表示されている
入力文字列の訂正を必要とする文字位置にカーソ
ルを移動し、しかる後、訂正文字をキー入力す
る。そこで、この状態をステツプS4で検出する
ことにより、入力文字訂正が行なわれたことが判
定できる。キー入力制御機構１２は、ステツプ
S4での判定がYES判定の場合、先読み制御機構
１４に対し先読み訂正要求を発する。これにより
先読み制御機構１４は以下に示す先読み訂正処理
（ステツプS13）を実行する。 On the other hand, if the determination in step S3 is NO,
The key input control mechanism 12 determines whether the input character in the input stack 13 has been corrected (step S4). In this embodiment, when correcting an input character, the operator moves the cursor to the position of the input character string displayed on the input display line that requires correction, and then inputs the corrected character using the keypad. Therefore, by detecting this state in step S4, it can be determined that input character correction has been performed. The key input control mechanism 12
If the determination in S4 is YES, a prefetch correction request is issued to the prefetch control mechanism 14. As a result, the prefetch control mechanism 14 executes the prefetch correction process (step S13) described below.

キー入力制御機構１２から先読みに関する要求
が発せられると、先読み制御機構１４内の先読み
要求認識部１０１は要求種類の判別を行なう。し
かして先読み要求認識部１０１は先読みに関する
要求が先読み訂正要求であることを判別すると、
先読み訂正処理部１０３を起動する。先読み訂正
処理部１０３による先読み訂正処理の具体的な手
順が第６図に示されている。先読み訂正処理部１
０３は、まず先読みフラグがセツト（ON）され
ているか否か、すなわち該当入力文字列に関する
単語辞書の先読みが行なわれたか否かの判定を行
なう（ステツプS31）。このステツプS31での判定
がNO判定の場合、制御はキー入力制御機構１２
に返され、第２図のフローチヤートのステツプ
S7が実行される。これに対し、ステツプS31での
判定がYES判定の場合、先読み訂正処理部１０
３は入力スタツク１３に積まれた入力文字列のｍ
（＝２）文字目までの文字列に訂正があつたか否
かの判定を行なう（ステツプS32）。ステツプS32
での判定がYES判定の場合、先読み訂正処理部
１０３は（第２図のフローチヤートのステツプ
S8と同様に）入力スタツク１３中のｍ（＝２）文
字目までの文字列を見出しとする単語辞書を単語
辞書バツフア１５に格納する（ステツプS33）。
すなわち、ステツプS32での判定がYES判定の場
合、先読み訂正処理部１０３は先読み時の見出し
情報となつたｍ（＝２）文字の見出しが訂正され
たものと判断し、先読みされていたｍ（＝２）を
文字を見出しとする辞書部分を訂正後の見出しに
対応する辞書部分に変更する。 When a request for prefetching is issued from the key input control mechanism 12, the prefetching request recognition unit 101 in the prefetching control mechanism 14 determines the type of request. When the prefetch request recognition unit 101 determines that the prefetch-related request is a prefetch correction request,
The prefetch correction processing unit 103 is activated. The specific procedure of the prefetch correction processing by the prefetch correction processing unit 103 is shown in FIG. Prefetch correction processing unit 1
03, first, it is determined whether or not the prefetch flag is set (ON), that is, whether or not the word dictionary has been prefetched for the corresponding input character string (step S31). If the determination in step S31 is NO, control is performed by the key input control mechanism 12.
and the steps in the flowchart in Figure 2.
S7 is executed. On the other hand, if the determination in step S31 is YES, the look-ahead correction processing unit 10
3 is m of the input character string stacked on the input stack 13
(=2) It is determined whether or not the character string up to the 2nd character has been corrected (step S32). Step S32
If the determination is YES, the look-ahead correction processing unit 103 (steps in the flowchart in FIG.
(Similar to S8) A word dictionary whose headings are character strings up to m (=2) characters in the input stack 13 is stored in the word dictionary buffer 15 (step S33).
That is, if the determination in step S32 is YES, the prefetch correction processing unit 103 determines that the m (= 2) character heading that became the headline information at the time of prefetching has been corrected, and the prefetched m( =2) changes the dictionary portion whose heading is a character to the dictionary portion corresponding to the corrected heading.

次に先読み訂正処理部１０３は入力スタツク１
３中の入力文字がｎ（＝３）文字以上であるか否
かの判定を行なう（ステツプS34）。ステツプS34
での判定がYES判定の場合、先読み訂正処理部
１０３は（第２図のフローチヤートのステツプ
S10と同様に）入力スタツク１３中のｎ（＝３）
文字目までの文字列を見出しとする単語辞書を単
語辞書バツフア１５に格納する（ステツプS35）。
すなわちステツプS34での判定がYES判定の場
合、先読み訂正処理部１０３は先読み時の見出し
情報となつたｎ（＝３）文字の見出しが訂正され
たものと判断し、先読みされていたｎ（＝３）文
字を見出しとする辞書部分を訂正後の見出しに対
応する辞書部分に変更する。一方、ステツプS34
での判定がNO判定の場合には、ステツプS31で
の判定がNO判定の場合と同様に、第２図のフロ
ーチヤートのステツプS7が実行される。 Next, the look-ahead correction processing unit 103 inputs the input stack 1.
It is determined whether the number of input characters in 3 is greater than or equal to n (=3) characters (step S34). Step S34
If the determination is YES, the look-ahead correction processing unit 103 (steps in the flowchart in FIG.
n (=3) in input stack 13 (same as S10)
A word dictionary whose headings are the character strings up to the first character is stored in the word dictionary buffer 15 (step S35).
In other words, if the determination in step S34 is YES, the prefetch correction processing unit 103 determines that the heading of n (= 3) characters that became the heading information at the time of prefetching has been corrected, and the prefetched n (= 3) Change the dictionary portion that uses characters as headings to the dictionary portion that corresponds to the corrected headings. Meanwhile, step S34
If the determination at step S31 is NO, step S7 of the flowchart of FIG. 2 is executed, as in the case where the determination at step S31 is NO.

上記ステツプS32での判定がNO判定の場合、
先読み訂正処理部１０３は入力スタツク１３に積
まれた入力文字列のｎ（＝３）文字目までの文字
列（ｍ＋１文字目からｎ文字目までの文字列、ｍ
＝２，ｎ＝３のこの例では３文字目）に訂正があ
つたか否かの判定を行なう（ステツプS36）。ス
テツプS36での判定がYES判定の場合、先読み訂
正処理部１０３は前記ステツプS35を実行する。
すなわちステツプS36での判定がYES判定の場
合、先読み訂正処理部１０３は先読み時の見出し
情報となつたｎ（＝３）文字の見出し（この例で
は３文字の見出し中の３文字目の文字）が訂正さ
れたものと判断し、先読みされていたｎ（＝３）
文字を見出しとする辞書部を訂正後の見出しに対
応する辞書部分に変更する。 If the determination in step S32 above is NO,
The look-ahead correction processing unit 103 processes the character string up to the n (=3) character of the input character string stacked on the input stack 13 (the character string from the m+1 character to the n character, m
In this example, the third character (=2, n=3) is determined to be corrected (step S36). If the determination in step S36 is YES, the prefetch correction processing unit 103 executes step S35.
In other words, if the determination in step S36 is YES, the prefetch correction processing unit 103 uses the n (= 3) character heading (in this example, the third character in the three character heading) that became the headline information at the time of prefetching. was judged to have been corrected and read ahead n (=3)
The dictionary section with characters as headings is changed to the dictionary section corresponding to the corrected heading.

以上が先読み取消し処理および先読み訂正処理
である。これらの処理により、文字取消し或いは
訂正のキー入力操作が行なわれても単語辞書の先
読みが仮名漢字変換結果に悪影響を及ぼすことは
ない。 The above is the prefetch erasure process and the prefetch correction process. Through these processes, even if a key input operation for canceling or correcting a character is performed, the pre-reading of the word dictionary will not adversely affect the kana-kanji conversion result.

なお、キー入力制御機構１２、先読み制御機構
１４、仮名漢字変換機構１７、編集・校正機構１
８および表示制御機構１９はマイクロプロセツサ
に制御機能によつて実現されている。また、前記
実施例では仮名漢字変換方式の日本語処理装置に
ついて説明したが、この発明は他の変換方式、例
えばローマ字漢字変換方式の日本語処理装置など
にも実施できる。更に、この発明は、キー入力に
限らず例えば音声入力の日本語処理装置にも適用
できる。 In addition, a key input control mechanism 12, a read-ahead control mechanism 14, a kana-kanji conversion mechanism 17, an editing/proofreading mechanism 1
8 and display control mechanism 19 are realized by the control functions of a microprocessor. Further, in the embodiment described above, a Japanese language processing device using a Kana-Kanji conversion method has been described, but the present invention can also be implemented in a Japanese processing device using other conversion methods, such as a Japanese language processing device using a Romaji-Kanji conversion method. Furthermore, the present invention is applicable not only to key input but also to, for example, a voice input Japanese language processing device.

〔発明の効果〕〔Effect of the invention〕

以上詳述したようにこの発明によれば、文章の
読み情報を漢字混り文に高速変換できる。 As detailed above, according to the present invention, reading information of a sentence can be converted into a sentence containing kanji at high speed.

【図面の簡単な説明】[Brief explanation of drawings]

第１図はこの発明の一実施例に係る日本語処理
装置の構成を示すブロツク図、第２図は先読み処
理に関する動作を説明するためのフローチヤー
ト、第３図は第１図に示す先読み制御機構の機能
構成を示すブロツク図、第４図は単語辞書の概略
構成を示す模式図、第５図は先読み取消し処理手
順を示すフローチヤート、第６図は先読み訂正処
理手順を示すフローチヤートである。１１……キー入力装置、１２……キー入力制御
機構、１３……入力スタツク、１４……先読み制
御機構、１５……単語辞書バツフア、１６……フ
ロツピーデイスク装置（外部記憶装置）、１７…
…仮名漢字変換機構、２０……表示装置、２０１
……単語辞書本体、２０５……辞書デイレクト
リ。 FIG. 1 is a block diagram showing the configuration of a Japanese language processing device according to an embodiment of the present invention, FIG. 2 is a flowchart for explaining operations related to prefetch processing, and FIG. 3 is a prefetch control shown in FIG. 1. FIG. 4 is a block diagram showing the functional configuration of the mechanism, FIG. 4 is a schematic diagram showing the general structure of the word dictionary, FIG. 5 is a flowchart showing the pre-read erasure processing procedure, and FIG. 6 is a flow chart showing the pre-read correction processing procedure. . 11... Key input device, 12... Key input control mechanism, 13... Input stack, 14... Prefetch control mechanism, 15... Word dictionary buffer, 16... Floppy disk device (external storage device), 17...
...Kana-kanji conversion mechanism, 20...Display device, 201
...Word dictionary body, 205...Dictionary directory.

Claims

【特許請求の範囲】１入力手段から入力された文章の読み情報を漢
字混り文に変換する日本語処理装置において、各種単語がその読みの先頭からの何文字かを見
出しとして格納された単語辞書と、上記入力手段から変換単位の読み情報が入力さ
れている途中で、当該読み情報の先頭からｎ文字
目までを見出しとして用いて上記単語辞書を参照
し、同見出しを持つ単語の群を先読みする動作
を、予め定められた幾つかのｎの値について実行
する先読み手段と、この先読み手段によつて先読みされた単語の群
を格納するためのバツフアと、上記入力手段からの上記変換単位の読み情報の
入力終了時に、当該読み情報を上記バツフアに格
納された上記単語の群をもとに対応する漢字また
は漢字混り文に変換する変換手段と、を具備することを特徴とする日本語処理装置。[Scope of Claims] 1. In a Japanese language processing device that converts reading information of a sentence inputted from an input means into a sentence containing kanji, a word in which various words are stored with several characters from the beginning of the reading as headings. While the reading information of the conversion unit is being input from the dictionary and the input means, the word dictionary is referred to using the n-th character from the beginning of the reading information as a heading, and a group of words with the same heading is searched. a prefetching means for performing a prefetching operation for several predetermined values of n; a buffer for storing a group of words prefetched by the prefetching means; and the conversion unit from the input means. and converting means for converting the reading information into a corresponding kanji or a sentence containing kanji based on the group of words stored in the buffer upon completion of inputting the reading information. Word processing device.