JP2798931B2

JP2798931B2 - Chinese phonetic delimiter and phonetic kanji conversion

Info

Publication number: JP2798931B2
Application number: JP63105030A
Authority: JP
Inventors: 健楠井
Original assignee: 健楠井
Priority date: 1988-04-26
Filing date: 1988-04-26
Publication date: 1998-09-17
Anticipated expiration: 2013-09-17
Also published as: JPH01274272A

Description

【発明の詳細な説明】 3.1 産業上の利用分野本発明は、中国語ワードプロセッサに関する。更に詳
しくは、本発明は、中国語の音節の連なりで表わされた
語音の列を入力し、該語音列を語音ごとに区切る中国語
語音区切方式、およびその後音区切方式で区切られた語
音を漢字に変換する中国語語音漢字変換方式に関する。DETAILED DESCRIPTION OF THE INVENTION 3.1 Industrial Field of the Invention The present invention relates to a Chinese word processor. More specifically, the present invention relates to a method of inputting a sequence of speech sounds represented by a series of Chinese syllables and separating the speech sequence by speech, and a speech speech segmented by a speech segmentation method thereafter. The present invention relates to a Chinese-language sound-kanji conversion method for converting Chinese characters into Chinese characters.

3.2 従来の技術 3.2.1 日本語ワードプロセッサにおける仮名漢字変換
の主流な最長一致法による文の形態素分析中国語の文（以下、単に「中文」と略記する）を扱う
ワードプロセッサでは、ローマ字入力鍵盤の一つとして
中文Ｍ式鍵盤（日本電気（株）製）が知られている。中
国漢字音を中文Ｍ式鍵盤から入力するのは、日本語文
（日本）のなかの漢字音を日本語Ｍ式鍵盤から入力する
のと同じように、たいへん易しく覚えやすく、入力速度
も速い。3.2 Conventional Technology 3.2.1 Morphological Analysis of Sentences by Mainstream Longest Matching Method of Kana-Kanji Conversion in Japanese Word Processors A word processor that handles Chinese sentences (hereinafter simply abbreviated as “Chinese sentence”) uses a Roman character input keyboard. One known type is a Chinese M-type keyboard (manufactured by NEC Corporation). The input of Chinese kanji sounds from the Chinese M-type keyboard is very easy and easy to learn and the input speed is the same as the input of kanji sounds in Japanese sentences (Japan) from the Japanese M-type keyboard.

現在活躍している各社の日本語ワードプロセッサは、
仮名入力漢字変換方式が主流である。仮名漢字変換の単
位は、連文節あるいは文一括であり、変換の手法は、
文節を単位として、最長一致法により自立語と付属語と
を切りはなし、自立語と付属語の音の間の妥当と思わ
れる接続関係を参照しつつ辞書データを引いて、自立
語を漢字に、付属語を仮名に変換し、最も可能性の高
い漢字仮名文を発生させるものである。これを形態素分
析法という。Currently active Japanese word processors are:
The kana input kanji conversion method is the mainstream. The unit of kana-kanji conversion is a continuous phrase or sentence batch, and the conversion method is
Separate the independent words and adjuncts by the longest match method using the phrase as a unit, and refer to the dictionary data while referring to the connection that seems to be appropriate between the sounds of the independent words and adjunct words, and convert the independent words to kanji. , A supplementary word is converted to a kana, and a kanji kana sentence with the highest possibility is generated. This is called a morphological analysis method.

前述の中文Ｍ式鍵盤を開発する以前から考案されてい
た日本語用のＭ式入力法がある（特開昭56−149631）。
このＭ式入力法においては、入力の時点で漢字と仮名が
自動的に分離されるから、の処理が非常に簡単にな
り、このことがＭ式入力法における高いかな漢字変換の
成功率の大きな原因となっている。There is an M-expression input method for Japanese language which was invented before the development of the above-described Chinese-style M-type keyboard (Japanese Patent Application Laid-Open No. 56-149631).
In the M-expression input method, the processing is very easy because the kanji and kana are automatically separated at the time of input, which is a major cause of the high success rate of the kana-kanji conversion in the M-expression input method. It has become.

3.2.2 日本語と中国語のちがい中文入力においても、このような日本語ワードプロセ
ッサ流のローマ字漢字変換がうまく成立するだろうと、
日本人ならば誰でも考える。3.2.2 Differences between Japanese and Chinese Even in Chinese input, such a Roman word-to-kanji conversion like the one in a Japanese word processor would be successful.
Any Japanese think.

ところが、中文は日文とは構造がおおいに異なってお
り、日文ワードプロセッサにおける鋭利な武器である文
節の概念が存在せず、そのなかで自立語と付属語とを分
離することができない。However, the structure of the Chinese sentence differs greatly from that of the Japanese language, and there is no concept of a clause, which is a sharp weapon in the Japanese language word processor, and independence words and adjuncts cannot be separated.

（１）中国語は付属語がすくない第２図に中文の例とこの中文に対応する日文を示す。(1) Chinese has few supplementary words Fig. 2 shows an example of a Chinese sentence and a Japanese sentence corresponding to the Chinese sentence.

図においてを付して示す中文の語は付属語、日文においてを付して示す語は中文の付属語に対応する付属語、＿を
付して示す語は日文特有の付属語、無印は中文・日文と
も自立語である。これらの例から、極度に漢字を多用し
た日文において残った仮名のところが付属語であり中文
にはもともと付属語が少ないことが明瞭である。In the figure The words in Chinese shown with a Words attached with are attached words corresponding to attached words in Chinese, words attached with _ are attached words specific to Japanese, and unmarked are independent words in both Chinese and Japanese. From these examples, it is clear that the kana that remained in the Japanese sentence where the kanji was heavily used was the adjunct, and the Chinese sentence originally had few adjuncts.

（２）中国語には「文節の」概念がない次に第２図の（ｂ）の日文について文節区切りをする
と第３図（ａ）に示すようになる。ところが第２図
（ａ）の中文を、第３図（ａ）の日文の文節に似せて文
字のリズムを切ってみると第３図（ｂ）の如くになる。
第３図（ａ），（ｂ）から中文と日文とでは言語および
文の構造がずいぶんと違うことが、いっそうよく分か
る。(2) There is no concept of “bunsetsu” in Chinese Next, if the Japanese text shown in FIG. 2 (b) is segmented, the result is as shown in FIG. 3 (a). However, when the middle sentence of FIG. 2 (a) is resembled to the phrase of the Japanese sentence of FIG. 3 (a) and the rhythm of the characters is cut, the result becomes as shown in FIG. 3 (b).
3 (a) and 3 (b), it can be better understood that the language and the sentence structure are significantly different between the Chinese sentence and the Japanese sentence.

（３）中国語は一語の長さが短い。日本語の文では一語
の長さが長いさらに第２図（ａ）の中文を「語」に区切ってみると
第４図の如くに区切れる。(3) One word is short in Chinese. In the Japanese sentence, the length of one word is long. Further, when the middle sentence of FIG. 2 (a) is divided into "words", it is separated as shown in FIG.

中国語では１漢字をつねに１音節に読むことを考慮す
ると、中文では「語」の音節の長さは大抵１か２音節と
いう短いものであることがわかる。いっぽう日文では第
５図に示すように自立語・付属語とも、助詞以外には１
音節の語は希である。但し、図において仮名一字が１音
節またはカナ文字の１キーを示す。Considering that one kanji is always read in one syllable in Chinese, it is understood that the syllable length of a "word" in Chinese is usually as short as one or two syllables. On the other hand, in Japanese, as shown in Fig. 5, both independence and adjuncts are 1 other than particles.
Syllable words are rare. However, in the figure, one kana character indicates one syllable or one key of kana characters.

これに対し中文では第６図に示すようになる。但し、
カタカナ読みは、いちおうの近似音である。中国式ロー
マ字は、大文字が子音、それに続く小文字が母音で、そ
れらひと組で１音節となる。Ｍ式中文入力法では、子音
は右１打、母音は左１打、左右の２打で１音節を打つ。On the other hand, in Chinese, it is as shown in FIG. However,
Katakana reading is an approximate sound of Ichiou. In Chinese-style Roman alphabet, uppercase letters are consonants, followed by lowercase letters are vowels. In the M-type Chinese sentence input method, a consonant strikes one syllable with one right stroke, a vowel with one left strike, and two left and right strokes.

第５図、第６図の文例において、数字のところを除
き、日文は50音節、中文は23音節である。中文は日文に
くらべて、いかに簡潔であるかが分かる。この差は中文
にカナがなく日文にカナが割り増しされているところか
ら来ているのは明瞭である。In the example sentences in FIGS. 5 and 6, except for the numerals, the Japanese sentence has 50 syllables and the Chinese sentence has 23 syllables. You can see how simple Chinese is compared to Japanese. It is clear that this difference comes from the fact that the Chinese has no kana and the Japanese has the extra kana.

（４）中国語は言語学上の「孤立語」、日本語は「膠着
語」、最長一致法は膠着語向き言語学の分類によれば、中国語は「孤立語」の典型で
あって、各品詞は一切の語尾変化をせず、語と語の関係
を示す助詞に乏しく、文の意味は、もっぱら語順できま
る。(4) Chinese is a linguistic "isolated word", Japanese is a "sticky word", and the longest match method is a sticky word. According to the classification of linguistics, Chinese is a typical of "isolated word". Each part-of-speech does not change its inflection at all, has few particles indicating the relationship between words, and the meaning of a sentence can be determined solely in word order.

いっぽう日本語は「膠着語」の一種であって、語と語
の関係は自立語の後に助詞や接尾要素を次々に連ねてい
って表現する。日本語における「文節」は、大体「自立
語＋接尾要素」に相当するものである。日本語ワードプ
ロセッサの開発途上で、文節を単位とするカナ漢字変換
法が基本的発想となり、ひとつの文節のなかで自立語を
分離するのに「最長一致法」が有力な武器として使われ
ているのは必然であろう。On the other hand, Japanese is a kind of "sticky word", and the relationship between words is expressed by a series of particles and suffix elements after the independent words. A “bunsetsu” in Japanese is roughly equivalent to “independent word + suffix element”. During the development of a Japanese word processor, the kana-kanji conversion method using phrases as a unit became the basic idea, and the "longest match method" was used as a powerful weapon to separate independent words in a single phrase. It will be inevitable.

「こきゅうは」に対して、先頭から辞書を引いてい
き、「語」が辞書にあるかないかを見ていく。「こ」
（子、粉、濃…）、「こき」（濃き、扱き、古希…）、
「こきゅ」（ナシ）、「こきゅう」（呼吸、故宮…）、
「こきゅうは」（ナシ）。ここでいちばん長い語は「こ
きゅう」であり、残った「は」は助詞の「は」と一致す
る。ゆえに「こきょうは」は多分「呼吸は」または「故
宮は」であろう。日本語の例文を、文節を単位として、
このような発想の「最長一致法」によってカナ漢字変換
をすれば、呼吸と故宮という同音語の誤り以外は、見事
に正しい第７図の文が出てくる。The dictionary is searched from the beginning for "Kyokuha" to see if "words" are in the dictionary. "This"
(Child, powder, dark ...), "Koki" (dark, treated, old ...),
"Koki" (pear), "Koki" (breathing, Forbidden City ...),
"Kokyuha" (pear). Here, the longest word is "kokyu", and the remaining "ha" matches the particle "ha". Therefore, "kokyoha" may be "breathing" or "forbidden city". Example sentences in Japanese
If Kana-Kanji conversion is performed by the “longest matching method” based on such an idea, the sentence shown in FIG.

日本語ワードプロセッサにおける同音語の問題は、主
として「音読み」の漢字熟語の部分にあり、それらはほ
とんど名詞の自立語である。動詞や形容詞などの述語性
品詞の語幹は、たいてい「訓読み」であって、１音節の
もの以外では同音語が少ない。助詞や述語性品詞の接尾
要素（変化語尾）についても同音語はあまりない。ゆえ
に日本語文の漢字カナ表現に対して、「文節区切り」と
「最長一致法」はカナ漢字変換において最も適した処理
の方法であるといえる。The problem of homonyms in Japanese word processors is mainly in the kanji idiom part of "onomi", which are almost independent words of nouns. The stem of predicative parts of speech, such as verbs and adjectives, is usually "Kunyomi", and there are few homonyms except for one syllable. There are not many homophones in the suffix elements (variable endings) of particles and predicative parts of speech. Therefore, it can be said that the phrase separation and the longest match method are the most suitable processing methods in the kana-kanji conversion for the kanji-kana representation of the Japanese sentence.

3.3 発明が解決しようとする課題中文のローマ字（読み）漢字変換において、以上に述
べたような日本語におけるカナ漢字変換の方法論は、ど
の程度に有効に適用できるかを次に検討する。3.3 Problems to be Solved by the Invention In the conversion of Chinese (Roman) to Kanji (kanji) in Chinese, we next examine how effectively the above-described method of Kana-Kanji conversion in Japanese can be applied.

（１）文節とは異なる概念−中文における階層分析法と
文の構成要素− 階層分析法とは、文の各構成要素のあいだの階層関係
を表現するtree（木）を求め、文の構造から意味をあき
らかにする方法である。第８図（ａ）の例文においてＶ
印は「停頓（ポーズ）」でゆっくりと解説的に朗読する
時、ここで小休止をおいてもよい箇所である。ポーズを
最大限に置いた時、例文は自然に細分されて第８図
（ｂ）の如くなる。第８図（ｂ）のような区切り方が、
中文における最も自然な区切り方であり、中国語の言葉
の流れに沿った区切り方あるから、中国語ワードプロセ
ッサでは「ポーズ区切り」におうじて「変換キー」を入
れれば、変換処理システムの設計上はたいへん都合がよ
い。ただし、中国語を中国人一般が読む場合に、例文の
Ｖの箇所でかならず「変換キー」を入れるように強制す
る事は無理がある。日本語では「故宮はネ」、「一四〇
六年からネ」のように、「ネ」をつけて区切れるのが
「文節」であるから、文節の概念は誰にもたいへん明快
である。ところが中国語における「ポーズ区切り」は日
本語における文節よりも不確かな存在で、「ネ」に匹敵
する万能で便利な言葉は中国語にはない。ローマ字漢字
変換方式の中文ワードプロセッサは、オペレータがポー
ズを無視して句読点から次の句読点まで一挙に入力した
ようなときでも、自動的に文を語に区切りつつ、ローマ
字の読みを順々に漢字に変換していく能力を持たなけれ
ばならない。(1) Concepts different from phrases-Hierarchical analysis in Chinese sentences and the components of sentences-Hierarchical analysis is to find a tree that expresses the hierarchical relationship between the components of a sentence, It's a way to make sense. In the example sentence of FIG.
The mark is a place where you can take a short break here when reading slowly and commentatively with "pause". When the pose is set to the maximum, the example sentence is naturally subdivided as shown in FIG. 8 (b). The way of dividing as shown in FIG.
Because it is the most natural way of dividing in Chinese, and there is a way of dividing along the flow of Chinese words, in a Chinese word processor, if you insert a "conversion key" in the "pause delimiter", the conversion processing system design Very convenient. However, when Chinese is read by Chinese in general, it is impossible to force the user to insert a “conversion key” at the V in the example sentence. In Japanese, "bunsetsu" is separated by "ne" like "forbidden city" or "from 1146", so the concept of the phrase is very clear to anyone. . However, the "pause break" in Chinese is more uncertain than the phrase in Japanese, and there is no versatile and convenient language in Chinese equivalent to "ne". Even when the operator ignores pauses and enters from the punctuation mark to the next punctuation mark at a stroke, the Chinese sentence word processor that converts Roman characters into kanji characters automatically converts Roman characters into kanji in order. You must have the ability to convert.

（２）中国語における「語」の観念とその不確定性中国語は文法上では「孤立語」であり、語音上では歴
史的に「単音節語」である。単音節語というのは、基本
的には１語が１音節で表わされる言語であ。現在中国語
は社会の近代化に伴って２音節の語彙が急速に増えつつ
あるが、最も頻繁に使われる基本的な語には依然として
単音節語が多い。例文における従、年、到、有、多、
的、了などがそうである。（これらは日本語ではそれぞ
れより、年、まで、ある、あまり、の、だなどのよう
に、ほとんどがカナで書かれる機能語または付属語のた
ぐいである。）日本語のカナ音節の数は102個、中国の標準語の音節
の数は411個であり、日本語の４倍はあるものの、180万
部（音節）の統計によれば、中文における単音節語の率
は約48％である。なお２音節語は約50％、３・４音節語
は残りの２％で、５以上の音声をもつ語はゼロと考えて
よい。このように中国語には、単音節語の頻度がきわめ
て多い。このことが、中文のローマ字漢字変換におい
て、日本語処理にならって「最長一致法」をとったと
き、正変換率が高くならない大きな原因となっている。
いちばん困るのは、中国語に最長一致法を適用すると
「区切り違い」が頻発することである。(2) The idea of "word" in Chinese and its uncertainty Chinese is an "isolated word" in grammar, and historically a "monosyllable word" in speech. A monosyllabic word is basically a language in which one word is represented by one syllable. At present, the vocabulary of two syllables in Chinese is rapidly increasing with the modernization of society, but the most frequently used basic words are still many monosyllable words. Dependent, year, reaching, having, many,
This is the case with the target and the end. (In Japanese, these are function words or adjuncts, most of which are written in kana, such as year, year, some, too much, no, etc.) The number of Japanese kana syllables is According to statistics of 1.8 million copies (syllables), the rate of monosyllable words in Chinese is about 48%, although the number of syllables in Chinese is 102, and the number of syllables in Chinese is 411. is there. The two-syllable word is about 50%, the 3.4-syllable word is the remaining 2%, and the word having five or more voices can be considered as zero. As described above, the frequency of monosyllable words is extremely high in Chinese. This is a major reason why the conversion of Chinese characters into Roman characters and kanji does not increase the normal conversion rate when the "longest match method" is adopted following the Japanese processing.
The worst thing is that applying the longest match method to Chinese often causes "misplacement".

第９図（ａ）の例文について最長一致法による漢字変
換を試みると第９図（ｂ）の結果が得られる。変換の単
位は句読点間一括変換である。ただし数字は数字キーか
ら直接に入力する。If the kanji conversion of the example sentence of FIG. 9A is performed by the longest match method, the result of FIG. 9B is obtained. The unit of conversion is batch conversion between punctuation marks. However, numbers are entered directly from the numeric keys.

変換を誤まった語にを付して示したが、このように最長一致法では正解率65
％という低さである。原因は下に示すように「同音語の
絡み合いによる区切り違い」である。第10図では太線と
網掛けの字が最長一致法による漢字変換の経路を示し、
×印は誤った場所を示す。到現在己経有を導線在意経由
のように連続して間違ったのは、最長一致法が到を取ら
ずに、より長い導線を取ったからで、いわゆる「チョッ
キの最初のボタンをかけ違った」類の誤りである。典型
的な「区切りミス」である。Incorrectly converted words As shown in the figure, the longest match method has a correct answer rate of 65
%. The cause is, as shown below, "different separation due to intertwining of homophones". In FIG. 10, the bold line and the shaded characters show the path of kanji conversion by the longest match method,
Crosses indicate incorrect locations. The reason why the self-existence has been mistaken continuously as if it was a conductor intention is that the longest match method did not arrive and the longer conductor was taken, so-called `` I misplaced the first button of the vest '' It is a kind of mistake. This is a typical “separation error”.

以上に示したように日本語のカナ漢字変換に有効な最
長一致法の手法は、中国語に適用すると必らずしも有効
でない。そこで、本発明の目的は、中国語に適した音標
文字漢字変換方式を提供することにある。As described above, the longest matching method that is effective for Japanese Kana-Kanji conversion is not necessarily effective when applied to Chinese. Therefore, an object of the present invention is to provide a phonetic character kanji conversion method suitable for Chinese.

3.4 課題を解決するための手段前述の課題を解決するために本発明が提供する一手段
は、中国語の音節の連なりで表わされた語音の列を入力
語音列として入力し、該入力語音列を語音ごとに区切る
中国語語音区切方式であって、中国語の語音のうちで単
音節語音および双音節語音について中国語文に出現する
統計的頻度の対数値を絶対値を該語音の頻級として記憶
する辞書と、前記入力語音列の各単音節語音について前記辞書を検
索し、該単音節語音及びこの単音節語音の頻級を該辞書
から読み出すとともに、該単音節語音の直前に別の単音
節語音が入力されているときにはこれら両単音節語音で
なる双音節語音があるか否かを前記辞書で検索し、該辞
書に該双音節語音がある場合には該双音節語音およびこ
の双音節語音の頻級を該辞書から読み出す語音・頻級検
索手段と、該語音・頻級検索手段で読み出された前記単音節語
音、該単音節語音の頻級、前記双音節語音および該双音
節語音の頻級を記憶する第１の記憶手段と、前記語音・頻級検索手段で双音節語音が検索されなか
った単音節語音と前記直前入力単音節語音との間におけ
る前記入力語音列上の仮想の点を節点とし、直近の２つ
の該節点の間にある前記入力語音列の句を音頻句とする
とき、該音頻句に対応する前記単音節語音及び双音節語
音並びに該単音節語音の頻級及び該双音節語音の頻級を
前記第１の記憶手段から読み出す節点判断手段と、前記節点判断手段で読み出された前記単音節語音およ
び双音節語音並びに該単音節語音の頻級および該双音節
語音の頻級を受け、前記音頻句を語音の単位に区切る区
切り方を語音区切型として生成し、該語音区切型のうち
で最適の語音区切型を選んで出力する最適語音区切型生
成手段と、前記最適語音区切型生成手段で生成された前記語音区
切型を記憶する第２の記憶手段とを備え、前記最適語音区切型生成手段は、前記語音区切型にお
ける各語音の前記頻級の和を求め、該和が最少である語
音区切型を前記最適語音区切型とすることを特徴とす
る。3.4 Means for Solving the Problem One solution provided by the present invention for solving the above-mentioned problem is to input a sequence of speech sounds represented by a sequence of Chinese syllables as an input speech sequence, and This is a Chinese word separation method that divides the sequence into words, and the absolute value of the logarithmic value of the statistical frequency that appears in Chinese sentences for monosyllabic sounds and disyllabic sounds among Chinese sounds is calculated as the frequency of the sound. And a dictionary stored for each of the monosyllable words in the input word sequence.The monosyllable words and the frequency of the monosyllable words are read out from the dictionary. When a monosyllabic word is input, the dictionary is searched to determine whether or not there is a disyllabic word consisting of these monosyllabic words. The syllable word frequency from the dictionary Means for retrieving speech / frequency classifying means; storing the monosyllabic speech, the monosyllabic speech frequency, the disyllable speech and the dimorphic speech frequency read out by the speech / frequency search means. A first storage unit, and a virtual point on the input speech sequence between the monosyllabic speech for which disyllable speech is not searched for by the speech / frequency search unit and the immediately preceding input monosyllabic speech, as a node, When the phrase of the input syllable string between the two most recent nodes is a syllable phrase, the monosyllable speech and disyllable speech corresponding to the syllable, and the frequency of the monosyllable speech and the disyllable speech Node judgment means for reading the frequency of the monosyllabic and diphthong sounds read by the node judgment means, the frequency of the monosyllable words and the frequency of the disyllable words And how to divide the frequent phrases into units of words Optimum speech segmentation type generating means for generating as a speech segmentation type, selecting and outputting an optimal speech segmentation type from the speech segmentation types, and storing the speech segmentation type generated by the optimal speech segmentation type generating device. A second storage unit, wherein the optimal speech segmentation type generation unit obtains the sum of the frequent classes of each speech in the speech segmentation format, and determines the speech segmentation type having the smallest sum as the optimal speech segmentation type. It is characterized by doing.

前述の課題を解決するために本発明が提供する別の手
段は、中国語の音節の連なりで表わされた語音の列を入
力語音列として入力し、該入力語音列を語音ごとに区切
り、前記入力語音列から区切られた前記語音をそれぞれ
漢字に変換する中国語語音漢字変換方式であって、中国語の語音の見出しとして該語音の漢字を記憶する
とともに、中国語の語音のうちで単音節語音および双音
節語音について中国語文に出現する統計的頻度の対数値
の絶対値を該語音の頻級として記憶する辞書と、前記入力語音列の各単音節語音について前記辞書を検
索し、該単音節語音及びこの単音節語音の頻級を該辞書
から読み出すとともに、該単音節語音の直前に別の単音
節語音が入力されているときにはこれら両単音節語音で
なる双音節語音があるか否かを前記辞書で検索し、該辞
書に該双音節語音がある場合には該双音節語音およびこ
の双音節語音の頻級を該辞書から読み出す語音・頻級検
索手段と、該語音・頻級検索手段で読み出された前記単音節語
音、該単音節語音の頻級、前記双音節語音および該双音
節語音の頻級を記憶する第１の記憶手段と、前記語音・頻級検索手段で双音節語音が検索されなか
った単音節語音と前記直前入力単音節語音との間におけ
る前記入力語音列上の仮想の点を節点とし、直近の２つ
の該節点の間にある前記入力語音列の句を音頻句とする
とき、該音頻句に対応する前記単音節語音及び双音節語
音並びに該単音節語音の頻級及び該双音節語音の頻級を
前記第１の記憶手段から読み出す節点判断手段と、前記節点判断手段で読み出された前記単音節語音およ
び双音節語音並びに該単音節語音の頻級および該双音節
語音の頻級を受け、前記音頻句を語音の単位に区切る区
切り方を語音区切型として生成し、該語音区切型のうち
で最適の語音区切型を選んで出力する最適語音区切型生
成手段と、前記最適語音区切型生成手段で生成された前記語音区
切型を記憶する第２の記憶手段と、前記最適語音区切型で区切って示される各語音につい
て前記辞書を検索し、該語音を見出しとする漢字のうち
の１つを該辞書から読み出し、前記入力語音列に対応す
る漢字列を生成する漢字列生成手段とを備え前記最適語音区切型生成手段は、前記語音区切型にお
ける各語音の前記頻級の和を求め、該和が最少である語
音区切型を前記最適語音区切型とすることを特徴とす
る。Another means provided by the present invention to solve the above-mentioned problem is to input a sequence of speech sounds represented by a sequence of Chinese syllables as an input speech sequence, and to separate the input speech sequence for each speech, A Chinese word sound / kanji conversion method for converting each of the word sounds separated from the input word sound string into a kanji, wherein the kanji of the word sound is stored as a heading of the Chinese word sound, and a single one of the Chinese word sounds is stored. A dictionary that stores the absolute value of the logarithmic value of the statistical frequency that appears in Chinese sentences for syllable words and disyllable words as a frequency of the word, and searches the dictionary for each monosyllabic word of the input word string. The monosyllabic speech and the frequency of the monosyllabic speech are read out from the dictionary, and when another monosyllabic speech is input immediately before the monosyllabic speech, whether or not there is a disyllable speech composed of these monosyllabic speeches. In front of A search is performed using a dictionary, and if the disyllable word is present in the dictionary, the disyllable word and the frequency of the disyllable word are read out from the dictionary. First storage means for storing the read monosyllable speech, the frequency of the monosyllable speech, the disyllable speech and the frequency of the disyllable speech, and the disyllable speech by the speech / frequency search means. A virtual point on the input speech sequence between the single syllable speech not searched for and the immediately preceding input syllable speech is set as a node, and the phrase of the input speech sequence between the two closest nodes is referred to as a sound frequency. When the phrase is used, the node determining means for reading out the monosyllabic sound and disyllabic sound corresponding to the syllable phrase, the frequency of the monosyllable word and the frequency of the disyllable word from the first storage means, The monosyllabic speech and the bisyllable read out by the node determining means Receiving the vocal sounds and the frequency of the monosyllabic voices and the frequency of the disyllable voices, generating a delimiter that divides the vocal phrasal into voice units as a vocal voice segmentation type; An optimal utterance separation type generating means for selecting and outputting a type; a second storage means for storing the utterance separation type generated by the optimal utterance separation type generating means; Kanji string generation means for searching the dictionary for a word sound, reading one of the kanji having the word sound as a heading from the dictionary, and generating a kanji string corresponding to the input word sound string. The generation means calculates a sum of the frequency of each speech in the speech segmentation type, and sets the speech segmentation type having the smallest sum as the optimal speech segmentation type.

3.5 作用本発明は、音頻法を適用した語音区切り方式と、この
語音区切方式に基づき語音漢字変換を行う方式である。
音頻法は中文を語音に区切る方法であり、従来全く見ら
れない方法である。現在の時点で日本語の仮名漢字変換
に広く使われている文節の概念、最長一致法および文節
に対する形態素分析などの基本技術に比べ、本音頻法に
よる語音区切漢字変換は、まったく異質の発想に基づく
から、今まで日本語処置の技術に慣れて来た人には理解
し難い面がある。そこで、以下では例文を挙げて主に音
頻法につき詳しく説明する。3.5 Function The present invention is a speech separation method to which the frequent tone method is applied, and a speech to kanji conversion method based on the speech separation method.
The frequent method is a method of dividing a Chinese sentence into speech sounds, and is a method that has never been seen before. Compared with the basic concept of phrase kana-kanji conversion that is widely used at present, such as the concept of bunsetsu, the longest match method, and morphological analysis of bunsetsu, the phonetic delimiter kanji conversion using the phonetic frequent method is a completely different idea. Based on this, there are aspects that are difficult to understand for those who have been accustomed to Japanese treatment techniques. Therefore, in the following, mainly the tone frequency method will be described in detail using example sentences.

3.5.1 中文における語音と語音情報量第11図は中文における語音とその情報量の例を示す図
である。ここで「語音」とは漢字文における個々の単語
の「読み」という。個々の漢字の読みは「漢字音」であ
って語音ではない。ここで「語音」と「漢字音」の意義
を明らかにしておく。3.5.1 Speech and Speech Information in Chinese Sentence FIG. 11 is a diagram showing examples of speech and the information amount in Sentence. Here, the “sound” is the “reading” of each word in the kanji sentence. The reading of each kanji is a "kanji sound", not a speech sound. Here, the meaning of "sounds" and "kanji sounds" will be clarified.

漢字は表意文字であり、個々の漢字はそれぞれ固有の
義（意味）・音（よみ）・形の３方面の情報を持つ。漢
字の音は例外なしに１音節の音で、これを「字音」と呼
ぶ。中文は漢字だけを使用して書かれているから、見方
によっては中文の「よみ」は１音節字音の列といえる。
字音で読まれる例としてお経がある。お経では、意味よ
りも音の雰囲気を大切にするから、「字音よみ」でよい
のである。古典中国語においては、おおむね１つの漢字
は文法上でも「語」であったが、現代中文では２字語の
語彙が優勢である。但し、伝統の力は強く、中文には分
ち書きがなく、漢語辞典はまず漢字を引いてその漢字か
ら始まる語を引く方式（日本の漢和辞典に同じ）が支配
している。（中国には音引きの辞書はない）中国人の中
文朗読も、棒よみ字音よみの傾向がのこっている。Kanji is an ideographic character, and each kanji has its own three-way information of meaning (meaning), sound (reading), and shape. The kanji sound is a one-syllable sound without exception, and is called a "character". Since Chinese is written using only Chinese characters, depending on how it is viewed, it can be said that "Yomi" in Chinese is a sequence of one syllable characters.
There is sutra as an example that can be read in syllabary. In sutras, the value of the sound atmosphere is more important than the meaning. In classical Chinese, one kanji is generally a "word" in grammar, but in modern Chinese, the two-letter vocabulary is dominant. However, the power of tradition is strong, there is no break in Chinese, and the Kanji dictionary is dominated by the method of drawing kanji first and then drawing words starting with the kanji (same as Japanese Kanji dictionary). (China does not have a dictionary for sound-pulling.) Chinese reading aloud in Chinese also has a tendency to read sticky characters.

いっぽう、漢字からいちおう離れて、ローマ字などを
発音記号として利用し、音節を単位として中文の「よ
み」を書くこともできる。このときには、語のわかち書
きは不可欠となる。On the other hand, it is possible to write Chinese characters "yomi" in syllable units, using Roman letters etc. as phonetic symbols, far from Kanji. In this case, word-writing is indispensable.

わかち書きされた個々の語ごとの音節列を「語音」と
呼ぶ。語音には、音節の数によって単音節語音・双音節
語音・３音節語音・４音節語音などがある。The syllable sequence for each word written in each word is called a "sound". Depending on the number of syllables, there are monosyllabic sounds, disyllabic sounds, three-syllabic sounds, and four-syllable sounds.

文が漢字だけで書かれていても、語音を意識して朗読
するのが中文の合理的な読み方である。Even if the sentence is written only in kanji, it is a reasonable way to read Chinese sentence while consciously reading it.

本発明の眼目は、「語音」で入力された中文の音節列
を処理し、自動語区切を行ったのち、はじめて語ごとの
漢字変換を行うというアイデアにある。もっとも、本発
明の方式では、入力された音節を自動的に語音に区切る
から、音節を入力する者が字音と認識して入力しても語
音として入力した場合と同様に処理される。すなわち、
本発明の方式で入力として受け付ける情報は、音節列で
あれば足り、語音であっても字音であっても差し支えな
い。The gist of the present invention lies in the idea that, after processing a syllable string of a Chinese sentence input as a "sound" and performing automatic word segmentation, kanji conversion is performed for each word for the first time. However, in the method of the present invention, the input syllables are automatically divided into speech sounds, so that even if the person who inputs the syllables recognizes and inputs the syllable, it is processed in the same way as when the syllable is input as a speech sound. That is,
The information received as input in the method of the present invention is sufficient if it is a syllable string, and may be a word sound or a letter sound.

本発明では、中文が音節ごとに順に入力されてくるこ
とを前提にしており、第11図の（１）は入力される音の
順を示す番号である。In the present invention, it is assumed that the Chinese sentence is sequentially input for each syllable, and (1) in FIG. 11 is a number indicating the order of input sounds.

図の（２）は14字の中文例文を示し、この例文は「此
の式の中文入力システムは許多（あたま）の利点を具有
している」という意味である。同図（３）は図（２）の
漢字音を示す。ただしローマ字の綴りかたは中国国定の
「音（ピンイン）」方式による。１漢字はかならず１
音節に対応している。以後、漢字からまったく離れて、
読みだけに注目すれば、（３）は14個の音節からなる音
節列となる。(2) in the figure shows a 14-character Chinese sentence example sentence, which means that "the Chinese sentence input system of this type has the advantage of the head". FIG. 3 (3) shows the kanji sound of FIG. 2 (2). However, the spelling of Roman characters is based on the Chinese national "sound (pinyin)" method. One Kanji is always one
Supports syllables. Since then, completely apart from Kanji,
Paying attention only to reading, (3) is a syllable string composed of 14 syllables.

（３）に対して番号１から順々に１音節の語を調べて
いくと、（８）のように、すべての音節に単音節の語が
実在する。ゆえに（５）のように１から14までの単音節
語音記号ａ〜ｎに内容が存在し、それらはそれぞれzhe,
shi,zhong,…,dianであることになる。中文では、すべ
ての音節には対応する単音節語音が存在するという原則
がある。２以上の音節の語（多音節語）の一部として使
用されるだけで、それ自身は単音節語音にならない音節
は、中文には存在しない。なお、図において、単音節語
音をRSで示しているが、ＲはRead（読み）を表わし、Ｓ
はSingle（単数）を表わしている。When one syllable word is examined sequentially from number 1 for (3), as shown in (8), a single syllable word actually exists in all syllables. Therefore, as shown in (5), the contents exist in monosyllabic syllables a to n from 1 to 14, which are zhe,
shi, zhong,…, dian. In Chinese, there is a principle that every syllable has a corresponding monosyllabic sound. Syllables that are only used as part of two or more syllable words (polysyllable words) and do not themselves become monosyllable words do not exist in Chinese. In the figure, the monosyllable words are indicated by RS, but R represents Read, and S
Represents Single (singular).

（６）の頻率は中文における単音節語音の出現確率で
あり、数値は総計約180万字の文献調査から得たもので
ある。たとえばshiという「読み」の単音節語音の出現
確率ps（ｐはprobabilityを表わす）は ps＝159.04×10^-4＝1.59％であって、この語音は平均63字に１回のわりで出てくる
ことがわかる。The frequency in (6) is the probability of occurrence of monosyllable words in Chinese sentences, and the numerical values are obtained from a literature survey of about 1.8 million characters in total. For example, the appearance probability ps (p indicates the probability) of a monosyllable word sound of “reading” called shi is ps = 159.04 × 10 ^-4 = 1.59%, and this word sound appears once every 63 characters on average. You can see it coming.

（７）は単音節語音の情報量頻級ISを示す（以下では
単に頻級、IS又は頻級ISと略記する）。IS（ＩはInform
ationを表わす）は IS＝−log₂ps のように定義される。対数の底は、ここでは２とする
が、理論上は１を越える正の実数なら何でもよい。ISの
値は小数点以下を切り捨てて整数化し、級の呼びつけを
つけて段階的に表現する。中文において最大頻度の単音
節語音はde（「的、地、得」の語音。それぞれ「〜の、
〜な、〜して」の意味）であり、deについてはps＝0.04
72、IS＝４級である。本例ではISの最小値は18級とし
た。すなわち、ps＜2^-18（＝0.000004）のpsの語音のIS
はすべて18級に簡略化した。ISを15段階に規準化したの
である。ISが小さければ語音の出現率は大きい。ISが１
級少なくなれば語音の出現確率は２倍になり、情報量は
1/2になる。(7) indicates the information level of the monosyllabic speech information level IS (hereinafter simply referred to as frequency, IS or frequency IS). IS (I is Inform
) is defined as IS = −log ₂ ps. The base of the logarithm is 2 here, but theoretically, any positive real number exceeding 1 can be used. The value of IS is rounded down to the nearest whole number, converted to an integer, and expressed stepwise with class designation. The most frequent monosyllable words in Chinese are de ("target, ground, gain".
"Meaning", and for de, ps = 0.04
72, IS = 4th grade. In this example, the minimum value of IS was set to 18th grade. That is, the IS of the speech sound of ps of ps <2 ⁻¹⁸ (= 0.000004)
Are all simplified to class 18. IS was standardized into 15 levels. If IS is small, the appearance rate of speech sounds is large. IS is 1
If the class is reduced, the appearance probability of speech sounds will be doubled, and the amount of information will be
1/2.

（８）の単音節語彙は、（５）の単音節記号の内容で
あるそれぞれの単音節語音を持つ実在の辞書に搭載して
いる）単音節語を、漢字表現で羅列したものである。羅
列の順序は統計的頻度の順である。たとえば語音shiに
対しては、是（〜である）が最大頻度、使（〜に〜させ
る）、時（〜したとき）がそれに次ぐ。例文中の式（森
田式などの式）は11番目の頻度である。The monosyllabic vocabulary of (8) is a list of monosyllabic words in a Kanji expression, which are included in an actual dictionary having the respective monosyllable words that are the contents of the monosyllabic symbols of (5). The order of the list is the order of statistical frequency. For example, for the phonetic sound shi, 是 (is 最大) is the highest frequency, 使 (させる is made to 〜), and time (〜 is made) is second to it. The expression (expression such as Morita expression) in the example sentence has the eleventh frequency.

中文においては単音節語音の同音語彙は例にあるよう
に相当多い。しかし同音語のなかで、頻度が上位の少数
の語が語音頻度の大部分を占める場合が多い。たとえば
語音shiに対しては、頻度１位の「是」だけで占有率76
％、２位の「使」までで82％、３位の「時」まで87％の
占有率になる。In the Chinese sentence, the homophonic vocabulary of monosyllable words is quite large as shown in the example. However, among the homophones, a small number of words having the highest frequency often occupy most of the speech frequency. For example, for the word sound shi, the occupancy rate is 76
The occupancy rate is 82% for the second-ranked "age" and 87% for the third-ranked "hour".

（９）〜（13）は、双音節語音（２つの音節からなる
語の音、即ち２つの漢字で表わせれる語の音）について
の同様のデータ例を示す。ただし双音節語音の情報量頻
度ID（ＤはDoubleを示す）は、その語音の頻度の統計総
音節数に対する比をpdとしたとき ID＝−log₂（２×pd）とし、ISの場合と同様に15段階に規準化する（双音節語
音の情報量頻級は以下では単に頻級、ID又は頻級IDと略
記する）。(9) to (13) show similar data examples of disyllable word sounds (sounds of words composed of two syllables, that is, sounds of words represented by two kanji). However, the information frequency ID of a disyllable word sound (D indicates Double) is defined as ID = −log ₂ (2 × pd) when the ratio of the frequency of the word sound to the total number of statistical syllables is pd. Similarly, normalization is performed in 15 steps (hereinafter, the information level of disyllable speech is simply abbreviated as frequency, ID, or frequency ID).

（９）〜（13）に見えるように、例文において双音節
記号hi、jk、lmには音節情報がない。これはtongju、yo
uxu、duoyouという双音節語音が中国語には元来なく、
これらの音の２音節語彙がないからである。双音語節音
にはID＝18または18に近いほとんど使われない語音が多
数存在する。との反面、women（我々）とかzhuyi（主
義、注意）などID＝７に達する高頻度語音も存在する。
双音節語音における同音語の数は、単音節語音にくらべ
ると一般に少ない。As can be seen from (9) to (13), in the example sentence, the disyllable symbols hi, jk, and lm do not have syllable information. This is tongju, yo
The syllable words uxu and duoyou are not native to Chinese,
This is because there is no two-syllable vocabulary of these sounds. There are many rarely used speech sounds near ID = 18 or 18 in diphthong syllables. On the other hand, there are also high-frequency speech sounds that reach ID = 7, such as women (we) and zhuyi (principle, attention).
The number of homonyms in disyllable speech is generally smaller than monosyllable speech.

３または４音節の語音は中文においては、平均すれば
きわめて希である。同時にこれらには同音語彙がほとん
どない。これらの語音は単音節語音および双音節語音と
干渉しあって、音頻法による語音区切を混乱させること
がほとんどない、干渉がないというのは、たとえば音節
列abcdeにおいて３音節語音cdeと３個の双音節語音ab,b
cおよびdeが存在するとき、区切a/bc/deはほとんどある
得ず、ab/cdeがほとんど眞になることをいう。ゆえに音
頻法の実行に際して、３または４音の語音が語音列中に
検出されたときには、これらの音頻区切処理から切り離
し、これらに対して独立に語音漢字変換を行ない、これ
らを除外した単音節語音と双音節語音の２類の語音だけ
で構成される語音列の部分に限って音頻区切処理（音頻
法による語音区切処理）をすることによって、自動音頻
区切の処理アルゴリズムを大幅に単純化することができ
る。Three or four syllable words are very rare in Chinese in average. At the same time, they have little homonymous vocabulary. These speech sounds interfere with monosyllabic speech sounds and disyllable speech sounds, and hardly confuse the syllable separation by the vowel method. The absence of interference means that the three syllable speech sounds cde and 3 Disyllable words ab, b
When c and de are present, the delimiter a / bc / de is almost impossible and ab / cde is almost true. Therefore, when three or four phonemes are detected in the phoneme sequence during the execution of the vocal frequent method, these vowels are separated from the vocal utterance separation processing, and the syllable kanji conversion is performed independently on these vocal syllables. To greatly simplify the processing algorithm for automatic frequent phonetic segmentation by performing frequent phonetic segmentation processing (speech sound segmentation processing using the frequent phonetic method) only for parts of the phonetic sequence composed of only two types of phonetic sounds, namely, and disyllable speech sounds. Can be.

本発明では、以上の理由により、単音節語音と双音節
語音だけによって構成される語音列に対象を限定して、
音頻法による語音区切を行う。In the present invention, for the above reasons, the target is limited to a phonetic sequence composed only of monosyllabic sounds and disyllable sounds,
Performs phonetic delimitation using the tone frequency method.

3.5.2 中文音節列の構造モデル第12図に漢字を捨象して構成した中文音節列のモデル
を示す。モデルは第11図の例文による。3.5.2 Structural model of Chinese syllable strings Figure 12 shows a model of Chinese syllable strings composed of Chinese characters. The model is based on the example sentence in FIG.

第12図の（１）は文例のａからｎまでの14音節のシス
テムである。音節単位の入力は点から始まり、点で
終わる。このモデルでは点の次には句読点が入力され
ることを想定している。FIG. 12 (1) shows a system of 14 syllables from a to n in the sentence example. Syllabic input starts at the point and ends at the point. This model assumes that punctuation is entered after the point.

点から点までを結ぶ直線軸は入力作業の時間経過
を示す「時間」軸である。それぞれの点は音節の入力直
前の時点を表している。隣りあわせの２点の間に、入力
された単音節語音記号ａ〜ｎがそれぞれ書き込まれる。
軸上の経路は、単音節語音だけを選んで通る径路で、第
12図の（２）に示すように、これを「軸路」と称する。The linear axis connecting the points is a “time” axis indicating the time lapse of the input operation. Each point represents a point immediately before the input of a syllable. The input monosyllabic phonetic symbols a to n are respectively written between two adjacent points.
The path on the axis is the path through which only single syllable speech is selected.
As shown in (2) of FIG. 12, this is referred to as “axis path”.

音列中の双音節語音が存在する時には、同図の（２）
に示すように、隣り合う２個の単音をまたが、それらの
両外側の２点を短絡するバイパス（傍路）を引き、その
上に該当する双音節語音記号を置く。一本の軸路と多く
の傍路によってシステム図は鎖状の形となる。このよう
に中文語音列を語音接続関係図としてモデル化したダイ
ヤグラムを「語音ネットワーク」と名付ける。When there are disyllable words in the sequence, (2) in FIG.
As shown in (1), two adjacent single tones are drawn, and a bypass (shorting path) that short-circuits the two points on both outer sides is drawn, and the corresponding disyllable phonetic symbol is placed thereon. With one axis and many sideways, the system diagram will be chained. A diagram in which the Chinese language sound sequence is modeled as a sound connection relation diagram in this way is named “speech network”.

原文の文頭に対応する音列の開始点、句読点、数字ロ
ーマ字など非変換の字や符号の単独または列の直前直後
の「点」は、音頻法の実行に際して外部条件により音列
を区切るものであるので、これらを「外部区切点」と呼
ぶ。第12図の例では、点と点、が「外部区切点」
である。The starting point, punctuation mark, non-converted character or code such as numeral Roman characters, or the `` dot '' immediately before and after the string, which corresponds to the beginning of the original sentence, separates the sound string according to external conditions when executing the frequent method. As such, they are called "external breakpoints". In the example of Fig. 12, the points are "external breakpoints".
It is.

その点の直前直後の両単音節をつないで得られる双音
節語音が存在しないとき、その点を節点という。その点
をまたぐ双音節語音の存在不存在にかかわらず、その点
の直前または直後に３または４音節の語音が存在してい
るときも、その点は「節点」である。また「外部区切
点」もまた「節点」である。If there is no disyllable speech obtained by connecting both monosyllables immediately before and after the point, the point is called a node. Regardless of the presence or absence of a disyllable word sound that straddles that point, a point is also a “node” when there are three or four syllable words immediately before or immediately after that point. The “outer breakpoint” is also a “node”.

隣りあわせの２個の節点間の軸路と傍路上の単音節お
よび双音節の全語音によって成立する語音列を「音頻
句」と名付ける。A speech sequence formed by all speech of monosyllables and disyllables on the axis path and by-path between two adjacent nodes is referred to as a “phonetic phrase”.

隣りあわせの２節点の間が３または４音節語音の場
合、それらの節点の間を「多音句」と名付ける。When a three or four-syllable word sound is present between two adjacent nodes, the space between those nodes is named "polyphonic".

隣りあわせの２節点が共に「外部区切点」で、その間
にローマ字・数字などの非変換の字または符号の一個あ
るいは列だけがあるとき、それらの節点の間を「非変換
句」と名付ける。When two adjacent nodes are both "external breakpoints" and there is only one or a line of non-conversion characters or codes such as Roman characters and numbers between them, a "non-conversion phrase" is named between those nodes.

隣りあわせの２個の「外部区切点」の間を「文」と称
する。「文」はふつう「音頻句」および「多音句」から
なっている。第11図と第12図に例示する文は、４個の
「音頻句」によって構成されている。A space between two adjacent “outer break points” is referred to as a “sentence”. A "sentence" usually consists of a "phonetic phrase" and a "polyphonic phrase". The sentence illustrated in FIGS. 11 and 12 is composed of four “phonetic phrases”.

音頻法は個々の「音頻句」だけを対象として、その音
列における最適語音区切を求める方法であり、本発明の
語音漢字変換方式ではその音頻法により区切られたそれ
ぞれの語音に対して漢字変換を行なう。The frequent call method is a method of finding the optimal utterance separation in the sound sequence for only individual frequent words, and the kanji conversion method of the present invention converts kanji for each sound separated by the frequent call method. Perform

3.5.3 音頻ネットワーク第12図に示すような音頻句の中の単音節記号および双
音節記号のそれぞれに、頻級をつけた形式を「音頻ネッ
トワーク」という。第12図の（３）は第11図（２）の例
文に対する音頻ネットワークである。音頻ネットワーク
は、多数のイベントを矢を持つアクティビティで連結し
て、おのおののアクティビティの時間的順序と接続関係
を示したPERTのプロジェクト・ネットワークに、パタン
のうえでは類似している。しかし音頻法は、むしろ最適
経路発見のために用いられるDP（ダイナミック・プログ
ラミング）の一種と見た方がよい。3.5.3 Phonetic Network A form in which each of the monosyllabic symbols and disyllable symbols in the vowel phrases as shown in Fig. 12 is assigned a frequent class is called a "phonetic network". (3) of FIG. 12 is a phonetic network for the example sentence of FIG. 11 (2). The tone network is similar in pattern to PERT's project network, which links multiple events with activities with arrows and shows the chronological order and connectivity of each activity. However, it is better to view the frequent call method as a kind of dynamic programming (DP) used for finding the optimal path.

単音節語音と双音節語音だけによって構成される音頻
ネットワークは、第12図で見てきたようにきわめて単純
なパタンを示す。それは軸路上に等間隔に並ぶ単音節記
号と互いに１音節周期ずれて軸路の上下に並ぶ傍路上の
双音節記号によって連続的につくられるパタンである。
第12図の（３）の表現は、同図（４）のような２段のブ
ロック積み形式に直して表現したほうが描き易くわかり
易い。音頻ネットワークは、以下、第12図の（４）のパ
タンによって記述することにする。A phonic network composed of only monosyllabic sounds and disyllable sounds shows a very simple pattern as seen in FIG. It is a pattern continuously formed by monosyllable symbols arranged at equal intervals on the axis path and disyllable symbols on the sideways arranged vertically above and below the axis path with a shift of one syllable cycle from each other.
The expression (3) in FIG. 12 is easier to draw and understand if it is expressed in a two-stage block stacking format as shown in FIG. 12 (4). The tone frequency network will be described below using the pattern (4) in FIG.

3.5.4 最小音頻径路と最適語音区切の理論最小音頻経路と、その経路上の個々の語音の頻級の総
和IT（以後ITを頻級和と称する）が最小になる経路を指
す。第12図の（４）における太線が例文の最小音頻経路
である。この例文の音頻句１では、太線の音頻経路では
頻級和ITは52であり、他のどの音頻経路の頻級和ITより
も小さい。最小音頻経路に沿う語音区切を行ない、さら
に各語音の同音語中の最多頻度の漢字語に各語音を変換
すれば、第12図の（５）のように、ほとんど真に近い結
果が得られる。この例では、ｂのshiに対する変換だけ
が正しい「式」ではなく、「是」となり、同音語ミスを
起こしている。語音区切は完全に正しくなされているこ
とに注意すべきである。3.5.4 Theory of Minimum Tone Path and Optimal Word Separation The term refers to the path with the minimum number of frequent paths and the sum IT of the frequent classes of individual speech sounds on the path (hereinafter, IT is referred to as the frequent class sum). The thick line in (4) of FIG. 12 is the minimum frequent phonetic path of the example sentence. In the frequent phrase 1 of this example sentence, the frequent class sum IT of the bold line frequent route is 52, which is smaller than the frequent class IT of any other frequent route. If speech separation along the minimum frequent phonetic path is performed, and each speech is converted to the most frequent kanji word in the same phoneme of each speech, a result that is almost true can be obtained as shown in (5) of FIG. . In this example, only the conversion of b to shi is not a correct “expression”, but “ze”, resulting in a homophone mistake. It should be noted that the phonetic delimiters are completely correct.

後に説明する第16図は本発明の方式により216音節に
わたる語音を漢字に変換した例を示す図であるが、本発
明で採用する音頻法によれば、最適語音区切の経路が真
の語区切にほとんど近い事が本図から了解されるであろ
う。FIG. 16 to be described later is a diagram showing an example in which a word sound over 216 syllables is converted into a kanji according to the method of the present invention. It will be understood from this figure that it is almost close to.

本発明は「最小音頻経路に沿う語音区切は、最も真に
近い語音区切である」という法則に基づいている。この
法則は本発明者の発見に係わるものである。この法則が
中文に対してよく合うことは、第16図の216音節に及ぶ
長い例文に対する漢字変換の結果から証明されている。The present invention is based on the rule that "speech separation along the minimum frequent phonetic path is the closest speech separation". This rule relates to the discovery of the present inventor. The fact that this rule fits well with Chinese sentences is proved by the results of kanji conversion for long example sentences of 216 syllables in FIG.

この法則が成立する理由は次のように考えられる。 The reason that this rule is satisfied is considered as follows.

（１）中文には411個の単音節語音があるが、各語音の
統計的使用頻度は、けっして一様ではなく、非常に偏っ
た分布をしている。(1) The Chinese sentence has 411 monosyllable words, but the statistical use frequency of each word is not uniform and has a very uneven distribution.

（２）中文における双音節語音は単純計算最大限の411²
≒16万種はなく、本発明におけるように声調（中国語に
特有の音節ごとの４種のアクセント）を無視したとき３
万種、その中でも主要なものは一万種をはるかに下まわ
る程度であり、双音節語音において結び付く２個の音節
の相性はずいぶん制限されている。実存する双音節各語
音の統計的使用頻度は個々の語音によって非常に偏った
分布をしている。(2) The maximum number of simple syllable words in Chinese is 411 ²
≒ There are no 160,000 varieties, and when ignoring tone (4 accents per syllable peculiar to Chinese) as in the present invention, 3
There are all varieties, of which the major ones are far below 10,000, and the compatibility of the two syllables that are linked in a disyllabic speech is quite limited. The statistical frequency of use of existing disyllables has a very skewed distribution depending on the individual sounds.

（３）３音節４音節の多音節語音になると、音節組合わ
せの特殊性と実在語音の頻度の偏りは、さらに大きくな
る。(3) In the case of a three-syllable, four-syllable multi-syllable word sound, the specificity of the syllable combination and the bias in the frequency of the actual word sound are further increased.

（４）語音の時系列すなわち「句」においては、それを
構成する語音の連続の具合は決して無秩序ではなく、文
法や語脈の影響をうけ、自然言語特有の秩序と偏りを持
つ。(4) In the time series of speech sounds, that is, "phrases", the continuity of the speech sounds that compose the speech is not disorderly, but is affected by grammar and language, and has an order and bias peculiar to natural language.

（５）以上をまとめれば「個々の音節が単音節・双音節などの語音を成立させ
るとき、さらにそれら語音が順次に接続されて句や文を
なすとき、その成立や接続には自然言語固有の秩序があ
る。」その結果、「各語音の情報量の総和が最小になるような、各語音
の音節構造と語音の区切と語音の接続が、自然言語にお
いては自ずから成立しているに違いない。」この法則は「自然言語における語音情報エントロピー
最小の法則」とも呼ぶべきである。(5) Summarizing the above, "When individual syllables form speech sounds such as monosyllables and disyllables, and when those speech sounds are connected sequentially to form phrases and sentences, the formation and connection are natural language-specific. As a result, the difference is that "the syllable structure of each speech, the segmentation of the speech, and the connection of the speech so that the sum of the information content of each speech is minimized, is naturally established in natural language. There is no. "This rule should be called" the law of minimum speech information entropy in natural language. "

中文に対しては、この法則は非常によく合う。本発明
の自動語音区切方式はこの法則を応用して、音節を単位
として入力された中文の個々の「音頻句」に対して、い
わゆる変換キーの操作を省略して、音頻法により自動語
音区切を実行する方式である。また本発明の語音漢字変
換方式は、その自動語音区切方式により中文を最も真に
近く個々の語音に区切り、区切られて生成された各語音
を漢字に変換する方式である。For Chinese, this rule fits very well. The automatic speech separation method of the present invention applies this rule, omitting the operation of a so-called conversion key for each “phonetic phrase” of a Chinese sentence input in units of syllables, and automatic speech separation according to the phonetic method. Is a method for executing. Further, the speech-kanji conversion method of the present invention is a method in which the middle sentence is divided into individual speech sounds closest to the true language by the automatic speech-sound separation method, and each speech sound generated by the division is converted into a kanji.

3.6 実施例次に、実施例を挙げて本発明を詳しく説明する。なお
本発明の核心は、単音節語音と双音節語音だけからなる
「音頻句」に対する逐次自動語音区切にあるので、３音
節および４音節の語音に対する処理について簡潔に述べ
る。3.6 Examples Next, the present invention will be described in detail with reference to examples. The core of the present invention is the automatic syllable segmentation of "syllable phrases" consisting of only monosyllabic speech and disyllable speech, so that the processing for speech of three syllables and four syllables will be briefly described.

3.6.1 音頻語音区切処理の構成例第１図は「音頻句」に対し語音区切を施し、この語音
区切により該音頻区を区切ることにより生成された語音
を漢字に変換する本発明の一実施例（語音漢字変換方
式）の構成を示すブロック図である。また第13図は第１
図のなかの辞書以外の記憶手段において必要なメモリの
記憶内容を示す表である。この第１図実施例により本発
明による語音区切方式および語音漢字変換方式を具体的
に説明する。3.6.1 Configuration example of phonetic word sound separation processing Fig. 1 shows an embodiment of the present invention in which "phonetic phrases" are subjected to phonetic word separation, and the phonetic sounds generated by separating the phonetic word segments into kanji words are converted into kanji. It is a block diagram which shows the structure of an example (word sound Chinese character conversion system). Fig. 13 shows the first
3 is a table showing storage contents of a memory required in storage means other than the dictionary in the figure. The embodiment of FIG. 1 will be used to specifically explain the speech separation system and the speech-kanji conversion system according to the present invention.

第１図における各手段ブロックの機能は以下の通りで
ある。The function of each means block in FIG. 1 is as follows.

1:辞書（１）音節列によって表現された語音見出し（２）各語音の頻級（３）各語音ごとの同音の漢字語彙（漢字コード表
現）のデータを記憶させてある記憶手段である。1: Dictionary (1) Speech headings expressed by syllable strings (2) Frequency of each speech sound (3) Kanji vocabulary (Kanji code expression) data of the same sound for each speech sound.

2:語音・頻級の検索手段音節入力手段（キーボード）11から音節が入力される
都度、２は１の辞書から以下のデータを検索し、３へ送
る。2: Retrieval means for speech and frequent classes Each time a syllable is input from the syllable input means (keyboard) 11, 2 retrieves the following data from 1 dictionary and sends it to 3.

（１）その音節の単音節語音RS_n （２）RS_nの頻級IS_n （３）ｎが２以上のとき、RS_n-1とRS_nとを結合して得
られる２音節の双音節語音RD_nの有無（但し、RD_nがない
場合、RD_n無の情報は４の節点判断手段に送られ、３へ
は送られない）。(1) The monosyllabic word sound RS _{n of the} syllable (2) When the frequency class IS _{n of} RS _n is (3) n is 2 or more, two syllable disyllables obtained by combining RS _n-1 and RS _n the presence or absence of speech RD _n (however, if there is no RD _n, information RD _n free is sent to the node determining means 4, not sent to 3).

（４）RD_nがあった場合にはその内容と頻級ID_n （５）ｎが３以上のときその音節に終わる３音節語音
RT_nの有無。あった場合にはRT_nの内容（６）ｎが４以上のときその音節に終わる４音節語音
RQ_nの有無。あった場合にはRQ_nの内容 RT_nまたはRQ_nは多音節語音データであって、無の情報
は節点判断手段４に送られ、有の時にはRT_nまたはRQ_nの
語音が記憶手段３に送られる。RQ_nが有のときにはRT_n,R
D_n,RS_n,ID_n,IS_nは３に送られない。RT_nが有のときにはR
D_n,RS_n,ID_n,IS_nは３に送られない。(4) If RD _n is present, its contents and frequency ID _n (5) Three syllable words that end with the syllable when n is 3 or more
Presence or absence of RT _n . If present, the contents of RT _n (6) Four syllable words that end with the syllable when n is 4 or more
Presence or absence of RQ _n . There was contents RT _n or RQ _n of RQ _n when a multi-syllable speech data, information-free is sent to the node determination means 4, when the chromatic has speech of RT _n or RQ _n in the storage means 3 Sent. RT _n , R when RQ _n is present
D _n , RS _n , ID _n and IS _n are not sent to 3. R if RT _n
D _n , RS _n , ID _n and IS _n are not sent to 3.

3:単音節語音（RS_n）・単音節語音の頻級（IS_n）・双音
節語音（RD_n）・双音節語音の頻級（ID_n）・３音節語音
（RT_n）・４音節語音（RQ_n）の記憶手段 4:節点判断手段「音頻句」が終わりになる節点を判断する。3: Monosyllable speech (RS _n ), monosyllable speech frequency (IS _n ), disyllable speech (RD _n ), disyllable speech frequency (ID _n ), 3-syllable speech (RT _n ), 4-syllable Speech (RQ _n ) storage means 4: Node judgment means Judgment of the node where the “phonetic phrase” ends.

（１）語音・頻級検索手段２によってRT_nまたはRQ_nが
検索されていないとき（１）RDnが無のとき:RS_nの直前の点は節点と判断
し、音節入力番号を１に戻し、RS_nとIS_nを記憶手段３の
RS₁とIS₁に入れ直す。(1) When the speech-Shikikyu search means 2 RT _n or RQ _n is not retrieved (1) when RDn is free: it determines that node points of the previous RS _n, returns the syllable input number 1 , RS _n and IS _n in storage means 3
Replace with RS ₁ and IS ₁ .

（２）RD_nが有のとき：記憶手段３からRS_n,IS_n,RD_n
およびID_nの情報を最適区切型生成手段５に送る。(2) When RD _n is present: RS _n , IS _n , RD _n from storage means 3
And the information of ID _n are sent to the optimal demarcation type generating means 5.

（２）語音・頻級検索手段２によってRQ_nまたはRT_nが
検索されたとき（１）RQ_nが存在するときｎ−４に外部区切りがあ
るとして処理し、またRT_nが存在するときはｎ−３に外
部区切りがあるものとして処理する。(2) when the speech-Shikikyu search means 2 RQ _n or RT _n is retrieved (1) was treated as being external delimiting the n-4 when RQ _n exists and when the RT _n exists Processing is performed assuming that n-3 has an external delimiter.

5:最適語音区切型生成手段「音頻句」を対象にして、処理中の音頻句の先頭から
最近に入力された音節までの音節列に対する語音区切型
を生成し、区切型記憶手段６に送る。最適語音区切型生
成手段５の主機能は最適語音区切型を生成して漢字列変
換手段７に送ることであるが、生成の方法については次
節で述べる。5: Optimal phrasal segmentation type generating means For syllable phrases, generate a syllable segmentation type for a syllable string from the beginning of the syllable being processed to the most recently input syllable, and send it to the segmentation type storage means 6. . The main function of the optimum word-sound separating type generating means 5 is to generate an optimum word-sound separating type and send it to the kanji string conversion means 7. The generating method will be described in the next section.

6:語音区切型記憶手段 7:漢字列生成手段最適語音区切型生成手段５から最適語音区切型のデー
タを受け取り、区切られた各語音を見出しとして辞書１
の漢字語彙を順次に検索し、同音語の中から現在最も眞
に近いと判断される漢字語を選定し、それを順次につら
ねて漢字語列を出力する。その漢字語列は、漢字語列記
憶手段８及び文書記憶手段12に送られる。6: Speech separation type storage means 7: Kanji character string generation means Receiving the data of the optimum speech separation type from the optimum speech separation type generation means 5, the dictionary 1 with each separated speech as a heading
Kanji vocabulary is sequentially searched, and a kanji word that is determined to be closest to the current one is selected from the homophones, and the kanji words are sequentially output to output a kanji word string. The kanji word string is sent to the kanji word string storage means 8 and the document storage means 12.

8:漢字列記憶手段 3.6.2 最適語音区切の生成第１図における最適語音区切型生成手段５の機能と作
用について詳しく述べる。8: Kanji string storage means 3.6.2 Generation of optimal word-segment delimiter The function and operation of the optimum word-sound delimiter generation means 5 in FIG. 1 will be described in detail.

第14図に、音節入力番号ｎに対する、ｎ音節句におい
て可能な区切型のすべてを示す。ただし、ｎ＝１〜８に
ついて例示する。FIG. 14 shows all possible delimiters for n syllable phrases for syllable input number n. However, n = 1 to 8 will be exemplified.

図において、音節はabc等のローマ字で示す。「／」
は語音の区切である。語音区切の型の右側に２進数で表
現したのは、区切型の数値表現T_nである。T_nの意味は例
示すれば区切型 ab/cd/e T₅ 10 10 １ T_nの最上位桁の１は、ローマ字表現の音節列の開始点を
示し、必ずこれを置く。以下、０は音節間に区切のない
ことを、１は区切のあることを示す。ローマ字表現の最
後に「／」がないのは、次の音節が入力されるときまで
最後の音節が単音節語音の音節か双音節語音の第一音節
かこの時点では不定なので「／」がつけられないのであ
る。In the figure, syllables are shown in Roman letters such as abc. "/"
Is a speech separator. It is to that expressed by a binary number to the right type of speech separated a delimiter type numeric representation T _n. The meaning of T _n is, for example, delimited type ab / cd / e T ₅ 10 10 1 The first digit of T _n indicates the starting point of a syllable string in Roman alphabet, and it is always placed. Hereinafter, 0 indicates that there is no break between syllables, and 1 indicates that there is break. The absence of "/" at the end of the Roman alphabet expression means that the last syllable is a syllable of a monosyllabic word or the first syllable of a disyllable word until the next syllable is input. It cannot be done.

T_nを「区切変数」と呼ぶ。T_nによって「音頻句におけ
る区切の型一般を簡潔に表現することができる。T _n is called a “separation variable”. By T _n , "the general type of delimiters in vocal phrases can be expressed simply.

第14図の最下欄に「数」とあるのは、各ｎに対する可
能な区切型の数である。可能な区切型の数をN_nとすれば
N_n＝N_n-1＋N_n-2の法則がある。すなわちN_nはｎの増加に
伴なって急速に増えていき、音節の順次入力に際して、
ｎが小さい間は区切型の数は少ない。区切型の各々につ
いてそれを構成する各語音の頻級を合算して、その区切
型の頻級和IT_nを計算し、すべての型のIT_nを比較し、そ
のなかからIT_nが最小の型を最適区切型変数Topt_nとして
発見することができる。The "numerals" at the bottom of FIG. 14 are the numbers of possible delimiting types for each n. If the number of possible delimiters is N _n
There are law _{_{N n = N n-1 +}} N n-2. That is, N _n increases rapidly with the increase of n.
While n is small, the number of partition types is small. By summing the modal class of each word sounds constituting the for each separator type, calculates the Shikikyu sum IT _n of the separator type, compares the IT _n of all types, IT _n is the smallest from among the it can be found a type as optimum separator variable Topt _n.

ｎに対してそのｎに可能なすべての区切型を生成する
一般的な方法は次のごとくである。The general method for generating all possible partition types for n for n is as follows.

（１）音頻句には単音節語音と双音節語音しか含まれて
いないことに注目すれば、T_nの各型の末尾に新たに双音
節語音を追加した型の集合に、T_n+1の各型の末尾に新た
に単音節語音を追加した型の集合を加えれば、それはT
_n+2の型の集合になる。第15図にT₄がT₂とT₃とから生成
される例を図示してある。(1) If the sound Shikiku noted that it contains only a single syllable speech and the bi-syllable speech, to the set of the type to add a new bi-syllable speech at the end of each type of T _{_n,} T _n _{+ 1} If we add a set of types with a new monosyllabic word added to the end of each type, we get T
_{It is} a set of _{n + 2} types. T ₄ in FIG. 15 there is shown an example that is generated from the T ₂ and T ₃ Prefecture.

（２）上記の方法に従えば、ｎ＝１の型ａと、ｎ＝２の
型abおよびa/bから出発して、ｎが１増えるたびに、新
しいｎに対する型を見出し、ｎ−１とｎ−２の型のITの
値におのおの追加さるべき単音節語音あるいは双音節語
音の頻級を加算すれば、ｎのすべてについて、可能なす
べての型とおのおのIT_nとを順次に求めることができ
る。(2) According to the above method, starting from the type a with n = 1 and the types ab and a / b with n = 2, each time n increases by 1, a new type for n is found, and n−1 By adding the frequency of monosyllabic words or disyllable words to be added to the value of IT of type n-2 and the value of IT of type n-2, it is possible to obtain all possible types and ITn for all _n in order. Can be.

第15図は第14図の全部の区切変数T_nについて頻級和IT
_nを計算し、おのおのｎについてIT_nの最小の値ITopt
_nと、対応する型Topt_nを求めた結果を示す。本図におい
て太字がTopt_nとITopt_nである。FIG. 15 shows a frequency sum IT for all the delimiting variables T _{n in} FIG.
calculate _n and for each _n the minimum value ITopt of IT _n
_n and the result of finding the corresponding type Topt _n are shown. In this figure, bold letters are Topt _n and ITopt _n .

第15図において、最下欄に各Topt_nにしたがって漢字
列変換手段７で語音漢字変換を実行した結果を示す。ｎ
＝３においては、最小のIT₃の型が唯一に決まっていな
い。もしｎ＝３が音頻句が閉じているときには、解が決
まらない。このときには、（１）末尾が単音語音または双音語音のどちらかである
方を眞と定める。In Figure 15, shows the result of executing the speech kanji conversion in kanji sequence converting section 7 in the bottom section in accordance with each Topt _n. n
At = 3, the minimum IT ₃ type is not uniquely determined. If n = 3 indicates that the phrasal phrase is closed, no solution is determined. At this time, (1) it is determined that the end of which is either a monophone sound or a diphthong sound is true.

（２）区切の数が多い方または少ない方の一方の真とす
る。(2) Either one of the larger or smaller number of divisions is true.

のひとつまたは両方の条件によって解をきめるものとす
る。The solution shall be determined by one or both conditions.

3.7 発明の効果 3.7.1 音頻式語音最適区切と漢字変換の成績例第16図に、やや長い例文に対する音頻法による語音最
適区切と漢字変換の結果を示す。この例文において、漢
字変換を必要とする音節は219個、そのなかから人名の
３音節を除いた216個の音節が音頻法の対象となる。3.7 Effects of the Invention 3.7.1 Examples of the results of optimal phonetic word separation and kanji conversion Figure 16 shows the results of optimal phonetic sound separation and kanji conversion using the phonetic method for somewhat longer example sentences. In this example sentence, 219 syllables requiring kanji conversion are used, and 216 syllables excluding the three syllables of the person's name are subjected to the syllable frequency method.

例文は（１）から（12）までの12段に分けて記述され
ている。記号等の定義は以下の通り。The example sentences are described in 12 columns from (1) to (12). The definitions of symbols etc. are as follows.

１）各段の１行目は例文、２行目は中国国定ローマ字
（音：ピンイン）の「読み」、３行目は節点と句の状
況を示す。４〜６行は第12図（５）の形式による音頻構
造を示す。ただし音頻処理の対象となる単音節および双
音節の各語音は、ａまたab等の単音節語音記号を用いず
に、その語音を持つ同音語中の最多頻度の漢字語で表現
してある。a,ab等には、本来ローマ字の音節で書くか又
は同音漢字語の全部を書くべきかであるが、記載の場所
の制限から第16図では漢字語で代表して示した。1) The first line of each column is an example sentence, the second line is "reading" of Chinese national Roman alphabet (sound: Pinyin), and the third line is the situation of nodes and phrases. Lines 4 to 6 show the tone frequency structure in the format of FIG. 12 (5). However, each syllable and disyllable word sound to be subjected to vocal frequency processing is represented by the most frequent kanji word in the same syllable having that word sound without using a single syllable word symbol such as a or ab. In a, ab, etc., it should be originally written in Roman syllables or all of the same phonetic kanji words. However, in FIG. 16, they are represented by kanji words in FIG.

２）３行目において、「Ｖ」は節点、「−」は音頻句、
「▽」は外部区切の節点、「多」は多音句、「非」は非
変換句、「人」は人名扱いの句を指す。2) In the third line, "V" is a node, "-" is a frequent phrase,
“▽” indicates an external delimiter node, “many” indicates a polyphonic phrase, “non” indicates a non-conversion phrase, and “person” indicates a phrase treated as a person name.

３）４〜６行の音頻ネットワークの説明において、太い
実線は最適語音句区切経路である。（６）段目に２箇
所、９段目に１箇所、太い点線の経路が太い実線の経路
と共存している。これはこれらの音頻処理において、IT
が最小となる型が２個出たため、3.6.2において既に述
べたところに従い、それらの音頻句の末尾の語音が双音
節語音の型を眞として太い実線の経路を描き、単音節語
音の型と偽として太い点線の経路を描いたものてある。3) In the description of the phonological network in lines 4 to 6, the bold solid line is the optimal phrase separation path. (6) The path indicated by the thick dotted line coexists with the path indicated by the thick solid line in two places in the tier and one place in the ninth tier. This means that in these tone frequency processing, IT
As described in 3.6.2, the words at the end of these vocal phrases draw a thick solid line path with the type of disyllable words as true, and the type of monosyllabic words as described in 3.6.2. And a thick dotted line drawn as false.

４）下線は多音節語音を示す。この例文の多音節語音は
６箇所に現れ、これらは全て３音節語音である。４音節
語音は出てこない。4) Underlines indicate polysyllabic sounds. The polysyllabic sounds in this example sentence appear in six places, all of which are three syllable sounds. Four syllable words do not come out.

５）語音の下の小字体の数値は頻級でさる。頻級の数値
は北京語学院編「現代漢語頻率詞典1986年版」から計算
した。単語の語彙は中国文字改革委員会の「漢語音詞
匯1963年版」を参考にした。後者にあって前者にない双
音節語音の頻級は一律に18級とした。前者後者の双方に
ないが、辞書システム上に必要と判断される常用の双音
節語（地名の北京や広州の地名も含む）の語音の頻級
は、双音節の前後の単音節の頻級の数値の和から１を減
じた数値を採用した。5) The value of the small font under the phonetic sound is frequent. Frequent class numbers were calculated from the book “Modern Chinese Frequently-Looking Words 1986”, edited by Beijing Language Institute. The vocabulary of the words was based on the Chinese Character Reform Commission's "Kanji Phonetic Hui 1963 Edition". The frequency of disyllable words in the latter but not in the former was uniformly set at 18. Neither the former nor the latter, but the frequency of common disyllable words (including the place names of Beijing and Guangzhou) that are deemed necessary on the dictionary system is the frequency of monosyllables before and after the disyllable The value obtained by subtracting 1 from the sum of the above values was adopted.

６）７行目の「△」は同音語ミスを示す。双音節語の同
音語ミスにおいては、ひとつの音節だけが正しく変換さ
れていても、２字文のミスとした。処理の単位が「字」
ではなく「語」だからである。正解は△の下に太字で示
した。6) “△” on the seventh line indicates a homophone miss. In the case of a disyllable homonym, a two-letter sentence was mistaken even if only one syllable was correctly converted. The processing unit is "character"
It is not a word but a word. Correct answers are shown in bold below △.

７）は区切ミスである。全例文のなかで区切ミスは９段目に
おける３音節にわたる１件しかない。このことは、音頻
法を応用した語音区切が変換キーをまったく必要としな
いにもかかわらず、一語の入力ごとに人の判断によって
いちいち「変換キー」を入れていくいわゆる「人力熟語
変換」区切に比べて、ほとんどこれに近い区切能力を持
っていることを示している。7) Is a separation error. In all of the example sentences, there is only one mistake in the ninth row that spans three syllables. This means that, despite the fact that the phonetic delimiter applying the tone frequency method does not require a conversion key at all, a "conversion key" is inserted one by one by the judgment of a person for each word input. This indicates that the person has a separation ability that is almost similar to this.

3.7.2 音頻法以外の中文漢字変換方式の効果第17図は、第16図の例文に対する漢字変換の正変換率
についての分析を示す。第17図において、（１）は例文
の全部、（２）は例文における語区切、（３）は音頻法
による漢字変換の正変換率をそれぞれ示す。また、
（４）と（５）は参考対象として、最長一致法漢字変換
と単漢字変換のそれぞれの正変換率を示す。3.7.2 Effect of Chinese-to-Kanji Conversion Method Other Than Tone Method FIG. 17 shows an analysis of the conversion rate of the kanji conversion to the example sentence of FIG. In FIG. 17, (1) shows the whole example sentence, (2) shows word segmentation in the example sentence, and (3) shows the normal conversion rate of kanji conversion by the phonetic method. Also,
(4) and (5) show the respective normal conversion rates of the longest match kanji conversion and single kanji conversion as reference objects.

（３）と（４）の本文において、太字は語を単位とし
た字の誤りである。（５）においては太字は字の誤りで
ある。（３）〜（５）の右の欄は誤変換の内容を示す。
その評価は以下の通りである。In the texts of (3) and (4), bold characters are errors in characters in units of words. In (5), the bold characters are typographical errors. The columns on the right of (3) to (5) show the contents of the erroneous conversion.
The evaluation is as follows.

１）本図（５）の単漢字変換は中文入力法として、実用
性がまったくない。原変換率（漢字頻度学習機能がない
ときの変換率）は37.0％と低い。最近使用先頭配列辞書
式の学習機能をつけ、誤字が出るごとに候補字のなかか
ら選択修正再入力していく場合のシミュレーションをし
てみると、正変換率は54.6％に上昇するが、これでも変
換性能はきわめて低い。単漢字変換は中文入力には日本
語入力においてと同じく、効果の点で無意味である。1) The single kanji conversion shown in FIG. 5 (5) has no practical use as a Chinese sentence input method. The original conversion rate (conversion rate without the kanji frequency learning function) is as low as 37.0%. A simulation of a case where a learning function of recently used top array dictionary was attached and a character was selected and corrected and re-entered from among candidate characters every time an erroneous character appeared, showed that the normal conversion rate rose to 54.6%. But the conversion performance is extremely low. The single kanji conversion is meaningless for Chinese sentence in terms of the effect as in Japanese input.

２）単語単位の入力区切入力では区切ミスが当然に０に
なるから正変換率は高く、（３）の例では正変換率は9
5.8％である。しかしこの方式は、この例文の入力のた
めに区切変換キーを107回も打たなければならない欠点
がある。2) The input conversion rate is high in the input delimitation input in word units because the separation error is naturally 0, and the input conversion rate is 9 in the example of (3).
5.8%. However, this method has a disadvantage in that the delimiter conversion key must be pressed 107 times to input the example sentence.

日本語入力における文節単位の漢字変換とは異なり、
中文は分ち書きのない漢字だけの羅列であるから、中文
入力においては、単語変換キーを打つ肉体労働の回数よ
りも、どこで語を切るべきかという判断の精神労働の回
数の方が、オペレータの誤操作と疲労を生む原因として
重大である。その点で現在音入力では最も普遍的なこ
の方式は労働能率の面から不十分な効果しか持たないと
判断できる。Unlike the kanji conversion for each phrase in Japanese input,
Because Chinese sentence is a list of only kanji characters without breaks, in Chinese sentence input, the number of mental labor in determining where to break the word is more than the number of physical labor hitting the word conversion key. This is a serious cause of operation error and fatigue. In this respect, it can be judged that this method, which is the most universal in sound input at present, has an insufficient effect in terms of labor efficiency.

３）また、（４）の最長一致式自動区切漢字変換は、人
力による語区切方式の上記の欠点をカバーすべく試みら
れた方式で、現在日本語入力に広く用いられ非常に成功
しているものの中文に対する応用である。しかし、中文
の構造が日文とは相当に異なるために正変換率が高くな
らず、例文では誤字率17.6％にも達し、しかも相当量の
再区切操作を免れない。誤字のほとんどが区切ミスに起
因するものである。区切ミス修正はオペレータに対して
非常な手間と精神的負担をかけるので、正しい語区切が
なされた後の同音語ミスにくらべれば、悪質なミスであ
る。区切ミスが多い点で、最長一致式自動区切漢字変換
は中文入力に対しては、なお不十分な効果しかない。3) In addition, the longest match automatic delimiter kanji conversion of (4) is a method that has been attempted to cover the above-mentioned drawbacks of the manual word delimitation method, and is currently widely used in Japanese input and has been very successful. It is an application to the Chinese sentence of things. However, since the structure of the Chinese sentence is significantly different from that of the Japanese sentence, the correct conversion rate does not increase. In the example sentence, the typographical error rate reaches 17.6%, and a considerable amount of re-segmentation operation is inevitable. Most of the typographical errors are caused by mistakes in separation. Correction of a segmentation error puts a great deal of effort and mental burden on the operator, and is a vicious mistake when compared to a homophone error after a correct word segmentation is performed. The longest-matching automatic delimiter kanji conversion still has an inadequate effect on Chinese sentence input in that there are many delimiter errors.

3.7.3 音頻式自動語音区切漢字変換の眼目と効果１）オペレータは単語の判断をまったく気にしないで読
む通りに音音節キーを打っていけば、打鍵に追従して
ほとんど真に近い語区切の漢字語列が自動的に出力され
てくる。3.7.3 Eyes and Effects of Tone-Type Automatic Word Separation Kanji Conversion 1) If the operator hits the syllable key as he or she reads without worrying about the judgment of words at all, the word separation almost follows the keystroke. Is automatically output.

２）音頻式自動区切の効果のより、区切ミスは希にしか
発生せず、変換ミスはほとんど同音ミスに限られる。ゆ
えにオペレータはさしたる手間をかけずにミスを修正で
きる。2) Due to the effect of the frequent type automatic segmentation, segmentation errors rarely occur, and conversion errors are almost always limited to the same tone errors. Therefore, the operator can correct the mistake without much trouble.

３）区切ミスが希にした起こらないから、語音から漢字
に変換する際に、連続した２語の間に成立する文法的な
関係を満たす同音語だけに変換することによって、同音
語ミスを減少させ、漢字正変換率をさらに挙げることが
可能である。最長一致方のように区切ミスが多い方式で
は、区切ミスのまま文法処理を加えることになるので、
文法処理の効果は期待できない。3) Since punctuation errors rarely occur, when converting from phonetic sounds to kanji, by converting only to homophones that satisfy the grammatical relationship established between two consecutive words, homophone errors are reduced. It is possible to further increase the kanji normal conversion rate. In a method with many delimitation errors such as the longest match method, grammatical processing will be added with the delimitation error as it is,
The effect of grammar processing cannot be expected.

４）以上をまとめれば、本発明は、（１）変換キーの操作がまったく不要で、漢字の「読
み」の通りに入力し（２）入力に追従して真に近い語区切を自動的に行な
い、その区切にしたがって個々の語ごとに自動的に漢字
に変換する機能を持つことにより（３）オペレータの負担を軽減し疲労を減少させ入力作
業の生産性を向上させる効果を持つ中文語音漢字変換方式を提供するものである。4) Summarizing the above, the present invention provides: (1) No conversion key operation is required at all, and the input is performed as the kanji “read”. (3) It has the effect of reducing the burden on the operator, reducing fatigue and improving the productivity of input work by having the function of automatically converting each word to kanji according to the division. It provides a conversion method.

【図面の簡単な説明】[Brief description of the drawings]

第１図は本発明の一実施例の構成を示すブロック図であ
る。第２図（ａ）は中文の例を示す図、第２図（ｂ）はその
中文に対応する日文を示す図、第３図（ａ）は第２図
（ｂ）の日文に文節区切を入れてなる文節区切日文を示
す図、第３図（ｂ）は本図（ａ）の日文の文節区切りに
似せたリズムで区切った中文の例を示す図、第４図は第
２図（ａ）の中文を語で区切った語区切中文を示す図、
第５図は第３図（ａ）の日文の各音節を仮名で表現した
仮名文字音節区切日文を示す図、第６図は第４図の中文
の各音節を中国式ローマ字で表現したローマ字音節区切
中文を示す図、第７図は第２図（ｂ）の日文を音節単位
で入力して最長一致法により文節に区切り、漢字に変換
して得られる日文の例を示す図、第８図（ａ）は中文に
おける階層分析法の概念を示す図、第８図（ｂ）は朗読
の際に小休止する位置で区切る方式の中文の区切り方を
例示する図、第９図は第２図（ａ）と同じ中文の例を示
す図、第９図（ｂ）は本図（ａ）の中文を音節で入力し
て最長一致法により漢字に変換して得た漢字文を示す
図、第10図は第９図の音節漢字変換の概念を示す図であ
る。第11図は中文における語音と情報量の例を示す図、第12
図（１）〜（４）は第11図（２）の中文における漢字を
捨象して構成した中文音節列の音頻ネットワークを示す
図、第12図（５）は本図（４）の音頻ネットワークにお
ける最小音頻経路の語音を最多頻度の漢字に変換して得
た漢字文を示す図、第13図は第１図実施例における記憶
手段に記憶されるデータを示す図、第14図はｎ音節句に
おいて可能な区切の型を示す図、第15図は第14図の区切
変数T_nに関する頻級和IT_nを示す図、第16図は本発明で
用いる音頻法により中文を漢字に変換して得た漢字文の
例を示す図、第17図は第16図の例文に対する漢字変換の
正解率を示す図である。１……辞書、２……語音・頻級検索手段、３……単音節
語音との頻級、双音節語音とその頻級、３音節語音およ
び多音節語音を記憶する手段、４……節点判断手段、５
……最適語音区切型生成手段、６……語音区切型記憶手
段、７……漢字例生成手段、８……漢字列記憶手段。FIG. 1 is a block diagram showing the configuration of one embodiment of the present invention. FIG. 2 (a) is a diagram showing an example of a Chinese sentence, FIG. 2 (b) is a diagram showing a Japanese sentence corresponding to the Chinese sentence, and FIG. 3 (a) is a segmentation of the Japanese sentence of FIG. 2 (b). FIG. 3 (b) shows an example of a Chinese sentence separated by a rhythm resembling the phrase break of the Japanese sentence in FIG. 3 (a), and FIG. 4 (b) shows FIG. 2 (a). ) Is a diagram showing word-separated Chinese sentence in which Chinese sentence is separated by words.
FIG. 5 is a diagram showing kana character syllable delimited sentences in which each syllable of the Japanese sentence of FIG. 3 (a) is represented by a kana, and FIG. 6 is a Roman syllable in which each syllable of the Chinese sentence of FIG. FIG. 7 shows an example of a Japanese sentence obtained by inputting the Japanese sentence of FIG. 2 (b) in units of syllables, separating the Japanese sentence of FIG. (A) is a diagram showing the concept of the hierarchical analysis method in the Chinese sentence, FIG. 8 (b) is a diagram showing an example of how to divide a Chinese sentence at the position of a short pause in reading, and FIG. 9 is FIG. FIG. 9B is a diagram showing an example of the same Chinese sentence as FIG. 9A, and FIG. 9B is a diagram showing a Chinese character sentence obtained by inputting the Chinese sentence of FIG. FIG. 10 is a diagram showing the concept of the syllable-kanji conversion of FIG. FIG. 11 is a diagram showing an example of speech sounds and the amount of information in a Chinese sentence, and FIG.
FIGS. 1 (1) to 4 (4) are diagrams showing the frequent phonetic network of the Chinese syllable sequence formed by omitting the kanji in the Chinese sentence of FIG. 11 (2), and FIG. 12 (5) is the phonetic network of FIG. 4 (4). FIG. 13 is a diagram showing a kanji sentence obtained by converting the vocabulary sound of the smallest frequent path into the most frequent kanji in FIG. 13, FIG. 13 is a diagram showing data stored in the storage means in the embodiment of FIG. 1, and FIG. shows a delimiter types possible in Boys, FIG. 15 converts figure, FIG. 16 Chinese by the sound frequent method used in the present invention in Chinese characters indicating the Shikikyu sum IT _n relates separated variable T _n in FIG. 14 FIG. 17 is a diagram showing an example of the kanji sentence obtained in FIG. 17, and FIG. 17 is a diagram showing the correct answer rate of kanji conversion for the example sentence of FIG. 1 ... Dictionary 2 ... Voice / Frequency Retrieval Means 3 ... Frequency with Monosyllabic Words, Disyllable Words and Their Frequent Levels, Means to Store Three-Syllable Words and Polysyllabic Words, 4 ... Nodes Judgment means, 5
···················································································

Claims

(57)【特許請求の範囲】(57) [Claims]

【請求項１】中国語の音節の連なりで表わされた語音の
列を入力語音列として入力し、該入力語音列を語音ごと
に区切る中国語語音区切方式において、中国語の語音のうちで単音節語音および双音節語音につ
いて中国語文に出現する統計的頻度の対数値の絶対値を
該語音の頻級として記憶する辞書と、前記入力語音列の各単音節語音について前記辞書を検索
し、該単音節語音及びこの単音節語音の頻級を該辞書か
ら読み出すとともに、該単音節語音の直前に別の単音節
語音が入力されているときにはこれら両単音節語音でな
る双音節語音があるか否かを前記辞書で検索し、該辞書
に該双音節語音がある場合には該双音節語音およびこの
双音節語音の頻級を該辞書から読み出す語音・頻級検索
手段と、該語音・頻級検索手段で読み出された前記単音節語音、
該単音節語音の頻級、前記双音節語音および該双音節語
音の頻級を記憶する第１の記憶手段と、前記語音・頻級検索手段で双音節語音が検索されなかっ
た単音節語音と前記直前入力単音節語音との間における
前記入力語音列上の仮想の点を節点とし、直近の２つの
該節点の間にある前記入力語音列の句を音頻句とすると
き、該音頻句に対応する前記単音節語音及び双音節語音
並びに該単音節語音の頻級及び該双音節語音の頻級を前
記第１の記憶手段から読み出す節点判断手段と、前記節点判断手段で読み出された前記単音節語音および
双音節語音並びに該単音節語音の頻級および該双音節語
音の頻級を受け、前記音頻句を語音の単位に区切る区切
り方を語音区切型として生成し、該語音区切型のうちで
最適の語音区切型を選んで出力する最適語音区切型生成
手段と、前記最適語音区切型生成手段で生成された前記語音区切
型を記憶する第２の記憶手段とを備え、前記最適語音区切型生成手段は、前記語音区切型におけ
る各語音の前記頻級の和を求め、該和が最少である語音
区切型を前記最適語音区切型とすることを特徴とする中
国語語音区切方式。1. A method for inputting a sequence of speech sounds represented by a series of Chinese syllables as an input speech sequence, and separating the input speech sequence for each speech sound. A dictionary that stores the absolute value of the logarithmic value of the statistical frequency that appears in Chinese sentences for monosyllable words and disyllable words as a frequency of the word, and searches the dictionary for each monosyllable word in the input word sequence. The monosyllabic speech and the frequency of the monosyllabic speech are read from the dictionary, and when another monosyllabic speech is input immediately before the monosyllabic speech, is there a disyllable speech composed of both monosyllabic speeches? Whether or not the disyllable word is present in the dictionary, and if the disyllable word is present in the dictionary, a word / frequency search means for reading out the disyllable word and the frequency of the disyllable word from the dictionary; Before being read by class search means Monosyllabic speech,
First storage means for storing the frequency of the monosyllable speech, the disyllable speech and the frequency of the disyllable speech; and the monosyllable speech for which the disyllable speech was not retrieved by the speech / frequency search means. When a virtual point on the input speech sequence between the immediately preceding input monosyllable speech is a node, and the phrase of the input speech sequence between the two most recent nodes is a vocal phrase, A node determining means for reading out the corresponding monosyllabic sounds and disyllable words, a frequency of the monosyllable words and a frequency of the disyllable words from the first storage means, and the node read by the node determining means Receiving the monosyllable speech and disyllable speech, and the frequency of the monosyllable speech and the frequency of the disyllable speech, generating a method of dividing the vocal phrasal into speech units as a speech segmentation type, Select the best speech separation type A speech separation unit, and a second storage unit that stores the speech separation unit generated by the optimal speech separation unit, wherein the optimal speech separation unit generates each speech in the speech separation unit. A speech sum delimiter of which the sum is the smallest, and the speech delimiter having the smallest sum is the optimal speech delimiter.

【請求項２】中国語の音節の連らなりで表わされた語音
の列を入力語音列として入力し、該入力語音列を語音ご
とに区切り、前記入力語音列から区切られた前記語音を
それぞれ漢字に変換する中国語語音漢字変換方式におい
て、中国語の語音を見出しとして該語音の漢字を記憶すると
ともに、中国語を語音のうちで単音節語音および双音節
語音について中国語文に出現する統計的頻度の対数値の
絶対値を該語音の頻級として記憶する辞書と、前記入力語音列の各単音節語音について前記辞書を検索
し、該単音節語音及びこの単音節語音の頻級を該辞書か
ら読み出すとともに、該単音節語音の直前に別の単音節
語音が入力されているときにはこれら両単音節語音でな
る双音節語音があるか否かを前記辞書で検索し、該辞書
に該双音節語音がある場合には該双音節語音およびこの
双音節語音の頻級を該辞書から読み出す語音・頻級検索
手段と、該語音・頻級検索手段で読み出された前記単音節語音、
該単音節語音の頻級、前記双音節語音および該双音節語
音の頻級を記憶する第１の記憶手段と、前記語音・頻級検索手段で双音節語音が検索されなかっ
た単音節語音と前記直前入力単音節語音との間における
前記入力語音列上の仮想の点を節点とし、直近の２つの
該節点の間にある前記入力語音列の句を音頻句とすると
き、該音頻句に対応する前記単音節語音及び双音節語音
並びに該単音節語音の頻級及び該双音節語音の頻級を前
記第１の記憶手段から読み出す節点判断手段と、前記節点判断手段で読み出された前記単音節語音および
双音節語音並びに該単音節語音の頻級および該双音節語
音の頻級を受け、前記音頻句を語音の単位に区切る区切
り方を語音区切型として生成し、該語音区切型のうちで
最適の語音区切型を選んで出力する最適語音区切型生成
手段と、前記最適語音区切型生成手段で生成された前記語音区切
型を記憶する第２の記憶手段と、前記最適語音区切型で区切って示される各語音について
前記辞書を検索し、該語音を見出しとする漢字のうちの
１つを該辞書から読み出し、前記入力語音列に対応する
漢字列を生成する漢字列生成手段とを備え前記最適語音区切型生成手段は、前記語音区切型におけ
る各語音の前記頻級の和を求め、該和が最少である語音
区切型を前記最適語音区切型とすることを特徴とする中
国語語音漢字変換方式。2. A speech sequence represented by a series of Chinese syllables is input as an input speech sequence, and the input speech sequence is divided for each speech, and the speech separated from the input speech sequence is input. In the Chinese word sound kanji conversion method of converting each to a kanji, a kanji of a Chinese sound is stored with a Chinese sound as a heading, and statistics appearing in a Chinese sentence for monosyllabic sounds and disyllable sounds among Chinese sounds. A dictionary that stores the absolute value of the logarithmic value of the target frequency as the frequency of the speech, and the dictionary is searched for each monosyllable speech of the input speech sequence. In addition to reading from the dictionary, when another monosyllabic word is input immediately before the monosyllabic word, the dictionary is searched to determine whether or not there is a disyllable word composed of these monosyllabic words. Syllable words And speech-Shikikyu search means for reading out from the dictionary specification the frequent grade bi syllables sound and the bi-syllable word sound when the monosyllable speech read by word or sound-Shikikyu search means,
First storage means for storing the frequency of the monosyllable speech, the disyllable speech and the frequency of the disyllable speech; and the monosyllable speech for which the disyllable speech was not retrieved by the speech / frequency search means. When a virtual point on the input speech sequence between the immediately preceding input monosyllable speech is a node, and the phrase of the input speech sequence between the two most recent nodes is a vocal phrase, A node determining means for reading out the corresponding monosyllabic sounds and disyllable words, a frequency of the monosyllable words and a frequency of the disyllable words from the first storage means, and the node read by the node determining means Receiving the monosyllable speech and disyllable speech and the frequency of the monosyllable speech and the frequency of the disyllable speech, generating a method of dividing the vocal phrasal into speech units as a speech segmentation type, Select the best speech separation type Speech separation type generating means; second storage means for storing the speech separation type generated by the optimum speech separation type generating means; and searching the dictionary for each speech separated by the optimum speech separation type. Kanji string generation means for reading one of the kanji having the word sound as a heading from the dictionary, and generating a kanji string corresponding to the input word sound string. A sum of the frequent classes of each speech in the pattern, and the speech segmentation type having the smallest sum is defined as the optimal speech segmentation type.