JPH1125093A

JPH1125093A - Machine translation device and computer-readable recording medium recording machine translation processing program

Info

Publication number: JPH1125093A
Application number: JP9180907A
Authority: JP
Inventors: Seiko Yoshiyama; 盛子吉山
Original assignee: Brother Industries Ltd
Current assignee: Brother Industries Ltd
Priority date: 1997-07-07
Filing date: 1997-07-07
Publication date: 1999-01-29

Abstract

PROBLEM TO BE SOLVED: To make it possible to correctly translate an input sentence including continuous character strings. SOLUTION: A morpheme analysis part divides a sentence inputted from an input part 10 into words by referring to a user dictionary 61 and a translation dictionary 57, and a syntax analysis part 53 analyzes modifications or the like and forms the tree structure of the input sentence. A syntax conversion part 54 converts the tree structure into the structure of a sentence to be translated, a syntax formation part 55 forms the tree structure of the sentence to be translated, a morpheme formation part 56 adjusts a change in a word form or the like, and an output part 20 outputs a translation result. A word including a continuous character string such as numerals is substituted by a determined symbol and registered, and when the word is retrieved, the character string substituted by the word is stored and an English translation is substituted for the character string. When the character string is numerals, information of a singular or plural is also stored and correct translation is formed.

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、第一言語を第二言
語に翻訳するための機械翻訳装置、及び機械翻訳処理プ
ログラムを記録したコンピュータ読み取り可能な記録媒
体に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a machine translation device for translating a first language into a second language, and a computer-readable recording medium storing a machine translation processing program.

【０００２】[0002]

【従来の技術】従来の機械翻訳装置、及びその機械翻訳
装置を動作させるためのプログラムを記録した記録媒体
においては、例えば、「１０勝する」（サ変動詞）、
「１０勝」（サ変名詞）、「５連敗する」（サ変動
詞）、「５連敗」（サ変名詞）のような数字を含むサ変
動詞やサ変名詞は、所謂「定型文」として登録するか、
ユーザー辞書に個別に登録することによって対応してい
た。「定型文」に登録する場合は、数字の部分を「＊」
等の予め定められた記号に置き換えて登録するようにし
ている。従って、「＊勝する」を「ｗｉｎ＊ｇａｍ
ｅ」と登録すれば、「１勝する」は「ｗｉｎ１ｇａ
ｍｅ」のように翻訳される。また、ユーザー辞書におい
て、「１勝する」を見出し語にして、訳を「ｗｉｎ１
ｇａｍｅ」として登録すれば、「彼は１勝した」は、
「Ｈｅｗｏｎ１ｇａｍｅ．」と翻訳される。2. Description of the Related Art In a conventional machine translation apparatus and a recording medium in which a program for operating the machine translation apparatus is recorded, for example, "10 wins" (sa variation),
Sa vars and sa nouns containing numbers such as "10 wins" (sa noun), "5 consecutive loses" (sa var), and "5 consecutive loses" (sa var) can be registered as so-called "fixed sentences"
This has been addressed by registering them individually in the user dictionary. When registering in the "fixed text", replace the number with "*"
And the like, and are registered in place of predetermined symbols. Therefore, "* win" is replaced with "win * gam"
e "," 1 win "is" win 1 ga "
translated as "me". In the user dictionary, “1 win” is used as a headword, and the translation is “win 1”.
If you register as "game,""He won one"
Translated as "He won 1 game."

【０００３】[0003]

【発明が解決しようとする課題】しかしながら、定型文
というのは、「形が決まった文」であって、入力文が登
録された形と同じ場合にだけ参照されるものである。そ
のため、「＊勝する」で登録された定型文は、入力文が
「１勝する」であれば引けるが、「１勝した」や「彼は
１勝する」だと引けてこない。また、＊の部分を実際の
入力文に現れた数字に置き換えるだけなので、「１０勝
する」は、正しくは「ｗｉｎ１０ｇａｍｅｓ」であ
るのに、「ｗｉｎ１０ｇａｍｅ」になってしまう。
これではいろいろなバリエーションを持つ入力文には到
底対応できない。また、ユーザー辞書に登録する場合で
も、「勝」や「勝する」の直前にはどんな数字が来ても
いいのであるから、それを全ての数字に対して個別に登
録するのは不可能である。もちろん、機械翻訳装置に予
め搭載される翻訳辞書もしくは翻訳プログラムで対応さ
れていれば、ユーザーが定型文に登録したり、ユーザー
辞書に登録する必要はない。しかし、数字等の連続性の
ある文字列を含む単語の中でユーザーが必要とする単語
全てが対応されているとは言い難い。However, a fixed sentence is a "sentence having a fixed form" and is referred to only when an input sentence is the same as a registered form. Therefore, the fixed phrase registered with "* wins" can be closed if the input sentence is "1 win", but cannot be closed if "it has won" or "he wins 1". Also, since the * part is simply replaced with the number that appears in the actual input sentence, “win 10 games” is correctly “win 10 games” but “win 10 games”.
This cannot handle input sentences with various variations at all. Also, even when registering in the user dictionary, since any number can come just before “win” or “win”, it is impossible to register it individually for all numbers. is there. Of course, if the translation is performed by a translation dictionary or a translation program installed in the machine translation apparatus in advance, the user does not need to register a fixed phrase or a user dictionary. However, it is difficult to say that all words required by the user among words including a continuous character string such as a number are supported.

【０００４】本発明は上述した問題点を解決するために
なされたものであり、数字のように連続性のある文字列
を含む単語であって翻訳辞書に登録されていない単語を
ユーザー辞書に登録できるようにしたものであり、連続
性のある文字列をいちいち別個のユーザー辞書に登録す
るのではなく、一つの見出し語でどんな連続性のある文
字列が入ってきても対応可能にし、その辞書を検索し
て、訳語を正しく翻訳文に反映させることができる機械
翻訳装置及び、機械翻訳処理プログラムを記録したコン
ピュータ読み取り可能な記録媒体を提供することを目的
としている。SUMMARY OF THE INVENTION The present invention has been made to solve the above-described problem. A word containing a continuous character string such as a number and not registered in a translation dictionary is registered in a user dictionary. Rather than registering continuous character strings in separate user dictionaries, it is possible to respond to any continuous character string that comes in with one headword. It is an object of the present invention to provide a machine translation apparatus capable of retrieving and translating a translated word into a translated sentence correctly, and a computer-readable recording medium recording a machine translation processing program.

【０００５】[0005]

【課題を解決するための手段】この目的を達成するため
に、本発明の請求項１に記載の機械翻訳装置は、入力手
段によって入力された第一言語を単語に分割するための
形態素解析手段と、前記形態素解析手段によって分割さ
れた第一言語の単語の係り受け等を解析するための構文
解析手段と、前記構文解析手段によって解析された第一
言語について翻訳辞書を参照しながら目的の第二言語に
翻訳するための翻訳生成手段とを備えたものを対象とし
て、特に、連続性のある文字列を含む単語を予め定めら
れた記号によって記憶手段に登録することができる登録
手段を備えたことを特徴とするものである。In order to achieve this object, a machine translation apparatus according to a first aspect of the present invention comprises a morphological analysis unit for dividing a first language input by an input unit into words. Syntactic analysis means for analyzing the dependency of the words of the first language divided by the morphological analysis means, and a first language analyzed by the syntax analysis means for the first language analyzed by referring to a translation dictionary In particular, the present invention has a registration means capable of registering a word including a continuous character string in a storage means by a predetermined symbol, with a translation generation means for translating into two languages. It is characterized by the following.

【０００６】従って、この発明によれば、前記登録手段
によって、連続性のある文字列を含む単語を予め定めら
れた記号によって記憶手段に登録することができる。例
えば、「１０勝」、「１０勝する」のような単語は、
「１０」の部分を記号化して一つの見出し語として前記
記憶手段に登録しておく。このように単語を記号によっ
て登録しておくことにより、前記入力手段によって入力
された第一言語を前記形態素解析手段によって正確に単
語に分割することができ、以後の前記構文解析手段によ
る構文解析、及び前記翻訳生成手段による前記第二言語
への翻訳生成を的確に実行して正確な翻訳結果を得るこ
とができる。Therefore, according to the present invention, a word including a continuous character string can be registered in the storage means by a predetermined symbol by the registration means. For example, words such as "10 wins" and "10 wins"
The part "10" is symbolized and registered in the storage means as one headword. By registering a word as a symbol in this way, the first language input by the input unit can be accurately divided into words by the morphological analysis unit. In addition, it is possible to accurately execute the translation generation into the second language by the translation generation unit and obtain an accurate translation result.

【０００７】また、請求項２に記載の機械翻訳装置は、
前記記憶手段に登録された連続性のある文字列を含む単
語を検索する検索手段と、前記記号に置き換えられた連
続性のある文字列を記憶する文字列記憶手段とを備えた
ことを特徴とする。従って、入力文の中に連続性のある
文字列を含む単語が出現した場合に、前記記憶手段に登
録された単語を前記検索手段によって検索することによ
り、例えば、「１０勝」が、「１０」と「勝」に切れて
しまったり、「１０勝する」が、「１０」と「勝する」
に切れて「勝する」が未知語（辞書に登録がない単語）
となるようなことがないようにする。また、予め定めら
れた記号に置き換えられた連続性のある文字列の情報を
前記文字列記憶手段に記憶することによって、その文字
列を翻訳結果に反映させることができる。[0007] The machine translation device according to claim 2 is
A search unit that searches for a word including a continuous character string registered in the storage unit; and a character string storage unit that stores a continuous character string replaced with the symbol. I do. Therefore, when a word including a character string having continuity appears in the input sentence, the word registered in the storage unit is searched by the search unit. "And" Win ", or" Win 10 ", but" 10 "and" Win "
Unknown word (word not registered in the dictionary)
So that there is no such thing as Further, by storing information of a character string having continuity replaced with a predetermined symbol in the character string storage means, the character string can be reflected in a translation result.

【０００８】また、請求項３に記載の機械翻訳装置は、
前記構文解析手段が、前記記憶手段等に登録された情報
に基づいて、連続性のある文字列を含む翻訳に必要な情
報をツリー構造に反映させるツリー構造変換手段を備え
ることを特徴とするものである。従って、前記形態素解
析手段によって得られた単語を句にまとめ上げたり、主
語や目的語等の係り先を決める等して、入力文をその内
部構造であるツリー構造に変換することがでる。このよ
うに、ツリー構造に変換しておくことにより、最終的
に、正しい翻訳結果を得ることができる。[0008] The machine translation device according to claim 3 is
Wherein the parsing means comprises a tree structure conversion means for reflecting information necessary for translation including a continuous character string in a tree structure based on the information registered in the storage means and the like. It is. Therefore, the input sentence can be converted into a tree structure, which is the internal structure of the input sentence, by combining words obtained by the morphological analysis means into phrases, determining the destination of a subject, an object, and the like. By converting the data into a tree structure, a correct translation result can be finally obtained.

【０００９】また、請求項４に記載の機械翻訳装置は、
前記記憶手段が、ユーザーによって使用される外部記憶
装置であることを特徴とするものである。従って、ユー
ザーによって使用される前記外部記憶装置に、連続性の
ある文字列を含む単語を予め定められた記号によって記
憶しておけば、これを、機械翻訳する度に有効に使用す
ることができる。The machine translation device according to claim 4 is
The storage means is an external storage device used by a user. Therefore, if words including a continuous character string are stored in the external storage device used by the user with predetermined symbols, the words can be effectively used every time machine translation is performed. .

【００１０】また、請求項５に記載の機械翻訳装置は、
前記翻訳生成手段によって生成された第二言語の翻訳文
を出力する出力手段をさらに備えたことを特徴とするも
のである。従って、前記翻訳生成手段によって生成され
た第二言語の翻訳文を前記出力手段によって出力するこ
とにより、翻訳した第二言語を直ちに確認することがで
き、一連の翻訳処理を効率的に実施することができる。Further, the machine translation device according to claim 5 is
An output unit for outputting the second language translation sentence generated by the translation generation unit is further provided. Therefore, by outputting the translated sentence of the second language generated by the translation generating means by the output means, the translated second language can be immediately confirmed, and a series of translation processes can be efficiently performed. Can be.

【００１１】また、請求項６に記載の機械翻訳処理プロ
グラムを記録したコンピュータ読み取り可能な記録媒体
は、入力手段によって入力された第一言語を単語に分割
するための形態素解析プログラムと、前記形態素解析プ
ログラムによって分割された第一言語の単語の係り受け
等を解析するための構文解析プログラムと、前記構文解
析プログラムによって解析された第一言語について翻訳
辞書を参照しながら目的の第二言語に翻訳するための翻
訳生成プログラムとを記録したものを対象として、特
に、連続性のある文字列を含む単語を予め定められた記
号によって記憶手段に登録することができる登録プログ
ラムを備えたことを特徴とするものである。A computer-readable recording medium storing the machine translation processing program according to claim 6, further comprising: a morphological analysis program for dividing a first language input by an input means into words; A parsing program for analyzing the dependency of words of the first language divided by the program, and translating the first language analyzed by the parsing program into a target second language with reference to a translation dictionary And a registration program capable of registering a word including a continuous character string in a storage means by a predetermined symbol. Things.

【００１２】従って、前記記録媒体を用いてプログラム
を実行することにより、前記登録プログラムによって、
連続性のある文字列を含む単語を予め定められた記号に
よって記憶手段に登録することができる。そして、この
ように単語を記号によって登録しておくことにより、前
記入力手段により入力された第一言語を形態素解析によ
って正確に単語に分割することができ、以後の前記構文
解析、及び前記第二言語への翻訳生成を的確に実行して
正確な翻訳結果を得ることができる。Therefore, by executing the program using the recording medium, the registration program allows
A word including a continuous character string can be registered in the storage unit using a predetermined symbol. By registering the word as a symbol in this way, the first language input by the input unit can be accurately divided into words by morphological analysis, and the subsequent syntax analysis and the second Accurate translation results can be obtained by accurately executing translation generation into a language.

【００１３】また、請求項７に記載の機械翻訳処理プロ
グラムを記録したコンピュータ読み取り可能な記録媒体
は、前記記憶手段に登録された連続性のある文字列を含
む単語を検索する検索プログラムと、前記記号に置き換
えられた連続性のある文字列を記憶する文字列記憶プロ
グラムとを備えたことを特徴とするものである。従っ
て、その記録媒体を用いてプログラムを実行することに
より、入力文の中に連続性のある文字列を含む単語が出
現した場合に、前記記憶手段に登録された単語を的確に
検索することができる。また、予め定められた記号に置
き換えられた連続性のある文字列の情報を記憶しておく
ことにより、その文字列を翻訳結果に反映させることが
できる。Further, a computer readable recording medium recording the machine translation processing program according to claim 7 is a computer readable recording medium for retrieving a word including a continuous character string registered in the storage means, A character string storage program for storing a continuous character string replaced with a symbol. Therefore, by executing the program using the recording medium, when a word including a continuous character string appears in the input sentence, it is possible to accurately search for the word registered in the storage unit. it can. Further, by storing information of a character string having continuity replaced with a predetermined symbol, the character string can be reflected in a translation result.

【００１４】また、請求項８に記載の機械翻訳処理プロ
グラムを記録したコンピュータ読み取り可能な記録媒体
は、前記構文解析プログラムが、前記記憶手段等に登録
された情報に基づいて、連続性のある文字列を含む翻訳
に必要な情報をツリー構造に反映させるツリー構造変換
プログラムを備えることを特徴とするものである。従っ
て、その記録媒体を用いてプログラムを実行することに
より、形態素解析によって得られた単語を句にまとめ上
げたり、主語や目的語等の係り先を決める等して、入力
文をその内部構造であるツリー構造に変換することがで
きる。このように、ツリー構造に変換しておくことによ
り、最終的に、正しい翻訳結果を得ることができる。According to another aspect of the present invention, there is provided a computer-readable recording medium on which the machine translation processing program is recorded, wherein the syntactic analysis program is configured to execute continuous character processing based on information registered in the storage means or the like. A tree structure conversion program for reflecting information necessary for translation including a column in a tree structure is provided. Therefore, by executing a program using the recording medium, the words obtained by the morphological analysis are grouped into phrases, and the relation between the subject and the object is determined. It can be converted to a tree structure. By converting the data into a tree structure, a correct translation result can be finally obtained.

【００１５】[0015]

【発明の実施の形態】以下に、本発明の機械翻訳装置及
び機械翻訳処理プログラムを記録したコンピュータ読み
取り可能な記録媒体を具体化した実施の形態について、
第一言語を日本語、第二言語を英語とした場合を例にと
って説明する。BEST MODE FOR CARRYING OUT THE INVENTION An embodiment of a machine translation apparatus and a computer readable recording medium on which a machine translation processing program is recorded according to the present invention will be described below.
The case where the first language is Japanese and the second language is English will be described as an example.

【００１６】さらに、連続性のある文字列として、ここ
では数字を例に挙げて説明するが、連続性のある文字列
であれば数字に限らず、例えば、アルファベット等でも
構わない。[0016] Furthermore, as a continuous character string, a number will be described as an example here. However, a continuous character string is not limited to a number, and may be, for example, an alphabet.

【００１７】本実施の形態の機械翻訳装置の構成を図１
に示す。この図に示すように、キーボード等からなる日
本語を入力するための入力部１０と、ＣＲＴ（陰極線
管）に表示したり、プリンタから印刷するための出力手
段としての出力部２０は、装置全体を制御するための中
央処理装置であるＣＰＵ３０に接続されている。FIG. 1 shows the configuration of the machine translation apparatus according to the present embodiment.
Shown in As shown in FIG. 1, an input unit 10 such as a keyboard for inputting Japanese, and an output unit 20 as an output unit for displaying on a CRT (cathode ray tube) or printing from a printer are provided as a whole apparatus Is connected to a CPU 30 which is a central processing unit for controlling the CPU.

【００１８】ＲＡＭ４０はＣＰＵ３０に接続され、入力
された日本語文を記憶するための入力バッファ領域４１
と、翻訳結果の英語文を記憶するための出力バッファ領
域４２に分かれている。The RAM 40 is connected to the CPU 30 and has an input buffer area 41 for storing the input Japanese sentence.
And an output buffer area 42 for storing an English sentence as a translation result.

【００１９】プログラムや辞書等を格納するＲＯＭ５０
は、ＣＰＵ３０と接続され、その中には、図４のフロー
チャートで示すプログラムが記憶されており、日本語文
を英語文に翻訳する翻訳プログラム５１と、翻訳プログ
ラム５１によって翻訳を実行する際に参照される翻訳辞
書５７とが格納されている。翻訳プログラム５１は、日
本語文を形態素解析するための形態素解析部５２と、日
本語文の係り受け等の解析をする構文解析部５３と、解
析された日本語文を英語の構造に変換する構文変換部５
４と、変換された英語の構造から英語を生成する構文生
成部５５と、語尾変化等の調整を行う形態素生成部５６
とに分かれている。ROM 50 for storing programs, dictionaries, etc.
Is connected to the CPU 30, in which a program shown in the flowchart of FIG. 4 is stored, which is referred to when a translation program 51 for translating a Japanese sentence into an English sentence is executed by the translation program 51. The translation dictionary 57 is stored. The translation program 51 includes a morphological analysis unit 52 for morphologically analyzing a Japanese sentence, a syntax analysis unit 53 for analyzing dependency of the Japanese sentence, and a syntax conversion unit for converting the analyzed Japanese sentence into an English structure. 5
4, a syntax generation unit 55 that generates English from the converted English structure, and a morpheme generation unit 56 that adjusts inflection and the like.
And divided into

【００２０】ＣＰＵ３０には、外部記憶装置（ＨＤＤ）
６０が接続されており、ユーザーが任意に作成するユー
ザー辞書６１が格納されている。The CPU 30 has an external storage device (HDD)
60 is connected, and stores a user dictionary 61 arbitrarily created by the user.

【００２１】図４は、本実施の形態の動作を示すフロー
チャートであり、一般的な翻訳の処理の流れを説明して
いる。FIG. 4 is a flowchart showing the operation of the present embodiment, and explains the flow of a general translation process.

【００２２】先ず、ステップ１（以下、Ｓ１と略称す
る。他のステップも同様）において、キーボード等から
日本語文を入力する。入力された日本語は翻訳にかける
ために入力バッファ領域４１に記憶される（Ｓ２）。こ
こに記憶された入力文は形態素解析部５２において翻訳
辞書５７とユーザー辞書６１を参照して該当する単語が
あるか否かを調べる形態素解析の処理が行われる（Ｓ
３）。通常、形態素解析ではユーザー辞書があれば翻訳
辞書に優先して検索される。一般的に、与えられた文の
構造を把握するためには、先ず、文を単語の並びとして
認識し、各単語を品詞等の属性に対応づける処理が必要
である。英語等では文が単語に分かち書きされるので単
語と単語の境界を認定する必要はないが、日本語や中国
語では通常分かち書きはされないので、単語の境界を認
定する必要がある。そして、文の中に出現した単語が活
用変化や語尾変化をしている場合には、その活用形を認
識して、原形に戻す操作も必要である。単語の並びには
一定の規則があり、入力された文字列をこの規則に従っ
て翻訳辞書５７やユーザー辞書６１等の辞書を使用しな
がら分割していく処理が「形態素解析」である。First, in step 1 (hereinafter abbreviated as S1; other steps are the same), a Japanese sentence is input from a keyboard or the like. The input Japanese is stored in the input buffer area 41 for translation (S2). The input sentence stored here is subjected to morphological analysis processing in which the morphological analysis unit 52 refers to the translation dictionary 57 and the user dictionary 61 to check whether there is a corresponding word (S).
3). Usually, in the morphological analysis, if there is a user dictionary, it is searched prior to the translation dictionary. Generally, in order to grasp the structure of a given sentence, it is necessary to first recognize the sentence as a sequence of words and to associate each word with an attribute such as a part of speech. In English and the like, sentences are separated into words, so there is no need to determine the boundaries between words. However, in Japanese and Chinese, usually, words are not separated, so it is necessary to recognize the boundaries between words. When a word that appears in a sentence changes inflection or ending, it is necessary to recognize the inflected form and return to the original form. The word sequence has a certain rule, and a process of dividing an input character string using a dictionary such as the translation dictionary 57 or the user dictionary 61 according to the rule is “morphological analysis”.

【００２３】本実施の形態の機械翻訳装置、及び機械翻
訳処理プログラム記録したコンピュータ読み取り可能な
記録媒体は、形態素解析における単語認定の方式とし
て、一致する候補の内、最長のものを優先させる「最長
一致法」を採用している。この方法によると、辞書の日
本語見出しとして登録可能な最大文字数分の文字列を文
頭から切り出し、それを検索キーワードとしてユーザー
辞書６１及び翻訳辞書５７を検索しに行く。そこで検索
されない場合は最大文字数より１文字少ない文字数分の
文字列を文頭から切り出し、それを検索キーワードにし
て検索する。もし、該当する単語が検索されれば、その
単語を確定して、次の文字列から再度最大文字数分を切
り出して検索を続ける。候補の中で一致するものがあれ
ば最長のものが検索される。例えば「理事国」は「理
事」という名詞と「国」という名詞に形態素解析される
が、もし「理事国」という名詞があればその文字数の方
が多いのでこの名詞が引けることになる。The machine translation apparatus and the computer-readable recording medium on which the machine translation processing program is recorded according to the present embodiment employs the longest one among matching candidates as a word recognition method in morphological analysis. The "concordance method" is adopted. According to this method, a character string corresponding to the maximum number of characters that can be registered as a Japanese heading in the dictionary is cut out from the beginning of the sentence, and the user dictionary 61 and the translation dictionary 57 are searched using the extracted character strings as search keywords. Therefore, if the character string is not searched, a character string of one character less than the maximum number of characters is cut out from the beginning of the sentence, and the character string is searched for as a search keyword. If the corresponding word is searched, the word is determined, the maximum number of characters is cut out again from the next character string, and the search is continued. If there is a match among the candidates, the longest one is searched. For example, "member of the country" is morphologically analyzed into the noun "member of the board" and the noun of "country", but if there is a noun "member of the country", the number of characters is larger, so this noun can be subtracted.

【００２４】形態素解析では、ユーザー辞書があれば、
この辞書が優先的に検索される（Ｓ４）。そのユーザー
辞書の中に該当する単語があるか否かを調べ（Ｓ５）、
ユーザー辞書６１に該当する単語がない場合は（Ｓ５：
Ｎｏの場合）、装置に予め搭載された翻訳辞書５７を検
索する（Ｓ６）。翻訳辞書５７にも該当する単語がない
場合（Ｓ７：Ｎｏの場合）、未知語（辞書に登録がない
単語）として処理される（Ｓ８）。ユーザー辞書６１に
該当する単語があるか（Ｓ５：Ｙｅｓの場合）、あるい
は、ユーザー辞書６１に該当する単語がなくても翻訳辞
書５７に該当する単語がある場合は（Ｓ７：Ｙｅｓの場
合）、次に、前後の単語の接続チェックを行う（Ｓ
９）。その接続チェックは、英語の場合、冠詞の後ろに
は’ａｎｄ’等の接続詞は来ないが、名詞や’ｔｈｅ
ｂｅａｕｔｉｆｕｌｇｉｒｌ’のように形容詞は来て
もよいというように、前後の単語の接続をチェックする
ものである。In the morphological analysis, if there is a user dictionary,
This dictionary is preferentially searched (S4). It checks whether there is a corresponding word in the user dictionary (S5),
If there is no corresponding word in the user dictionary 61 (S5:
If No), the translation dictionary 57 installed in the apparatus is searched (S6). If there is no corresponding word in the translation dictionary 57 (S7: No), it is processed as an unknown word (word not registered in the dictionary) (S8). If there is a corresponding word in the user dictionary 61 (S5: Yes), or if there is a corresponding word in the translation dictionary 57 without a corresponding word in the user dictionary 61 (S7: Yes), Next, a connection check of the preceding and following words is performed (S
9). In the case of English, a connection such as' and 'does not come after an article in English, but nouns or' the
It checks the connection of preceding and following words, such as an adjective may appear as in beafulful girl '.

【００２５】そして、ユーザー辞書から該当する単語が
検索されても接続チェックではじかれると（Ｓ１０：Ｎ
ｏの場合）、未知語として処理される（Ｓ８）。接続も
よければ（Ｓ１０：Ｙｅｓの場合）、入力文は単語に分
割されて、検索した単語の辞書情報が各々の単語に付与
される（Ｓ１１）。以上で形態素解析処理が終了する
（Ｓ１２）。次に、構文解析部５３において、形態素解
析によって得られた単語の並びを解析プログラムによっ
て解析して、単語を句にまとめ上げたり、主語や目的語
等の係り先を決める等して、入力文の内部構造であるツ
リー構造を作る（Ｓ１３）。Then, even if the corresponding word is searched from the user dictionary, the word is rejected by the connection check (S10: N
In the case of o), it is processed as an unknown word (S8). If the connection is good (S10: Yes), the input sentence is divided into words, and dictionary information of the searched words is added to each word (S11). Thus, the morphological analysis processing ends (S12). Next, in the parsing unit 53, the sequence of words obtained by the morphological analysis is analyzed by the analysis program, and the words are grouped into phrases, and the destination of the subject or object is determined. (S13).

【００２６】次に、Ｓ１４において、構文の変換を行
う。ここでは、解析された結果に基づいて、ユーザー辞
書６１及び翻訳辞書５７を参照しながら、各々の単語に
英語の訳を当てると同時に、翻訳対象言語である英語の
ツリー構造に変換する。また、英文に反映させるため
に、解析で付与された日本語入力文の情報を英語の情報
に付け替える。次に、Ｓ１５において構文生成を行う。
ここでは、翻訳辞書５７を参照しながら、構文変換部で
作られたツリー構造をもとにして、正しい英文を出すた
めのツリー構造にする。また、英語の情報も各々のノー
ドに付与する。次に、Ｓ１６において、形態素生成を行
って語尾変化等の調整をして、Ｓ１７で翻訳結果の英文
をＣＲＴに表示したり、プリンタから出力する。Next, in S14, syntax conversion is performed. Here, based on the analysis result, each word is assigned an English translation while referring to the user dictionary 61 and the translation dictionary 57, and is converted into an English tree structure as a translation target language. In addition, the information of the Japanese input sentence given by the analysis is replaced with English information so that the sentence is reflected in the English sentence. Next, syntax generation is performed in S15.
Here, a tree structure for outputting a correct English sentence is created based on the tree structure created by the syntax conversion unit while referring to the translation dictionary 57. English information is also given to each node. Next, in S16, morpheme generation is performed to adjust the inflection and the like, and in S17, the English sentence as a translation result is displayed on a CRT or output from a printer.

【００２７】尚、本実施の形態の機械翻訳装置は、連続
性のある文字列を含む単語を予め定められた記号を用い
てユーザー辞書に登録するプログラムと、前記ユーザー
辞書に登録された連続性のある文字列を含む単語を検索
するプログラムと、記号に置き換えられた連続性のある
文字列を記憶するプログラムと、連続性のある文字列を
含む訳語の情報をツリー構造に反映させるプログラムが
ＲＯＭに予め格納されたものであるが、本発明は必ずし
もこれに限定されるものではない。例えば、これらのプ
ログラムをフロッピーディスクやＣＤ−ＲＯＭ等に格納
したものを読み取り装置により読み取ってインストール
させて動作させることができる。また、有線もしくは無
線回線を使用して外部情報処理装置からプログラムを読
み込んで動作させることができる。この場合、前記フロ
ッピーディスクやＣＤ−ＲＯＭや、外部情報処理装置の
当該プログラムを格納したメモリが本発明の記録媒体を
構成することになる。The machine translation apparatus according to the present embodiment includes a program for registering a word including a continuous character string in a user dictionary using a predetermined symbol, and a program for registering a word including the continuous character string in the user dictionary. A program for searching for a word containing a character string having a continuation, a program for storing a character string having continuity replaced with a symbol, and a program for reflecting information of a translation including a character string having continuity in a tree structure are stored in a ROM. Is stored in advance, but the present invention is not necessarily limited to this. For example, a program in which these programs are stored in a floppy disk, CD-ROM, or the like can be read by a reading device, installed, and operated. In addition, a program can be read and operated from an external information processing device using a wired or wireless line. In this case, the floppy disk, the CD-ROM, or the memory of the external information processing device that stores the program constitutes the recording medium of the present invention.

【００２８】図２には「＊勝」を見出し語としたサ変名
詞のユーザー辞書６１の辞書情報が示されている。「１
０勝」のように、見出し語の中に数字を含む単語は、数
字の部分をシステムで予め決められた記号に置き換えて
登録する。ここでは「＊」を使用している。従って、見
出し語の日本語は「＊勝」であり、訳語の英語は「＊ｗ
ｉｎｎｉｎｇｇａｍｅ」である。主名詞というのは訳
語の英語の中で複数形の時に複数変化する名詞のことで
ある。この主名詞を「ｇａｍｅ」にすることによって、
それが可算名詞（数えられる名詞のことで、複数形の場
合には複数変化する単語）であれば、「ｇａｍｅｓ」の
ように複数変化する。意味情報には見出し語の日本語の
意味を入力する。「〜勝」は動作などの行為を表すので
「行為」が選ばれている。FIG. 2 shows the dictionary information of the user dictionary 61 of the sa-variable noun with the headword "* win". "1
A word including a number in the headword such as "0 win" is registered by replacing the number portion with a symbol predetermined by the system. Here, “*” is used. Therefore, the headword Japanese is “* sho” and the translated English is “* w
inning game ". The main noun is a noun that changes plurally in the plural form in the translated English. By making this main noun "game",
If it is a countable noun (a countable noun, a word that changes plurally in the plural form), it changes plurally like “games”. For the semantic information, enter the Japanese meaning of the headword. Since "~ win" represents an action such as an action, "action" is selected.

【００２９】この意味情報には、この他に「人・組織」
「動植物」「部品」等のカテゴリーがある。英語の冠詞
には、訳語の英語にいつも決まった冠詞が付く場合の冠
詞を指定する。「〜ｗｉｎｎｉｎｇｇａｍｅ」につ
いては、特に冠詞を必要としないので「付けない」が選
択されている。可算・不可算には、訳語の名詞が数えら
れるか、数えられないかを指定する。「〜ｗｉｎｎｉ
ｎｇｇａｍｅ」については、数えることができて、複
数形の場合は「〜ｗｉｎｎｉｎｇｇａｍｅｓ」にな
るので「可算」が選択されている。複数形のタイプに
は、訳語の英語が規則活用するか、不規則活用するかを
指定する。この場合は「ｓ」が付くだけなので規則活用
であり、「規則」が選ばれている。例えば、「ｍａｎ」
の複数形は「ｍｅｎ」になるように不規則活用する単語
であれば「不規則」を選択して、複数形のところに「ｍ
ｅｎ」と入れる。The semantic information includes "person / organization"
There are categories such as "animals and plants" and "parts". The English article specifies an article in the case where a translated English article always has a fixed article. As for "~ winning game", "not attached" is selected because an article is not particularly required. For countable and uncountable, specify whether the noun of the translated word is counted or not. "~ Winni
"ng game" can be counted, and in the case of plural forms, it becomes "~ winning games", so "countable" is selected. In the plural type, it is specified whether the translated word is to be used regularly or irregularly. In this case, since only "s" is added, it is rule utilization, and "rule" is selected. For example, "man"
If the plural form of a word is irregularly used so as to be “men”, select “irregular” and place “m” in the plural form.
en ".

【００３０】図３には、「＊勝する」を見出し語とした
サ変動詞の、ユーザー辞書６１の辞書情報が示されてい
る。「１０勝する」のように、見出し語の中に数字を含
む単語は、「＊勝」と同様に数字の部分をシステムで予
め設定された記号に置き換えて登録する。ここでは
「＊」を使用している。従って、見出し語の日本語は
「＊勝する」である。例えば、「１０勝する」について
は、「ｗｉｎ１０ｇａｍｅｓ」と出したいので、本
実施の形態の機械翻訳装置では、訳語の英語の部分には
動詞の部分だけを入力する。従って、「ｗｉｎ」が入力
されている。主動詞というのは訳語の英語の中で活用変
化する動詞のことである。３人称単数の時には「ｗｉｎ
ｓ」、過去形及び過去分詞の時には「ｗｏｎ」、現在分
詞形の時には「ｗｉｎｎｉｎｇ」と活用変化するので、
訳語の英語と同様に「ｗｉｎ」が入力されている。FIG. 3 shows the dictionary information of the user dictionary 61 of the sa variant with the headword "* win". A word such as “Win 10” that includes a number in the headword is registered by replacing the number part with a symbol preset in the system, like “* Win”. Here, “*” is used. Therefore, the headword Japanese is “* wins”. For example, for “10 wins”, the user wants to output “win 10 games”, so in the machine translation apparatus of the present embodiment, only the verb part is input in the English part of the translated word. Therefore, “win” has been input. The main verb is a verb that changes in the translated English. In the third person singular, "win
s "," won "for past tense and past participle, and" winning "for present participle,
"Win" is input as in the case of the English translation.

【００３１】また、修飾語の部分には主動詞を修飾する
単語を入れる。例えば、「考慮する」という単語につい
ては、「ｔａｋｅ〜ｉｎｔｏｃｏｎｓｉｄｅｒａ
ｔｉｏｎ」と訳したいので、英語の部分には「ｔａｋ
ｅ」と入れて、修飾語に「ｉｎｔｏｃｏｎｓｉｄｅｒ
ａｔｉｏｎ」と入れればよい。「＊勝する」の場合は、
何も入れない。目的語には訳語の英語が目的語として取
る単語を入れる。「＊勝する」の場合は、「〜ｇａｍ
ｅｓ」を「ｗｉｎ」の目的語に取りたいので、数字が入
るところを「＊」に置き換えて「＊ｇａｍｅ」と入力
されている。主名詞、英語の冠詞、可算・不可算、複数
形のタイプ、複数形はそれぞれ「＊勝」で説明したとお
りである。In the modifier part, a word that modifies the main verb is entered. For example, for the word "consider", "take ~ into considera"
I want to translate it as "tion"
e "and the modifier" into consider
ation ". In the case of "* win",
Do not put anything. The object word includes a word which the translated word takes as an object word. In the case of "* win", "~ gam
Since we want to take "es" as the object of "win", "* game" is entered by replacing the place where the number is entered with "*". The main noun, English article, countable / uncountable, plural type, plural form are as described in "* win".

【００３２】以下に、以上のように構成された機械翻訳
装置による翻訳動作について説明する。Hereinafter, the translation operation of the machine translation apparatus configured as described above will be described.

【００３３】先ず、キーボード等から日本語文が入力さ
れると、翻訳辞書５７及びユーザー辞書６１を参照して
形態素解析部５２で単語に分割し、それぞれの単語につ
いて参照した辞書情報を付与する。その後、構文解析部
５３で単語を句にまとめ上げ、翻訳辞書５７及びユーザ
ー辞書６１を参照しながら係り受け解析等の解析を行
い、入力文をツリー構造に変換すると同時に、翻訳に必
要な情報をツリーの節（以下ノードと呼ぶ）に付ける。
ツリー構造というのは、入力文の内部構造を木の形に置
き換えたものであり、通常、機械翻訳ではツリー構造を
別のツリー構造に書き換えたり、ノードに種々の情報を
付けることによって処理するように作られている。First, when a Japanese sentence is input from a keyboard or the like, the morphological analysis unit 52 divides the sentence into words with reference to the translation dictionary 57 and the user dictionary 61, and adds dictionary information referred to for each word. Thereafter, the syntactic analysis unit 53 compiles the words into phrases, performs a dependency analysis and the like with reference to the translation dictionary 57 and the user dictionary 61, converts the input sentence into a tree structure, and simultaneously converts information necessary for translation into a tree structure. It is attached to a node (hereinafter referred to as a node) of the tree.
A tree structure is a structure in which the internal structure of an input sentence is replaced with a tree shape.In general, machine translation is performed by rewriting the tree structure with another tree structure or by adding various information to nodes. Made in.

【００３４】次に、構文変換部５４で、構文解析部で解
析された結果に基づいて、翻訳辞書５７及びユーザー辞
書６１を参照しながら各々の単語には訳を付与し、英語
を生成するためのツリー構造に変換する。構文生成部５
５では構文変換部で作ったツリー構造を、翻訳辞書５７
を参照しながら英文生成規則に基づいて英語として正し
いツリー構造にし、英語を生成するために必要な情報を
ツリーのノードに付与する。最後に形態素生成部５６に
おいて、語尾変化等の調整を行い、翻訳結果の英語文を
生成してＣＲＴ等に表示するか、プリンターから印刷す
る。Next, based on the result analyzed by the syntax analysis section, the syntax conversion section 54 adds a translation to each word while referring to the translation dictionary 57 and the user dictionary 61 to generate English. Into a tree structure. Syntax generator 5
5, the tree structure created by the syntax converter is translated into a translation dictionary 57.
To make a tree structure correct as English based on the English sentence generation rules, and add information necessary for generating English to the nodes of the tree. Finally, the morpheme generation unit 56 adjusts the inflection and the like, generates an English sentence as a translation result, and displays it on a CRT or the like, or prints it from a printer.

【００３５】図５は本実施の形態の形態素解析処理を示
すフローチャートであり、特に「１０勝」という入力文
を形態素解析するプロセスについて説明している。本実
施の形態の「＊」を使ったユーザー辞書では「＊」に入
る数字の文字列は１０文字までという制限になってい
る。また、「＊」の数字以外の部分は１０文字までとい
う制限を設けている。勿論、この制限はどのようにでも
変更可能である。FIG. 5 is a flowchart showing the morphological analysis processing of the present embodiment, and particularly describes a process of morphologically analyzing an input sentence "10 wins". In the user dictionary using “*” according to the present embodiment, the character string of the number entering “*” is limited to 10 characters. In addition, a portion other than the numeral “*” is limited to 10 characters. Of course, this limit can be changed in any way.

【００３６】そこで、先ず、Ｓ２０において、合計２０
文字分の文字列を切り出し位置から切り出す。最初は文
頭から２０文字切り出すことになる。その文字列を検索
キーワードにして、ユーザー辞書６１を検索する（Ｓ２
１）。そして、その中に該当する単語があるか否かを調
べる（Ｓ２２）。該当する単語がなければ（Ｓ２２：Ｎ
ｏの場合）、次に、翻訳辞書５７を検索する（Ｓ２
３）。そこで、該当する単語があるか否かを調べ（Ｓ２
４）、該当する単語が見つからなければ（Ｓ２４：Ｎｏ
の場合）、切り出し位置はそのままで、２０文字から１
文字減らした１９文字を文頭から切り出してそれを検索
キーワードにセットする（Ｓ２５）。その検索キーワー
ドによって辞書検索を継続するが、該当する単語が見つ
からず、文字を１文字ずつ減らしていった結果、もう検
索キーワードにセットする文字列がなくなった場合（Ｓ
２６：Ｎｏの場合）、今度は切り出し位置を一文字右に
ずらして、そこを新たな切り出し位置として２０文字分
を切り出して検索キーワードにセットする（Ｓ２７）。
そして、辞書検索を続ける。Therefore, first, in S20, a total of 20
Cuts out a character string from the cutout position. First, 20 characters are cut from the beginning of the sentence. The user dictionary 61 is searched using the character string as a search keyword (S2).
1). Then, it is checked whether or not there is a corresponding word (S22). If there is no corresponding word (S22: N
Then, the translation dictionary 57 is searched (S2).
3). Then, it is checked whether there is a corresponding word (S2).
4) If no corresponding word is found (S24: No)
), The cutout position is not changed, and 20 characters to 1
Nineteen characters are cut out from the beginning of the sentence and set as search keywords (S25). If the dictionary search is continued by the search keyword, but the corresponding word is not found, and the characters are reduced one by one, there is no more character string to be set as the search keyword (S
26: No), this time, the cut-out position is shifted to the right by one character, and this is set as a new cut-out position, and 20 characters are cut out and set as a search keyword (S27).
Then, the dictionary search is continued.

【００３７】切り出し位置がまだ文末でなければ（Ｓ２
８：Ｎｏの場合）、引き続き続けられる。切り出し位置
が文末に来た場合（Ｓ２８：Ｙｅｓの場合）、全ての入
力文字列が形態素解析されたので未知語処理をして（Ｓ
２９）、辞書に登録がなかった文字列を未知語としてま
とめる。「１０勝」の場合、切り出し位置は文頭のまま
で、そこから３文字分が切り出されてユーザー辞書５７
が検索されると、「＊勝」が引けてくる。このように、
ユーザー辞書５７に該当する単語が見つかった場合に接
続チェックも良ければそこでその単語が確定される（Ｓ
３０）。If the cutout position is not the end of the sentence (S2
8: No) and continue. When the cutout position is at the end of the sentence (S28: Yes), the unknown word processing is performed because all the input character strings have been morphologically analyzed (S28).
29) Combine character strings not registered in the dictionary as unknown words. In the case of "10 wins", the cut-out position remains at the beginning of the sentence, and three characters are cut out therefrom, and the user dictionary 57
Is found, "* win" is closed. in this way,
When a word corresponding to the user dictionary 57 is found, if the connection check is good, the word is determined there (S
30).

【００３８】次に、このように確定された単語が「＊」
で始まるか否かを調べ（Ｓ３１）、「＊」で始まれば
（Ｓ３１：Ｙｅｓの場合）、「＊」に当たる数字を
「＊」（アスタリスク）情報として記憶する（Ｓ３
２）。「１０勝」の場合は「１０」が「＊」情報に記憶
される。日本語見出しが「＊」から始まらなければ（Ｓ
３１：Ｎｏの場合）、確定された単語の次に切り出し位
置を変更して、そこから再度２０文字を切り出して（Ｓ
３３）、検索を継続する。また、翻訳辞書５７に該当す
る単語が見つかった場合に、接続チェックも良ければ同
様にその単語を確定する（Ｓ３４）。Next, the word thus determined is changed to "*".
Is checked (S31). If it starts with "*" (S31: Yes), the number corresponding to "*" is stored as "*" (asterisk) information (S3).
2). In the case of “10 wins”, “10” is stored in the “*” information. If the Japanese heading does not start with "*" (S
31: No), the cut-out position is changed after the determined word, and 20 characters are cut out therefrom again (S
33), continue the search. When a word corresponding to the translation dictionary 57 is found, if the connection check is good, the word is similarly determined (S34).

【００３９】そして、確定された単語の次に切り出し位
置を変更して、そこから２０文字を切り出して（Ｓ３
３）、検索を継続する。以上のように、形態素解析され
た結果はツリー構造に展開されて、各単語には検索して
きた辞書の辞書情報が付与される。Then, the cutout position is changed next to the determined word, and 20 characters are cut out therefrom (S3).
3) Continue searching. As described above, the result of the morphological analysis is expanded into a tree structure, and dictionary information of the searched dictionary is added to each word.

【００４０】図６は、「１０勝」という入力文が形態素
解析部５２で形態素解析されツリー構造に展開された結
果を表している。入力文が「１０勝」であるので、ｒｏ
ｏｔの下に「１０勝」という名詞が下がっている。ｒｏ
ｏｔというのは、入力文のツリー構造の最上部にいつも
付く仮のノードである。このノードは、内部的なツリー
構造を下げるのに形態素解析から形態素生成までの翻訳
プロセスを通して必要なものであるが、最終的に英文を
得るときには落とされる。ｒｏｏｔのノードに下がる
「１０勝」の名詞には、ユーザー辞書６１の「＊勝」の
情報が全て付与されている。つまり、日本語見出しとし
て「＊勝」、日本語品詞として「名詞」、意味情報とし
て「行為」、訳の英語として「＊ｗｉｎｎｉｎｇｇ
ａｍｅ」、その内の主名詞が「ｇａｍｅ」、可算・不可
算の情報として英語のタイプに可算を表す「Ｃ」、英語
の冠詞として「付けない」を表す「ＮＯＮＥ」、複数形
が規則活用することを表すものとして「活用」に「ＲＥ
ＧＵＬＡＲ」である。FIG. 6 shows a result obtained by morphologically analyzing the input sentence “10 wins” by the morphological analysis unit 52 and expanding it into a tree structure. Since the input sentence is "10 wins", ro
The noun “10 wins” has dropped below ot. ro
ot is a temporary node that is always attached to the top of the tree structure of the input sentence. This node is necessary through the translation process from morphological analysis to morphological generation to lower the internal tree structure, but is dropped when the English sentence is finally obtained. Nouns of “10 wins” descending to the root node are all given information of “* win” in the user dictionary 61. In other words, “* win” as the Japanese headline, “noun” as the Japanese part of speech, “act” as the semantic information, and “* winning g” as the translated English
ame ", its main noun is" game ", countable / uncountable information is" C "which represents countable in English type," NONE "which means" not attached "as English article, plural form is rule utilization "RE" to indicate that
GULAR ".

【００４１】この他に、日本語ＩＤとして「９」が付与
されている。日本語ＩＤというのは、見出し語の品詞細
分類として付けられた識別番号である。ユーザー辞書で
検索された単語には品詞に応じて特有の日本語ＩＤが付
与される。ここでは仮に名詞には「９」を、動詞で自動
詞なら「１１」を、動詞で他動詞なら「１２」を付与す
ることとする。「＊勝」は名詞であるので「９」が付
く。ユーザー辞書では日本語見出しとは異なる品詞の英
語を訳として当てることは許されていないので、「＊
勝」が名詞なら、「＊ｗｉｎｎｉｎｇｇａｍｅ」も
名詞であり、英語品詞に「名詞」が入っている。さら
に、ユーザー辞書では日本語ＩＤと同じ英語ＩＤが入る
ことになっているので、英語ＩＤにも「９」が入ってい
る。装置に予め搭載されている翻訳辞書５７では日本語
見出しの品詞と異なる品詞の英語を訳として当てること
が許されているので、その英語に応じた品詞と細分類の
ＩＤが付く。さらに、「＊情報」として「１０」が付与
されている。In addition, "9" is assigned as a Japanese ID. The Japanese ID is an identification number assigned as a sub-speech classification of the headword. A unique Japanese ID is assigned to a word searched in the user dictionary according to the part of speech. Here, it is assumed that “9” is assigned to a noun, “11” is assigned to a verb and an intransitive verb, and “12” is assigned to a verb and a transitive verb. Since “* win” is a noun, “9” is added. In the user dictionary, it is not allowed to assign the part of speech English different from the Japanese headline as a translation.
If “Katsu” is a noun, “* winning game” is also a noun, and “Noun” is included in the English part of speech. Furthermore, since the same English ID as the Japanese ID is entered in the user dictionary, "9" is also entered in the English ID. In the translation dictionary 57 pre-installed in the apparatus, it is permitted to assign a part of speech different from the part of speech of the Japanese heading as an English translation. Furthermore, “10” is given as “* information”.

【００４２】図７は、本実施の形態の構文解析処理を示
すフローチャートであって、特に、「１０勝」という入
力文を構文解析するためのプロセスについて説明してい
る。先ず、日本語見出しが「＊」で始まるノードが形態
素解析結果のツリー構造にあるか否かを調べる（Ｓ５
０）。ある単語の日本語見出しが「＊」で始まっていた
場合（Ｓ５０：Ｙｅｓの場合）、次に、日本語品詞が名
詞か否かを調べる（Ｓ５１）。名詞であれば（Ｓ５１：
Ｙｅｓの場合）、英語の「＊」を「＊」情報に記憶され
ている数字に置き換える（Ｓ５２）。また、「＊」情報
に記憶されている数字が「０」か「１」かを調べ（Ｓ５
３）、「０」か「１」であれば（Ｓ５３：Ｙｅｓの場
合）、数情報として単数を示す「Ｓ」を付与する（Ｓ５
４）。「０」でも「１」でもなければ（Ｓ５３：Ｎｏの
場合）、数情報として複数を示す「Ｐ」を付与する（Ｓ
５５）。FIG. 7 is a flowchart showing a syntax analysis process according to the present embodiment, and particularly describes a process for syntax analysis of an input sentence "10 wins". First, it is checked whether or not the node whose Japanese heading starts with “*” is in the tree structure of the morphological analysis result (S5).
0). If the Japanese heading of a certain word starts with "*" (S50: Yes), it is checked whether the Japanese part of speech is a noun (S51). If it is a noun (S51:
If “Yes”, the English “*” is replaced with the number stored in the “*” information (S52). Further, it is determined whether the number stored in the “*” information is “0” or “1” (S5).
3) If "0" or "1" (S53: Yes), "S" indicating a singular number is added as numerical information (S5).
4). If it is neither “0” nor “1” (S53: No), “P” indicating a plurality is given as numerical information (S53).
55).

【００４３】そして、「１０勝」という名詞は日本語見
出しが「＊」で始まっており、日本語品詞が名詞である
ため、英語の「＊ｗｉｎｎｉｎｇｇａｍｅ」の
「＊」が「＊」情報に記憶されている「１０」に置き換
えられて「１０ｗｉｎｎｉｎｇｇａｍｅ」となる。さ
らに、「＊」情報に記憶されている数字が「１０」であ
るので、数情報として、「Ｐ」が付与される。この後、
係り受け等の構文解析処理が行われる（Ｓ６１）。「１
０勝」の場合は、これだけで１文を構成しているので、
その他の構文解析処理は行われない。The noun "10 wins" starts with "*" in the Japanese heading and the Japanese part of speech is a noun, so "*" of the English "* winning game" is replaced with "*" information. The stored “10” is replaced with “10 winninggame”. Further, since the number stored in the “*” information is “10”, “P” is added as numerical information. After this,
Syntax analysis processing such as dependency is performed (S61). "1
In the case of "0 wins", this alone constitutes one sentence,
No other parsing processing is performed.

【００４４】また、「＊」で始まる日本語見出しが名詞
でなければ（Ｓ５１：Ｎｏの場合）、次に目的語が
「＊」で始まるか否かを調べる（Ｓ５６）。目的語も
「＊」で始まっていなければ（Ｓ５６：Ｎｏの場合）、
「＊」を用いたユーザー辞書が検索された場合の構文解
析部での一連の処理は不要になるのでＳ６１のその他の
構文解析処理に移る。目的語が「＊」で始まっていれば
（Ｓ５６：Ｙｅｓの場合）、目的語の「＊」を「＊」情
報に記憶されている数字に置き換える（Ｓ５７）。If the Japanese heading starting with "*" is not a noun (S51: No), it is checked whether the object starts with "*" (S56). If the object does not start with "*" (S56: No),
When a user dictionary using “*” is searched, a series of processes in the syntax analysis unit is not necessary, and thus the flow proceeds to other syntax analysis processes in S61. If the object starts with "*" (S56: Yes), the object "*" is replaced with a number stored in the "*" information (S57).

【００４５】そして、「＊」情報に記憶されている数字
が「０」か「１」かを調べ（Ｓ５８）、「０」か「１」
であれば（Ｓ５８：Ｙｅｓの場合）、目的語の数情報と
して単数を示す「Ｓ」を付与する（Ｓ５９）。「０」で
も「１」でもなければ（Ｓ５８：Ｎｏの場合）、数情報
として複数を示す「Ｐ」を付与する（Ｓ６０）。「１０
勝する」という動詞は日本語見出しが「＊」で始まって
おり、日本語品詞が動詞であって、目的語が「＊ｇａ
ｍｅ」のように「＊」で始まっているので、その「＊」
が「＊」情報に記憶されている「１０」に置き換えられ
て「１０ｇａｍｅ」となる。さらに、「＊」情報に記
憶されている数字が「１０」であるので、数情報とし
て、「Ｐ」が付与される。この後、係り受け等の構文解
析処理が行われる（Ｓ６１）。Then, it is determined whether the number stored in the "*" information is "0" or "1" (S58), and "0" or "1" is determined.
If (S58: Yes), "S" indicating a singular number is added as object number information (S59). If it is neither “0” nor “1” (S58: No), “P” indicating a plurality is given as numerical information (S60). "10
The verb “Win” starts with a Japanese heading “*”, the Japanese part of speech is a verb, and the object is “* ga”.
Since it starts with "*" like "me", the "*"
Is replaced with “10” stored in the “*” information to become “10 game”. Further, since the number stored in the “*” information is “10”, “P” is added as numerical information. Thereafter, syntax analysis processing such as dependency is performed (S61).

【００４６】図８は、「１０勝」という入力文が構文解
析部５３で構文解析されツリー構造に展開された結果を
表している。入力文が「１０勝」だけであるので、図６
の形態素解析結果のツリー構造と同じであるが、「１０
勝」の名詞に付与された情報が異なる。つまり、英語の
「＊ｗｉｎｎｉｎｇｇａｍｅ」は「１０ｗｉｎｎ
ｉｎｇｇａｍｅ」になり、数情報として、複数を表す
「Ｐ」が付与されている。このツリー構造が構文変換部
５４に渡されて構文変換処理され、その後に、構文生成
部５５に渡されて構文生成処理される。その後、形態素
生成部５６において、語尾変化等の活用変化の処理が行
われるが、ここで「１０勝」は数情報として「Ｐ」が付
与されており、主名詞の英語である「ｇａｍｅ」が可算
で、活用がＲＥＧＵＬＡＲ（規則活用）であるため、英
語の「１０ｗｉｎｎｉｎｇｇａｍｅ」は「１０ｗ
ｉｎｎｉｎｇｇａｍｅｓ」となる。また、英語の冠詞
は付けないと指定されているので冠詞は生成されず、
「１０勝」は「１０ｗｉｎｎｉｎｇｇａｍｅｓ」が
翻訳結果として出力される。FIG. 8 shows the result of parsing the input sentence "10 wins" by the parsing unit 53 and expanding it into a tree structure. Since the input sentence is only "10 wins", FIG.
Is the same as the tree structure of the morphological analysis result of
The information given to the noun "Katsu" is different. In other words, “* winning game” in English is “10 winn
ing game ", and" P "representing a plurality is given as numerical information. This tree structure is passed to the syntax conversion unit 54 for syntax conversion processing, and then passed to the syntax generation unit 55 for syntax generation processing. Thereafter, the morpheme generation unit 56 performs a process of utilizing changes such as inflections, where "10 wins" are given "P" as numerical information, and the main noun English "game" is used. Because it is countable and the utilization is REGULAR (rule utilization), the English “10 winning game” is changed to “10 w
inning games ". In addition, no article is generated because English articles are not specified,
As for "10 wins", "10 winning games" is output as a translation result.

【００４７】次に「彼は１０勝した」を入力文とした翻
訳プロセスについて図４を使って説明する。Next, a translation process in which "he has won 10" is described with reference to FIG.

【００４８】先ず、Ｓ１において、「彼は１０勝した」
が入力されると、次に、Ｓ２において、入力バッファ領
域４１にその入力文が記憶される。そして、形態素解析
部５２に送られて（Ｓ３）、ユーザー辞書６１を検索し
（Ｓ４）、そこで、「＊勝する」という単語が引け、翻
訳辞書を検索して（Ｓ６）、「彼」という名詞が検索さ
れる。前後の単語との接続もよければ、それらの単語が
確定され、それぞれの単語に辞書情報が付与される（Ｓ
１１）。図９は入力文の「彼は１０勝した」が形態素解
析された結果のツリー構造を表している。この図が示す
ように「彼は１０勝した」は、「彼」（名詞）と、
「は」（助詞）と、「＊勝する」（動詞）と、「た」
（助動詞）に分割される。First, in S1, "he has won 10"
Is input, the input sentence is stored in the input buffer area 41 in S2. Then, the sentence is sent to the morphological analysis unit 52 (S3), and the user dictionary 61 is searched (S4). The word "* wins" is closed, and the translation dictionary is searched (S6). Nouns are searched. If the connection with the preceding and succeeding words is good, those words are determined and dictionary information is added to each word (S
11). FIG. 9 shows a tree structure as a result of morphological analysis of the input sentence “He has won 10”. As this figure shows, "He won 10" means "He" (noun)
"Ha" (particle), "* win" (verb) and "ta"
(Auxiliary verb).

【００４９】そして、それぞれの単語については、この
図９に示すような辞書情報が付与される。「１０勝し
た」から「＊勝する」という単語が確定される方法は、
「１０勝」という入力文の形態素解析のところで説明し
た通りである。「１０勝する」は、「＊勝する」が引け
て日本語見出しが「＊勝する」になり、日本語品詞は
「動詞」、日本語ＩＤには「１２」が付いている。目的
語に何かの値が入っている場合は他動詞であるという意
味なので、他動詞の日本語ＩＤである「１２」が付与さ
れる。また、訳の英語には「ｗｉｎ」が、英語の品詞に
は「動詞」が、英語のＩＤには日本語ＩＤと同じ「１
２」が、主動詞には「ｗｉｎ」、主動詞の活用のタイプ
は不規則活用なので「ＩＲＲＥＧＵＬＡＲ」、過去形と
過去分詞形には「ｗｏｎ」、現在分詞形には「子音重
ね」、目的語には「＊ｇａｍｅ」、目的語の主名詞に
は「ｇａｍｅ」、目的語の冠詞には「付けない」を表す
「ＮＯＮＥ」が、目的語のタイプには可算名詞であるこ
とを示す「Ｃ」が、目的語の活用には規則活用であるこ
とを示す「ＲＥＧＵＬＡＲ」が付いている。Then, dictionary information as shown in FIG. 9 is added to each word. The method to determine the word "* win" from "10 wins"
This is as described for the morphological analysis of the input sentence “10 wins”. In “10 wins”, “* wins” is closed and the Japanese headline becomes “* wins”, the Japanese part of speech is “verb”, and the Japanese ID is “12”. If any value is included in the object, it means that the object is a transitive verb, and thus the transitive verb Japanese ID “12” is assigned. In addition, "win" is used for translated English, "verb" is used for English part of speech, and "1" is the same as Japanese ID for English ID.
"2", "win" for the main verb, "IRREGULAR" because the type of inflection of the main verb is irregular, "won" for the past tense and the past participle, "consonant overlap" for the present participle, the purpose The word "* game", the object noun "game", the object article "NONE" indicating "not attached", and the object type "countable noun""C" is attached with "REGULAR" indicating that the use of the object is rule use.

【００５０】さらに「１０勝」の形態素解析の処理で説
明したように、「１０勝する」から「＊勝する」が確定
した時に＊には「１０」が入ることが認識されて、
「＊」情報として「１０」が入る。また、「＊」情報に
記憶されている数字が「０」でも「１」でもないので、
数情報として複数形を示す「Ｐ」が付く。Further, as described in the morphological analysis process of “10 wins”, when “* win” is determined from “10 wins”, it is recognized that “10” is entered in *, and
"10" is entered as "*" information. Also, since the number stored in the “*” information is neither “0” nor “1”,
"P" indicating a plurality is attached as numerical information.

【００５１】形態素解析が終了すると（Ｓ１２）、その
後、Ｓ１３において構文解析が行われる。構文解析では
単語を句にまとめ、主語や目的語等の係り受けを決める
等の構文解析を行う。When the morphological analysis is completed (S12), a syntax analysis is performed in S13. In syntactic analysis, words are combined into phrases, and syntactic analysis such as determining the dependency of a subject or object is performed.

【００５２】図１０は、構文解析された結果のツリー構
造を示している。入力文が「彼は１０勝した」であるの
で、ｒｏｏｔの下には入力文の主動詞となる「１０勝す
る」が下がっている。FIG. 10 shows a tree structure as a result of parsing. Since the input sentence is “He has won 10”, “10 wins” which is the main verb of the input sentence is below “root”.

【００５３】主動詞の「１０勝する」には、日本語見出
し「＊勝する」と、日本語品詞「動詞」と、日本語ＩＤ
「１２」と、訳語の英語として「ｗｉｎ」と、英語品詞
「動詞」と、英語ＩＤ「１２」と、’ｗｉｎ’が不規則
活用することを示す「ＩＲＲＥＧＵＬＡＲ」と、過去形
の「ｗｏｎ」、過去分詞形の「ｗｏｎ」、現在分詞形が
「子音重ね」という辞書情報の他に、構文解析処理で付
けられた入力文の主動詞の時制（この場合は「過去」）
が付いている。主動詞の下には「１０勝する」の主語に
解析された「彼」のノードと、目的語のノードが下がっ
ている。The main verb "10 wins" includes a Japanese heading "* wins", a Japanese part of speech "verb", and a Japanese ID.
"12", the translated word "win", the English part of speech "verb", the English ID "12", "IRREGULAR" indicating that "win" uses irregularly, and the past tense "won" , The past participle "won" and the present participle "consonant overlap" in addition to the dictionary information, the tense of the main verb of the input sentence attached in the parsing process (in this case, "past")
With. Below the main verb, the node of "he" analyzed to the subject of "win 10" and the node of the object are down.

【００５４】また、「彼」には、日本語見出しとして
「彼」、日本語品詞として「名詞」、日本語ＩＤとして
「１」の他に、「１０勝する」に対して「彼」が主語で
あることを示す日本語の格に「主語」、助詞が「は」だ
ったことを示す日本語助詞に「は」が付与されている。
「１０勝する」は自動詞であるので、目的語を持つユー
ザー辞書が引けてこなければ主動詞の下には主語のノー
ドしか下がらない。しかし、形態素解析によって訳に目
的語を持つ「＊勝する」が引けてきて、その目的語が
「＊ｇａｍｅ」であるので、構文解析で主語の他に目
的語のノードも生成する。In addition to "he" as a Japanese headline, "noun" as a Japanese part of speech, "1" as a Japanese ID, "he" for "10 wins""Subject" is assigned to the Japanese case indicating the subject, and "ha" is assigned to the Japanese particle indicating that the particle is "ha".
Since "Win 10" is an intransitive verb, only the subject node falls below the main verb if the user dictionary having the object cannot be closed. However, "* wins" having an object in translation is closed by morphological analysis, and the object is "* game". Therefore, a node of the object is generated in addition to the subject by the syntax analysis.

【００５５】この目的語のノードは、もともと日本語の
入力文にあったものではないので日本語見出しは付いて
いないことを示す「ＮＩＬ」が付与されている。形態素
解析で「＊」の部分に「１０」が入ることが分かるの
で、訳の英語の「＊ｇａｍｅ」が「１０ｇａｍｅ」
に書き換えられる。英語の品詞には「名詞」、英語ＩＤ
には名詞であるので「９」、「＊ｇａｍｅ」の主名詞
は「ｇａｍｅ」であるので、主名詞として「ｇａｍ
ｅ」、「ｇａｍｅ」は可算名詞であるので英語のタイプ
として「Ｃ」、その英語には冠詞は付かないことを示す
「ＮＯＮＥ」、「ｇａｍｅ」は規則活用であるので活用
には「ＲＥＧＵＬＡＲ」を付けている。また、＊には
「１０」が入ることを示す＊情報「１０」と、＊情報の
中の数字が０でも１でもないので数情報として複数形を
表す「Ｐ」が付いている。さらに日本語の格として「目
的語」が入っている。Since the object node is not originally in the input sentence in Japanese, "NIL" indicating that no Japanese heading is attached is added. The morphological analysis shows that "10" is included in the "*" part, so the translated English "* game" becomes "10 game".
Is rewritten as "Noun" in English part of speech, English ID
Is a noun, and the main noun of “9” and “* game” is “game”, so the main noun is “gam”.
Since "e" and "game" are countable nouns, the type of English is "C", and "NONE" indicates that the English article does not have an article. "game" is rule-based, so use "REGULLAR". Is attached. In addition, * has * information "10" indicating that "10" is included therein, and "P" representing a plural form as numerical information because the number in the * information is neither 0 nor 1. In addition, "object" is included as a Japanese case.

【００５６】このように、構文解析されたツリー構造は
構文変換部５４に渡されてＳ１４において構文変換処理
される。ここで、主動詞の日本語ＩＤが１１か１２であ
った場合にユーザー辞書の動詞が検索されたことを示す
ものであるので、構文変換部５４において日本語ＩＤが
１１なら主語のノードに、日本語ＩＤが１２なら主語と
目的語のノードにそれぞれ英語の格を付ける。つまり、
主語であれば英語の格に「主語」を、目的語であれば英
語の格に「目的語」を入れる。また、入力文の日本語の
時制が「過去」であったが、英語でも過去形で出したい
ので、英語の時制にも「過去」を入れる。As described above, the parsed tree structure is passed to the syntax conversion unit 54 and subjected to syntax conversion processing in S14. Here, if the Japanese ID of the main verb is 11 or 12, it indicates that the verb of the user dictionary has been searched. If the Japanese ID is 12, the subject and the object nodes are assigned English cases. That is,
If it is the subject, put the "subject" in the English case, and if it is the object, put the "object" in the English case. Also, although the Japanese tense of the input sentence was "past", we want to put it in the past tense in English, so we also put "past" in the English tense.

【００５７】その後に、Ｓ１５において構文生成処理さ
れる。さらに、形態素生成部５６において語尾変化等の
形態素生成処理が行われるが（Ｓ１６）、ここで、目的
語の「１０ｇａｍｅ」には数情報として「Ｐ」が付与
されており、主名詞の英語である「ｇａｍｅ」が可算
で、活用がＲＥＧＵＬＡＲ（規則活用）であるため、英
語の「１０ｇａｍｅ」は「１０ｇａｍｅｓ」とな
る。また、英語の冠詞は付けないと指定されているので
冠詞は生成されず、さらに英語の時制も「過去」になっ
ているので過去形の「ｗｏｎ」が選ばれて「Ｈｅｗｏ
ｎ１０ｇａｍｅｓ．」が翻訳結果として出力される
（Ｓ１７）。Thereafter, a syntax generation process is performed in S15. Further, morpheme generation processing such as inflection is performed in the morpheme generation unit 56 (S16). Here, “P” is added as numerical information to the object “10 game”, and the main noun English "Game" is countable, and the utilization is REGULAR (rule utilization), so "10 games" in English becomes "10 games". Also, it is specified that no English article is attached, so no article is generated, and since the English tense is also "past", the past tense "won" is selected and "He wo
n 10 games. Is output as a translation result (S17).

【００５８】[0058]

【発明の効果】以上説明したことから明らかなように、
本発明の請求項１に記載の機械翻訳装置によれば、数字
のような連続性のある文字列は見出しを変えて個別に記
憶手段に登録しなくても、予め定められた記号を用いて
一つの見出し語で登録することができる。As is apparent from the above description,
According to the machine translation device of the first aspect of the present invention, a character string having continuity such as a number can be determined by using a predetermined symbol without changing the heading and registering the character string individually in the storage unit. You can register with one headword.

【００５９】また、請求項２に記載の機械翻訳装置によ
れば、入力文の中に数字のような連続性のある文字列を
含む単語が出現した場合に、形態素解析において前記記
憶手段の見出し語を正しく検索することができる。According to a second aspect of the present invention, when a word including a continuous character string such as a number appears in an input sentence, a heading of the storage means is obtained by morphological analysis. Words can be searched correctly.

【００６０】また、請求項３に記載の機械翻訳装置によ
れば、どんな文字列がその記号に置き換わっているのか
を記憶し、その情報によって、例えば、数字であれば単
数形か複数形かの情報を付与し、さらに目的語の場合は
目的語のノードにそれらの情報を付与するというよう
に、数字のような連続性のある文字列を含む訳語の情報
をツリー構造に正しく展開することができる。これによ
って、入力文にどんな文字列が入ってきても正しい翻訳
結果を出すことができる。According to the machine translation device of the present invention, what character string is replaced with the symbol is stored, and, based on the information, for example, whether a numeral is singular or plural is used. It is possible to correctly expand the translated word information that includes a continuous character string such as a number into a tree structure, such as adding information and, in the case of an object, adding that information to the object node. it can. As a result, a correct translation result can be obtained no matter what character string enters the input sentence.

【００６１】また、請求項４に記載の機械翻訳装置によ
れば、ユーザーによって使用される前記外部記憶装置
に、連続性のある文字列を含む単語を予め定められた記
号によって記憶しておけば、これを機械翻訳する度に有
効に使用することができる。According to the machine translation device of the present invention, a word including a continuous character string can be stored in the external storage device used by the user by a predetermined symbol. , And can be effectively used every time machine translation is performed.

【００６２】また、請求項５に記載の機械翻訳装置によ
れば、前記翻訳生成手段によって生成された第二言語の
翻訳文を前記出力手段によって出力することにより、翻
訳した第二言語を直ちに確認することができ、一連の翻
訳処理を効率的に実施することができる。According to the machine translation device of the fifth aspect, the translated sentence of the second language generated by the translation generating means is output by the output means, so that the translated second language is immediately confirmed. And a series of translation processes can be efficiently performed.

【００６３】また、請求項６に記載の機械翻訳処理プロ
グラムを記録したコンピュータ読み取り可能な記録媒体
によれば、各プログラムをフロッピーディスクやＣＤ−
ＲＯＭ等の様々な媒体の中から機械翻訳装置に適した記
録媒体に記録して提供することができる。そして、この
記録媒体を用いてプログラムを実行することにより、数
字のような連続性のある文字列は見出しを変えて個別に
記憶手段に登録しておかなくても、予め定められた記号
を用いて一つの見出し語で登録することができる。According to a computer-readable recording medium on which the machine translation processing program according to claim 6 is recorded, each program is stored on a floppy disk or CD-ROM.
It can be provided by being recorded on a recording medium suitable for a machine translation device from various media such as a ROM. Then, by executing a program using this recording medium, a character string having continuity such as a number can use a predetermined symbol without having to change the heading and register it individually in the storage means. You can register with one headword.

【００６４】また、請求項７に記載の機械翻訳処理プロ
グラムを記録したコンピュータ読み取り可能な記録媒体
によれば、各プログラムをフロッピーディスクやＣＤ−
ＲＯＭ等の様々な媒体の中から機械翻訳装置に適した記
録媒体に記録して提供することができる。そして、この
記録媒体を用いてプログラムを実行することにより、入
力文の中に数字のような連続性のある文字列を含む単語
が出現した場合に、形態素解析において前記記憶手段の
見出し語を正しく検索することができる。According to a computer-readable recording medium on which the machine translation processing program according to claim 7 is recorded, each program can be stored on a floppy disk or CD-ROM.
It can be provided by being recorded on a recording medium suitable for a machine translation device from various media such as a ROM. Then, by executing a program using this recording medium, when a word including a continuous character string such as a number appears in the input sentence, the headword of the storage means is correctly corrected in the morphological analysis. Can be searched.

【００６５】さらに、請求項８に記載の機械翻訳処理プ
ログラムを記録したコンピュータ読み取り可能な記録媒
体によれば、各プログラムをフロッピーディスクやＣＤ
−ＲＯＭ等の様々な媒体の中から機械翻訳装置に適した
記録媒体に記録して提供することができる。そして、こ
の記録媒体を用いてプログラムを実行することにより、
どんな文字列がその記号に置き換わっているのかを記憶
し、その情報によって、例えば、数字であれば単数形か
複数形かの情報を付与し、さらに、目的語の場合は目的
語のノードにそれらの情報を付与するというように、数
字のような連続性のある文字列を含む訳語の情報をツリ
ー構造に正しく展開することができる。これによって、
入力文にどんな文字列が入ってきても正しい翻訳結果を
出すことができる。Further, according to a computer-readable recording medium on which the machine translation processing program according to claim 8 is recorded, each program can be stored on a floppy disk or a CD.
-It can be provided by being recorded on a recording medium suitable for a machine translation device from various media such as a ROM. Then, by executing a program using this recording medium,
It memorizes what character string is replaced by the symbol, and adds information such as singular or plural if it is a number, and adds it to the object node if it is an object. Thus, information on a translated word including a continuous character string such as a number can be correctly expanded in a tree structure. by this,
No matter what character string comes in the input sentence, you can get the correct translation result.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明の実施の形態の機械翻訳装置の構成を示
すブロック図である。FIG. 1 is a block diagram illustrating a configuration of a machine translation device according to an embodiment of the present invention.

【図２】「＊勝」のユーザー辞書の辞書情報を示す図で
ある。FIG. 2 is a diagram showing dictionary information of a user dictionary of “* win”.

【図３】「＊勝する」のユーザー辞書の辞書情報を示す
図である。FIG. 3 is a diagram showing dictionary information of a user dictionary “* wins”;

【図４】本実施の形態の翻訳処理を示すフローチャート
である。FIG. 4 is a flowchart showing a translation process according to the embodiment.

【図５】本実施の形態の形態素解析処理を示すフローチ
ャートである。FIG. 5 is a flowchart illustrating a morphological analysis process according to the present embodiment.

【図６】形態素解析の処理が終わった後のツリー構造を
示す図である。FIG. 6 is a diagram showing a tree structure after a morphological analysis process is completed.

【図７】本実施の形態の構文解析処理を示すフローチャ
ートである。FIG. 7 is a flowchart illustrating a syntax analysis process according to the present embodiment.

【図８】構文解析の処理が終わった後のツリー構造を示
す図である。FIG. 8 is a diagram showing a tree structure after a syntax analysis process is completed.

【図９】形態素解析の処理が終わった後のツリー構造を
示す図である。FIG. 9 is a diagram showing a tree structure after a morphological analysis process is completed.

【図１０】構文解析の処理が終わった後のツリー構造を
示す図である。FIG. 10 is a diagram showing a tree structure after a syntax analysis process is completed.

【符号の説明】[Explanation of symbols]

１０入力部２０出力部３０ＣＰＵ５０ＲＯＭ５２形態素解析部５３構文解析部５４構文変換部５５構文生成部５６形態素生成部５７翻訳辞書６１ユーザー辞書 Reference Signs List 10 input unit 20 output unit 30 CPU 50 ROM 52 morphological analysis unit 53 syntax analysis unit 54 syntax conversion unit 55 syntax generation unit 56 morpheme generation unit 57 translation dictionary 61 user dictionary

Claims

【特許請求の範囲】[Claims]

【請求項１】入力手段によって入力された第一言語を
単語に分割するための形態素解析手段と、前記形態素解
析手段によって分割された第一言語の単語の係り受け等
を解析するための構文解析手段と、前記構文解析手段に
よって解析された第一言語について翻訳辞書を参照しな
がら目的の第二言語に翻訳するための翻訳生成手段とを
備えた機械翻訳装置において、連続性のある文字列を含む単語を予め定められた記号に
よって記憶手段に登録することができる登録手段を備え
たことを特徴とする機械翻訳装置。1. A morphological analysis means for dividing a first language input by an input means into words, and a syntax analysis for analyzing a dependency of a word of the first language divided by the morphological analysis means. Means and a translation generating means for translating the first language analyzed by the parsing means into a target second language while referring to a translation dictionary. A machine translation device comprising a registration unit capable of registering a word to be included in a storage unit using a predetermined symbol.

【請求項２】前記記憶手段に登録された連続性のある
文字列を含む単語を検索する検索手段と、前記記号に置き換えられた連続性のある文字列を記憶す
る文字列記憶手段とを備えたことを特徴とする請求項１
に記載の機械翻訳装置。2. A search means for searching for a word including a continuous character string registered in the storage means, and a character string storage means for storing a continuous character string replaced with the symbol. 2. The method according to claim 1, wherein
A machine translation device according to claim 1.

【請求項３】前記構文解析手段は、前記記憶手段等に
登録された情報に基づいて、連続性のある文字列を含む
翻訳に必要な情報をツリー構造に反映させるツリー構造
変換手段を備えることを特徴とする請求項１もしくは２
に記載の機械翻訳装置。3. The method according to claim 1, wherein the syntax analysis unit includes a tree structure conversion unit that reflects information necessary for translation including a continuous character string in a tree structure based on the information registered in the storage unit and the like. 3. The method according to claim 1, wherein
A machine translation device according to claim 1.

【請求項４】前記記憶手段は、ユーザーによって使用
される外部記憶装置であることを特徴とする請求項１乃
至３のいずれかに記載の機械翻訳装置。4. The machine translation apparatus according to claim 1, wherein the storage unit is an external storage device used by a user.

【請求項５】前記翻訳生成手段によって生成された第
二言語の翻訳文を出力する出力手段を備えたことを特徴
とする請求項１乃至４のいずれかに記載の機械翻訳装
置。5. The machine translation apparatus according to claim 1, further comprising an output unit that outputs a translation sentence of the second language generated by the translation generation unit.

【請求項６】入力手段によって入力された第一言語を
単語に分割するための形態素解析プログラムと、前記形
態素解析プログラムによって分割された第一言語の単語
の係り受け等を解析するための構文解析プログラムと、
前記構文解析プログラムによって解析された第一言語に
ついて翻訳辞書を参照しながら目的の第二言語に翻訳す
るための翻訳生成プログラムとを記録したコンピュータ
読み取り可能な記録媒体において、連続性のある文字列を含む単語を予め定められた記号に
よって記憶手段に登録することができる登録プログラム
を備えたことを特徴とする機械翻訳処理プログラムを記
録したコンピュータ読み取り可能な記録媒体。6. A morphological analysis program for dividing the first language input by the input means into words, and a syntax analysis for analyzing the dependency of the words of the first language divided by the morphological analysis program. Program and
A computer-readable recording medium that records a translation generation program for translating the first language analyzed by the syntax analysis program into a target second language while referring to a translation dictionary, wherein a character string having continuity is A computer-readable recording medium recording a machine translation processing program, comprising a registration program capable of registering a word including the character by a predetermined symbol in a storage unit.

【請求項７】前記記憶手段に登録された連続性のある
文字列を含む単語を検索する検索プログラムと、前記記号に置き換えられた連続性のある文字列を記憶す
る文字列記憶プログラムとを備えたことを特徴とする請
求項６に記載の機械翻訳処理プログラムを記録したコン
ピュータ読み取り可能な記録媒体。7. A search program for searching for a word including a continuous character string registered in the storage means, and a character string storage program for storing a continuous character string replaced with the symbol. A computer-readable recording medium recording the machine translation processing program according to claim 6.

【請求項８】前記構文解析プログラムは、前記記憶手
段等に登録された情報に基づいて、連続性のある文字列
を含む翻訳に必要な情報をツリー構造に反映させるツリ
ー構造変換プログラムを備えることを特徴とする請求項
６もしくは７に記載の機械翻訳処理プログラムを記録し
たコンピュータ読み取り可能な記録媒体。8. The syntax analysis program according to claim 1, further comprising a tree structure conversion program for reflecting information necessary for translation including a continuous character string in a tree structure based on information registered in the storage unit or the like. A computer-readable recording medium recording the machine translation processing program according to claim 6.