JP2001101186A

JP2001101186A - Machine translating device

Info

Publication number: JP2001101186A
Application number: JP27773099A
Authority: JP
Inventors: Tokuji Ikeno; 篤司池野
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1999-09-30
Filing date: 1999-09-30
Publication date: 2001-04-13

Abstract

PROBLEM TO BE SOLVED: To realize a machine translating device which does not have to store a large amount of data to select translated words. SOLUTION: When a translated word selecting part 2 receives a plurality of translated word candidates for a plurality of words from a translating part 1, the part 2 outputs translated word candidates to be retrieved as a retrieval request to a retrieval instructing part 3. The part 3 generates a query on the basis of the translated work candidates and instructs a retrieving part 4 to perform retrieval. The part 4 accesses a document file group 4b on a network and performs retrieval. When the part 3 receives retrieval results from the part 4, the part 3 outputs them to the part 2. The part 2 narrows down to one translated word from the plurality of translated word candidates and transfers it to the part 1 on the basis of prescribed decision reference on the basis of the retrieval results.

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、第１言語（原言
語）を、第２言語（目的言語）に翻訳するのに際して、
複数の訳語候補が存在するときに訳語間の共起関係を参
照して訳語を決定することで、適切に翻訳を行う機械翻
訳装置に関するものである。The present invention relates to a method for translating a first language (source language) into a second language (target language).
The present invention relates to a machine translation device that appropriately translates by determining a translated word by referring to a co-occurrence relationship between translated words when a plurality of translated word candidates exist.

【０００２】[0002]

【従来の技術】従来技術として次のような文献に示され
ているものがあった。１．佐藤理史「ＭＢＴ１：実例に基づく訳語選択」
人工知能学会誌Ｖｏｌ．６，Ｎｏ．４（Ｊｕｌ．
1991）２．野美山浩「目的言語の知識を用いた訳語選択とそ
の学習性」情報処理学会研究報告ＮＬ８６−８（199
1）３．特開平６−２５１０５５号公報「機械翻訳方式」2. Description of the Related Art There is a prior art disclosed in the following literature. 1. Masashi Sato "MBT1: Selection of translation words based on actual examples"
Journal of the Japanese Society for Artificial Intelligence Vol. 6, No. 4 (Jul.
1991) 2. Hiroshi Nomiyama "Translation word selection using knowledge of target language and its learning ability" Information Processing Society of Japan Research Report NL86-8 (199
1) 3. Japanese Unexamined Patent Publication No. Hei 6-251555, "Machine Translation Method"

【０００３】ある言語の表現を他の言語に翻訳する機械
翻訳装置の技術的な課題はいくつかあるが、その中の一
つが訳語選択である。これは、原言語のある単語の翻訳
に際して訳語の候補が複数ある場合に、最も適当な訳語
を選択することである。[0003] There are several technical problems with a machine translator for translating an expression in a certain language into another language, one of which is translation selection. This is to select the most appropriate translation when there are a plurality of translation candidates when translating a certain word in the source language.

【０００４】この問題に対しては、上記文献１や２に示
すように、目的言語での訳語と訳語の関係の強さを見て
訳語の選択をすることである。また、文献３では、関係
の強さを統計的に求めておくのではなく、目的言語の表
現をテキストとして蓄えておき、各単語の訳語を組み合
わせた翻訳候補が蓄えてあったテキスト中の表現とマッ
チした場合に正解とする手法が提案されている。[0004] To solve this problem, as shown in the above-mentioned documents 1 and 2, the translation is selected based on the strength of the relationship between the translation in the target language. In Reference 3, instead of statistically calculating the strength of the relationship, the expression of the target language is stored as text, and the expression in the text in which the translation candidate combining the translation of each word is stored. A method has been proposed in which when a match is made, the correct answer is obtained.

【０００５】[0005]

【発明が解決しようとする課題】上記文献１と２に示さ
れている手法は、大量の文書データを用意して統計的な
処理を行うことにより、尤もらしい訳語の選択を行う、
というものである。The techniques disclosed in the above-mentioned documents 1 and 2 select a likely word by preparing a large amount of document data and performing statistical processing.
That is.

【０００６】しかしながら、一般に、統計処理によって
精度の高い結果を得るためには、個人のパーソナルコン
ピュータには保持できないほどの膨大なデータ（この場
合テキストデータ）が必要となるが、上記の手法では、
そのような大量のデータを入手する手段およびアクセス
する手段については全く考慮されていない。However, in general, in order to obtain highly accurate results by statistical processing, a huge amount of data (in this case, text data) that cannot be stored in an individual personal computer is required.
No means of obtaining and accessing such large amounts of data is considered.

【０００７】また、文献３の手法は、統計処理は行わな
いものの、大量のテキストを蓄えておかなければならな
いため、上述の手法と同様の問題点を抱えていた。Further, the method of Reference 3 has a problem similar to that of the above-mentioned method, since a large amount of text must be stored, though statistical processing is not performed.

【０００８】[0008]

【課題を解決するための手段】本発明は、前述の課題を
解決するため次の構成を採用する。〈構成１〉原言語文を目的言語文に翻訳する際、複数の
語に対する複数の訳語候補を出力する翻訳部と、翻訳部
から複数の訳語候補を受け取った場合は、複数の訳語候
補による検索要求を行い、その検索要求に対する検索結
果を受け取った場合は、検索結果に基づき複数の訳語候
補を一つに絞り込んで翻訳部に出力する訳語選択部と、
訳語選択部から検索要求を受け取った場合は、複数の訳
語候補に基づいてクエリを生成してクエリによる検索指
示を出力し、その検索指示に対する検索結果を受け取っ
た場合は、これを訳語選択部に検索結果として出力する
検索指示部と、検索指示部より検索指示を受けてネット
ワーク上に存在する文書ファイル群にアクセスし、クエ
リに基づく検索結果を取得し、これを検索指示部に出力
する検索部とを備えたことを特徴とする機械翻訳装置。The present invention employs the following structure to solve the above-mentioned problems. <Structure 1> When translating a source language sentence into a target language sentence, a translation unit that outputs a plurality of translation word candidates for a plurality of words, and, when a plurality of translation word candidates are received from the translation unit, a search using the plurality of translation word candidates Requesting, and when a search result for the search request is received, a translation word selection unit that narrows down a plurality of translation word candidates to one based on the search result and outputs it to the translation unit;
When a search request is received from the translation term selection unit, a query is generated based on a plurality of translation term candidates, a search instruction is output from the query, and when a search result for the search instruction is received, this is sent to the translation word selection unit. A search instructing unit that outputs as a search result, and a search unit that receives a search instruction from the search instructing unit, accesses a document file group existing on the network, obtains a search result based on a query, and outputs this to the search instructing unit A machine translation device comprising:

【０００９】〈構成２〉構成１に記載の機械翻訳装置に
おいて、検索指示部は、それぞれの語の訳語候補を組み
合わせてクエリを生成するクエリ生成部と、検索部が行
う検索処理の負荷状況を判断して、検索部に検索要求を
行うアクセス制御部とからなることを特徴とする機械翻
訳装置。<Structure 2> In the machine translation device according to Structure 1, the search instruction unit includes a query generation unit configured to generate a query by combining translated word candidates of each word, and a load state of a search process performed by the search unit. A machine translation device, comprising: an access control unit that determines and sends a search request to a search unit.

【００１０】〈構成３〉構成１または２に記載の機械翻
訳装置において、訳語選択部は、複数の訳語候補から絞
り込むために、複数の語においてどの語との組み合わせ
を判断基準とするかを決定し、これらの語の複数の訳語
候補を検索指示部に渡す対象語抽出部と、対象語抽出部
が渡した語による検索指示部からの全ての検索結果が揃
うまで検索結果を保持しておき、揃った時点でデータを
出力する検索結果保持部と、検索結果保持部よりデータ
を受け取った場合、予め決められた判定基準を適用し、
複数の訳語候補から絞り込みを行う結果判定部とからな
ることを特徴とする機械翻訳装置。<Structure 3> In the machine translation apparatus according to structure 1 or 2, the translation word selection unit determines which combination of a plurality of words is used as a criterion in order to narrow down from a plurality of translation word candidates. Then, the search result is held until all the search results from the search instruction unit based on the word passed by the target word extraction unit and the target word extraction unit that pass a plurality of translation word candidates of these words to the search instruction unit are obtained. , A search result holding unit that outputs data at the same time, and when receiving data from the search result holding unit, applies a predetermined determination criterion,
A machine translation apparatus comprising: a result determination unit that narrows down a plurality of translation word candidates.

【００１１】〈構成４〉原言語文を目的言語文に翻訳す
る際、複数の語に対する複数の訳語候補を出力する翻訳
部と、翻訳部から複数の訳語候補を受け取った場合は、
複数の訳語候補による検索要求を行い、その検索要求に
対する検索結果を受け取った場合は、検索結果に基づき
複数の訳語候補を一つに絞り込んで翻訳部に出力する訳
語選択部と、訳語選択部において過去に複数の訳語候補
から一つに絞り込まれた訳語を、複数の訳語候補と関連
付けて保持する判定結果データベースと、訳語選択部か
ら検索要求を受け取った場合は、検索要求の複数の訳語
候補が判定結果データベースに存在するかを判定し、存
在した場合は、その複数の訳語候補から絞り込まれた訳
語を検索結果として翻訳部に出力すると共に、存在しな
い場合は複数の訳語候補による検索要求を行う判定結果
検索部と、判定結果検索部から検索要求を受け取った場
合は、複数の訳語候補に基づいてクエリを生成してクエ
リによる検索指示を出力し、その検索指示に対する検索
結果を受け取った場合は、これを訳語選択部に検索結果
として出力する検索指示部と、検索指示部より検索指示
を受けてネットワーク上に存在する文書ファイル群にア
クセスし、クエリに基づく検索結果を取得し、これを検
索指示部に出力する検索部とを備えたことを特徴とする
機械翻訳装置。<Structure 4> When translating a source language sentence into a target language sentence, when a translation unit that outputs a plurality of translation word candidates for a plurality of words and a plurality of translation word candidates are received from the translation unit,
When a search request is made with a plurality of translation word candidates and a search result corresponding to the search request is received, a translation word selection unit that narrows down the plurality of translation word candidates to one based on the search result and outputs it to the translation unit, and a translation word selection unit. When a search request is received from a translation word selection unit and a determination result database that holds translations narrowed down to one from a plurality of translation word candidates in the past in association with the plurality of translation word candidates, the plurality of translation word candidates of the search request are It is determined whether or not it exists in the determination result database. If there is, a translation word narrowed down from the plurality of translation word candidates is output to the translation unit as a search result, and if not, a search request is made with the plurality of translation word candidates. A determination result search unit and, when receiving a search request from the determination result search unit, generating a query based on a plurality of translation word candidates and performing a search instruction by the query When a search result is received and a search result corresponding to the search instruction is received, a search instruction unit that outputs the search result to the translation selection unit as a search result, and a search instruction from the search instruction unit to access a document file group existing on the network. A search unit that obtains a search result based on the query and outputs the result to a search instruction unit.

【００１２】〈構成５〉原言語文を目的言語文に翻訳す
る際、複数の語に対する複数の訳語候補を出力する翻訳
部と、翻訳部から複数の訳語候補を受け取った場合は、
複数の訳語候補による検索要求を行い、その検索要求に
対する検索結果を受け取った場合は、検索結果に基づき
訳語候補を一つに絞り込んで翻訳部に出力する訳語選択
部と、訳語選択部から検索要求を受け取った場合は、複
数の訳語候補に基づいてクエリを生成してクエリによる
検索指示を出力し、その検索指示に対する検索結果を受
け取った場合は、これを訳語選択部に検索結果として出
力する検索指示部と、訳語選択部に出力した過去の検索
結果を保持する検索結果データベースと、検索指示部よ
り検索指示を受けてネットワーク上に存在する文書ファ
イル群にアクセスし、クエリに基づく検索結果を取得
し、これを検索指示部に出力する検索部とを備え、検索
指示部は、訳語選択部から検索要求を受け取った場合、
先ず、検索結果データベースをアクセスし、検索結果デ
ータベース中に検索要求に一致するクエリがあるかを調
べ、あった場合は、そのクエリの検索結果を訳語選択部
に出力するよう構成されていることを特徴とする機械翻
訳装置。<Structure 5> When a source language sentence is translated into a target language sentence, a translation unit that outputs a plurality of translation word candidates for a plurality of words, and when a plurality of translation word candidates are received from the translation unit,
When a search request is made based on a plurality of translation word candidates and a search result for the search request is received, a translation word selection unit that narrows down the translation word candidates to one based on the search result and outputs it to the translation unit, and a search request from the translation word selection unit. Is received, a query is generated based on a plurality of translation word candidates, a search instruction is output by the query, and if a search result for the search instruction is received, this is output to the translation word selection unit as a search result. An instructing unit, a search result database holding past search results output to the translation word selecting unit, and receiving a search instruction from the search instructing unit, accessing a group of document files existing on the network, and acquiring a search result based on the query And a search unit that outputs the search request to the search instruction unit. When the search instruction unit receives the search request from the translation word selection unit,
First, the search result database is accessed, and it is checked whether there is a query that matches the search request in the search result database. If there is, a search result of the query is output to the translation word selection unit. A featured machine translation device.

【００１３】[0013]

【発明の実施の形態】以下、本発明の実施の形態を具体
例を用いて詳細に説明する。DESCRIPTION OF THE PREFERRED EMBODIMENTS Embodiments of the present invention will be described below in detail with reference to specific examples.

【００１４】《具体例１》〈構成〉図１は、本発明の機械翻訳装置の具体例１を示
す構成図である。図示のように本具体例の機械翻訳装置
は、翻訳部１と、訳語選択部２と、検索指示部３と、検
索部４とからなる。Embodiment 1 <Configuration> FIG. 1 is a configuration diagram showing Embodiment 1 of a machine translation apparatus according to the present invention. As shown in the figure, the machine translation apparatus of this specific example includes a translation unit 1, a translated word selection unit 2, a search instruction unit 3, and a search unit 4.

【００１５】翻訳部１は、ある言語の文を入力として受
け取り、それに相当する他の言語の文を出力する機能を
有している。訳語選択部２は、翻訳部１から複数の訳語
候補を持つ翻訳途中の文を受け取り、検索指示部３に必
要なデータを送って検索結果を受け取り、その結果によ
って複数の訳語候補を一つに絞り、翻訳部１に訳語を一
つに絞った状態の文を返す、という機能を有している。The translation unit 1 has a function of receiving a sentence in a certain language as input and outputting a corresponding sentence in another language. The translated word selection unit 2 receives a partially translated sentence having a plurality of translated word candidates from the translation unit 1, sends necessary data to the search instruction unit 3, receives a search result, and combines the plurality of translated word candidates into one according to the result. It has a function of returning a sentence in which the translated words are narrowed down to one to the translating unit 1.

【００１６】検索指示部３は、訳語選択部２から検索対
象となる候補語を受け取り、クエリを生成して検索部４
に検索要求を出す機能を有している。検索部４は、検索
指示部３から検索要求を受け取って検索を行い、検索結
果を検索指示部３に返す機能を有している。The search instruction unit 3 receives candidate words to be searched from the translated word selection unit 2, generates a query, and generates a query.
Has a function of issuing a search request. The search unit 4 has a function of receiving a search request from the search instruction unit 3, performing a search, and returning a search result to the search instruction unit 3.

【００１７】翻訳部１は、入力部１ａと、解析部１ｂ
と、変換部１ｃと、生成部１ｄと、出力部１ｅとからな
る。The translation unit 1 includes an input unit 1a and an analysis unit 1b.
, A conversion unit 1c, a generation unit 1d, and an output unit 1e.

【００１８】入力部１ａは、原文の入力を受け付け、解
析部１ｂに渡す機能部である。解析部１ｂは、原文の構
造を解析する（係り受け関係を明らかにする）機能部で
ある。変換部１ｃは、解析された原文の構造に応じて原
文の語句を用いて訳文の構造を形作る機能を有してい
る。The input unit 1a is a functional unit that receives an input of an original sentence and passes it to the analysis unit 1b. The analysis unit 1b is a functional unit that analyzes the structure of the original text (clarifies the dependency relationship). The conversion unit 1c has a function of forming the structure of the translated sentence using the words and phrases of the original sentence according to the analyzed structure of the original sentence.

【００１９】生成部１ｄは、変換された目的言語の語句
を用いて訳文の構造を形作るものである。訳語が複数の
候補を持つ場合、この生成部１ｄにより、訳語選択部２
との情報受け渡しを行い、一つに絞り込む機能を有して
いる。出力部１ｅは、生成部１ｄで生成された訳文を受
け取り出力するものである。The generator 1d forms the structure of the translated sentence using the converted words in the target language. When the translated word has a plurality of candidates, the translated word selecting unit 2
And has a function to narrow down to one. The output unit 1e receives and outputs the translated sentence generated by the generation unit 1d.

【００２０】訳語選択部２は、対象語抽出部２ａと、検
索結果保持部２ｂと、結果判定部２ｃからなる。The translated word selection unit 2 includes a target word extraction unit 2a, a search result holding unit 2b, and a result determination unit 2c.

【００２１】対象語抽出部２ａは、生成部１ｄから複数
の訳語候補を持つ翻訳途中の文を受け取り、複数候補を
持つ語と、それを一つに絞るために適当な語を抽出して
候補語とし、検索指示部３の後述するクエリ生成部３ａ
と、検索結果保持部２ｂに渡す機能を有している。The target word extracting unit 2a receives a sentence in the middle of translation having a plurality of translation word candidates from the generation unit 1d, and extracts a word having a plurality of candidates and an appropriate word for narrowing the words to one. And a query generation unit 3a to be described later of the search instruction unit 3.
And a function of passing the search result to the search result holding unit 2b.

【００２２】検索結果保持部２ｂは、対象語抽出部２ａ
から受け取った候補語データを元に検索結果を保持する
ためのテーブルを作成し、検索指示部３の後述するアク
セス制御部３ｂから検索結果のデータを受け取ってテー
ブルのデータを完成させ、結果判定部２ｃにそのテーブ
ルデータを渡す機能を有している。結果判定部２ｃは、
検索結果保持部２ｂからデータを受け取り、そのデータ
を用いて複数ある訳語候補を一つに絞った訳文を生成部
１ｄに返す機能を有している。The search result holding unit 2b includes a target word extracting unit 2a
Creates a table for holding the search result based on the candidate word data received from the server, receives the search result data from an access control unit 3b of the search instruction unit 3, which will be described later, and completes the data of the table. 2c has a function of passing the table data. The result determination unit 2c
It has a function of receiving data from the search result holding unit 2b and using the data to return a translated sentence in which a plurality of translated word candidates are reduced to one to the generating unit 1d.

【００２３】検索指示部３は、上述したようにクエリ生
成部３ａとアクセス制御部３ｂとからなる。クエリ生成
部３ａは、対象語抽出部２ａから受け取った候補語から
（全ての）組み合わせを作り、クエリを生成して検索部
４の検索エンジン４ａに渡す機能を有している。The search instruction unit 3 includes the query generation unit 3a and the access control unit 3b as described above. The query generation unit 3a has a function of creating (all) combinations from the candidate words received from the target word extraction unit 2a, generating a query, and passing it to the search engine 4a of the search unit 4.

【００２４】アクセス制御部３ｂは、クエリ生成部３ａ
からクエリを受け取り、（ネットワークおよび）検索エ
ンジン４ａに過大な負荷をかけないように、状態をモニ
タリングしながら検索エンジン４ａに検索を要求する機
能を有している。また、アクセス制御部３ｂは、利用す
る検索エンジン４ａの種類によってクエリを整形する機
能を有している。同時に、アクセス制御部３ｂは、検索
エンジン４ａから検索結果を受け取り、検索結果保持部
２ｂに渡す機能を有している。The access control unit 3b includes a query generation unit 3a
And has a function of requesting the search engine 4a to search while monitoring the status so as not to put an excessive load on the (network and) the search engine 4a. The access control unit 3b has a function of shaping a query according to the type of the search engine 4a to be used. At the same time, the access control unit 3b has a function of receiving a search result from the search engine 4a and passing it to the search result holding unit 2b.

【００２５】検索部４は、検索エンジン４ａと文書ファ
イル群４ｂとからなる。検索エンジン４ａは、アクセス
制御部３ｂから受け取った検索要求に基づいて、ネット
ワークで接続された文書ファイル群４ｂから該当する文
書あるいは関連データを検索し、検索結果をアクセス制
御部３ｂに返す機能を有している。また、文書ファイル
群４ｂは、ネットワークを介してアクセスできる文書フ
ァイルの集まりを示している。The search section 4 includes a search engine 4a and a document file group 4b. The search engine 4a has a function of searching for a corresponding document or related data from the document file group 4b connected via the network based on the search request received from the access control unit 3b, and returning a search result to the access control unit 3b. are doing. The document file group 4b indicates a group of document files that can be accessed via a network.

【００２６】尚、この検索部４は、例えば、文献：
（社）日本電子工業振興協会「自然言語処理システムの
動向に関する調査報告書」（1997年4月），ｐｐ．142-1
56等に示されているように、既知の構成である。The search unit 4 is, for example, a document:
Japan Electronic Industry Development Association, "Survey Report on Trends in Natural Language Processing Systems" (April 1997), pp. 142-1
This is a known configuration as shown in 56 and the like.

【００２７】また、上記翻訳部１、訳語選択部２、検索
指示部３はパーソナルコンピュータ等で構成され、それ
ぞれの処理部は、各処理に対応したソフトウェアとこれ
を実行するプロセッサやメモリおよび補助記憶装置等で
その機能が実現されている。The translation unit 1, the translation word selection unit 2, and the search instruction unit 3 are constituted by a personal computer or the like, and each processing unit is composed of software corresponding to each process and a processor, a memory, and an auxiliary storage for executing the software. The function is realized by a device or the like.

【００２８】〈動作〉先ず、翻訳部１の生成部１ｄに入
力されるまでの動作は従来と同様である。即ち、ある原
文が入力部１ａに入力されると、解析部１ｂはこれを解
析し（係り受け関係を解析し）、変換部１ｃは、これを
次のような形で生成部１ｄに渡す。<Operation> First, the operation up to the input to the generation unit 1d of the translation unit 1 is the same as the conventional one. That is, when a certain original sentence is input to the input unit 1a, the analysis unit 1b analyzes it (analyzes the dependency relationship), and the conversion unit 1c passes it to the generation unit 1d in the following manner.

【００２９】that be a hot/salty/hard/difficult/tou
gh work/labor/task/job/business.ここで、“/”は複
数候補を列記するための記号である。尚、“be”のノー
ドが示すように、活用する語は全て原形で表示されるも
のとする。尚、文や句を構成する各単語（形態素）をこ
こではノードと呼ぶことにする。[0029] that be a hot / salty / hard / difficult / tou
gh work / labor / task / job / business. Here, “/” is a symbol for listing a plurality of candidates. Note that, as indicated by the “be” node, all the words to be used are displayed in their original forms. Here, each word (morpheme) constituting a sentence or phrase is called a node here.

【００３０】または、主語−述語関係や修飾関係が明ら
かにされた（構造を持った）形式で受け取ってもよい。
図２は、このような主語−述語関係や修飾関係が明らか
にされた形式の説明図である。Alternatively, it may be received in a form (having a structure) in which the subject-predicate relation and the modification relation are clarified.
FIG. 2 is an explanatory diagram of a format in which such a subject-predicate relationship and a modification relationship are clarified.

【００３１】また、語順などが原言語の状態を保持した
まま渡される場合もあるが、その場合、生成部１ｄの内
部で目的言語での正しい語順になるように調整されるも
のとする。In some cases, the word order or the like is passed while maintaining the state of the source language. In such a case, it is assumed that adjustment is made within the generation unit 1d so that the word order is correct in the target language.

【００３２】生成部１ｄにおいて、上述したような入力
に複数候補が列記されている（“/”を含むノードが存
在する）場合に、従来の装置とは異なる動作をする。こ
こでは、入力に複数候補が列記されている場合に、複数
候補が一つに絞られて生成部１ｄに戻されるまでの各部
の動作を順に説明する。When a plurality of candidates are listed in the input as described above (there is a node including "/"), the generator 1d operates differently from the conventional apparatus. Here, when a plurality of candidates are listed in the input, the operation of each unit until the plurality of candidates is narrowed down to one and returned to the generation unit 1d will be sequentially described.

【００３３】先ず、対象語抽出部２ａの動作を説明す
る。図３は、対象語抽出部２ａの動作フローチャートで
ある。文や句を構成する各単語（形態素）をここではノ
ードと呼ぶことにする。尚、ここで句とは複数語からな
る文の構成要素（例えば「名詞句」「前置詞句」等）で
あり、例えば英語であれば主語−動詞（−目的語）のよ
うな構造を持たないものを指す。一方、文とは、主語−
動詞（−目的語）のような一連の構造を持ってそれだけ
で完結するものを指す。First, the operation of the target word extraction unit 2a will be described. FIG. 3 is an operation flowchart of the target word extraction unit 2a. Here, each word (morpheme) constituting a sentence or phrase is referred to as a node. Here, a phrase is a component of a sentence composed of a plurality of words (for example, “noun phrase”, “preposition phrase”, etc.), and for example, English does not have a structure like a subject-verb (−object). Points to something. On the other hand, a sentence is a subject-
It refers to something that has a series of structures such as a verb (-object) and is completed by itself.

【００３４】対象語抽出部２ａは、受け取ったノード列
の最初のノードを処理対象としてセットし（ステップＳ
３０１）、そのノードが複数候補を持っているかどうか
をチェックする（ステップＳ３０２）。複数候補を持っ
ている場合、そのノードの複数候補と組み合わせて相性
を見て絞り込むための語（組み合わせ語）を抽出する
（ステップＳ３０３）。The target word extracting unit 2a sets the first node of the received node string as a processing target (step S).
301), it is checked whether the node has a plurality of candidates (step S302). If there is a plurality of candidates, a word (combination word) to be narrowed down in view of the compatibility in combination with the plurality of candidates of the node is extracted (step S303).

【００３５】この抽出の最も簡単な方法は、隣接するノ
ードを組み合わせ語とする方法であるが、図２のように
係り受け関係が明らかな場合、複数候補を持ったノード
と係り受け関係のある語を組み合わせ語とすることもで
きる。また、前接、後接の２語、あるいは、どちらかの
方向に２語、というように、組み合わせ語を１語ではな
く２語以上に設定することも可能である。The simplest method of this extraction is to use adjacent nodes as combination words. However, when the dependency relation is clear as shown in FIG. 2, there is a dependency relation with a node having a plurality of candidates. Words can be combined words. In addition, it is also possible to set the combination word to two or more words instead of one word, such as two words in front and rear, or two words in either direction.

【００３６】尚、組み合わせ語となるノード自身も複数
候補を持っていてもよい。It should be noted that the node serving as a combination word itself may have a plurality of candidates.

【００３７】次に、セットされている複数候補を持った
ノード（セット語）と、組み合わせ語と、位置情報など
をクエリ生成部３ａに送付する（ステップＳ３０４）。
ここで、位置情報とは、セット語と組み合わせ語の位置
関係（セット語の後ろに隣接して組み合わせ語がある、
とか、セット語を目的語とする動詞が組み合わせ語であ
る、等）を示す情報である。そして、この情報を元にク
エリ生成部３ａはクエリを生成する。Next, a node (set word) having a plurality of candidates that have been set, combination words, position information, and the like are sent to the query generation unit 3a (step S304).
Here, the positional information refers to a positional relationship between the set word and the combination word (the combination word is adjacent to the set word,
Or a verb having a set word as an object is a combination word, etc.). Then, the query generator 3a generates a query based on this information.

【００３８】クエリ生成部３ａに送付されるデータ（情
報）形式は次の通りである。［送付語数，セット語，組み合わせ語１，組み合わせ語
２，…，語順，位置情報１，位置情報２，…］The data (information) format sent to the query generation unit 3a is as follows. [Number of words sent, set words, combination words 1, combination words 2, ..., word order, location information 1, location information 2, ...]

【００３９】例えば、入力が図２に示すような単語列で
あった場合は以下のようなデータが渡される。例１：［3,hot/salty/hard/difficult/tough,be,work/labor/t
ask/job/business,2-1-3,前方，後接］For example, when the input is a word string as shown in FIG. 2, the following data is passed. Example 1: [3, hot / salty / hard / difficult / tough, be, work / labor / t
ask / job / business, 2-1-3, forward and follow-up]

【００４０】この例は、セット語“hot/salty/hard/dif
ficult/tough”に対して、組み合わせ語として“be”と
“work/labor/task/job/business”を使い、その出現の
仕方は、２番目の要素（つまり、組み合わせ語１）、１
番目の要素（つまりセット語）、３番目の要素（つまり
組み合わせ語２）、の順である。In this example, the set word “hot / salty / hard / dif
For "ficult / tough", "be" and "work / labor / task / job / business" are used as a combination word, and the appearance of the combination word is the second element (that is, combination word 1), 1
The third element (ie, set word 2) and the third element (ie, combination word 2).

【００４１】この例で、「前方」が意味するところは、
セット語と組み合わせ語１との位置関係は、「セット語
の前方（直前でなくてもよい）に組み合わせ語１が出現
する」である。「後接」が意味するところは、セット語
と組み合わせ語２との位置関係は、「セット語の後ろに
組み合わせ語２が接している」である。In this example, "forward" means:
The positional relationship between the set word and the combination word 1 is “the combination word 1 appears before (not necessarily immediately before) the set word”. The meaning of the "postfix" means that the positional relationship between the set word and the combination word 2 is "the combination word 2 is in contact with the set word."

【００４２】説明を簡単にするため、以降では上記例１
から組み合わせ語１の“be”を除いた例２を参照して説
明を行う。例２：［2,hot/salty/hard/difficult/tough,work/labor/task
/job/business,1-2,後接］For the sake of simplicity, the following Example 1
This will be described with reference to Example 2 in which “be” of the combination word 1 is removed from. Example 2: [2, hot / salty / hard / difficult / tough, work / labor / task
/ job / business, 1-2, subsequent]

【００４３】次に、セットされている複数候補を持った
ノードと組み合わせ語とを、検索結果保持部２ｂに送付
する（ステップＳ３０５）。Next, the node having the plurality of candidates set and the combination word are sent to the search result holding unit 2b (step S305).

【００４４】ステップＳ３０２において、複数候補を持
っていない場合、入力ノード列の次のノードがあるかど
うかをチェックし（ステップＳ３０６）、次のノードが
あれば、それを処理対象としてセットして（ステップＳ
３０７）、ステップＳ３０２の複数候補チェックに戻
る。ステップＳ３０６において、次のノードがない場
合、一連の処理を終了する。In step S302, if there is not a plurality of candidates, it is checked whether there is a next node in the input node sequence (step S306). If there is a next node, it is set as a processing target (step S306). Step S
307), and returns to the multiple candidate check in step S302. If there is no next node in step S306, a series of processing ends.

【００４５】次に、クエリ生成部３ａの動作を説明す
る。図４は、クエリ生成部３ａの動作を示すフローチャ
ートである。先ず、対象語抽出部２ａから受け取ったデ
ータのうち、セット語と組み合わせ語をリストに格納す
る（ステップＳ４０１）。複数候補が存在する語は、一
つずつにばらしてリストに格納する。図５は、セット語
・組み合わせ語のリストの説明図である。Next, the operation of the query generator 3a will be described. FIG. 4 is a flowchart illustrating the operation of the query generation unit 3a. First, among the data received from the target word extraction unit 2a, set words and combination words are stored in a list (step S401). Words in which a plurality of candidates exist are separated into one word and stored in the list. FIG. 5 is an explanatory diagram of a list of set words and combination words.

【００４６】そして、各リストの要素を用いて全ての組
み合わせを作成し、クエリ用リストに格納する（ステッ
プＳ４０２）。図６は、クエリ用リストの説明図であ
る。Then, all combinations are created using the elements of each list and stored in the query list (step S402). FIG. 6 is an explanatory diagram of a query list.

【００４７】次に、クエリ用リストが空かどうかチェッ
クする（ステップＳ４０３）。クエリ用リストが空では
なかった場合は、先ず、クエリ用リストから要素を一つ
取り出してクエリ作成ターゲットとする（ステップＳ４
０４）。Next, it is checked whether the query list is empty (step S403). If the query list is not empty, first, one element is extracted from the query list and set as a query creation target (step S4).
04).

【００４８】次に、セット語・組み合わせ語の語順情報
に応じて、クエリのベースとなるスロットを配置する
（ステップＳ４０５）。例２のようなデータの場合、語
順情報は“1-2”なので、＄１＋＄２のようにスロットを配置する。尚、＄１、＄２の部分が
スロットであり、＄の直後の数字は何番目の要素がその
スロットに入るかを示している。この例では、前のスロ
ットには１番目の要素が、後ろのスロットには２番目の
要素が入ることになる。Next, a slot serving as a query base is arranged according to the word order information of the set word / combination word (step S405). In the case of the data as in Example 2, since the word order information is "1-2", the slots are arranged as {1 + # 2}. Note that the portions of $ 1 and $ 2 are slots, and the number immediately after $ indicates the number of the element that goes into that slot. In this example, the first element is contained in the preceding slot, and the second element is contained in the subsequent slot.

【００４９】そして、位置情報を参照して、関係情報を
付与する（ステップＳ４０６）。位置情報が「前方」
「後方」のように直前や直後を意味しない場合は、例え
ば、＄１＋ＮＥＡＲ＋＄２のように、近接して存在するという意味の関係子“ＮＥ
ＡＲ”をスロットの間に挿入することにより関係情報を
付与する。Then, related information is given with reference to the position information (step S406). Location information is "forward"
When it does not mean immediately before or immediately after, such as “behind”, for example, the relational element “NE” meaning that it exists close to each other, such as $ 1 + NEAR + $ 2
AR "is inserted between the slots to give the related information.

【００５０】位置情報が「前接」「後接」のように直前
や直後に限定している場合、例えば、 “＄１＋＄２” のように、スロットの前後をダブルクォートで囲むこと
により関係情報を付与する。When the position information is limited to immediately before or immediately after, such as “front contact” and “back contact”, for example, by enclosing the front and back of the slot with double quotes such as “$ 1 + $ 2”, Is given.

【００５１】以上の手順の後、クエリ（のベースとなる
文字列）を、アクセス制御部３ｂに送付し（ステップＳ
４０７）、ステップＳ４０３に戻る。ステップＳ４０３
で、クエリ用リストが空であった場合は、一連の処理を
終了する。After the above procedure, the query (the character string serving as the base) is sent to the access control unit 3b (step S).
407), and returns to step S403. Step S403
If the query list is empty, a series of processing ends.

【００５２】次に、アクセス制御部３ｂの動作を説明す
る。尚、クエリ生成部３ａからアクセス制御部３ｂに送
られてきたクエリ（ベースとなる文字列）は、スタック
に積まれているものとする。Next, the operation of the access control unit 3b will be described. The query (base character string) sent from the query generation unit 3a to the access control unit 3b is assumed to be stacked on the stack.

【００５３】図７は、アクセス制御部３ｂの動作フロー
チャートである。先ず、スタックからクエリを一つ取り
出す（ステップＳ７０１）。もし、ステップＳ７０１で
スタックが空でクエリが取り出せない場合は、そのまま
スタックにクエリが積まれるまで一時中断する。FIG. 7 is an operation flowchart of the access control unit 3b. First, one query is taken out of the stack (step S701). If the query is not fetched because the stack is empty in step S701, the process is temporarily suspended until the query is stacked on the stack.

【００５４】次に、クエリを渡す検索エンジン４ａの負
荷を調べる（ステップＳ７０２）。負荷が高い場合は、
間をおいてステップＳ７０２で再び負荷チェックを繰り
返す。ステップＳ７０２において、検索エンジン４ａの
負荷が高くない場合は、データ検索ルーチンにクエリを
渡してそのクエリに関する処理を委ね（ステップＳ７０
３）、ステップＳ７０１に戻って処理を繰り返す。アク
セス制御部３ｂは、装置全体が停止するまで動作を続け
る。Next, the load of the search engine 4a that passes the query is checked (step S702). If the load is high,
After a while, the load check is repeated again in step S702. If it is determined in step S702 that the load on the search engine 4a is not high, the query is passed to the data search routine and the process related to the query is entrusted to the data search routine (step S70).
3) Return to step S701 and repeat the process. The access control unit 3b continues to operate until the entire device stops.

【００５５】次に、上記ステップＳ７０３のデータ検索
ルーチンの処理を説明する。図８は、データ検索ルーチ
ンの動作フローチャートである。先ず、クエリ（のベー
スとなる文字列）を、実際の検索を行う検索エンジン４
ａに適した形式に整形し（ステップＳ８０１）、検索エ
ンジン４ａに渡す（ステップＳ８０２）。Next, the processing of the data search routine in step S703 will be described. FIG. 8 is an operation flowchart of the data search routine. First, a search engine 4 that performs an actual search of a query (a character string serving as a base)
It is shaped into a format suitable for a (step S801), and passed to the search engine 4a (step S802).

【００５６】このルーチンは検索エンジン４ａから結果
が返ってくるまで待ち状態に入り、検索結果を受け取っ
たら（ステップＳ８０３）、検索結果保持部２ｂに結果
を送付する（ステップＳ８０４）。This routine enters a wait state until a result is returned from the search engine 4a, and upon receiving a search result (step S803), sends the result to the search result holding unit 2b (step S804).

【００５７】検索エンジン４ａに要求する結果は、クエ
リにマッチした句の数、あるいは、マッチしたページ数
とする。また、検索結果保持部２ｂに送付する内容は、
クエリのベースとなる文字列（あるいは整形済みのクエ
リ）と、クエリにマッチした句あるいはページの数であ
る。The result requested from the search engine 4a is the number of phrases matching the query or the number of matching pages. The contents to be sent to the search result holding unit 2b are:
The string (or formatted query) that the query is based on, and the number of phrases or pages that matched the query.

【００５８】例えば、クエリのベースとなる文字列が、 “hard＋work” であったとする。そして、この場合、ある検索エンジン
を利用する場合は、一例として、 query?pg=aq&what=web&fmt=n&q=%22hard+work%22 のようなクエリに整形されるとする。For example, assume that the character string serving as the base of the query is “hard + work”. Then, in this case, when a certain search engine is used, for example, it is assumed that the query is formed into a query such as query? Pg = aq & what = web & fmt = n & q =% 22hard + work% 22.

【００５９】尚、上記の形式の意味は次の通りである。
“query”は起動されるコマンド名で、“?”以降に示さ
れているのは、コマンドに渡される引数名と値を“=”
でペアにして記述したものである。引数と値のペアを複
数記述する場合は、“&”で連結する。この例では、“p
g”という引数には、Advanced Query（詳細検索）を要
求する“aq”という値を入れている。The meaning of the above format is as follows.
“Query” is the name of the command to be invoked, and the characters after “?” Are “=” for the argument name and value passed to the command.
Is described in pairs. When describing multiple argument-value pairs, concatenate them with "&". In this example, “p
The argument “g” contains a value “aq” requesting an advanced query.

【００６０】“what”は検索対象を何にするかを指定す
るもので、この場合web pageを検索したいので、“we
b”と記述する。また、検索結果として欲しいのはペー
ジ数だけなので、フォーマットを意味する引数“fmt”
には検索したページの細かい情報の表示を抑制するため
ＮＯの意味で“n”を値として与える。"What" specifies what to search. In this case, since we want to search the web page, "we"
b ”. Since the only search result is the number of pages, the argument“ fmt ”which means the format
In order to suppress the display of detailed information of the searched page, “n” is given as a value in the meaning of NO.

【００６１】最後に、実際に検索する文字列を与えるた
めの引数“q”に「クエリのベースとなる文字列」を与
えるが、この場合の検索エンジン４ａでのダブルクォー
トの処理の都合上、「クエリのベースとなる文字列」の
先頭と末尾を“%22”に変換する。Lastly, a "character string serving as a query base" is given to the argument "q" for giving a character string to be actually searched. However, due to the double quote processing in the search engine 4a in this case, " Convert the beginning and end of the "string to be the base of the query" to "% 22".

【００６２】以上のような形に整形されて、検索エンジ
ン４ａに送り込まれる。尚、検索エンジン４ａの種類が
異なれば、記述できる引数（および値）の種類・名前が
異なるため、その検索エンジン４ａに合わせて、クエリ
の整形を行う。The data is shaped in the manner described above and sent to the search engine 4a. If the type of the search engine 4a is different, the types and names of the arguments (and values) that can be described are different, so that the query is shaped according to the search engine 4a.

【００６３】次に、検索結果保持部２ｂおよび結果判定
部２ｃの動作を説明する。図９は、検索結果保持部２ｂ
および結果判定部２ｃの動作フローチャートである。検
索結果保持部２ｂは、対象語抽出部２ａからセット語・
組み合わせ語を受け取る（ステップＳ９０１）。セット
語・組み合わせ語とも、複数候補を持つ場合は一つずつ
にばらして、全ての組み合わせの結果を保持できるよう
に結果保持テーブルを作成しておき（ステップＳ９０
２）、アクセス制御部３ｂから検索結果が送付されてく
るのを待つ。Next, the operation of the search result holding unit 2b and the result judgment unit 2c will be described. FIG. 9 shows the search result holding unit 2b.
6 is an operation flowchart of a result determination unit 2c. The search result holding unit 2b receives a set word / word from the target word extraction unit 2a.
The combination word is received (step S901). If both the set word and the combination word have a plurality of candidates, they are separated into one and a result holding table is created so as to hold the results of all the combinations (step S90).
2) Wait for the search result to be sent from the access control unit 3b.

【００６４】検索結果を受け取ったら、クエリの要素を
見て、当該クエリの位置に結果の数値を埋める（ステッ
プＳ９０３）。次に、テーブルが完成したか否かのチェ
ックを行い（ステップＳ９０４）、完成していなければ
再び検索結果を待つ状態に入り、ステップＳ９０３で次
の結果を受け取り、処理を繰り返す。When the search result is received, the value of the result is embedded in the position of the query by looking at the elements of the query (step S903). Next, it is checked whether or not the table is completed (step S904). If the table is not completed, the process waits for a search result again, receives the next result in step S903, and repeats the process.

【００６５】次に、テーブルが完成したか否かのチェッ
クを行い（ステップＳ９０４）、完成していなければ再
び検索結果を待つ状態に入り、ステップＳ９０３で次の
結果を受け取り、処理を繰り返す。Next, it is checked whether or not the table is completed (step S904). If the table is not completed, the process waits for a search result again, receives the next result in step S903, and repeats the process.

【００６６】ステップＳ９０４で、テーブルの数値が全
て埋められて完成した場合には、結果判定部２ｃにおい
てテーブル中の値を参照して最適な組み合わせを判定結
果として選択する（ステップＳ９０５）。If it is determined in step S904 that the table has been filled with all the numerical values, the result determination section 2c refers to the values in the table and selects an optimal combination as a determination result (step S905).

【００６７】この選択として、最も単純な方法はテーブ
ル中の最大値を取る方法であるが、何らかの重みをつけ
た計算式をテーブル中の数値に適用した後に、最適な組
み合わせを選択してもよい。例えば、各単語の単体での
検索マッチ数を別途収集しておき、組み合わせの検索結
果の数値を単体での数値で割ることによって正規化す
る、等の方法も考えられる。As this selection, the simplest method is to take the maximum value in the table. However, after applying some weighted calculation formula to the numerical value in the table, the optimum combination may be selected. . For example, a method of separately collecting the number of search matches of each word alone and normalizing the result by dividing the numerical value of the search result of the combination by the numerical value of each unit may be considered.

【００６８】図１０は、検索結果保持テーブルの完成例
を示す説明図である。図示のように、例えば、テーブル
中の最大値を取る組み合わせを選択するとすれば、hard
とworkの組み合わせが303,434で最大値であるので、セ
ット語はhardを、組み合わせ語はworkを選択することに
なる。FIG. 10 is an explanatory diagram showing a completed example of the search result holding table. As shown in the figure, for example, if a combination that takes the maximum value in the table is selected, hard
Since the combination of and work is 303,434, which is the maximum value, the set word is hard and the combination word is work.

【００６９】最後に、選択された組み合わせを生成部１
ｄに送付して（ステップＳ９０６）、一連の処理を終了
する。Finally, the selected combination is generated by the generation unit 1
d (step S906), and the series of processing ends.

【００７０】尚、文書ファイル群４ｂは、予め別の手段
で収集・蓄積された文書を処理した特別なフォーマット
のデータ（インデックスされている等）であってもよ
い。The document file group 4b may be data in a special format (indexed or the like) obtained by processing documents collected and stored by another means in advance.

【００７１】また、検索部４（あるいは検索エンジン４
ａ）を複数用意して、複数のエンジンにクエリを渡す方
式であってもよい。The search unit 4 (or the search engine 4)
A method may be adopted in which a plurality of a) are prepared and a query is passed to a plurality of engines.

【００７２】更に、上記具体例１では、生成部１ｄから
翻訳途中のデータを受け渡しする形式になっているが、
変換部１ｃからデータを受け渡しして処理する形態であ
ってもよい。尚、この場合の変換部１ｃから訳語選択部
２への検索要求および訳語選択部２からの訳語選択結果
の基本的な動作は、上述した生成部１ｄからの場合と同
様である。即ち、このような構成では、変換部１ｃにお
いて、訳語を一つに絞り込んで生成部１ｄに渡すことに
なる。Further, in the specific example 1, the data in the middle of translation is transferred from the generating unit 1d.
The data may be transferred from the conversion unit 1c and processed. In this case, the basic operation of the search request from the conversion unit 1c to the translated word selection unit 2 and the translated word selection result from the translated word selection unit 2 are the same as those from the above-described generation unit 1d. That is, in such a configuration, the translation unit 1c narrows down the translated word to one and passes it to the generation unit 1d.

【００７３】〈効果〉以上のように具体例１によれば、
訳語選択部２および検索指示部３を設け、翻訳部１から
の複数の訳語候補を検索部４での検索結果を利用して一
つに絞り込むようにしたので、ネットワーク上に分散す
る文書データの統計情報を利用して翻訳品質を向上させ
ることができ、かつ、機械翻訳装置として訳語候補を絞
り込むための大量のデータを保持しておく必要がないと
いう効果がある。<Effects> According to the specific example 1 as described above,
A translation word selection unit 2 and a search instruction unit 3 are provided, and a plurality of translation word candidates from the translation unit 1 are narrowed down to one using the search result of the search unit 4. The translation quality can be improved by using the statistical information, and there is an effect that it is not necessary for the machine translation device to hold a large amount of data for narrowing down translation word candidates.

【００７４】《具体例２》〈構成〉図１１は、具体例２の構成図である。具体例２
の機械翻訳装置は、具体例１における翻訳部１、訳語選
択部２、検索指示部３、検索部４に加えて、判定結果検
索部５と、判定結果データベース６からなる。<< Example 2 >><Structure> FIG. 11 is a diagram showing the structure of Example 2. Example 2
The machine translation device includes a translation unit 1, a translated word selection unit 2, a retrieval instruction unit 3, and a retrieval unit 4 in Specific Example 1, a decision result retrieval unit 5, and a decision result database 6.

【００７５】判定結果検索部５は、対象語抽出部２ａか
ら候補語を受け取り、その候補語が判定結果データベー
ス６にあるかどうかを確認し、あればその結果を生成部
１ｄに送り、なければ候補語をクエリ生成部３ａに渡す
機能を有している。The judgment result search unit 5 receives the candidate word from the target word extraction unit 2a, checks whether the candidate word is in the judgment result database 6, and sends the result to the generation unit 1d if there is one. It has a function of passing candidate words to the query generation unit 3a.

【００７６】判定結果データベース６は、結果判定部２
ｃから判定結果を受け取り、データとして保持する機能
を有している。即ち、判定結果データベース６は、訳語
選択部２において過去に複数の訳語候補から一つに絞り
込まれた訳語を、その複数の訳語候補と関連付けて保持
するためのデータベースである。The judgment result database 6 stores the result judgment unit 2
c has a function of receiving a determination result from the terminal c and holding it as data. That is, the determination result database 6 is a database for holding the translated word previously narrowed down from the plurality of translated word candidates to one by the translated word selection unit 2 in association with the plurality of translated word candidates.

【００７７】他の各構成は具体例１と同様であるため、
その説明は省略する。Since the other components are the same as those in the first embodiment,
The description is omitted.

【００７８】〈動作〉翻訳部１全体の動作および検索部
４の動作については、既知の動作と同様であるため、こ
こでの説明は省略する。また、訳語選択部２および検索
指示部３の基本的な動作は具体例１と同様であるが、次
の点が具体例１とは異なっている。<Operation> The operation of the whole translation unit 1 and the operation of the retrieval unit 4 are the same as those of the known operation, and a description thereof will be omitted. The basic operations of the translation word selection unit 2 and the search instruction unit 3 are the same as those of the first embodiment, but the following points are different from the first embodiment.

【００７９】●結果判定部２ｃの結果は、生成部１ｄへ
送られるだけでなく、判定結果データベース６にも送ら
れて保持される。 ●対象語抽出部２ａから送られる候補語は、直接クエリ
生成部３ａに行くのではなく、判定結果検索部５を経由
する。The result of the result judging unit 2c is sent not only to the generating unit 1d but also sent to and held in the judgment result database 6. -The candidate words sent from the target word extraction unit 2a do not go directly to the query generation unit 3a, but pass through the determination result search unit 5.

【００８０】例えば、判定結果データベース６には次の
ようなデータ形式で保存されているとする。 hot/salty/hard/difficult/tough,work/labor/task/job
/business,hard,workFor example, it is assumed that the determination result database 6 is stored in the following data format. hot / salty / hard / difficult / tough, work / labor / task / job
/ business, hard, work

【００８１】この例は、“hot/salty/hard/difficult/t
ough”と“work/labor/task/job/business”の組み合わ
せがあったときに、“hard”と“work”が選択されたこ
とを示している。In this example, “hot / salty / hard / difficult / t
When there is a combination of “ough” and “work / labor / task / job / business”, it indicates that “hard” and “work” are selected.

【００８２】次に、判定結果検索部５の動作を説明す
る。図１２は、判定結果検索部５の動作フローチャート
である。先ず、対象語抽出部２ａから候補語を受け取る
（ステップＳ１２０１）。次に、そのデータが判定結果
データベース６に存在するか否かをチェックし（ステッ
プＳ１２０２）、存在すれば、判定結果データベース６
から引き出した判定結果を生成部１ｄに送付して（ステ
ップＳ１２０３）、一連の処理を終了する。Next, the operation of the determination result search section 5 will be described. FIG. 12 is an operation flowchart of the determination result search unit 5. First, a candidate word is received from the target word extraction unit 2a (step S1201). Next, it is checked whether or not the data exists in the determination result database 6 (step S1202).
Is sent to the generation unit 1d (step S1203), and a series of processing ends.

【００８３】ステップＳ１２０２で候補語のデータが判
定結果データベース６に存在しなかった場合、（この場
合は通常の処理を行わなければならないため）対象語抽
出部２ａから受け取った候補語を、そのままクエリ生成
部３ａに転送して（ステップＳ１２０４）、一連の処理
を終了する。If the candidate word data does not exist in the determination result database 6 in step S1202, the candidate word received from the target word extraction unit 2a is directly used as a query (since normal processing must be performed in this case). The data is transferred to the generation unit 3a (step S1204), and a series of processing ends.

【００８４】〈効果〉以上のように、具体例２によれ
ば、具体例１の構成に加えて、判定結果データベース６
と判定結果検索部５とを備え、判定結果データベース６
で過去の判定結果を保持し、訳語選択部２から検索要求
があり、その検索要求の訳語候補が判定結果データベー
ス６中の訳語候補と一致した場合はその訳語を検索結果
として返すようにしたので、具体例１の効果に加えて次
のような効果がある。<Effects> As described above, according to the specific example 2, in addition to the configuration of the specific example 1, the determination result database 6
And a determination result search unit 5, and a determination result database 6
Holds the past judgment result, and when there is a search request from the translation word selection unit 2 and the translation word candidate of the search request matches the translation word candidate in the judgment result database 6, the translation word is returned as the search result. There are the following effects in addition to the effects of the first embodiment.

【００８５】一度、判定を行った結果を再利用すること
ができるため、ネットワークに余計な負荷をかけること
なく、効率よく翻訳品質を向上させることができる。ま
た、判定結果データベース６に保持するのは、結果判定
部２ｃで判定されたデータのみであるため、判定結果デ
ータベース６の記憶容量もさほど大容量でなくても実現
が可能であるという効果がある。Since the result of the determination once can be reused, the translation quality can be efficiently improved without imposing an extra load on the network. Also, since only the data determined by the result determination unit 2c is stored in the determination result database 6, there is an effect that the determination result database 6 can be realized even if the storage capacity is not so large. .

【００８６】《具体例３》〈構成〉図１３は、具体例３の構成図である。具体例３
の機械翻訳装置は、具体例１における翻訳部１、訳語選
択部２、検索指示部３、検索部４に加えて、検索結果デ
ータベース７からなる。<< Embodiment 3 >><Structure> FIG. Example 3
The machine translation apparatus includes a search result database 7 in addition to the translation unit 1, the translation word selection unit 2, the search instruction unit 3, and the search unit 4 in the specific example 1.

【００８７】検索結果データベース７は、アクセス制御
部３ｃから検索結果を受け取り、データとして保持して
おく機能を有している。即ち、アクセス制御部３ｃから
検索結果として検索結果保持部２ｂに出力するデータを
保持するデータベースである。アクセス制御部３ｃは、
クエリ生成部３ａからクエリを受け取った場合、先ず、
検索結果データベース７をアクセスし、検索結果データ
ベース７中に一致するクエリがあるかを調べ、あった場
合は、そのクエリの検索結果を訳語選択部２の検索結果
保持部２ｂに出力するよう構成されている。The search result database 7 has a function of receiving a search result from the access control unit 3c and holding it as data. That is, it is a database that holds data output from the access control unit 3c to the search result holding unit 2b as a search result. The access control unit 3c
When a query is received from the query generation unit 3a, first,
The search result database 7 is accessed to check whether there is a matching query in the search result database 7, and if there is, the search result of the query is output to the search result holding unit 2 b of the translation word selecting unit 2. ing.

【００８８】他の各構成は具体例１と同様であるため、
ここでの説明を省略する。The other components are the same as those in the first embodiment.
The description here is omitted.

【００８９】〈動作〉翻訳部１全体の動作および検索部
４の動作については、既知の動作と同様であるため、こ
こでの説明を省略する。また、訳語選択部２および検索
指示部３の基本的な動作は具体例１と同様であるため、
その説明は省略する。但し、アクセス制御部３ｃの動作
のみ具体例１と異なっているため、その動作を説明す
る。<Operation> The operation of the whole translation unit 1 and the operation of the search unit 4 are the same as those of the known operation, and a description thereof will be omitted. Also, the basic operations of the translation word selection unit 2 and the search instruction unit 3 are the same as those in the first embodiment,
The description is omitted. However, only the operation of the access control unit 3c is different from that of the specific example 1, and the operation will be described.

【００９０】図１４は、具体例３におけるアクセス制御
部３ｃの動作フローチャートである。先ず、スタックか
らクエリを一つ取り出す（ステップＳ１４０１）。尚、
この処理は具体例１におけるステップＳ７０１と同様で
ある。FIG. 14 is an operation flowchart of the access control unit 3c in the third embodiment. First, one query is taken out of the stack (step S1401). still,
This process is the same as step S701 in the first embodiment.

【００９１】次に、検索結果データベース７に当該のク
エリ（と結果）が存在するかどうかを調べる（ステップ
Ｓ１４０２）。存在する場合は、検索部４の検索エンジ
ン４ａにはアクセスせず、検索結果データベース７から
引き出した検索結果を検索結果保持部２ｂに送付して
（ステップＳ１４０５）、ステップＳ１４０１に戻り、
次のクエリを取り出す。Next, it is checked whether or not the query (and the result) exists in the search result database 7 (step S1402). If there is, the search engine 4a of the search unit 4 is not accessed, and the search result extracted from the search result database 7 is sent to the search result holding unit 2b (step S1405), and the process returns to step S1401.
Retrieve the next query.

【００９２】ステップＳ１４０２で、検索結果データベ
ース７に存在しない場合は、検索エンジン４ａの負荷を
チェックする（ステップＳ１４０３）。尚、この処理は
具体例１におけるステップＳ７０２と同様である。If it does not exist in the search result database 7 in step S1402, the load on the search engine 4a is checked (step S1403). This processing is the same as step S702 in the first embodiment.

【００９３】ステップＳ１４０３で、負荷が高ければ、
負荷が低くなるまでステップＳ１４０３を繰り返す。一
方、負荷が高くない場合は、データ検索ルーチンにクエ
リを渡してそのクエリに関する処理を委ね（ステップＳ
１４０４）、ステップＳ１４０１に戻って処理を繰り返
す。尚、ステップＳ１４０４の処理は具体例１のステッ
プＳ７０３にほぼ等しいが、ルーチンの内容が若干異な
る。At step S1403, if the load is high,
Step S1403 is repeated until the load decreases. On the other hand, if the load is not high, the query is passed to the data search routine and the processing related to the query is entrusted (step S).
1404), returning to step S1401 and repeating the processing. Note that the processing in step S1404 is almost the same as step S703 in the first embodiment, but the contents of the routine are slightly different.

【００９４】図１５は、上記ステップＳ１４０４のデー
タ検索ルーチンの動作フローチャートである。ステップ
Ｓ１５０１からステップＳ１５０４までの動作は、図８
のステップＳ８０１からステップＳ８０３と同様である
ため、その説明は省略する。FIG. 15 is a flowchart showing the operation of the data search routine in step S1404. The operations from step S1501 to step S1504 are the same as those in FIG.
Since steps S801 to S803 are the same as those in step S801, description thereof is omitted.

【００９５】アクセス制御部３ｃは、ステップＳ１５０
４で検索結果保持部２ｂに結果を送付した後、検索結果
データベース７にクエリと結果を格納して（ステップＳ
１５０５）、一連の処理を終了する。検索結果データベ
ース７に格納されるデータは、クエリ（のベースになる
文字列）と検索エンジン４ａから返ってきた数値（クエ
リのマッチした句の数あるいはページ数）からなる。The access control unit 3c determines in step S150
After sending the result to the search result holding unit 2b in step 4, the query and the result are stored in the search result database 7 (step S4).
1505), a series of processing ends. The data stored in the search result database 7 includes (a character string serving as a base for) the query and a numerical value (the number of phrases or pages matching the query) returned from the search engine 4a.

【００９６】〈効果〉以上のように具体例３によれば、
検索部４での検索結果を保持する検索結果データベース
７を設けたので、１度検索を行った結果を再利用するこ
とができるため、具体例２と同様にネットワークに余計
な負荷をかけることなく、効率よく翻訳品質を向上させ
ることができる。また、機械翻訳装置として保持するの
は検索結果であるため、例えば、結果判定部２ｃにおい
て訳語選択の判定基準を変えたいといった場合であって
も、検索結果を有効に利用することができる。<Effects> As described above, according to the third embodiment,
Since the search result database 7 that holds the search results in the search unit 4 is provided, the result of the search performed once can be reused, so that no extra load is applied to the network as in the specific example 2. The translation quality can be efficiently improved. Further, since the search result is stored as the machine translation device, the search result can be effectively used even when, for example, the result determination unit 2c wants to change the criterion for selecting the translated word.

【００９７】《利用形態》上記各具体例では、原言語を
日本語、目的言語を英語とした場合を説明したが、逆で
あってもよい。更に、自然言語は、英語や日本語だけで
なく他の言語であってもよい。<< Usage Form >> In each of the above specific examples, the case where the source language is Japanese and the target language is English has been described. Further, the natural language may be not only English or Japanese but also other languages.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明の機械翻訳装置の具体例１を示す構成図
である。FIG. 1 is a configuration diagram showing a specific example 1 of a machine translation device of the present invention.

【図２】係り受け関係が明らかな形式の入力例を示す説
明図である。FIG. 2 is an explanatory diagram showing an input example of a format in which a dependency relationship is clear;

【図３】対象語抽出部の動作フローチャートである。FIG. 3 is an operation flowchart of a target word extraction unit.

【図４】クエリ生成部の動作を示すフローチャートであ
る。FIG. 4 is a flowchart illustrating an operation of a query generation unit.

【図５】セット語・組み合わせ語のリストの説明図であ
る。FIG. 5 is an explanatory diagram of a list of set words and combination words.

【図６】クエリ用リストの説明図である。FIG. 6 is an explanatory diagram of a query list.

【図７】アクセス制御部の動作フローチャートである。FIG. 7 is an operation flowchart of an access control unit.

【図８】データ検索ルーチンの動作フローチャートであ
る。FIG. 8 is an operation flowchart of a data search routine.

【図９】検索結果保持部および結果判定部の動作フロー
チャートである。FIG. 9 is an operation flowchart of a search result holding unit and a result determination unit.

【図１０】検索結果保持テーブルの完成例を示す説明図
である。FIG. 10 is an explanatory diagram showing a completed example of a search result holding table.

【図１１】本発明の機械翻訳装置の具体例２の構成図で
ある。FIG. 11 is a configuration diagram of a specific example 2 of the machine translation device of the present invention.

【図１２】判定結果検索部の動作フローチャートであ
る。FIG. 12 is an operation flowchart of a determination result search unit.

【図１３】本発明の機械翻訳装置の具体例３の構成図で
ある。FIG. 13 is a configuration diagram of Example 3 of the machine translation device of the present invention.

【図１４】具体例３におけるアクセス制御部の動作フロ
ーチャートである。FIG. 14 is an operation flowchart of an access control unit in a specific example 3.

【図１５】データ検索ルーチンの動作フローチャートで
ある。FIG. 15 is an operation flowchart of a data search routine.

【符号の説明】[Explanation of symbols]

１翻訳部１ｄ生成部２訳語選択部２ａ対象語抽出部２ｂ検索結果保持部２ｃ結果判定部３検索指示部３ａクエリ生成部３ｂ，３ｃアクセス制御部４検索部 Reference Signs List 1 translation unit 1d generation unit 2 translated word selection unit 2a target word extraction unit 2b search result holding unit 2c result determination unit 3 search instruction unit 3a query generation unit 3b, 3c access control unit 4 search unit

Claims

【特許請求の範囲】[Claims]

【請求項１】原言語文を目的言語文に翻訳する際、複
数の語に対する複数の訳語候補を出力する翻訳部と、前記翻訳部から前記複数の訳語候補を受け取った場合
は、当該複数の訳語候補による検索要求を行い、その検
索要求に対する検索結果を受け取った場合は、当該検索
結果に基づき前記複数の訳語候補を一つに絞り込んで前
記翻訳部に出力する訳語選択部と、前記訳語選択部から前記検索要求を受け取った場合は、
前記複数の訳語候補に基づいてクエリを生成して当該ク
エリによる検索指示を出力し、その検索指示に対する検
索結果を受け取った場合は、これを前記訳語選択部に検
索結果として出力する検索指示部と、前記検索指示部より検索指示を受けてネットワーク上に
存在する文書ファイル群にアクセスし、当該クエリに基
づく検索結果を取得し、これを検索指示部に出力する検
索部とを備えたことを特徴とする機械翻訳装置。When translating a source language sentence into a target language sentence, a translation unit that outputs a plurality of translation word candidates for a plurality of words, and, when the plurality of translation word candidates are received from the translation unit, the plurality of translation word candidates. A search request based on the translation word candidate, and when a search result for the search request is received, a translation word selection unit that narrows down the plurality of translation word candidates to one based on the search result and outputs the narrowed translation word candidate to the translation unit; If the search request is received from the department,
A search instruction unit that generates a query based on the plurality of translation word candidates, outputs a search instruction based on the query, and, when receiving a search result for the search instruction, outputs the search result to the translation word selection unit as a search result. A search unit that receives a search instruction from the search instruction unit, accesses a document file group existing on a network, acquires a search result based on the query, and outputs the result to the search instruction unit. Machine translation device.

【請求項２】請求項１に記載の機械翻訳装置におい
て、検索指示部は、それぞれの語の訳語候補を組み合わせてクエリを生成す
るクエリ生成部と、検索部が行う検索処理の負荷状況を判断して、当該検索
部に検索要求を行うアクセス制御部とからなることを特
徴とする機械翻訳装置。2. The machine translation device according to claim 1, wherein the search instruction unit determines a load state of a search process performed by the search unit, the query generation unit generating a query by combining the translation candidates of each word. A machine translation device comprising: an access control unit for making a search request to the search unit.

【請求項３】請求項１または２に記載の機械翻訳装置
において、訳語選択部は、複数の訳語候補から絞り込むために、複数の語において
どの語との組み合わせを判断基準とするかを決定し、こ
れらの語の複数の訳語候補を検索指示部に渡す対象語抽
出部と、前記対象語抽出部が渡した語による前記検索指示部から
の全ての検索結果が揃うまで当該検索結果を保持してお
き、揃った時点でデータを出力する検索結果保持部と、前記検索結果保持部よりデータを受け取った場合、予め
決められた判定基準を適用し、複数の訳語候補から絞り
込みを行う結果判定部とからなることを特徴とする機械
翻訳装置。3. The machine translation device according to claim 1, wherein the translation word selection unit determines which combination of the plurality of words is to be used as a criterion in order to narrow down the plurality of translation word candidates. A target word extraction unit that passes a plurality of translation word candidates of these words to a search instruction unit, and holds the search results until all search results from the search instruction unit with the words passed by the target word extraction unit are collected. In addition, a search result holding unit that outputs data at the same time, and a result determination unit that, when receiving data from the search result holding unit, applies a predetermined determination criterion and narrows down from a plurality of translation word candidates. A machine translation device, comprising:

【請求項４】原言語文を目的言語文に翻訳する際、複
数の語に対する複数の訳語候補を出力する翻訳部と、前記翻訳部から前記複数の訳語候補を受け取った場合
は、当該複数の訳語候補による検索要求を行い、その検
索要求に対する検索結果を受け取った場合は、当該検索
結果に基づき前記複数の訳語候補を一つに絞り込んで前
記翻訳部に出力する訳語選択部と、前記訳語選択部において過去に前記複数の訳語候補から
一つに絞り込まれた訳語を、当該複数の訳語候補と関連
付けて保持する判定結果データベースと、前記訳語選択部から前記検索要求を受け取った場合は、
当該検索要求の複数の訳語候補が前記判定結果データベ
ースに存在するかを判定し、存在した場合は、その複数
の訳語候補から絞り込まれた訳語を検索結果として前記
翻訳部に出力すると共に、存在しない場合は前記複数の
訳語候補による検索要求を行う判定結果検索部と、前記判定結果検索部から検索要求を受け取った場合は、
前記複数の訳語候補に基づいてクエリを生成して当該ク
エリによる検索指示を出力し、その検索指示に対する検
索結果を受け取った場合は、これを前記訳語選択部に検
索結果として出力する検索指示部と、前記検索指示部より検索指示を受けてネットワーク上に
存在する文書ファイル群にアクセスし、当該クエリに基
づく検索結果を取得し、これを検索指示部に出力する検
索部とを備えたことを特徴とする機械翻訳装置。4. When translating a source language sentence into a target language sentence, a translation unit that outputs a plurality of translation word candidates for a plurality of words, and, when the plurality of translation word candidates are received from the translation unit, the plurality of translation word candidates. A search request based on the translation word candidate, and when a search result for the search request is received, a translation word selection unit that narrows down the plurality of translation word candidates to one based on the search result and outputs the narrowed translation word candidate to the translation unit; In the past, the translation word narrowed down to one from the plurality of translation word candidates in the past, a determination result database that holds in association with the plurality of translation word candidates, and when the search request is received from the translation word selection unit,
It is determined whether or not a plurality of translation word candidates of the search request exist in the determination result database, and if there is, a translation word narrowed down from the plurality of translation word candidates is output to the translation unit as a search result and does not exist. In the case, a judgment result search unit for making a search request by the plurality of translation word candidates, and when a search request is received from the judgment result search unit,
A search instruction unit that generates a query based on the plurality of translation word candidates, outputs a search instruction based on the query, and, when receiving a search result for the search instruction, outputs the search result to the translation word selection unit as a search result. A search unit that receives a search instruction from the search instruction unit, accesses a document file group existing on a network, acquires a search result based on the query, and outputs the result to the search instruction unit. Machine translation device.

【請求項５】原言語文を目的言語文に翻訳する際、複
数の語に対する複数の訳語候補を出力する翻訳部と、前記翻訳部から前記複数の訳語候補を受け取った場合
は、当該複数の訳語候補による検索要求を行い、その検
索要求に対する検索結果を受け取った場合は、当該検索
結果に基づき訳語候補を一つに絞り込んで前記翻訳部に
出力する訳語選択部と、前記訳語選択部から前記検索要求を受け取った場合は、
前記複数の訳語候補に基づいてクエリを生成して当該ク
エリによる検索指示を出力し、その検索指示に対する検
索結果を受け取った場合は、これを前記訳語選択部に検
索結果として出力する検索指示部と、前記訳語選択部に出力した過去の検索結果を保持する検
索結果データベースと、前記検索指示部より検索指示を受けてネットワーク上に
存在する文書ファイル群にアクセスし、当該クエリに基
づく検索結果を取得し、これを検索指示部に出力する検
索部とを備え、前記検索指示部は、前記訳語選択部から検索要求を受け
取った場合、先ず、前記検索結果データベースをアクセ
スし、当該検索結果データベース中に前記検索要求に一
致するクエリがあるかを調べ、あった場合は、そのクエ
リの検索結果を前記訳語選択部に出力するよう構成され
ていることを特徴とする機械翻訳装置。5. When translating a source language sentence into a target language sentence, a translation unit that outputs a plurality of translation word candidates for a plurality of words, and, when the plurality of translation word candidates are received from the translation unit, the plurality of translation word candidates. When performing a search request based on the translation word candidate and receiving a search result corresponding to the search request, a translation word selection unit that narrows down the translation word candidates to one based on the search result and outputs it to the translation unit, If you receive a search request,
A search instruction unit that generates a query based on the plurality of translation word candidates, outputs a search instruction based on the query, and, when receiving a search result for the search instruction, outputs the search result to the translation word selection unit as a search result. A search result database holding past search results output to the translated word selection unit, and a document file group existing on a network in response to a search instruction from the search instruction unit, and obtaining a search result based on the query And a search unit for outputting the search result to a search instruction unit. When the search instruction unit receives a search request from the translation word selection unit, the search instruction unit first accesses the search result database, and in the search result database, A search is performed to determine whether there is a query that matches the search request, and if so, a search result of the query is output to the translation selection unit. Machine translation apparatus characterized by being.