JP3313810B2

JP3313810B2 - Aspect processing device

Info

Publication number: JP3313810B2
Application number: JP09295393A
Authority: JP
Inventors: 俊桔郭
Original assignee: Panasonic Corp; Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Corp; Panasonic Holdings Corp
Priority date: 1993-04-20
Filing date: 1993-04-20
Publication date: 2002-08-12
Anticipated expiration: 2017-08-12
Also published as: JPH06309352A; TW231351B

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、多国言語翻訳装置及び
方法に関し、特にアスペクトの多義性を解決し、適用可
能なアスペクト処理方法及びその装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a multilingual translation apparatus and method, and more particularly to an aspect processing method and apparatus capable of solving the ambiguity of aspects and applicable.

【０００２】[0002]

【従来の技術】本発明に関する技術や技術上の問題点に
ついては、あまり広くは知られていなため、本発明に直
接関係する従来の技術を説明するに先立ち、本発明に間
接的に関係する部分をも含めて、機械翻訳そのものにつ
きその概略の説明をする。まず、ここにいう機械翻訳と
は、電子計算機という電気、電子的なもの、そしてハー
ドでなくソフト的なものをもその構成要素とする自動翻
訳機による自動翻訳という概念である。2. Description of the Related Art The techniques and technical problems relating to the present invention are not widely known, so prior to describing the prior art directly related to the present invention, the techniques related to the present invention are indirectly related to the present invention. The outline of machine translation itself, including its parts, is explained. First, the machine translation referred to here is a concept of automatic translation by an electronic translator, which is an electronic computer, an electronic translator, and an automatic translator that includes not only hardware but also software.

【０００３】そして、我国（日本）におけるこの技術に
ついての文献としては、例えば１．「機械翻訳」、共立出版、野村浩郷、田中穂積著２．「機械翻訳」、オーム社、電子情報通信学会編牧
野武則著３．「情報処理学会自然言語研究会資料63-2,61-2 」
（アスペクト処理方面）４．「日本語動詞のアスペクト」、むぎ書房、金田一春
彦著５．「機械翻訳サミット」、オーム社、長真真他編集がある。[0003] References to this technology in Japan (Japan) include, for example, 1. "Machine Translation", Kyoritsu Shuppan, Hirogo Nomura, Hozumi Tanaka 2. "Machine Translation", edited by Ohmsha, IEICE Takenori Makino 3. "Information Processing Society of Japan, Natural Language Study Group Material 63-2,61-2"
(For aspect processing) 4. Aspects of Japanese Verbs, Mugi Shobo, Kazuharu Kaneda "Machine Translation Summit", edited by Ohmsha, Makoto Nagashima and others.

【０００４】さて、機械翻訳の一般的方法であるが、以
下のようなものである。基本的には「直接」、「変
換」、「中間言語」等の方式に分けられる。このうち、
「中間言語」方式は目下実験中であり、「直接」方式で
は意味、構文が似る言葉（例えば、同じアルタイ語族の
中でも、環日本海言語を構成するとされる日本語と韓国
語）の間の翻訳だけに応じられる状態であり、一般には
「変換」方式による。この「変換」方式を採用している
機械翻訳システムでは、その内容は大よそ解析、変換及
び生成の三段階に分けられる。解析とは、他の言語への
翻訳の対象となる原始言語の意味及び構文構造について
その内容を解析することである。変換とは、自然言語間
のギャップを調整するものである。生成とは、目的言語
の構文構造に基づき訳文（目的言語）を作ることであ
る。A general method of machine translation is as follows. Basically, it is classified into methods such as "direct", "conversion", and "intermediate language". this house,
The "intermediate language" method is currently under experimentation, and the "direct" method has a similar meaning and syntax (for example, Japanese and Korean, which are considered to constitute the Pan-Japan Sea language even in the same Altai language). This is a state where only translation can be performed, and is generally based on a “conversion” method. In a machine translation system that employs this "conversion" method, the contents are roughly divided into three stages: analysis, conversion, and generation. The analysis is to analyze the contents of the meaning and syntactic structure of the source language to be translated into another language. Transformation adjusts the gap between natural languages. Generation refers to creating a translated sentence (target language) based on the syntax structure of the target language.

【０００５】さて解析処理において、英語や、ラテン語
等のヨーロッパ語族ではスペースにより単語と単語を区
別可能であるため、単語の識別についての問題は少な
い。この一方、たとえば、中国語族、アルタイ語族など
の単語間にこれを区別する記号や空所がない、いわゆる
べた書きをなす言語では、単漢字か単語かの語の切り出
し（形態素解析）が大切な課題となる。このため普通は
これらのべた書き言語に対して最長一致法を採用し、形
態素ごとに切り出し、この上で解析する。その際の主語
述語の認識は、各言語とも名詞は述語にならず、動詞は
主語にならず、更に各言語とも例えば主語になりえる単
語は「Ｉｃｈ」、「Ｄｕ」、「Ｉ」、「他」、「私」等
おおよそ定まっていること、また、主語や述語の文中で
の位置は各言語によっておおよそ定まっていること、更
にまた日本語においては「は」、「が」等の特定の助詞
がつくことが多いことや、特に会話（話し言葉）を文章
化した台詞等の短い文章では主語が省略されることがあ
ること等各言語固有の特徴も利用される。結局、文章の
主語、述語の認識を含む文章の構造、文章の示す時や動
作の内容に注意しつつ辞書を引き双方の言語の単語を対
応させ、その際双方の言語固有の特徴にも注意を払うと
いう人が行う翻訳作業と基本的には異ならない。In the analysis process, words can be distinguished from each other by spaces in European languages such as English and Latin, so that there is little problem with word identification. On the other hand, for example, in a so-called solid writing language where there is no symbol or space between words such as Chinese or Altai, it is important to extract words (single kanji or words) (morphological analysis). Will be an issue. For this reason, the longest match method is usually adopted for these solid-state languages, cut out for each morpheme, and then analyzed. In this case, the recognition of the subject predicate is such that nouns do not become predicates in any language, verbs do not become the subject, and words that can be the subject in each language are, for example, "Ich", "Du", "I", " Others, "I", etc. are roughly fixed, and the position of the subject or predicate in the sentence is roughly fixed by each language. In Japanese, "ha", "ga", etc. The features unique to each language are also used, such as the fact that particles are often attached, and that the subject is sometimes omitted in short sentences such as dialogue in which conversation (spoken language) is written. In the end, look up the dictionary while paying attention to the subject of the sentence, the structure of the sentence including the recognition of the predicate, the time when the sentence is shown and the contents of the action, and match the words in both languages, paying attention to the characteristics unique to both languages It is not fundamentally different from the translation work performed by people who pay

【０００６】なお、ここに「自然言語」とは、スワヒリ
語、アラブ語、マレー語、ドイツ語、日本語、中国語等
人々が通常若しくはその日常の生活に現実にそして普通
に使用している言語をいい、記号論理学で使用されてい
るメタ言語、電子計算機で使用されるプログラム言語や
機械語（マシン語）はもとより、エスペラント、ヴォラ
ピック、イド等の人工的な国際補助語は包含されない。[0006] Here, "natural language" is used by people such as Swahili, Arabic, Malay, German, Japanese, Chinese and the like in a normal or daily life. A language, which does not include meta-languages used in logistic logic, programming languages and machine languages (machine languages) used in computers, and artificial international auxiliary words such as Esperanto, Volapic, and Id.

【０００７】また、「最長一致法」とは、あらかじめ辞
書に格納されている単語と文章中の文字とを比較して文
章中の単語を切り出すものであるが、この場合第１に音
節若しくは字数等の多い単語を優先し、第２に前方にあ
る単語を優先するものである。従って、中国語ワードプ
ロセッサー、日本語ワードプロセッサー等における入力
された音節文字や音素記号からの漢字変換に採用されて
いる技術である。ただし、これらのいわば基本的な技術
は既に各方面で実用化若しくは試用され、現に本願出願
人の成果も前掲の「機械翻訳サミット」１９５頁に紹介
されている状態であるため、これ以上の説明は省略す
る。[0007] The "longest match method" is to compare words stored in a dictionary in advance with characters in a sentence to cut out words in the sentence. The second is to give priority to a word having a large number of words, and second to giving priority to a word located ahead. Therefore, this is a technique employed for converting Chinese characters from syllable characters or phoneme symbols input in a Chinese word processor, a Japanese word processor, or the like. However, these so-called basic technologies have already been put to practical use or used in various fields, and the results of the applicant of the present application are already introduced in the above-mentioned “Machine Translation Summit” on page 195. Is omitted.

【０００８】ところで、機械翻訳には多くの問題点があ
る。例えば、一番基本となる辞書の作成一つとっても日
本語、英語、中国語を比較した場合に、各国語特有の活
用、用語法、多義性（言葉をかえて言うならば不確定
性）等があり、このため各言語の単語が一対一に対応し
ていない。固有の特徴として日本語では敬語が特に有名
である。もちやん、ドイツ語における「Ｄｕ」と「Ｓｉ
ｅ」の使い分け、英語における「ｓｉｒ」等各国語とも
多少はあるが、日本語は特にはなはだしい。これを後に
本発明の対象とするアスペクトの説明に使用する「死」
という語を例にとって示すならば、人が死んだ場合、天
皇なら「崩御」、皇族や貴族なら「薨去」、その他の人
なら「逝去」であり、身内の者のことを言うときには
「死去」等である。また、助詞や助動詞は敬語に伴って
使用が変化するが、これは英語の前置詞、中国語の介詞
等にはないか、あってもわずかである。By the way, there are many problems in machine translation. For example, even if one of the most basic dictionaries is created, when comparing Japanese, English, and Chinese, the unique use of each language, terminology, polysemy (uncertainty if you change words), etc. Therefore, words in each language do not correspond one-to-one. Honorifics are particularly famous in Japanese as a unique feature. Mochiyan, "Du" and "Si" in German
Although there are some differences between the use of "e" and various languages such as "sir" in English, Japanese is particularly remarkable. This will be used later in the description of the aspect covered by the present invention.
For example, if a person dies, the emperor is `` death '', the royal family and nobles are `` death '', and others are `` death '', and `` death '' when talking about relatives And so on. In addition, the use of particles and auxiliary verbs changes with honorifics, but this is not found in English prepositions, Chinese interjections, etc., or is very little.

【０００９】次に、各言語は多少の多義性若しくは不確
定性あるいは不正確性を有している。日本語の例でいう
なら、人はいったん死んだら絶対に生き返らないため、
「死」という語は時間的には過去の事象、動作のみあら
わすはずであるが、「彼はとうとう死んでしまっ
た。」、「彼はとっくに死んでしまった。」、「彼は今
死んだ。」等、完了、過去完了、現在的な使用方法もな
される。また、動作のみならず「死んでいる。」という
過去の動作の結果としての現在の状態をも表す。更に、
「九死に一生を得る」、「瀕死の重傷」、「決死隊」、
「死語」等における「死」は「人の死」とは別の意味を
有するようになっている。例えば「死語」の「死」は、
「現在使用されていない、何等の意味を有さない若しく
は何の役にも立たない」という意味である。これが「来
る」、「行く」等の取り消しや逆動作やその一時中止の
動作が可能、また動作そのものに継続性がある単語にな
ると、更に複雑となる。更にまた、各言語とも多少とも
非論理性を有する。日本語では「水を沸かす」と言わず
にその結果たる「湯」を使用して「湯を沸かす」と言う
等である。これらの特異性、非論理性の存在は中国語、
英語でも同様である。品詞一つとっても、英語の前置詞
や冠詞の存在、名詞や動詞の単数、複数の使いわけ、中
国語の前置詞、後置詞、否定詞、指定詞の存在、それ以
前に多数の漢字の存在、また各国語固有の活用やその不
規則性等である。しかも、かかる不規則性は各言語とも
日常使用される単語に多い。Second, each language has some ambiguity, uncertainty or inaccuracy. In the case of Japanese, people never die once they die,
The term "death" should only refer to past events and actions in time, but "he has finally died.", "He has long died.", "He has died now." ., Etc., completed, past completed, and current usage. It also represents the current state as a result of a past operation such as "dead" as well as the operation. Furthermore,
"Get a lifetime at nine deaths", "Death serious injury", "Death squad",
“Death” in “dead language” and the like has a different meaning from “death of a person”. For example, "death" of "dead language"
It means "not currently used, meaningless or useless." This becomes even more complicated when it becomes possible to cancel or reverse the operation of "coming" or "going", or to suspend the operation, or to make the operation itself continuous. Furthermore, each language has some illogicality. In Japanese, instead of saying "boil water", use the resulting "hot water" and say "boil water." The existence of these peculiarities and illogicalities is Chinese,
The same goes for English. For a part of speech, the presence of English prepositions and articles, singular and plural nouns and verbs, the presence of Chinese prepositions, postpositions, negatives, and nouns, the presence of many Chinese characters before that, It is the use unique to each language and its irregularity. Moreover, such irregularities are common in words used everyday in each language.

【００１０】従って、現在のところこれらを完全に反映
した上で機械翻訳をなすのは困難であり、これらの困難
性のうち重要な点の解決を図ることによりいかに経済的
に質のよい翻訳文を得られるようにするか、あるいはい
かに経済的に多国言語の翻訳に使用しえるか等が技術的
課題となっており、また質のよい翻訳文を得るために
は、最終的には人が見直し、チェックするのが実情であ
る。Therefore, at present, it is difficult to perform machine translation while completely reflecting these, and by solving the important points among these difficulties, it is possible to obtain an economically high quality translated text. Is a technical issue, such as how to obtain such translations or how it can be used economically for translating into multiple languages. It is the actual situation to review and check.

【００１１】従って、また本明細書で後に示す本発明の
技術的特徴を説明するために例示若しくは使用する中国
語、英語、日本語の翻訳文若しくは対照文も厳密には不
正確性若しくは人が通常使用しない用語法がありえる点
は御了解願いたい。以上のもとで、本発明が直接関係す
るアスペクト（aspect）についての従来の技術について
説明する。[0011] Accordingly, the Chinese, English, and Japanese translations or contrasts illustrated or used to explain the technical features of the present invention, which will be described later in this specification, are also strictly inaccurate or subject to inaccuracies. Please understand that there may be terminology that is not usually used. Based on the above, a description will be given of a conventional technique regarding an aspect directly related to the present invention.

【００１２】本発明は、アスペクト（aspect）処理の方
法そしてそのための装置に関するものである。以下、ア
スペクトとこれに密接関係するテンス（tense)について
説明する。そして、これは各国語に共通し、また各国語
が使用されるに際して重要な意味内容を有し、また役割
を担うため機械翻訳でも重要視されているものである。The present invention relates to a method of aspect processing and an apparatus therefor. Hereinafter, the aspect and the tense closely related thereto will be described. This is common to each language, has important meanings when each language is used, and is regarded as important in machine translation because it plays a role.

【００１３】自然言語においては、事件が行われる際の
主語、主体、事件とその述語、動作等との時間関係を表
示する方法には、テンス（tense)とアスペクト（aspct)
の二つがある。テンスとは、事件が発生する時間と話す
時間との相対関係を指すものである。例えば、現在形、
過去形、未来形などがある。テンスは、語族や言語毎に
異なる手段で表わされる。例えば、下記の例に示すよう
に、英語と日本語は主に動詞の語尾の変化により、中国
語では、「昨日」という時間を表す副詞とされる「昨
日」のみでテンスを表わす。In a natural language, a method of displaying a time relationship between a subject, a subject, a case and its predicates, actions, and the like when the case is performed includes a tense and an aspect (aspct).
There are two. Tense refers to the relative relationship between the time at which an incident occurs and the time at which it is spoken. For example, the present tense,
There are past and future forms. Tense is represented by different means for each family and language. For example, as shown in the following example, in English and Japanese, the tense is represented only by “advertisement” which is an adverb indicating the time “yes yesterday” in Chinese mainly due to the change in the ending of the verb.

【００１４】なお、ここに動詞とは、本発明に係るアス
ペクト処理という面からいった場合には、事件、主体、
主語等の動作や発生をあらわす単語という意味であり、
言語学、国語学あるいは英語学等上の動詞とは必ずしも
一致しない場合がある。また、同じく副詞とは、「alre
ady 」、「もう」、「まだ」等同じく動詞をアスペクト
の面から修飾する、あるいは事件等の生じた時間やその
事件の継続状態を解説する単語という意味であり、これ
また必ずしも言語学等上の副詞とは一致しない場合があ
る。なお、このことは英語における日本語にはない助動
詞の存在等各国語の品詞の内容の相違、英語で動詞「ha
ve（持つ）」が本来の持つという動作の他に現在完了を
表す記号的に使用される等各国語固有の使用方法、「デ
ートする」等英語や日本語で名詞が動詞的に使用される
こと、各国語とも「昨日」等名詞が動作のなされた時間
をあらわすため副詞的な作用を有すること等のためある
程度さけられないものである。Here, the verb means the case, the subject, and the subject in the aspect of the aspect processing according to the present invention.
It means a word that expresses the action or occurrence of the subject, etc.
Verbs in linguistics, linguistics or English may not always match. Also, the adverb is "alre
ady "," already "," yet ", etc. are words that modify the verb in terms of aspects, or that describe the time when the incident occurred or the continuation of the incident. May not match the adverb of. This is due to differences in the content of parts of speech in various languages, such as the presence of auxiliary verbs not found in Japanese in English, and the verb "ha
In addition to the action that "ve (has)" has, it is used symbolically to indicate the current completion, such as the unique usage of each language, such as "date", the noun is used verbally in English and Japanese such as "date" In each language, the noun such as “Yesterday” cannot be avoided to some extent because it has an adverbial action to indicate the time at which the action was performed.

【００１５】以下に、日本語、中国語、英語でこのテン
スを具体的に示す。（例１）過去形の表現日本語：彼は昨日学校へ行った。中国語：他昨日去学校。英語：Ｈe went to school yesterday. 次に、アスペクトについて説明する。Hereinafter, this tense will be specifically shown in Japanese, Chinese, and English. (Example 1) Past tense expression Japanese: He went to school yesterday. Chinese: I left school yesterday. English: He went to school yesterday. Next, I will explain aspects.

【００１６】アスペクトとは、事件、主体、主語の発生
時間、動作時間に基づいて、その進行の如何、状況を表
わすものである。例えば、完了、継続、反復などがあ
る。一般的に、自然言語は独自のアスペクト記号及び関
連共起（co-occurred 、動詞、アスペクト記号以外でア
スペクトの多義性を解決するのに役立つもの。）、形態
素（morphology、中国語では複数の漢字の組み合わせで
まとまった意味を持ちうる最小の単位。英語、日本語で
は単語に相当する。例えば、副詞「昨日」、「yesterda
y 」等）によりアスペクトを表わす。但し、アスペクト
記号はその意味に以下のような多義性があり、ひいては
不確定性が生じるもととなる。このため、処理対象の文
のアスペクト意味を正確に解析するためには、文中のア
スペクト意味に影響を及ぼす他の要素をも同時に考える
必要がある。なお、本明細書においては、一つの文のア
スペクトを、文アスペクト（sentential aspect)と定義
する。Aspects represent the progress and the situation of an incident, subject, subject occurrence time, and operation time based on the operation time. For example, completion, continuation, repetition, and the like. In general, natural language has its own aspect symbols and related co-occurrences (other than co-occurred, verbs, and aspect symbols that help resolve the ambiguity of aspects), morphemes, and multiple Chinese characters in Chinese. The smallest unit that can have a coherent meaning in the combination of. In English and Japanese, it is equivalent to a word. For example, the adverbs "yesterday", "yesterda
y ”, etc.). However, the aspect symbol has the following polysemy in its meaning, and eventually causes uncertainty. For this reason, in order to accurately analyze the aspect meaning of the sentence to be processed, it is necessary to simultaneously consider other factors that affect the aspect meaning in the sentence. In this specification, an aspect of one sentence is defined as a sentence aspect.

【００１７】以下に、日本語、中国語、英語でこのアス
ペクトを具体的に示す。（例２）日本語のアスペクト記号：「ている」可能アスペクト意味：進行、状態、反復、経験など日本語の例文文アスペクト（１）雨が降っている。（進行）（２）彼は毎日本を読んでいる。（反復）（３）彼は死んでいる。（状態）なお、日本語では、この他のアスペクト記号として「て
くる」等がある。Hereinafter, this aspect is specifically shown in Japanese, Chinese, and English. (Example 2) Japanese aspect symbol: “Yes” Possible aspect meaning: progress, state, repetition, experience, etc. Japanese example sentences Sentence aspect (1) It is raining. (Progress) (2) He reads books every day. (Repeat) (3) He is dead. (State) In the Japanese language, there is "come" as another aspect symbol.

【００１８】中国語のアスペクト記号：″了″ 可能アスペクト意味：完成、終了、経験など中国語の例文文アスペクト（１）他讀了書。（完成）（２）他死了。（終了）（３）他己經看了書。（経験）なお、中国語では、この他のアスペクト記号として
「過」等がある。Aspect symbol in Chinese: "OK" Possible aspect meaning: completion, completion, experience, etc. Example sentences in Chinese Sentence aspect (1) Other reading book. (Completed) (2) Others died. (End) (3) The other person's consent letter. (Experience) In Chinese, there is "over" as another aspect symbol.

【００１９】英語のアスペクト記号：″have＋Ｐ．
Ｐ．″ 可能アスペクト意味：完成、経験など英語の例文文アスペクト（１）Ｉ have eaten my lunch. （完成）（２）Ｉ have been to Japan twice.（経験）なお、英語では、この他のアスペクト記号として「be＋
Ｖ-ing」がある。English aspect symbol: "have + P.
P. ″ Possible aspect meaning: completion, experience, etc. Example sentences in English Sentence aspect (1) I have eaten my lunch. (Complete) (2) I have been to Japan twice. (Experience) In English, other aspect symbols "Be +
V-ing ".

【００２０】したがって、機械翻訳においては、翻訳の
対象とする自然言語たる原始言語のアスペクト記号の意
味の多義性を適切に解決しないと、翻訳結果が悪くな
る。極簡単な具体例を示すならば、上記中国語例文中
「他死了」は、既述のように「死」という簡単な動作、
事件であるにもかかわらず文そのものとしては（状態）
と（完了）のアスペクトを持ちえ、日本語に翻訳すると
「彼は死んだ。」と「彼は死んでしまっている。」の２
通りが考えられる。このため、この程度の短い例文１つ
のみではさして問題がないが、多少複雑な動作内容かつ
もう少し長い文章となるとその意味内容に種々の解釈が
生じえることとなり、そう簡単にはいかなくなる。とこ
ろで、例えば、日本情報処理学会の自然言語処理４８−
４（１９８５）に掲載されているように、従来日本語の
アスペクト意味は、図１０に示すように状態、継続、動
作、意志、結果、反復、起動、完了の八つの素性（基本
要素，primitive feature)に分けられている。また、ア
スペクト素性の組み合わせにより、日本語の動詞は図９
の動詞アスペクト分類辞書に示すように８種類に分類さ
れている。なお、ここにアスペクト素性の前に付された
「＋」はこの素性がある、「−」はこの素性がないこと
を表す。例えば、図９の動詞「ある」、「できる」に対
応するＡ「＋state,＋cont」は状態と継続との二素性を
持っていることを表す。なお、ここに、「＋」及び
「−」は結合演算でのアスペクトの増減に用いられる。
また、各アスペクト記号のアスペクト素性の組み合わせ
は図１１のアスペクト意味辞書のように整理され、語句
のアスペクト素性の組み合わせの間の結合演算規則及び
アスペクト解釈規則も定められている。ここに、演算ル
ールは、｛副詞があればそれに応じてアスペクト素性
の付加、消去を行う。２つのアスペクト素性の束を演
算し、付加、消去を行う。前記を左（文章の先とい
う意味である。従って日本語、中国語では方位、位置、
方向という面からは上や左もありえる。）から右（文章
の後）へくり返し行うことにより、原則としてアスペク
ト素性を消去するものであり、この後消去されていない
アスペクト素性を１つの束としてまとめて解釈する。｝
というものである。この際の解釈ルールを図１３に示
す。なお、これらについては、日本情報処理学会の「自
然言語処理48-4」（1985年３月27日）、野垣内出他二
名共著「動詞意味素性の付加による日本語文アスペク
トの解析および翻訳について」等にて記載され、また説
明されているいわば公知のものであるため、ここでこれ
以上の説明は省略する。また目的言語のアスペクト記号
を生成するために、翻訳言語間の差異を検討し、参照資
料としてのアスペクト変換テーブルも製作されている。
その従来例の一としては、日本情報処理学会の「自然言
語処理48-4」の「はじめに」に掲載されている日英機械
翻訳システム（方法）を用いるものである。図８は該シ
ステムのアスペクト処理流れを示すものである。以下本
図をもとにこれを説明する。例えば、図１７に示すよう
な日本語解析処理を行った中間形式の解析木について、
まず図９の動詞分類辞書を参照し、動詞「読む」のアス
ペクト素性の組み合わせＡ「−state,＋act,＋cont, ＋
compl 」（−状態，＋動作，＋継続，＋完了）を取り出
す（Ｓ１０）。なお、これらの動詞アスペクト素性は図
１０に示すように定義されている。続けて、図１１のア
スペクト意味辞書を参照し、一文の左（最初）から右
（最後）までのそれぞれのアスペクト記号のアスペクト
素性の組み合わせＢ［ｉ］，（ここに、ｉ＝１，…，ｎ
と可能な数だけ採りえる。）、本文ではＢ［１］＝「て
いる」＝「＋state,＋cont」（＋状態，＋継続）を取り
出す（Ｓ２０）。動詞以外にアスペクト意味に影響を及
ぼし得る形態素、例えば、副詞などがあるか否かを判断
する（Ｓ３０）。もし、動詞以外に、アスペクト意味に
影響し得る形態素Ｃがないと判断すると（Ｓ３０）、動
詞アスペクト素性組み合わせＡを各アスペクト記号素性
組み合わせＢと結合ルールにのっとっての演算をする
（Ｓ６０）。ただし、本文では、「すでに」という他形
態素があるためこれが検出されることとなる（Ｓ３
０）。このため、図６に示すような他形態素アスペクト
意味辞書を参照し、「すでに」のアスペクト意味の基本
素性組み合わせＣｉ，（ここに、ｉ＝１，…，ｍ）
を取り出す（Ｓ４０）。上記取り出されたアスペクト意
味の基本素性組み合わせＣｉ，（ｉ＝１，... ，
ｍ）をそれに対応する動詞アスペクト素性組み合わせＡ
と、または、各アスペクト記号素性組み合わせＢｉ，
（ｉ＝１，... ，ｎ）と結合ルールにのっとっての演算
を行ってから（Ｓ５０）、動詞アスペクト素性組み合わ
せＡを各アスペクト記号素性組み合わせＢと結合ルール
の演算をする（Ｓ６０）。上記の結合ルールにのっとっ
ての演算を行った結果を、図１３の解釈規則辞書に示す
解釈ルールと照合演算をなしたならば、この文のアスペ
クト意味が得られる（Ｓ７０）。最後に、図１４の変換
規則辞書に示す変換規則を参照し、目的言語のアスペク
ト記号を決めてから、アスペクトの処理を終える（Ｓ８
０）。この結果、「He has already read that book.」
という英文が得られる。他の例文「今日も雪が降り続い
ている。」では、上述した演算方法及び参照辞書によ
り、図１２に示す演算過程を経て、反復状態アスペクト
と解釈されるようになる。その結果、現在進行完成式と
判定され、この文章の英語の機械翻訳の結果は「Now,it
has been snowing.」となる。ただし、勿論厳密には英
国人、米国人は「Now 」と「has been snowing」とが一
の文章に並存するような文章構成とすることはありえな
いため、最終的には人手により適切な英訳がなされるこ
ととなるが、これについては既に説明したとおりであ
る。Therefore, in machine translation, the translation result becomes poor unless the polysemy of the meaning of the aspect symbol of the source language, which is the natural language to be translated, is properly resolved. To show a very simple specific example, "other dying" in the above Chinese example sentence is a simple operation of "death" as described above,
Despite being an incident, the sentence itself is (state)
And (Complete) aspects, and when translated into Japanese, "He is dead." And "He is dead."
The street is conceivable. For this reason, there is no problem with only one short example sentence of such a degree, but if the sentence is somewhat complicated and the sentence is a little longer, various interpretations may occur in the meaning of the sentence, which is not easy. By the way, for example, natural language processing 48-
As described in FIG. 4 (1985), the conventional Japanese aspect meanings are the eight features (basic element, primitive) of state, continuation, action, will, result, repetition, activation, and completion, as shown in FIG. feature). In addition, Japanese verbs are combined as shown in FIG.
Are classified into eight types as shown in the verb aspect classification dictionary. Here, “+” added before the aspect feature indicates that the feature exists, and “−” indicates that the feature does not exist. For example, A “+ state, + cont” corresponding to the verbs “a” and “available” in FIG. 9 indicates that the verb has two features of state and continuation. Here, "+" and "-" are used to increase or decrease the aspect in the combining operation.
Also, combinations of aspect features of each aspect symbol are organized as in the aspect semantic dictionary of FIG. 11, and rules for combining operations and aspects interpretation rules between combinations of the aspect features of words and phrases are defined. Here, as for the operation rule, if there is a ｛adverb, the aspect feature is added or deleted according to the ｛adverb. A bunch of two aspect features is calculated and added and deleted. The above is the left (meaning the beginning of the sentence. Therefore, in Japanese and Chinese, the direction, position,
From the aspect of direction, it can be up or left. ) To the right (after the sentence), in principle, erases the aspect features. After that, the non-erased aspect features are interpreted collectively as one bundle. ｝
That is. FIG. 13 shows the interpretation rules at this time. These are described in "The Natural Language Processing 48-4" of the Information Processing Society of Japan (March 27, 1985) and by Izumi Nogakiuchi et al. And so on, and so to speak, are well-known, so that further description is omitted here. In addition, in order to generate an aspect symbol of a target language, differences between translated languages are examined, and an aspect conversion table as a reference material is produced.
As one of the conventional examples, a Japanese-English machine translation system (method) described in “Introduction” of “Natural Language Processing 48-4” of the Information Processing Society of Japan is used. FIG. 8 shows an aspect processing flow of the system. This will be described below with reference to FIG. For example, for a parse tree in an intermediate format that has undergone Japanese parsing as shown in FIG.
First, referring to the verb classification dictionary of FIG. 9, the combination A of the aspect features of the verb “read” “−state, + act, + cont, +
compl "(-state, + operation, + continuation, + complete) is extracted (S10). Note that these verb aspect features are defined as shown in FIG. Subsequently, referring to the aspect semantic dictionary of FIG. 11, the combination B [i], (where i = 1,..., Of the aspect features of the respective aspect symbols from the left (first) to the right (last) of one sentence. n
And as many as possible. ), B [1] = “Yes” = “+ state, + cont” (+ state, + continuation) in the text (S20). It is determined whether there is a morpheme other than the verb that can affect the aspect meaning, for example, an adverb (S30). If it is determined that there is no morpheme C that can affect the aspect meaning other than the verb (S30), the verb aspect feature combination A and each aspect symbol feature combination B are operated according to the combination rule (S60). However, in the text, since there is another morpheme “already”, this is detected (S3).
0). Therefore, referring to the other morphological aspect semantic dictionary as shown in FIG. 6, the basic feature combination C i, (where i = 1,..., M) of the aspect meaning “already”
Is taken out (S40). The extracted basic feature combinations C i, (i = 1,...,
m) is the corresponding verb aspect feature combination A
Or each aspect symbol feature combination Bi,
(I = 1,..., N) and the calculation according to the combination rule (S50), and then the verb aspect feature combination A is combined with each aspect symbol feature combination B to calculate the combination rule (S60). If the result of the operation according to the above combination rule is compared with the interpretation rule shown in the interpretation rule dictionary of FIG. 13, the aspect meaning of this sentence is obtained (S70). Finally, referring to the conversion rules shown in the conversion rule dictionary of FIG. 14, the aspect symbol of the target language is determined, and then the processing of the aspect is completed (S8).
0). As a result, "He has already read that book."
Is obtained. The other example sentence “Snow continues to fall today” is interpreted as an iterative state aspect through the calculation process shown in FIG. 12 by the above-described calculation method and reference dictionary. As a result, it is determined that the ceremony is in progress, and the result of the English machine translation of this sentence is "Now, it
has been snowing. " However, strictly speaking, English and Americans cannot form sentences in which "Now" and "has been snowing" coexist in one sentence. This is as described above.

【００２１】[0021]

【発明が解決しようとする課題】しかしながら、上記の
従来例のアスペクト処理方法に不可欠のアスペクト組み
合わせの結合演算規則、解釈規則、及び変換規則等は全
て処理対象の言語に依存する。このため、一旦機械翻訳
システムの処理対象言語が変更されると、これらの関連
規則もそれに応じて変更する必要がある。更に、原始言
語をその翻訳を目指す言語たる目的言語に変換するとき
には、日本語の動詞には英語の動詞と異なり過去形がな
い等もとの言語に存在する関連規則をそのまま利用する
ことはできないことが多いので、改めて解析しなおさな
ければならない。またこれが、機械翻訳の開発効率が向
上しない一因でもある。更にまた、共通のアスペクトを
認知できる表現が欠けているので、ｎ国語用の機械翻訳
装置の場合には、中国語の各方言間、ポルトガル語とス
ペイン語等非常に類縁関係の近い言語を除き、原則とし
てアスペクト素性辞書、動詞アスペクト分類辞書、アス
ペクト記号辞書、他形態素アスペクト辞書などを各ｎ個
と、ｎ（ｎ−１）個の結合演算規則、及び変換規則を準
備しなければならず、高価、不経済、非効率的となる。However, the combination operation rules, interpretation rules, conversion rules, and the like of the aspect combinations that are indispensable for the above-described conventional aspect processing method all depend on the language to be processed. Therefore, once the language to be processed by the machine translation system is changed, it is necessary to change these related rules accordingly. Furthermore, when translating a source language into a target language, which is the language for which it is intended to be translated, Japanese verbs, unlike English verbs, do not have the past tense and cannot use the related rules that exist in the original language as they are In many cases, the analysis must be performed again. This is also one of the reasons why the development efficiency of machine translation is not improved. Furthermore, since there is no expression that can recognize common aspects, in the case of a machine translator for n languages, except for languages that are very closely related, such as between Chinese dialects, Portuguese and Spanish, etc. In principle, each of n aspect feature dictionaries, verb aspect classification dictionaries, aspect symbol dictionaries, other morpheme aspect dictionaries, and the like, and n (n-1) combination operation rules and conversion rules must be prepared. It is expensive, uneconomical and inefficient.

【００２２】本発明は、以上の課題に鑑み、各国語共通
のアスペクト意味表現形式を見つけ、各言語毎に必要と
する各種の辞書及び規則を最低限にして、多国言語機械
翻訳を大幅に効率を向上させるアスペクト処理方法及び
その装置を提供することを目的としてなされたものであ
る。In view of the above problems, the present invention finds an aspect semantic expression format common to various languages, minimizes various dictionaries and rules required for each language, and greatly improves multilingual machine translation. The object of the present invention is to provide an aspect processing method and an apparatus for improving the aspect ratio.

【００２３】[0023]

【課題を解決するための手段】上記目的を達成するため
に、請求項１の発明においては、目的言語のアスペクト
記号を求めるために、原始言語の文アスペクト情報に基
づいて、原始言語の動詞アスペクト分類、アスペクト記
号並びに「状況」、「解釈」及び「共起要素」の形式表
現による可能アスペクト意味を登録（あらかじめの格納
による永久記憶）している原始言語アスペクト記号辞書
部と、目的言語の動詞アスペクト分類、アスペクト記号
並びに「状況」、「解釈」及び「共起要素」の形式表現
による可能アスペクト意味解釈を登録している目的言語
アスペクト記号辞書部と、動詞及びアスペクト記号のほ
かにアスペクト意味に影響し得る形態素、品詞（単語を
意義、作用、形態から分類した構文上の意味。例えば日
本語における名詞、動詞、形容詞、形容動詞、副詞等）
及び「可能解釈ｉ」（ここに、ｉ＝１，２，３，…と可
能な解釈の数だけ採りえる整数）の形式表現による可能
アスペクト解釈を登録している原始言語他形態素アスペ
クト辞書部と、原始言語の変換条件のアスペクト意味組
み合わせと目的言語のアスペクト意味組み合わせとの対
で「状況」、「解釈」及び「共起要素」の形式表現によ
るアスペクト意味変換規則を登録しているアスペクト意
味変換辞書部と、解析木における動詞ノードの動詞アス
ペクト分類及びアスペクト記号を検索要素として、前記
原始言語アスペクト記号辞書部内を参照し、すべての可
能な「状況」、「解釈」及び「共起要素」アスペクト意
味組み合わせを取り出す原始言語アスペクト記号情報抽
出部と、動詞及びアスペクト記号以外の形態素、品詞を
検索要素として、前記原始言語他形態素アスペクト辞書
部内を参照し可能アスペクト意味制限の組み合わせを取
り出すアスペクト意味制限条件抽出部と、上記取り出さ
れたアスペクト意味制限条件に基づいて同一化手法によ
り上記の可能アスペクト意味の組み合わせの中からあり
えないアスペクト意味を削除する同一化部と、同一化さ
れた可能アスペクト意味組み合わせを検索要素として前
記アスペクト意味変換辞書部内を参照し目的言語のアス
ペクト意味組み合わせを獲得する変換部と、目的言語の
動詞分類及び上記のアスペクト意味組み合わせを検索要
素として前記目的言語アスペクト記号辞書部内を参照
し、対応する目的言語アスペクト記号及び共起要素を獲
得する目的言語アスペクト記号抽出部とを有することを
特徴とするアスペクト処理装置としている。To achieve the above object, according to the first aspect of the present invention, in order to obtain an aspect symbol of a target language, a verb aspect of a source language is determined based on sentence aspect information of the source language. A source language aspect symbol dictionary unit that registers (permanently stores in advance by storing) possible aspect meanings by classification, aspect symbols, and formal expressions of "situation", "interpretation" and "co-occurrence element", and verbs in the target language A target language aspect symbol dictionary part that registers the possible aspect semantic interpretation by the formal expression of the aspect classification, aspect symbol, and "situation", "interpretation", and "co-occurrence element", and the aspect meaning in addition to the verb and aspect symbol Influential morphemes and parts of speech (syntactic meaning of words classified according to meaning, action, or form; for example, nouns in Japanese Verb, adjective, adjective, adverb, etc.)
A source language / other morphological aspect dictionary unit that registers possible aspect interpretations in a formal expression of “possible interpretation i” (where i = 1, 2, 3,... Aspect semantic conversion in which aspect semantic conversion rules based on the formal expressions of "situation", "interpretation" and "co-occurrence element" are registered as a pair of the aspect semantic combination of the conversion conditions of the source language and the aspect semantics of the target language The dictionary part and the verb aspect classification and the aspect symbol of the verb node in the parse tree are referred to in the source language aspect symbol dictionary part as search elements, and all possible "situation", "interpretation" and "co-occurrence element" aspects are referred to. A source language aspect symbol information extraction unit that extracts semantic combinations, and morphemes and parts of speech other than verbs and aspect symbols as search elements, An aspect semantic restriction condition extraction unit that extracts combinations of possible aspect semantic restrictions by referring to the source language other morphological aspect dictionary unit, and a combination of the above possible aspect semantics by the sameization method based on the extracted aspect semantic restriction conditions An identification unit that removes an impossible aspect meaning from the inside; a conversion unit that obtains an aspect meaning combination of the target language by referring to the aspect meaning conversion dictionary unit as a search element using the identified possible aspect meaning combination; A target language aspect symbol extraction unit that refers to the target language aspect symbol dictionary unit as a search element using the verb classification and the aspect meaning combination as a search element, and obtains a corresponding target language aspect symbol and a co-occurrence element. As an aspect processing device That.

【００２４】[0024]

【作用】本発明に係るアスペクト処理装置は上記構成に
よって、原始言語から目的言語への機械翻訳に際してア
スペクト処理部を介して原始言語のアスペクトを目的言
語のアスペクトに変換する処理を行う。具体的には、以
下の作用をなす。原始言語アスペクト記号辞書部には、
目的言語のアスペクト記号を求めるために、原始言語の
文アスペクト情報に基づいて、原始言語の動詞アスペク
ト分類、アスペクト記号並びに「状況」、「解釈」及び
「共起要素」の形式表現による可能アスペクト意味が登
録されている。The aspect processing apparatus according to the present invention performs processing for converting an aspect of a source language into an aspect of a target language via an aspect processing unit at the time of machine translation from a source language to a target language. Specifically, the following operation is performed. In the source language aspect symbol dictionary part,
To determine the aspect symbol of the target language, based on the sentence aspect information of the source language, the verb aspect classification of the source language and the possible aspect meaning by the formal expression of "situation", "interpretation" and "co-occurrence element" Is registered.

【００２５】目的言語アスペクト記号辞書部には、目的
言語の動詞アスペクト分類、アスペクト記号並びに「状
況」、「解釈」及び「共起要素」の形式表現による可能
アスペクト意味解釈が登録されている。なお、「状
況」、「解釈」及び「共起要素」の具体的内容について
は実施例にて説明する。原始言語他形態素アスペクト辞
書部には、動詞及びアスペクト記号のほかにアスペクト
意味に影響し得る形態素、品詞及び「可能解釈ｉ」（こ
こに、ｉ＝１，２，３，…と可能な数だけ採りえる。例
えば「可能解釈１」、「可能解釈２」、…）の形式表現
による可能アスペクト解釈が登録されている。In the target language aspect symbol dictionary part, the verb aspect classification of the target language, the aspect symbol, and the possible aspect semantic interpretation by the formal expression of "situation", "interpretation" and "co-occurrence element" are registered. The specific contents of “situation”, “interpretation”, and “co-occurrence element” will be described in Examples. In the source language and other morphological aspect dictionary section, in addition to verbs and aspect symbols, morphemes, parts of speech, and “possible interpretations i” (where i = 1, 2, 3,... For example, possible aspect interpretations in the form of “possible interpretation 1”, “possible interpretation 2”,...) Are registered.

【００２６】アスペクト意味変換辞書部には、原始言語
の変換条件のアスペクト意味組み合わせと目的言語のア
スペクト意味組み合わせとが対で「状況」、「解釈」及
び「共起要素」の形式表現によるアスペクト意味変換規
則が登録されている。原始言語アスペクト記号情報抽出
部が、解析木における動詞ノードの動詞アスペクト分類
及びアスペクト記号を検索要素として、前記原始言語ア
スペクト記号辞書部内を参照し、すべての可能な「状
況」、「解釈」及び「共起要素」アスペクト意味組み合
わせを取り出す。In the aspect meaning conversion dictionary unit, the aspect meaning combination of the source language conversion condition and the aspect meaning combination of the target language are paired, and the aspect meaning is expressed in the form of "situation", "interpretation" and "co-occurrence element". Conversion rules are registered. The source language aspect symbol information extraction unit refers to the source language aspect symbol dictionary unit as a search element using the verb aspect classification and the aspect symbol of the verb node in the parse tree, and all possible “situations”, “interpretations” and “ Extract the co-occurrence element "aspect semantic combination.

【００２７】アスペクト意味制限条件抽出部が、動詞及
びアスペクト記号以外の形態素、品詞を検索要素とし
て、前記原始言語他形態素アスペクト辞書部内を参照し
可能アスペクト意味制限の組み合わせを取り出す。同一
化部が、上記取り出されたアスペクト意味制限条件に基
づいて同一化手法により上記の可能アスペクト意味の組
み合わせの中からありえないアスペクト意味を削除す
る。The aspect semantic restriction condition extracting unit extracts a combination of possible aspect semantic restrictions by referring to the source language and other morphological aspect dictionary unit using morphemes and parts of speech other than verbs and aspect symbols as search elements. The identification unit deletes an impossible aspect meaning from the combinations of the possible aspect meanings by the identification method based on the extracted aspect meaning restriction condition.

【００２８】目的言語アスペクト記号抽出部が、同一化
された可能アスペクト意味組み合わせを検索要素として
前記アスペクト意味変換辞書部内を参照し目的言語のア
スペクト意味組み合わせを獲得する変換部と、目的言語
の動詞分類及び上記アスペクト意味組み合わせを検索要
素として前記目的言語アスペクト記号辞書部内を参照
し、対応する目的言語アスペクト記号及び共起要素を獲
得する。A target language aspect symbol extraction unit that obtains an aspect semantic combination of the target language by referring to the aspect semantic conversion dictionary unit using the identified possible aspect semantic combinations as search elements, and a verb classification of the target language Then, the target language aspect symbol dictionary and the co-occurrence element are obtained by referring to the target language aspect symbol dictionary unit as a search element using the aspect semantic combination.

【００２９】[0029]

【実施例】本発明を実施例に基づいて説明するに先立
ち、本発明の立脚する法則について説明する。Ｍr.
Ｃ．Ｓmith（Ｃ．スミス氏）がＬinguistics and Ｐhi
losophy （言語学及び哲学（意訳））の論文集（１９８
６年２月）により提案した「Ａ speaker-based approac
h to aspect 」（話す者からのアスペクトへの提言（意
訳））から分かるように、どの言語でも自然言語のアス
ペクトの意味は、図１５に示す状況（situation ) によ
り分類できる。かつ、各種類の状況の性質は、基本的に
は完了相（perfectivity) 及び未完了相（imperfectivi
ty）の二種類しかを持っていない。しかし、アスペクト
の意味をこのように分類すれば、細かく分類していない
ので、アスペクトを認知処理するときに、アスペクトの
意味の多義性が発生し易い。アスペクトの多義性を解決
するために、Ｍr.Ｂ．Ｃomire （Ｂ．コミレ氏）がケン
ブリッジ大学の論文集（１９７６年）により提案した
「Aspect：Ａn introduction to the study of aspect
and related problem 」（アスペクトとこれに関連する
問題への探究への紹介（意訳））は上記の完了及び未完
了の性質について、解釈という概念を導入の上で、図１
６に示すように完了という性質を完了（perfective）、
終了（telic)及び経験（experience）というふうに、未
完了という性質を試し（delimitative）、開始（incept
ive ) 、反復（repeative)というふうにもっと細かく分
類した。（そして、これが特許請求の範囲に使用した
「状況」の意味、内容である。）すると、上記図１５に
示すアスペクトの状況分類の状況及び図１６に示す本来
は日本語、英語用のものを中国語にも適用しうるよう補
足した結果である細分化された解釈を日本語、英語、中
国語等語族の類縁関係を大きく越える（ウラル−アルタ
イ−古アジア語族系、インド−ヨーロッパ語族系、中国
語族）各言語の、そしてその他の各自然言語の文のアス
ペクトの共通表現にすることができるようになった。そ
して、ここにいう解釈が特許請求の範囲にいう解釈であ
る。本発明は、更に文の共起要素（動詞及びアスペクト
記号の他に文のアスペクト意味を解釈し、また目的言語
のアスペクト記号を生成する際にはその参考となるも
の。）例えば、副詞、その他助詞等によりアスペクト意
味の多義性を解決できることを発見したので、この共起
要素の有無をも考慮して上記のアスペクト意味の共通表
現に加えたものである。また本発明は、多数のアスペク
ト記号を連用して多数のアスペクト意味の組み合わせが
生じたときに、これに対応して、言語間のアスペクト表
現の差異を参照するためのアスペクト変換辞書を準備す
ることにより各言語共通のアスペクト表現を採用したた
め、アスペクト意味変換辞書の方向性（対象とする言
語）がなくなる。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Before describing the present invention based on embodiments, the rules on which the present invention is based will be described. Mr.
C. Smith (C. Smith) Linguistics and Phi
losophy (linguistics and philosophy)
"A speaker-based approac" proposed by
As can be seen from "h to aspect" (a proposal (translation) from a speaker to an aspect), the meaning of an aspect of a natural language in any language can be classified by the situation shown in FIG. In addition, the nature of each type of situation basically depends on the perfectness phase and the incomplete phase (imperfectivi).
ty). However, if the meanings of the aspects are classified in this way, they are not categorized in detail, so that when the aspects are recognized and processed, polysemy of the meanings of the aspects is likely to occur. In order to resolve the ambiguity of the aspect, Mr.B. "Aspect: An introduction to the study of aspect" proposed by Comire (B. Comire) in a collection of dissertations at Cambridge University (1976).
and related problem ”(introduction to the search for aspects and related problems) is based on the concept of interpretation of the above-mentioned completed and uncompleted properties, and introduces the concept of interpretation.
Complete the property of completion as shown in 6 (perfective),
Try incomplete properties, such as end (telic) and experience (delimitative), start (incept
ive) and repetitive. (And this is the meaning and content of “situation” used in the claims.) Then, the situation classification of the aspect shown in FIG. 15 and the situation originally for Japanese and English shown in FIG. Subdivided interpretations, which are the results of supplementing to be applicable to Chinese, far exceed the similarities of Japanese, English, Chinese, etc. (Ural-Altai-Pale Asian languages, Indo-European languages, (Chinese) It became possible to have a common representation of sentence aspects in each language and in each of the other natural languages. And the interpretation here is the interpretation in the claims. The present invention further provides co-occurrence elements of a sentence (interpretation of the aspect meaning of a sentence in addition to verbs and aspect symbols, and reference when generating an aspect symbol of a target language), such as adverbs, etc. Since it was discovered that the ambiguity of the aspect meaning can be resolved by using particles, etc., it is added to the common expression of the aspect meaning described above in consideration of the presence or absence of this co-occurrence element. Further, according to the present invention, when a large number of combinations of aspect meanings are generated by continuously using a large number of aspect symbols, an aspect conversion dictionary for referring to a difference in aspect expression between languages is prepared. As a result, the aspect expression common to each language is adopted, so that the direction (target language) of the aspect meaning conversion dictionary is lost.

【００３０】以下、以上のことを日本文を主に、例文を
参照しつつ説明する。今、従来の技術で示した例文「彼
はすでに本を読んでいる。」を例にとるならば、「読
む」と「ている」だけからならば、文のアスペクトとし
ては「進行」、「反復」、「状態」、「反復状態」等が
ありえるが、副詞「すでに」があるため、「反復状
態」、「進行」、「反復」は除外される。このため、副
詞「すでに」が文のアスペクト内容の理解に大きな役割
を担うのがわかる。Hereinafter, the above will be described mainly with reference to Japanese sentences and with reference to example sentences. If the example sentence "He has already read a book" shown in the conventional technique is taken as an example, if only "read" and "do" are used, the sentence aspects are "progress", " Although there may be "repeat", "state", "repeat state", and the like, the adverb "already" excludes "repeat state", "progress", and "repeat". For this reason, it can be seen that the adverb "already" plays a large role in understanding the aspect content of the sentence.

【００３１】また、もしこの際この文章が「彼は毎日本
を読んでいる。」と「すでに」のかわりに同じく副詞の
「毎日」が入っているならば、この「毎日」のために文
アスペクトの素性は「反復状態」以外にありえないこと
となる。このため、「すでに」のような純然たる副詞の
みならず、「今日」「毎日」というような本来は名詞で
ある語がアスペクトに大きな影響を及ぼすのがわかる。
この他「やっと」、「とうとう」、「昨日から」、「３
月１日」等の具体的期日をあげられよう。例えば、「３
月１日」はこれ以前の日からみれば未来、未来完了とな
り、これ以後の日からみれば過去、過去完了、継続等と
なる。If this sentence contains the adverb "everyday" instead of "he is reading a book every day" and "already," the sentence for this "everyday" The features of the aspect are impossible except for the "repeated state". For this reason, it can be seen that not only a pure adverb such as “already” but also a word that is originally a noun such as “today” and “everyday” greatly affects the aspect.
In addition, "finally", "finally", "from yesterday", "3
A specific date such as "Monday 1" will be given. For example, "3
The “Month 1” is the future and the future is completed when viewed from the previous day, and is the past, the past completed and the continuation is viewed from the day after this.

【００３２】副詞以外の例を示すならば、助詞をあげら
れる。ここに、助詞とはある言葉について、その言葉と
他の言葉の関係を示したり、限定・接続・感動などの意
味を表したりするものをいう。ここの助詞は日本語では
「が」、「を」、「に」、「で」、等であり、英語では
preposition に相当し、「in」、「on」、「from」等で
あり、中国語では介詞を指し、「從」、「到」、
「給」、「在」等である。To show examples other than adverbs, particles can be mentioned. Here, a particle refers to a word that indicates a relationship between the word and another word, or expresses a meaning such as limitation, connection, and impression. Particles here are "ga", "wo", "ni", "de", etc. in Japanese, and in English
preposition is equivalent to "in", "on", "from", etc.
For example, "pay" and "at".

【００３３】さて、各国語とも、助詞は格をもってい
る。例えば、日本語の「で」は、「位置」格、「道具」
格等を有している。さて、本発明ではこれらの格意味に
よりアスペクト意味の多義性を解決することができるこ
とをも発明の上、これをとり入れている。例えば、「彼
は去年から失業している」という文のアスペクト意味は
“進行”、“反復”、“状態”、“習慣”などがある。
しかしこの文での「から」という副詞の格意味は「time
from」（時間 …から）なので、上記の文のアスペク
ト意味を“状態”に解釈することができる。したがっ
て、上記の文の中国語の訳文は「他從去年一直失業著」
（He has been losing his job since lastyear）であ
る。「他從去年正在失業著」（He was losing his job
from last year）というような誤訳の防止がはかれる。The particles in each language have a case. For example, "de" in Japanese means "position" case, "tool"
It has a case. In the present invention, the fact that the polysemy of the aspect meaning can be solved by these case meanings is incorporated into the invention. For example, aspects of the sentence "He has been unemployed since last year" include "progression", "repetition", "state", and "habit".
However, the case of the adverb "kara" in this sentence is "time
from "(time ...), The aspect meaning of the above sentence can be interpreted as “state”. Therefore, the Chinese translation of the above sentence is "other從去year Kazunao unemployment Author"
(He has been losing his job since lastyear). "Other從去year Seizai unemployment Author" (He was losing his job
from last year).

【００３４】他の例文を示すなら、「私は今から君が昨
日まで私にしてやってきたことは不問にする。」という
多少複雑な文では「今」と「昨日」という２つの本来は
名詞であるが副詞として作用する語があるから、このう
ち「昨日」の方は後の方にある助詞「は」で、文全体と
してのアスペクトの意味は有されないのがわかる。以上
は、原始言語を中心にした説明であるが、これをもとに
目的言語を生成する場合にも、その言語固有のアスペク
ト共起要素を文に組み込むことによりアスペクトの表現
が正確となる。上記日本文の「今から」の部分の英訳文
で具体的に示せば、「From now on 」である。To show another example sentence, in the somewhat complicated sentence "I do not care what you have done to me from now until yesterday." Two originally nouns "now" and "yesterday" However, since there is a word that acts as an adverb, it can be seen that "yesterday" is a particle "ha" in the latter part and does not have the meaning of the aspect as a whole sentence. The above description has been focused on the source language. Even when the target language is generated based on the source language, the aspect expression can be accurately expressed by incorporating the language-specific aspect co-occurrence element into the sentence. The English translation of the "imakara" part of the Japanese sentence above is "From now on".

【００３５】他の例文を示すならば、翻訳文が日本文で
ある場合の現在までに事件が完了しその結果が今日継続
している動作であるならば、副詞「とっくに」、「すで
に」がある。また、会話を示す記号「」は、その内部
で独立した文アスペクトを有する。また英訳での「Oh,y
es」、「but 」、ドイツ語の「Ja」、「aber」、日本語
の「なるほど」、「しかし」という品詞間の呼応では両
品詞に関係する部分の文アスペクトは同じであるのが原
則である。そして、以上説明したことは各国言語に共通
にみられる現象である。To show another example sentence, if the case has been completed by now and the result is an operation that continues today when the translated sentence is a Japanese sentence, the adverbs “tokku ni” and “already” is there. The symbol “” indicating a conversation has an independent sentence aspect inside. The English translation of "Oh, y
es, but, German Ja, aber, Japanese naruhaku, but the part-of-speech correspondence between the parts of speech is the same in principle. It is. What has been described above is a phenomenon that is common to all languages.

【００３６】以下、本発明を実施例に基づいてより具体
的かつ詳細に説明する。図１は本発明に係るアスペクト
処理装置の一実施例の構成を示すものである。本図に示
すアスペクト処理装置は、入力部１００、原始言語解析
中間構造生成部２００、解析用辞書部２５０、変換部３
００、変換辞書部３５０、アスペクト処理部４００、原
始言語アスペクト記号辞書部４５０、原始言語他形態素
アスペクト辞書部４５１、アスペクト意味変換辞書部４
５２、目的言語アスペクト記号辞書部４５３、目的言語
生成部５００、生成用辞書５５０、出力部６００及びバ
ッファ７００を備えている。入力部１００はキーボード
などからなり、使用者が解析しようとする原始言語を入
力するものである。原始言語解析中間構造生成部２００
は入力された文について解析用辞書部２５０内を参照し
て、構文及び意味の解析を行い、原始言語に依存する中
間構造を獲得してから、これをバッファ７００に記憶す
る。変換部３００はバッファ７００から中間構造を取り
出し、変換辞書部３５０を参照して、原始言語の構文及
び意味に依存する中間構造を目的言語に依存する中間構
造に変換し、原始言語及び目的言語の動詞アスペクト分
類を判定してから、変換結果をバッファ７００に記憶す
る。アスペクト処理部４００はバッファ７００から目的
言語に依存する中間構造を取り出してから、原始言語ア
スペクト記号辞書部４５０、原始言語他形態素アスペク
ト辞書部４５１、アスペクト意味変換辞書部４５２、目
的言語アスペクト記号辞書部４５３を参照し、原始言語
に依存するアスペクト記号及び関連要素を目的言語に依
存するアスペクト記号及び関連要素に変換する。そし
て、変換された結果をバッファ７００に記憶する。この
際のアスペクト処理部４００の詳細な処理流れを図２に
示す。原始言語アスペクト記号辞書部４５０、目的言語
アスペクト記号辞書部４５３、原始言語他形態素アスペ
クト辞書部４５１、アスペクト意味変換辞書部４５２の
構造をそれぞれ図４、図５、図６、図７に示す。目的言
語生成部５００は、生成用辞書部５５０を参照して、目
的言語を生成し、その翻訳結果をバッファ７００に記憶
する。最後に、翻訳結果を印刷機、ＣＲＴなどの出力部
６００により出力する。Hereinafter, the present invention will be described more specifically and in detail based on examples. FIG. 1 shows the configuration of an embodiment of an aspect processing apparatus according to the present invention. The aspect processing apparatus shown in the figure includes an input unit 100, a source language analysis intermediate structure generation unit 200, an analysis dictionary unit 250, a conversion unit 3
00, conversion dictionary section 350, aspect processing section 400, source language aspect symbol dictionary section 450, source language and other morphological aspect dictionary section 451, aspect meaning conversion dictionary section 4
52, a target language aspect symbol dictionary unit 453, a target language generation unit 500, a generation dictionary 550, an output unit 600, and a buffer 700. The input unit 100 includes a keyboard or the like, and is used to input a source language to be analyzed by a user. Source language analysis intermediate structure generation unit 200
Analyzes the syntax and meaning of the input sentence with reference to the analysis dictionary unit 250 to obtain an intermediate structure depending on the source language, and stores it in the buffer 700. The conversion unit 300 extracts the intermediate structure from the buffer 700, refers to the conversion dictionary unit 350, converts the intermediate structure that depends on the syntax and meaning of the source language into an intermediate structure that depends on the target language, and converts the source language and the target language. After determining the verb aspect classification, the conversion result is stored in the buffer 700. The aspect processing unit 400 extracts the intermediate structure depending on the target language from the buffer 700, and then extracts the source language aspect symbol dictionary unit 450, the source language / other morphological aspect dictionary unit 451, the aspect meaning conversion dictionary unit 452, and the target language aspect symbol dictionary unit. With reference to 453, the source language-dependent aspect symbols and related elements are converted into the target language-dependent aspect symbols and related elements. Then, the converted result is stored in the buffer 700. FIG. 2 shows a detailed processing flow of the aspect processing unit 400 at this time. The structures of the source language aspect symbol dictionary unit 450, target language aspect symbol dictionary unit 453, source language and other morphological aspect dictionary unit 451, and aspect meaning conversion dictionary unit 452 are shown in FIGS. 4, 5, 6, and 7, respectively. The target language generation unit 500 generates the target language with reference to the generation dictionary unit 550, and stores the translation result in the buffer 700. Finally, the translation result is output by the output unit 600 such as a printing machine or a CRT.

【００３７】なお、ここにアスペクト処理するときに参
照する原始言語アスペクト記号辞書部４５０、原始言語
他形態素アスペクト辞書部４５１、アスペクト意味変換
辞書部４５２、目的言語アスペクト記号辞書部４５３な
どの内容は予め作成された上でレジスタに登録（永久記
憶）されている。この際、原始言語アスペクト記号辞書
部４５０及び原始言語他形態素アスペクト辞書部４５１
の内容は言語自身の意味や構文などの情報により作成さ
れたものである。The contents of the source language aspect symbol dictionary unit 450, source language and other morphological aspect dictionary unit 451, aspect meaning conversion dictionary unit 452, and target language aspect symbol dictionary unit 453, which are referred to when performing the aspect processing, are previously determined. It is created and registered (permanently stored) in a register. At this time, the source language aspect symbol dictionary unit 450 and the source language other morphological aspect dictionary unit 451
Is created based on information such as the meaning and syntax of the language itself.

【００３８】以下には、原始言語アスペクト記号辞書部
４５０に格納されているデータ構造をＣプログラム言語
で実現しているのを示す。 int mrknum＝１６；／目的言語アスペクト記号数／ typedef struct aspmarker｛ char poss-interpre ８８０；｝ＡＳＰＭＡＲＫＥＲ；／可能アスペクト意味の構造／ typedef struct sourceasp｛ＡＳＰＭＡＲＫＥＲ marker mrknum ；｝ＳＯＵＲＣＥＡＳＰ；／各アスペクト記号の構造／ＳＯＵＲＣＥＡＳＰ sourceasp＿data ５；／５種類の動詞アスペクト／次に、上記各部のアスペクト処理に際しての動作の内容
を具体的に説明するに先立ち、その説明で使用する技術
用語についてあまり一般的ではないので当業者にとり周
知なものもあるが定義若しくは内容をあらかじめ説明し
ておく。The following shows that the data structure stored in the source language aspect symbol dictionary unit 450 is realized by the C program language. int mrknum = 16; / number of target language aspect symbols / typedef struct aspmarker @ char poss-interpre 880; @ASPMARCHER; / structure of possible aspect meanings / typedef struct sourceasp @ ASPMARCHER marker mrknum; @SOURCEASP; / structure of each aspect symbol / SOURCEASP sourceasp_data 5; / 5 types of verb aspects / Next, prior to specifically describing the content of the operation in the aspect processing of each of the above-mentioned parts, the technical terms used in the description are not so general, so those skilled in the art Although some are well known, their definitions or contents will be explained in advance.

【００３９】原始言語解析中間構造部生成部の生成対象
たる「中間構造」とは、依存構造や概念構造などからな
り、これらは機械翻訳処理の専門用語であり、システム
の演算方法にしたがって、任意に依存構造や概念構造を
選択したり、作成したりすることができるものである。
通常、中間構造は多くの情報を属性値に隠し、構造をよ
り簡単にする。したがって、より少ない規則だけで、中
間構造の変換を行うことが可能となるものである。更
に、中間構造は処理対象の言語とは無関係に、機械翻訳
の演算方法にて処理しえるものである。例えば、「私は
学校へ行く」という簡単な文を対象に日中機械翻訳での
原始言語と目的言語との中間構造を図１８に示す。図１
８のような簡単な中間構造で原始言語の中間構造の変換
処理（構造変換、概念変換、訳語選択など）を行えば、
目的言語の中間構造となる。なお、後に説明する実施例
での原始言語あるいは目的言語の中間構造はいずれも同
じ構造（例えば木構造）とされ、各自の意味や構文など
の情報を格納するものとしている。The "intermediate structure" to be generated by the source language analysis intermediate structure part generating unit is composed of a dependency structure, a conceptual structure, and the like. These are technical terms of machine translation processing, and are arbitrary according to the operation method of the system. It is possible to select or create a dependent structure or a conceptual structure.
Usually, the intermediate structure hides a lot of information in attribute values, making the structure simpler. Therefore, the conversion of the intermediate structure can be performed with fewer rules. Further, the intermediate structure can be processed by a machine translation operation method irrespective of the language to be processed. For example, FIG. 18 shows an intermediate structure between a source language and a target language in Japanese-Chinese machine translation for a simple sentence “I go to school”. FIG.
If you perform the conversion process (structure conversion, concept conversion, translation selection, etc.) of the intermediate structure of the source language with a simple intermediate structure like 8,
The intermediate structure of the target language. The intermediate structures of the source language and the target language in the embodiments described later have the same structure (for example, a tree structure), and store information such as their own meaning and syntax.

【００４０】「解析木」とは、機械翻訳、自然言語処理
の分野では、木構造で言語の処理結果を表示したものを
いう。「同一化」手法（unification)とは、プログラム
言語に用いられる同一化手法と同じであり、同一化演算
を利用して索引値（インデックス値）により関連のある
辞書内を参照し、機械翻訳処理に必要な関連ある値を求
めるものである。例えば、（ａ、ｂ、ｘｘ）を（ａ、
ｂ、Ａ）、（ａ、ｂ、Ｂ）、（ａ、ｂ、Ｃ）、（ｂ、
ｃ、Ｒ）、（ｂ、ｃ、Ｓ）、（ｃ、ｄ、Ａ）及び（ｄ、
ｆ、Ｋ）という知識データベース（辞書）と同一化演算
をすると、ｘｘという変数の値として（ａ、ｂ）に対応
するＡ、Ｂ、Ｃが得られる。In the field of machine translation and natural language processing, an "analysis tree" refers to a representation of a language processing result in a tree structure. The “unification” method (unification) is the same as the unification method used in a programming language, and refers to a related dictionary by an index value (index value) using an equalization operation and performs machine translation processing. To determine the relevant values required for For example, (a, b, xx) is changed to (a,
b, A), (a, b, B), (a, b, C), (b,
c, R), (b, c, S), (c, d, A) and (d,
By performing an equalization operation with a knowledge database (dictionary) f) (K), A, B, and C corresponding to (a, b) are obtained as values of variables xx.

【００４１】「マッチング手法」とは、条件を比較し、
一致するものがあれば該変換値に置換する演算である。
もし、一致するものがない場合には、マッチングの演算
失敗となる。以下、アスペクト処理部４００における処
理流れを図２、３を参照しつつ説明する。なお、処理流
れ図は一文を対象としたものである。まず初期値として
処理に必要のある変数部Ａ、Ｂ、Ｃ、Ｄ、Ｅを全て０
（空集合）に設定する（Ｓ４０１）。ここでは変数部Ａ
は動詞アスペクト分類を格納する。変数部ＢはＢ［ｉ］
（ここに、ｉ＝１、…、ｎ）より構成され、各Ｂ［ｉ
は木構造で処理対象の言語の文の処理結果を表示した解
析木に出現するすべてのアスペクト記号の集合を格納す
る。変数部Ｃは各国語共通の意味表現たる「状況」、
「解釈」及び「共起要素」の形式表現による可能アスペ
クト意味の集合を格納する。変数部Ｄはアスペクト制限
条件を格納する。変数部Ｅはアスペクト意味を格納す
る。次に、原始言語解析木から動詞のアスペクト分類を
取り出し、取り出された値を変数部Ａに格納する（Ｓ４
１０）。本実施例では、入力された原始言語の文につい
て左から右まで出現した原始言語のすべてのアスペクト
記号を図１７のように動詞ノードに格納しているので、
動詞ノードからすべてのアスペクト記号を取り出し順に
各Ｂ［ｉに格納する（Ｓ４２０）。続けて、上記Ａ部
の動詞アスペクト分類とＢ部のすべてのアスペクト記号
Ｂ［ｉの内容を検索要素として、原始言語アスペクト
記号辞書部４５０を参照して、「状況」、「解釈」及び
「共起要素」の形式表現による可能なアスペクト意味の
集合を取り出し（Ｓ４３０）、その結果を変数部Ｃに格
納する。その後、解析木から動詞ノード以外の形態素ノ
ードを取り出し、取り出された形態素及び品詞を検索要
素として、原始言語他形態素アスペクト辞書部４５１を
参照して、そのアスペクト意味制限条件を取り出し、取
り出された各アスペクト意味制限条件毎にこれを変数部
Ｄの各Ｄ［ｉに格納する（Ｓ４４０）と共に、変数ｉ
の初期値を１に設定する（Ｓ４５０）。その後、上記取
り出されたアスペクト意味制限条件を格納するＤ部の各
Ｄ［ｉは空集合（０、すなわち何も取り出されていな
い）か否かを判断する（Ｓ４６０）。空集合でない場合
には、同一化手法により、アスペクト意味制限条件部Ｄ
の各Ｄ［ｉについてその内容に合う可能アスペクト意
味部Ｃの値Ｃ［１ｋを取り出し、取り出されたＣ
１ｋによりＣ１１を設定してから、Ｃ１
Ｃｊ，（ｊ＝２，... ，ｍ）を空集合に設定する
（Ｓ４７０）。ｉを１加算してから（Ｓ４８０）、可能
アスペクト意味を抽出するための処理（Ｓ４６０〜Ｓ４
８０）を繰り返し行う。もし、Ｄ［ｉの値が空集合で
あれば、Ｃ［ｉ１，（ｉ＝１，... ，ｎ）の値によ
りアスペクト意味部Ｅの各値Ｅ［ｉを設定する（Ｓ４
９０）。そして、図７のアスペクト意味変換辞書部４５
２を参照し、マッチング手法によりＥ［ｉの内容をア
スペクト意味変換辞書部４５２の変換条件と比較し、一
致する変換条件の対応変換結果を取り出し、アスペクト
意味部Ｅの各Ｅ［ｉ内の内容を修正する（Ｓ５０
０）。目的言語の動詞分類及びアスペクト意味部Ｅの各
Ｅ［ｉの内容を検索要素として、目的言語アスペクト
記号辞書部４５３を参照して、目的言語アスペクト記号
及び共起要素を取り出し、目的言語生成部の生成に利用
されるように動詞ノードに格納する。これで、アスペク
ト処理部の処理を終える。The "matching method" is to compare conditions,
If there is a match, the operation is replaced with the converted value.
If there is no match, the matching operation fails. Hereinafter, a processing flow in the aspect processing unit 400 will be described with reference to FIGS. The processing flow chart is for one sentence. First, all the variable parts A, B, C, D, and E required for processing as initial values are set to 0.
(Empty set) (S401). Here, variable part A
Stores the verb aspect classification. The variable part B is B [i]
(Where i = 1,..., N), and each B [i
Stores a set of all aspect symbols appearing in the parse tree that displays the processing result of the sentence of the language to be processed in a tree structure. The variable part C is "situation" which is a semantic expression common to each language,
Stores a set of possible aspect meanings in formal expressions of "interpretation" and "co-occurrence element". The variable section D stores an aspect restriction condition. The variable section E stores the aspect meaning. Next, the verb aspect classification is extracted from the source language parse tree, and the extracted value is stored in the variable part A (S4).
10). In this embodiment, all the aspect symbols of the source language appearing from the left to the right in the input source language sentence are stored in the verb node as shown in FIG.
All aspect symbols are extracted from the verb node and stored in each B [i] in the order of extraction (S420). Subsequently, referring to the source language aspect symbol dictionary 450 as a search element, the verb aspect classification of the part A and the contents of all the aspect symbols B [i of the part B are referred to, and the “situation”, “interpretation”, and “ A set of possible aspect meanings by the formal expression of “element” is extracted (S430), and the result is stored in the variable part C. Thereafter, morpheme nodes other than the verb node are extracted from the parse tree, and the extracted morpheme and part of speech are used as search elements to refer to the source language / other morpheme aspect dictionary unit 451 to extract the aspect semantic restriction condition. This is stored in each D [i of the variable part D for each aspect semantic restriction condition (S440), and the variable i
Is set to 1 (S450). Thereafter, it is determined whether each D [i of the D part storing the extracted aspect semantic restriction condition is an empty set (0, that is, nothing has been extracted) (S460). If it is not an empty set, the aspect semantic restriction condition part D
Of the possible aspect meaning part C that matches the content of each D [i of
After setting C 11 by 1 k, C 1
C j, (j = 2,..., M) is set to an empty set (S470). After adding 1 to i (S480), processing for extracting possible aspect meanings (S460 to S4)
80) is repeated. If the value of D [i is an empty set, each value E [i of the aspect meaning part E is set by the value of C [i 1, (i = 1,..., N) (S4
90). Then, the aspect meaning conversion dictionary unit 45 shown in FIG.
2, the contents of E [i are compared with the conversion conditions of the aspect semantic conversion dictionary unit 452 by a matching method, the corresponding conversion result of the matching conversion conditions is extracted, and the contents in each E [i of the aspect semantic part E are extracted. Is corrected (S50
0). The target language aspect symbol and co-occurrence element are extracted with reference to the target language aspect symbol dictionary unit 453 using the contents of each E [i of the target language verb classification and aspect meaning part E as search elements, and the target language generation unit Store it in a verb node to be used for generation. This completes the processing of the aspect processing unit.

【００４２】「彼はすでにその本を読んでいる。」とい
う原始言語を例として、図１、図２及び図３を参照しな
がら、本発明のアスペクト処理装置のアスペクト処理を
説明する。上記の原始言語が入力部１００により入力さ
れると、原始言語解析中間構造生成部２００を介して、
原始言語に依存する中間構造が得られる。この中間構造
について変換用辞書部３５０を参照し変換部３００の処
理を行ってから、図１７に示すような中間結果が得られ
る。この後、図２のアスペクト処理部４００の処理に入
る。図２において、まず変数Ａ、Ｂ、Ｃ、Ｄ、Ｅの初期
値を全て０に設定する（Ｓ４０１）。次に、図１７の動
詞「読む」ノードの原始言語の動詞アスペクト分類ＶＪ
及び目的言語の動詞アスペクト分類ＶＣを取り出して、
それぞれ変数Ａ及びＴに設定する。すると、Ａ＝action
（動き）, Ｔ＝activity（動作）になる。入力された文
には一つのアスペクトているしかないので、Ｂ１
＝「ている」となり，Ｂ２，Ｂ３，... ，な
どは空集合になる（Ｓ４２０）。そして上記のＡ及びＢ
１の値により原始言語アスペクト記号辞書部４５０
内を検索し、図４に示すように、下記の「状況」、「解
釈」及び「共起要素」の形式表現による可能アスペクト
意味集合Ｃを取り出す（Ｓ４３０）。Aspects of the aspect processing of the aspect processing apparatus of the present invention will be described with reference to FIGS. 1, 2 and 3 by taking, as an example, the primitive language "He has already read the book." When the above-mentioned source language is input by the input unit 100, the source language analysis intermediate structure generation unit 200
An intermediate structure that depends on the source language is obtained. After performing the processing of the conversion unit 300 with reference to the conversion dictionary unit 350 for this intermediate structure, an intermediate result as shown in FIG. 17 is obtained. Thereafter, the process of the aspect processing unit 400 shown in FIG. 2 is started. In FIG. 2, first, the initial values of variables A, B, C, D, and E are all set to 0 (S401). Next, the verb aspect classification VJ of the primitive language of the verb "read" node in FIG.
And the verb aspect classification VC of the target language
Set to variables A and T, respectively. Then A = action
(Motion), T = activity. Since the input sentence has only one aspect, B1
= “Is”, and B 2, B 3,... Become an empty set (S420). And A and B above
The source language aspect symbol dictionary 450 based on the value of 1
Then, as shown in FIG. 4, a possible aspect semantic set C in the form of the following “situation”, “interpretation”, and “co-occurrence element” is extracted (S430).

【００４３】Ｃ１１＝（Activity, repeative
−）（動作，反復−）Ｃ１２＝（Activity, progressive −）（動作，
進歩的−）Ｃ１３＝（State, experience −）（状態，
経験−）Ｃ１４＝（State, nonprogressive −）（状態，
非進歩的−）Ｃ１５＝（State, perfective −）（状態，
完了−）Ｃ１６＝（Habitual, habitual−）（常習的，
学習的−）そして、入力された文において、動詞以外の要素、例え
ば、「彼」、「本」、「すでに」などの形態素及び品詞
により、原始言語他形態素アスペクト辞書部４５１内を
検索し、図６に示すように、下記のアスペクト意味制限
条件部Ｄを取り出す（Ｓ４４０）。C 11 = (Activity, repeative
−) (Operation, repetition −) C 12 = (Activity, progressive −) (operation,
Progressive-) C13 = (State, experience-) (State,
Experience-) C14 = (State, nonprogressive-) (State,
Non-progressive −) C 15 = (State, perfective −) (State,
Done-) C16 = (Habitual, habitual-)
(Learning-) Then, in the input sentence, the source language other morphological aspect dictionary unit 451 is searched for elements other than the verb, for example, morphemes and parts of speech such as "he", "book", and "already". As shown in FIG. 6, the following aspect meaning restriction condition part D is extracted (S440).

【００４４】Ｄ１＝｛perfective｝（｛完了｝）続けて、可能アスペクト意味を抽出するために、まず変
数ｉの初期値を１に設定してから（Ｓ４５０）、アスペ
クト制限条件部ＤのＤｉ内は空集合であるか否かを
判断する（Ｓ４６０）。アスペクト制限条件部ＤのＤ
ｉ内は空集合ではないと判定すると、同一化演算を行
い（Ｓ４７０）、下記の「状況」、「解釈」及び「共起
要素」の形式表現による可能アスペクト意味集合Ｃが得
られる。D 1 = {perfective} ({completion}) Then, in order to extract the meaning of the possible aspect, first, the initial value of the variable i is set to 1 (S450), and D of the aspect restriction condition part D is set. It is determined whether or not i is an empty set (S460). D of aspect restriction condition part D
If it is determined that i is not an empty set, an equalization operation is performed (S470), and a possible aspect semantic set C in the form of the following “situation”, “interpretation” and “co-occurrence element” is obtained.

【００４５】Ｃ１１＝（State perfective−）
（状態，完了−）Ｃ１２＝（）Ｃ１３＝（）Ｃ１４＝（）Ｃ１５＝（）Ｃ１６＝（）それから、変数ｉを１加算した後（Ｓ４８０）、Ｄ２
は空集合であるか否かの判断を行う（Ｓ４６０）。Ｄ
２は空集合であるので、Ｃｉ１，（ｉ＝１，
…）により、Ｅ内の各Ｅｉを設定する（Ｓ４９
０）。Ｅｉは下記のようになる。C 11 = (State perfective−)
(State, completed-) C 12 = () C 13 = () C 14 = () C 15 = () C 16 = () Then, after adding 1 to the variable i (S480), D 2
Determines whether the set is an empty set (S460). D
2 is an empty set, so C i 1, (i = 1,
..) To set each E i in E (S49).
0). E i is as follows.

【００４６】Ｅ１＝（State perfective−）（状態，完了−）Ｅ２＝（）：：：そして、アスペクト意味変換辞書部４５２を参照し、ア
スペクト意味部Ｅ内のＥ１の内容と一致する変換条
件がないので、変換結果を獲得できないものと判断す
る。このため、Ｅｉを修正しない。目的言語の対応
動詞分類Ｔはactivity（動作）であるので、activity
（動作）及び（State perfective−）（状態，完了−）
により、目的言語アスペクト記号辞書部４５３内を検索
し、「了，−」という変換結果が得られる。そのうちの
「−」は共起要素がない事を示す。目的言語生成部５０
０内を介して上記のアスペクトの処理結果「了，−」に
より、生成用辞書部５５０内を参照し、「他己經讀了那
本書」という中国語文に翻訳するようになる。最後に出
力部６００により翻訳結果を出力する。E 1 = (State perfective−) (State, completed−) E 2 = () ::: Refer to the aspect meaning conversion dictionary unit 452 and match the content of E 1 in the aspect meaning part E. Since there is no conversion condition, it is determined that the conversion result cannot be obtained. Therefore, E i is not modified. Since the corresponding verb classification T of the target language is activity,
(Operation) and (State perfective-)
As a result, the target language aspect symbol dictionary unit 453 is searched, and a conversion result of “Ryo,-” is obtained. "-" Indicates that there is no co-occurrence element. Target language generator 50
With reference to the generation dictionary unit 550 based on the processing result of the aspect described above through “0”, the translation is translated into a Chinese sentence of “Other Self-Reading Rina Books”. Finally, the output unit 600 outputs the translation result.

【００４７】この処理例では、翻訳文として意味内容を
理解する面からわかり易いものを選定したためアスペク
ト意味変換辞書部４５２を利用しなかったので、次にア
スペクト意味変換辞書部４５２の具体的作用について説
明する。図７に示すようにアスペクト意味変換辞書部４
５２の主な作用は、多数のアスペクトを連用するときに
生じる多義性を解決するものである。例えば、日本語文
によくみかけられる「てしまっている」等の動詞と助動
詞を連用する文型はアスペクト意味を解析するときに状
況及び解釈の変換に際しては多義性の問題が起こり易
い。このため、従来以上にアスペクト素性を的確に演算
するために、処理対象の言語について特に演算規則とし
てアスペクト意味変換辞書を作ったものである。本実施
例では、production-type(マッチング条件→変換結果と
いう形をproduction-type と称する）の意味変換規則、
照合演算、置換演算を採用する。なお、この演算規則は
処理対象の言語に応じて特殊のものとする必要がないた
め、実施しやすい上に、翻訳効率がよくなる。従って、
上記「してしまっている」があるような場合にはアスペ
クト意味変換辞書部４５２を利用して、「てしまってい
る」に対応する原始言語変換条件に対応する目的言語ア
スペクト意味を取り出し、これより目的言語の動詞等を
決定すれば、アスペクト意味を適切に処理できる上に、
翻訳品質も向上できる。In this processing example, since the translated sentence was selected to be easy to understand from the aspect of understanding the semantic content, the aspect meaning conversion dictionary unit 452 was not used. Next, the specific operation of the aspect meaning conversion dictionary unit 452 will be described. I do. As shown in FIG.
The main function of 52 is to solve the ambiguity that occurs when multiple aspects are used in series. For example, a sentence pattern in which a verb such as "teshimaru", which is often found in Japanese sentences, and an auxiliary verb are frequently used, the problem of polysemy is likely to occur when converting the situation and interpretation when analyzing the aspect meaning. For this reason, in order to calculate the aspect feature more accurately than in the past, an aspect meaning conversion dictionary is created particularly as a calculation rule for the language to be processed. In this embodiment, the semantic conversion rule of production-type (the form of matching condition → conversion result is referred to as production-type)
Adopt collation operation and replacement operation. Note that this calculation rule does not need to be special according to the language to be processed, so that it is easy to implement and the translation efficiency is improved. Therefore,
In the case where there is the above-mentioned "it has been done", the target language aspect meaning corresponding to the source language conversion condition corresponding to the "it has been made" is taken out using the aspect meaning conversion dictionary unit 452, and this is extracted. If the verb etc. of the target language is determined, the aspect meaning can be properly processed, and
Translation quality can be improved.

【００４８】以下、これについて説明する。たとえば、
「彼は本を読んでしまっている」という例では、図７に
示しているアスペクト意味変換辞書部４５２を参照の
上、（activity,telic,-)(活動, 終了-)＋(activity,st
ate,-)( 活動, 状態-)→(activity,telic,-)( 活動, 終
了,-) という変換規則を翻訳処理につけ加える。さて、
「彼は本を読んでしまっている」という文には、「しま
う」と「ている」という二つのアスペクト記号があるの
で、そのままでは（activity,telic,-）及び(activity,
state,-)の二つの連用のアスペクト意味が得られる。も
しも、この状態でこの二連用のアスペクト意味について
これ以上の処理をしないと、悪くて、不正確な翻訳結果
（″＊″という記号で表示する）となる可能性が大き
い。しかし、この変換規則によって更に処理を行った結
果、下記のように高い品質の翻訳文が得られることとな
る。Hereinafter, this will be described. For example,
In the example of "he has read the book", referring to the aspect meaning conversion dictionary unit 452 shown in FIG. 7, (activity, telic,-) (activity, end-) + (activity, st
Add the translation rule ate,-) (activity, state-) → (activity, telic,-) (activity, end,-) to the translation process. Now,
In the sentence "He has read the book", there are two aspect symbols "Masu" and "Yes", so (activity, telic,-) and (activity,
You can get the meaning of two aspects of state,-). If no further processing is performed on the meaning of this dual aspect in this state, there is a high possibility that a bad and inaccurate translation result (indicated by the symbol “*”) will result. However, as a result of further processing according to this conversion rule, a high quality translated sentence is obtained as described below.

【００４９】（日本語）彼は本を読んでしまっている。（中国語＊）他讀完著書。（activity,telic）＋(activ
ity,state ）（He has been reading the book over.) （中国語）他讀完書。（activity,telic）（英語）He read the book over.(Japanese) He has read a book. (Chinese *) and other complete books. (Activity, telic) + (activ
ity, state) (He has been reading the book over.) (Activity, telic) (English) He read the book over.

【００５０】[0050]

【発明の効果】以上説明したように本発明は文の共起要
素によりアスペクト意味の多義性を解決できることに注
目してなされたものである。このため、各自然言語に対
して、動詞アスペクト分類及び関連の表層（関係する分
の表現の構造）のアスペクト記号により、可能なアスペ
クト意味を解析可能とし、また可能なアスペクト意味辞
書の作成も可能としたものである。また、アスペクトに
影響し得るほかの要素、例えば、副詞、助詞（日本語に
おける純然たる助詞の他に「おいて」等を含み、中国語
における介詞、英語におけるpreposition 等）なども解
析できるし、他形態素アスペクト辞書も作成できる。と
ころで、原始言語アスペクト記号辞書及び原始言語他形
態素アスペクト辞書の二辞書の情報は言語自身の意味や
構文等の情報により作り出され、しかも各国語で独自に
作成されえるため、処理対象言語の変更に従って変更す
る必要もない。また本発明は、多数のアスペクト記号を
連用して多数のアスペクト意味の組み合わせが生じたと
きには、各言語間のアスペクト表現の差異を参照するた
めのアスペクト変換辞書を参照することにより各言語共
通のアスペクト表現を採用することとしたため、上記ア
スペクト意味変換辞書の言語による特有性がなくなる。
すなわち、原始言語と目的言語を変えても、このアスペ
クト意味変換辞書を変更する必要がない。このため、ｎ
種類言語用の機械翻訳装置を作る場合には、ｎ種類の可
能アスペクト意味辞書、他形態素アスペクト辞書、及び
ｎ（ｎ−１）／２のアスペクト変換辞書を準備すればよ
いこととなる。すなわち、従来の各言語に依存する演
算、解釈及び変換規則の問題点をいっぺんに若しくは共
通に解決することが可能となる。As described above, the present invention has been made by paying attention to the fact that the polysemy of the aspect meaning can be solved by the co-occurrence element of the sentence. Therefore, for each natural language, the possible aspect meaning can be analyzed by the verb aspect classification and the aspect symbol of the related surface (the structure of the related expression), and the possible aspect meaning dictionary can be created. It is what it was. It can also analyze other elements that can affect the aspect, such as adverbs, particles (including "OK" in addition to pure particles in Japanese, interposition in Chinese, preposition in English, etc.), Other morphological aspect dictionaries can also be created. By the way, the information of two dictionaries, a source language aspect symbol dictionary and a source language and other morphological aspect dictionary, is created based on information such as the meaning and syntax of the language itself, and can be created independently in each language. No need to change. Further, when a large number of combinations of aspect meanings are generated by using a large number of aspect symbols consecutively, the present invention refers to an aspect conversion dictionary for referring to a difference in aspect expression between languages, thereby providing an aspect common to each language. Since the expression is adopted, the specificity of the aspect meaning conversion dictionary in the language is lost.
That is, even if the source language and the target language are changed, it is not necessary to change the aspect meaning conversion dictionary. Therefore, n
In the case of making a machine translation device for a type language, n types of possible aspect meaning dictionaries, other morphological aspect dictionaries, and n (n-1) / 2 aspect conversion dictionaries may be prepared. That is, it is possible to solve the problems of the conventional arithmetic, interpretation and conversion rules depending on each language at once or in common.

【００５１】具体的には下記の効果がある。（１）アスペクト意味も考慮しているので、機械翻訳の
質が向上する。（２）アスペクトを処理する際に、目的言語のアスペク
ト記号及び関連共起要素をも同時に生成可能であるた
め、各言語間のアスペクトの差異を解決しえ、機械翻訳
の質が向上する。Specifically, the following effects are obtained. (1) Since the meaning of the aspect is also taken into consideration, the quality of machine translation is improved. (2) When processing aspects, aspect symbols and related co-occurrence elements of the target language can be generated at the same time, so that differences in aspects between languages can be resolved, and the quality of machine translation is improved.

【００５２】（３）アスペクト意味を一つの共通の形式
表現により表わすため、アスペクト意味を解析する際
に、必要とする関連データは翻訳の対象となる言語に無
関係に設定可能であり、このため、処理対象言語のアス
ペクト記号辞書及びアスペクト意味変換辞書を変更する
ことにより容易に他の言語に応用できる。更に、参照用
すべき辞書及び規則の数が減少し、翻訳装置も安価とな
る。(3) Since the meaning of an aspect is represented by one common formal expression, the related data required when analyzing the aspect meaning can be set regardless of the language to be translated. By changing the aspect symbol dictionary and aspect meaning conversion dictionary of the processing target language, it can be easily applied to other languages. Furthermore, the number of dictionaries and rules to be referred to is reduced, and the translator is less expensive.

【００５３】（４）機械翻訳の質が大幅に向上するた
め、人による後編集の必要性が低下する。このため、機
械翻訳の自動性、翻訳作業の効率も向上する。(4) The need for manual post-editing is reduced because the quality of machine translation is greatly improved. For this reason, the automaticity of machine translation and the efficiency of translation work are improved.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明に係るアスペクト処理装置の一実施例の
構成図である。FIG. 1 is a configuration diagram of an embodiment of an aspect processing apparatus according to the present invention.

【図２】上記実施例におけるアスペクト処理装置の動作
流れ図である。FIG. 2 is an operation flowchart of the aspect processing apparatus in the embodiment.

【図３】上記実施例におけるアスペクト処理装置の動作
流れ図である。FIG. 3 is an operation flowchart of the aspect processing apparatus in the embodiment.

【図４】上記実施例における原始言語アスペクト記号辞
書部内の辞書のデータ構造の様子を示す図である。FIG. 4 is a diagram showing a state of a data structure of a dictionary in a source language aspect symbol dictionary unit in the embodiment.

【図５】上記実施例における目的言語アスペクト記号辞
書部の辞書のデータ構造の様子を示す図である。FIG. 5 is a diagram showing a state of a data structure of a dictionary of a target language aspect symbol dictionary unit in the embodiment.

【図６】上記実施例における原始言語他形態素アスペク
ト辞書のデータ構造の様子を示す図である。FIG. 6 is a diagram showing a data structure of a source language / other morphological aspect dictionary in the embodiment.

【図７】上記実施例におけるアスペクト変換辞書部の辞
書のデータ構造の様子を示す図である。FIG. 7 is a diagram showing a state of a data structure of a dictionary of an aspect conversion dictionary unit in the embodiment.

【図８】従来技術に係るアスペクト処理装置の動作流れ
図である。FIG. 8 is an operation flowchart of an aspect processing apparatus according to the related art.

【図９】従来技術に係る動詞アスペクト分類用辞書の構
成の様子を示す図である。FIG. 9 is a diagram showing a configuration of a dictionary for verb aspect classification according to the related art.

【図１０】従来技術に係るアスペクトの八つの素性の説
明図である。FIG. 10 is an explanatory diagram of eight features of an aspect according to the related art.

【図１１】従来技術に係るアスペクト記号意味辞書の構
成の様子を示す図である。FIG. 11 is a diagram showing a configuration of an aspect symbol meaning dictionary according to the related art.

【図１２】従来技術に係るアスペクト処理装置の動作の
基本的構成を示す図である。FIG. 12 is a diagram showing a basic configuration of an operation of an aspect processing apparatus according to the related art.

【図１３】従来技術に係る解釈規則辞書の構成の様子を
示す図である。FIG. 13 is a diagram showing a configuration of an interpretation rule dictionary according to the related art.

【図１４】従来技術に係るる変換規則辞書の構成の様子
を示す図である。FIG. 14 is a diagram illustrating a configuration of a conversion rule dictionary according to the related art.

【図１５】本発明に係る状況分類及び関連解釈の内容を
示す図である。FIG. 15 is a diagram showing contents of situation classification and related interpretation according to the present invention.

【図１６】本発明に係るアスペクト解釈分類の内容を示
す図である。FIG. 16 is a diagram showing contents of an aspect interpretation classification according to the present invention.

【図１７】本発明の上記実施例における原始言語解析結
果の木構造を示す図である。FIG. 17 is a diagram showing a tree structure of a source language analysis result in the embodiment of the present invention.

【図１８】中国語文及び日本文における中間構造の例で
ある。FIG. 18 is an example of an intermediate structure in a Chinese sentence and a Japanese sentence.

【符号の説明】[Explanation of symbols]

１００入力部２００原始言語解析中間構造生成部２５０解析用辞書３００変換部４００アスペクト処理部４５０原始言語アスペクト記号辞書４５１原始言語他形態素アスペクト辞書４５２アスペクト意味変換辞書４５３目的言語アスペクト記号辞書５００目的言語生成部５５０生成用辞書６００出力部７００バッファ Reference Signs List 100 input unit 200 source language analysis intermediate structure generation unit 250 analysis dictionary 300 conversion unit 400 aspect processing unit 450 source language aspect symbol dictionary 451 source language other morphological aspect dictionary 452 aspect meaning conversion dictionary 453 target language aspect symbol dictionary 500 target language generation Unit 550 generation dictionary 600 output unit 700 buffer

Claims

(57)【特許請求の範囲】(57) [Claims]

【請求項１】目的言語のアスペクト記号を求めるため
に、原始言語の文アスペクト情報に基づいて、原始言語
の動詞アスペクト分類、アスペクト記号並びに「状
況」、「解釈」及び「共起要素」の形式表現による可能
アスペクト意味を登録している原始言語アスペクト記号
辞書部と、目的言語の動詞アスペクト分類、アスペクト
記号並びに「状況」、「解釈」及び「共起要素」の形式
表現による可能アスペクト意味解釈を登録している目的
言語アスペクト記号辞書部と、動詞及びアスペクト記号
のほかにアスペクト意味に影響し得る形態素、品詞及び
「可能解釈ｉ」（ここに、ｉ＝１，２，３，…と可能な
解釈の数だけ採りえる整数）の形式表現による可能アス
ペクト解釈を登録している原始言語他形態素アスペクト
辞書部と、原始言語の変換条件のアスペクト意味組み合
わせと目的言語のアスペクト意味組み合わせとの対で
「状況」、「解釈」及び「共起要素」の形式表現による
アスペクト意味変換規則を登録しているアスペクト意味
変換辞書部と、解析木における動詞ノードの動詞アスペ
クト分類及びアスペクト記号を検索要素として、前記原
始言語アスペクト記号辞書部内を参照し、すべての可能
な「状況」、「解釈」及び「共起要素」アスペクト意味
組み合わせを取り出す原始言語アスペクト記号情報抽出
部と、動詞及びアスペクト記号以外の形態素、品詞を検
索要素として、前記原始言語他形態素アスペクト辞書部
内を参照し可能アスペクト意味制限の組み合わせを取り
出すアスペクト意味制限条件抽出部と、上記取り出され
たアスペクト意味制限条件に基づいて同一化手法により
上記の可能アスペクト意味の組み合わせの中からありえ
ないアスペクト意味を削除する同一化部と、同一化され
た可能アスペクト意味組み合わせを検索要素として前記
アスペクト意味変換辞書部内を参照し目的言語のアスペ
クト意味組み合わせを獲得する変換部と、目的言語の動
詞分類及び上記のアスペクト意味組み合わせを検索要素
として前記目的言語アスペクト記号辞書部内を参照し、
対応する目的言語アスペクト記号及び共起要素を獲得す
る目的言語アスペクト記号抽出部とを有することを特徴
とするアスペクト処理装置。Claims: 1. To determine an aspect symbol of a target language, a verb aspect classification of the source language, an aspect symbol, and a format of "situation", "interpretation" and "co-occurrence element" based on sentence aspect information of the source language A source language aspect symbol dictionary that registers the possible aspect meanings by expressions, and verb aspect classifications of the target language, aspect symbols, and possible aspect meaning interpretations by formal expressions of "situation", "interpretation" and "co-occurrence element" The registered target language aspect symbol dictionary part, morphemes, parts of speech, and “possible interpretations i” (where i = 1, 2, 3,. The source language and other morphological aspect dictionary part that registers the possible aspect interpretation by the formal expression of the number of interpretations An aspect semantic conversion dictionary unit that registers an aspect semantic conversion rule in the form of "situation", "interpretation", and "co-occurrence element" in a pair of an aspect semantic combination of a conversion condition and an aspect semantic combination of a target language; As a search element, the verb aspect classification and the aspect symbol of the verb node in the parse tree are referred to in the source language aspect symbol dictionary section, and all possible "situation", "interpretation" and "co-occurrence element" aspect semantic combinations are extracted. A source language aspect symbol information extracting unit, an aspect semantic restriction condition extracting unit that extracts combinations of possible aspect semantic restrictions by referring to the source language other morphological aspect dictionary unit as morphemes and parts of speech other than verbs and aspect symbols as search elements, Same based on the extracted aspect semantic restriction condition An identification unit that deletes an impossible aspect meaning from the above combinations of possible aspect meanings according to the method, and an aspect meaning combination of the target language by referring to the aspect meaning conversion dictionary unit as a search element using the identified possible aspect meaning combinations as search elements A conversion unit for acquiring the target language verb classification and the above-mentioned aspect semantic combination as a search element in the target language aspect symbol dictionary unit,
An aspect processing apparatus comprising: a target language aspect symbol extraction unit that acquires a corresponding target language aspect symbol and a co-occurrence element.