JPH0383167A

JPH0383167A - Natural language processing

Info

Publication number: JPH0383167A
Application number: JP2205324A
Authority: JP
Inventors: Karen Jensen; カレン・ジヤンセン
Original assignee: International Business Machines Corp
Current assignee: International Business Machines Corp
Priority date: 1989-08-16
Filing date: 1990-08-03
Publication date: 1991-04-09
Also published as: EP0413132A3; EP0413132A2; US5146406A

Abstract

PURPOSE: To attain an improved data processing for deciding the basic meaning structures of a sentence by analyzing input characters by a syntax rule, and identifying a first word group in a character string as a meaning argument when the meaning argument correctly corresponds to a syntax argument. CONSTITUTION: An input sentence in the form of a word character string is grammar-analyzed by a rule constituting elements accessible to morphology information and syntax information related with a word, and the argument structures of the input sentence are formed. Not one part of an objective language but all process are applied. Next, the meaning argument which is easily identified is identified by a syntax reference. Then, an analyzed segment which is not the argument of a main verve is reexamined, a find 1 adjustment is added to record structures, an unnecessary attribute is eliminated, and the record structures are generally cleaned up before moving to the next constituting element of the system. Thus, the outputted record structures can be used by a machine translation system for exactly translating a certain natural language in another language.

Description

【発明の詳細な説明】Ａ、産業上の利用分野本明細書に記載された本発明は、自然言語処理（ＮＬＰ
）のためのデータ処理方法に関し、具体的には文の基本
意味構造を決定するための改良されたデータ処理方法に
関する。DETAILED DESCRIPTION OF THE INVENTION A. Industrial Application Field The present invention described herein is applicable to natural language processing (NLP).
), and specifically relates to improved data processing methods for determining the basic semantic structure of sentences.

Ｂ、従来の技術自然言語テキストは、述部とそれらの引き数から成る命
題（ｐｒｏｐｏｓｉｔｉｏｎ　）群から構成されると言
える。述部の例は動詞であり、その引き数の例は関連す
る名詞または名詞句である。たとえば、Ｊｏｈｎ　１０
ｖｅｓ　Ｍａｒｙ。B. Conventional Technology A natural language text can be said to be composed of a group of propositions consisting of predicates and their arguments. An example of a predicate is a verb, and an example of its argument is a related noun or noun phrase. For example, John 10
ves Mary.

という文には、動詞″１０ｖｅｓ　”を述部とする。１
つの命題がある。”　１０ｖｅｓ″はこの命題中で、”
　Ｊｏｈｎ　”と”　）ｌａｒｙ″の２つの引き数をも
つ。コンピュータ・システムが自然言語を理解するには
、述部と引き数群を正確に識別できなければならない。In the sentence, the verb "10ves" is used as a predicate. 1
There are two propositions. "10ves" is in this proposition, "
It has two arguments: ``John'' and ``)lary.'' In order for a computer system to understand natural language, it must be able to accurately identify predicates and argument groups.

上記のような単純な文では、これは難しくない６　（上
記の”Ｊｏｈｎ　１０ｖｅｓ　Ｉ（ａｒｙ、　”のよう
に）英語の動詞にその引き数が直接接している場合、コ
ンピュータ文法で動詞に適切な引き数を割り当てるのは
比較的容易である。しかし、実生活上のテキストでよく
現れるようなより複雑な文では、この作業はずっと難し
くなる。引き数がその動詞に密接してないとき、難しい
問題が発生する。In a simple sentence like the one above, this is not difficult.6 If the argument is directly adjacent to an English verb (as in “John 10ves I(ary,” above)), computer grammar will tell you the appropriate verb for the verb. Assigning arguments is relatively easy. However, in more complex sentences, such as those that often occur in real-life texts, this task becomes much more difficult. It is difficult when the argument is not closely related to its verb. A problem occurs.

実際には、引き数が文の上で全く見えなくなることもあ
るが、人間がするように、プログラムがそれを推論しな
ければならない。たとえば、Ｍａｒｙ　ｗａｓ　ｋｉｓ
ｓｅｄ。In reality, the argument may not be visible at all in the statement, but the program has to reason about it, just as a human would. For example, Mary was kissing
sed.

この文では、動詞”　ｋｉｓｓｅｄ　”の唯一の目に見
える引き数は”Ｍａｒｙ”である。しかし、人間はキス
をした人に対応する別の引き数を推論することができる
。もう１つの関連する状況は次のような文に現れる。In this sentence, the only visible argument for the verb "kissed" is "Mary." However, humans can infer another argument that corresponds to the kisser. Another related situation occurs in sentences such as:

Ｗｈｏ　ｄｉｄ　Ｍａｒｙ　ｔｈｉｎｋ　ｔｈａｔ　Ｐ
ｅｔｅｒ　５ａｉｄ　ｔｈａｔＪｏｈｎ　ｋｉｓｓｅｄ
？上記の例でも、動詞の”　ｋ　１ｓｓｅｄ　”に２つの
引き数がある。”　Ｊｏｈｎ″は動詞のすぐ傍にあるが
、第２の引き数である”ｗｈｏ″は動詞から離れている
。Who did Mary think that P?
eter 5aid that John kissed
? In the example above, the verb "k 1ssed" also has two arguments. "John" is right next to the verb, but the second argument, "who", is far away from the verb.

したがって、問題は、見えないものや離れた位置にある
ものを含めて、すべての引き数をそれらの述部と正しく
連係することである。Therefore, the problem is to correctly associate all arguments with their predicates, including invisible and remote ones.

述部−引き数構造を識別する問題、特に、”Ｗｈ。The problem of identifying predicate-argument structures, especially "Wh.

ｄｉｄ　Ｍａｒｙ　ｔｈｉｎｋ　ｔｈａｔ　Ｐｅｔｅｒ
　５ａｉｄ　ｔｈａｔ　Ｊｏｈｎｋｉｓｓｅｄ？”にお
けるような「遠く離れた依存関係」を正しく指定するＨ
Ｈは、言語学及びコンピュータ言語の文献で周知である
。これを実施するための２つの主要な方法が記載されて
いる。Did Mary think that Peter
5aid that Johnkissed? H to correctly specify “distant dependencies” such as in “
H is well known in the linguistics and computer language literature. Two main ways to do this have been described.

＊「空範嗜Ｊ　　（ＥＣ）法＊「機能的不確定性Ｊ　　（ＦＵ）法ＥＣ法は、支配結合（ＧＢ）学派及び一般化句構造文法
（ＧＰＳＧ）学派の言語学者によって主張されている。＊The ``Functional Uncertainty J (EC) Method'' ＊The ``Functional Uncertainty J (FU) Method'' The EC method is advocated by linguists of the Dominant Bond (GB) school and the Generalized Phrase Structure Grammar (GPSG) school. .

（Ｐ、セル（Ｓｅｌｌ）著「現代統辞法理輪講義（Ｌｅ
ｃｔｕｒｅｓ　ｏｎ　Ｃｏｎｔｅ＋＋ｐｏｒａｒｙＳｙ
ｎｔａｃｔｉｃ　Ｔｈｅｏｒｆｅｓ）　Ｊ　、ＣＳ　Ｌ
　Ｉ　Ｎ　スタンフォード大学、米国カリフォルニア州
スタンフォード、１９８５年刊）。この方法は、その文
が最も中立的な形にある場合に転置された構成要素があ
るはずの場所に、空のスロットを含む解析構造を使用す
る。たとえば、”Ａｌ１ｃｅ、　Ｐｅｔｅｒ　５ａｉｄ
　ｔｈａｔＪｏｈｎ　　ｋｉｓｓｅｄ、　にＰｅｔｅｒ
　５ａｉｄ　ｔｈａｔ　Ｊｏｈｎ　ｋｉｓｓｅｄＡｌｉ
ｃｅ、）　”という文は、動詞″ｋｉｓｓｅｄ”のすぐ
後に「空範曙」または「トレースＪ　　（’“ｃＩＩで
表ス）をもつと想像される。というのは、それが、名詞
句”Ａｌ１ｃｅ”が属する場所であるからである。こう
した方向に沿って＋ｍ築されたコンピュータ文法は、実
際にその解析構造または氷中に空スロットを指定する（
第１Ａ図参照）。(P. Sell, “Lectures on Modern Syntactic Law”
tures on Conte++porarySy
ntactic Theorfes) J, C.S.L.
I N Stanford University, Stanford, California, USA, 1985). This method uses a parsing structure that contains empty slots where the transposed components would be if the sentence were in its most neutral form. For example, “Al1ce, Peter 5aid
that John kissed, Peter
5aid that John kissedAli
The sentence ``ce, )'' is imagined to have ``empty day dawn'' or ``trace J'' (expressed in cII) immediately after the verb ``kissed.'' Because that is where the noun phrase "Al1ce" belongs. A computer grammar built along these lines actually specifies empty slots in its analytic structure or ice (
(See Figure 1A).

ＦＵ法は、語負機能文法（ＬＦＧ）理論を信奉する言語
学者によって主張されている。この方法は、解析木中の
空スロットではなく、すべての動詞の特徴（「特徴」と
は主として動詞が持たなければならない必要な引き数の
数と種類を指す）を、文の左から右へと順次評価して、
転置された構成要素が最もうまくはまる場所を見つける
ものである。うまくあてはまる引き数構造を計算するた
めに、形式的表記装置がＬＰＧ文法記述言語に追加され
た。Ｒ，Ｍ、カブラン（Ｋａｐｌａｎ）　、Ａ、ツェー
ネン（Ｚａｅｎｅｎ　）の論文「遠隔依存関係、句構造
、および機能的不確定性（Ｌｏｎｇ−ｄｉｓｔａｎｃｅ
Ｄｅｐｅｎｄｅｎｃｉｅｓ、　Ｃｏｎ５ｔｉｔｕｅｎｔ
　５ｔｒｕｃｔｕｒｅ、　ａｎｄＦｕｎｃｔｉｏｎａｌ
　Ｕｎｃｅｒｔａｉｎｔｙ）　Ｊ　１Ｍ、パルティン（
Ｂａｌｔｉｎ）　、Ａ、クロッホ（Ｋｒｏｃｈ　）共編
「句構造の代替概念（Ａｌｔｅｒｎａｔｉｖｅ　Ｃｏｎ
ｃｅｐｔｉｏｎｓ　ｏｆＰｈｒａｓｅ　５ｔｒｕｃｔｕ
ｒｅ）　Ｊ　、シカゴ大学出版社、１９８７年刊）。こ
うした方向に沿って構築されたコンピュータ言語は、こ
の装置を使用して、その文法規則で、見えない引き数を
割り当てるべき場所を指定する。The FU method is advocated by linguists who believe in word-negative functional grammar (LFG) theory. This method uses all verb features (“features” primarily refer to the number and type of required arguments the verb must have), rather than empty slots in the parse tree, from left to right in the sentence. Evaluate sequentially,
It finds where the transposed components fit best. A formal notation was added to the LPG grammar description language to compute well-fitting argument structures. R, M. Kaplan, A. Zaenen's paper “Long-distance dependencies, phrase structure, and functional uncertainty (Long-distance
Dependencies, Con5tituent
5structure, andFunctional
Uncertainty) J 1M, Partin (
Baltin), A. and Kroch (eds.), “Alternative Conception of Phrase Structure”
ceptions of Phrase 5tructu
re) J, University of Chicago Press, 1987). Computer languages built along these lines use this device to specify in their grammar rules where invisible arguments should be assigned.

本発明の方法は、上記のどちらの方法とも異なっている
。本方法は、ＥＣ法とは次の点で異なる。The method of the present invention is different from both of the methods described above. This method differs from the EC method in the following points.

ａ、どんな種類の空［４またはトレースも使用しない。a, Do not use any kind of empty [4 or trace.

ｂ、構成要素、木、構造に余り依存せず、構文解析によ
って供給されるあらゆる種類の情報を使用する。b. relies less on component, tree, structure and uses all kinds of information provided by parsing.

本方法は、ＦＵ法とは次の点で異なる。This method differs from the FU method in the following points.

ａ、使用するプログラミング言語によってすでに供給さ
れたちの以外のどんな特別の表記装置も使用しない。a. Do not use any special notation devices other than those already provided by the programming language you are using.

５１文中の動詞の特徴（いわゆる「機能情報」）に完全
には依存せず、構文解析によって供給されるあらゆる種
類の情報を使用する。51 does not rely entirely on the features of the verb in the sentence (so-called "functional information"), but uses all kinds of information provided by the parsing.

本方法は、構文解析が完了した後に引き数充填を実行す
るという点で上記のどちらの方法とも異なっている。本
方法は、解析構成要素自体ではなく、ポストプロセッサ
を使用して、構文属性値情報の全領域を操作して、最も
妥当な引き数構造を誘導するものである。This method differs from both of the above methods in that it performs argument filling after parsing is complete. The method uses a post-processor, rather than the parsing component itself, to manipulate the gamut of syntactic attribute value information to derive the most reasonable argument structure.

本方法と、言語理論から生まれたＮＬＰシステムの方法
のもう一つの違いは、後者のシステムの大部分が、論理
プログラミング言語によって供給されたような、ある形
式の統一化を使用していることである。統一化により、
属性値構造の自動突合せが可能になるが、それは、否定
および分離の条件をうまく処理できないなど、いくつか
の欠点をもつ。本方法は、手順式ポストプロセッサを使
用するので、こうした欠点を持たない。Another difference between our method and those of NLP systems born from language theory is that most of the latter systems use some form of unification, such as that provided by logic programming languages. be. Due to unification,
Although automatic matching of attribute value structures is possible, it has some drawbacks, such as not handling negation and disjunction conditions well. The present method does not have these drawbacks because it uses a procedural post-processor.

本方法はきわめて効率的である。ポストプロセッサによ
ってシステムの動作時間は大して増加しない。さらに、
初期解析構成要素は完全に定義域に依存しているので、
システム全体できわめて広い範囲の英語をカバーするこ
とができる。This method is extremely efficient. Postprocessors do not significantly increase system operating time. moreover,
Since the initial analysis components are completely domain dependent,
The entire system can cover an extremely wide range of English.

ＥＣ法とＦＵ法は現在有力な言語理論であるが、どちら
も、今日のＮＬＰ技術を使用する応用分野で広く採用さ
れてはいない。英文テキストの意味解析を含む従来技術
の応用例は、一般に何らかの形の辞書主導の引き数識別
を使用しているが、ＥＣまたはＦＵの技術または形式を
必ずしも採用していない。Although the EC method and the FU method are currently dominant linguistic theories, neither has been widely adopted in today's applied fields using NLP techniques. Prior art applications involving semantic analysis of English text generally use some form of dictionary-driven argument identification, but do not necessarily employ EC or FU techniques or formats.

従来技術による英文のテキストの意味処理の方法は、コ
ンピュータ言語学会第２５回年会報文集（ｔｈｅ　Ｐｒ
ｏｃｅｅｄｉｎｇｓ　ｏｆ　ｔｈｅ　２５ｔｈ　Ａｎｎ
ｕａｌ　Ｍｅ゛ｅｔｉｎｇｏｆ　ｔｈｅ　Ａｓ５ｏｃｉ
ａｔｉｏｎ　ｆｏｒ　ＣｏｍｐｕｔａｔｉｏｎａｌＬｉ
ｎｇｕｉｓｔｉｃｓ）　、スタンフォード大学、１９８
７年７月−９月、ｐｐ、１３１−１３４に記載されてい
る。そこに記載された方法について、以下に簡単に説明
する。この従来技術のシステムは、単一の意味定義域、
すなわち、米海軍の船舶で使用される特定の種類の機械
類の故障報告を処理するように設計されている。この定
義域に属する英文が入力されると、システムは、文の意
味解析を行ない、その意味解析を、いくつの引き数をそ
の文の動詞に関連づけることができるか、及びそれがど
んな種類の引き数かを指定する、基礎フォーマットまた
はテンプレートにマツプする。（１）必須、（２）基本
、（３）非基本の３つの異なる引き数クラスが定義され
ている。必須引き数は構文解析で必ず存在しなければな
らず、存在しないとその解析は失敗する。基本引き数は
、構文中に必ずしも存在する必要はないが、それがない
場合、システムはその役割を果たすいくつかの「最良の
推測」の候補を仮説として取り上げる。したがって、基
本引き数及び必須引き数は、最終的には文の意味構造中
に存在する。非基本引き数は存在してもしなくてもよい
。A method of semantic processing of English text using conventional technology is described in the 25th Annual Report of the Association for Computer Linguistics (the Pr
oceedings of the 25th Ann
ual Meeting of the As5oci
ation for ComputationalLi
nguistics), Stanford University, 198
Published in July-September 7, pp. 131-134. The method described therein will be briefly explained below. This prior art system has a single semantic domain,
That is, it is designed to handle failure reports for certain types of machinery used on U.S. Navy vessels. When an English sentence belonging to this domain is input, the system performs a semantic analysis of the sentence and converts the semantic analysis into determining how many arguments can be associated with the verb in the sentence and what kind of arguments they are. map to a base format or template. Three different argument classes are defined: (1) mandatory, (2) basic, and (3) non-basic. Required arguments must be present in the parse, or the parse will fail. The base argument does not necessarily have to be present in the construct, but if it is absent, the system hypothesizes some "best guess" candidate for the role. Therefore, basic arguments and required arguments ultimately exist in the semantic structure of a sentence. Non-primary arguments may or may not be present.

たとえば、”Ｐｕｍｐ　ｆａｉｌｅｄ、　　（ポンプ故
障）という入力文が与えられると、構文解析の結果は、
主動詞として”ｆａｉｌｅｄ″が与えられ、その構文上
の主語としてｐｕｍｐ　”が与えられるはずである。For example, given the input sentence ``Pump failed,'' the parsing result is
"failed" should be given as the main verb, and "pump" should be given as its syntactic subject.

動詞”ｆａｉｌ″１の基礎テンプレートは、それがＦＡ
ＴＩ　ＥＮＴと呼ばれる１つの引き数を持つことを示す
はずである。次に、マツピング規則は、ｎｐｕｖＡｐｎ
がＰＡＴＩ　ＥＮＴ引き数の良い候補である（引き数は
「役割」とも呼ばれる）とする。次に、制限があるかど
うか検査する。動詞″ｆａｉｌ”では、ＰＡＴＩＥＮＴ
の役割を果たすものは機械装置でなければならないとい
う制限がある。（−般に、こうした情報は、辞書の名詞
″ｐｕｍｐ″の項目に記載のマーク、すなわち＋ＭＥＣ
Ｈで示される。）　　ｐｕｍｐ”は機械装置として合格
しているので、この引き数構造は完成する。すなわち、
”ｆａｉｌｅｄ”は１つの引き数としてそのＰＡＴ、Ｉ
ＥＮＴをもち、これはｐｕｍｐ　”で充填される。The basic template for the verb “fail”1 is that it is FA
It should show that it has one argument called TI ENT. Then the mapping rule is npuvApn
is a good candidate for the PATI ENT argument (the argument is also called a "role"). Next, check to see if there are any restrictions. With the verb "fail", PATIENT
There is a restriction that the device that fulfills this role must be a mechanical device. (-In general, such information is provided by the mark found in the entry for the noun "pump" in the dictionary, i.e. +MEC
Indicated by H. ) pump” passes as a mechanical device, so this argument structure is complete, i.e.
"failed" takes as one argument its PAT, I
ENT, which is filled with pump''.

しかし、この従来技術の引き数充填法には、次のような
いくつかの問題点がある。However, this conventional argument filling method has several problems as follows.

第一に、単語がとり得る可能な意味が、当該の定義域に
属するものだけに厳しく限定される。たとえば、動詞”
ｆａｉｌ　”は、次のような文に関連する意味をもつこ
とができる。First, the possible meanings that a word can have are strictly limited to those belonging to the domain in question. For example, the verb “
"fail" can have meanings associated with sentences such as:

Ｔｈｅ　ｅｑｕｉｐｍｅｎｔ　ｆａｉｌｅｄ。The equipment failed.

土窯では、この動詞は１つの必須引き数（”　ｅｑｕｉ
ｐｍｅｎｔ″）をとる。しかし、このシステムは次のよ
うな文中の動詞″ｆａＬＩｅｄ“を解釈できない。In clay kilns, this verb takes one required argument (“equi
However, this system cannot interpret the verb "faLIed" in the following sentence.

Ｈｉｓ　ｃｏｕｒａｇｅ　ｆａｉｌｅｄ　ｈｉｍ。His course failed him.

Ｔｏｄａｙ　　Ｉ　　ｔｏｏｋ　ｔｈｅ　ｃｈｅｍｉｓ
ｔｒｙ　ｅｘａｍ　ａｎｄ　ｆａｉｌｅｄｍｅ　ａ　ｖ
ｈｏｐｐｅｒ！このシステムは、狭く定義された部分定義域内ではこの
ような文は通常現れないことに依拠している。しかし、
人間による言葉の使い方は予測できないものであり、動
詞”ｆａｉｌ″が海軍船舶の報告で上記のような意味で
は絶対に使用されないとの保証はない。このシステムで
このような文ヲ扱う唯一の方法は、”ｆａｉｌ”に対す
る追加のテンプレートを用いるものであろう。しかし、
追加のテンプレートを用いると、構文解析構成要素に多
くの困難が生じることがある。Today I took the chemis
try exam and failed me av
hopper! This system relies on the fact that such statements do not normally occur within narrowly defined subdomains. but,
Human language usage is unpredictable, and there is no guarantee that the verb "fail" will never be used in the above sense in a naval vessel report. The only way to handle such statements in this system would be to use an additional template for "fail." but,
Using additional templates can introduce many difficulties to the parsing component.

第二に、各動詞ごとに、必須、基本、非基本の３つの引
き数クラスを分離する必要があるために処理が複数にな
る。必須引き数の数は、ある動詞の意味が異なるごとに
変わるので、特定の狭い意味論上の部分定義域の範囲内
でさえ、所与の動詞がいくつの意味をとり得るかを正確
に指定することは非常に難しい。Second, for each verb, there is a need to separate the three argument classes: essential, basic, and non-basic, resulting in multiple processes. Because the number of required arguments changes for different meanings of a verb, it is difficult to specify exactly how many meanings a given verb can have, even within a particular narrow semantic subdomain. very difficult to do.

第三に、充填子が「最良の推測」仮説にすぎない場合で
さえすべての基本引き数を充填する必要があるために、
システムの流れが妨げられる。構文構造中に必須引き数
及び基本引き数、のりストで必要とされるより少しの引
き数しか存在しない場合、その文の文法解析を成功させ
るために、しばしば、システムを停止し、バックアップ
し、再度試行する必要がある。Third, because the filler needs to pad all base arguments even when it is only a "best guess" hypothesis,
System flow is obstructed. If there are fewer arguments than required and basic arguments in a syntactic structure, and fewer arguments than are required in the list, the system often needs to be stopped, backed up, and Need to try again.

第四に、この従来技術のシステムでは、上記の「遠隔依
存関係」など最も巧妙な種類の引き数充填にほとんどま
たは全く関心が払われていない。Fourth, this prior art system pays little or no attention to the most sophisticated types of argument filling, such as the "remote dependencies" mentioned above.

このシステムは、そのような複雑な構造が狭い部分定義
域で現れるとは予想されないということに依拠している
。しかし、自然言語がフレキシブルな性格であるとすれ
ば、この予想は完全に安全なものではない。This system relies on the fact that such complex structures are not expected to appear in narrow subdomains. However, given the flexible nature of natural languages, this conjecture is not completely safe.

上記の理論的引き数充填手法（ＥＣとＦＵ）は、自然言
語の複雑さを処理するものであるが、それ自体が複雑で
あるため、実際の応用例で使用するのは難しい。これら
の従来技術の応用例は、実世界で意味論上の部分定義域
の範囲内で利用できるが、自然言語の複雑さをすべて処
理する技術を提供するものではなく、シたがってその適
用範囲が限られている。The above theoretical argument filling techniques (EC and FU) deal with the complexity of natural language, but their complexity makes them difficult to use in practical applications. Although these prior art applications can be used in the real world within a semantic subdomain, they do not provide a technique that handles all the complexities of natural language, and therefore their scope is limited. is limited.

米国特許第４７３１７３６号明細書に、本明細書で開示
する発明が実施できるデータ処理システムが開示されて
いる。上記の開示を、本明細書に開示する発明の背景技
術として、引用により本明細書に合体する。U.S. Pat. No. 4,731,736 discloses a data processing system in which the invention disclosed herein may be implemented. The above disclosure is incorporated herein by reference as background to the invention disclosed herein.

Ｃ０発明が解決しようとする課題本発明の目的は、英文の引き数構造または基本意味構造
を決定するための改良された方法を提供することである
。OBJECTS OF THE INVENTION It is an object of the present invention to provide an improved method for determining the argument structure or basic semantic structure of English sentences.

本発明の他の目的は、制限された意味定義域に関連する
文だけでなく、英語の任意の入力文に対する引き数構造
を決定する際に成功の確率が高くなるように、広い範囲
の英語をカバーすることである。Another object of the present invention is to provide a method for a wide range of English languages, so that the probability of success is high in determining the argument structure for any input sentence in English, not just sentences related to a restricted semantic domain. It is to cover.

本発明の他の目的は、属性値データ構造に照会し、それ
に追加し、かつそれから差し引くことによって完全に動
作し、ある動詞（または任意の語）を理解できるように
するにはそれがいくつの引き数をもたなければならない
かを指定する任意の述部引き数テンプレートによって制
限されない、手順と結合した、完全にモジュール式の方
法を使って、効率的な意味処理技術を提供することにあ
る。Another object of the invention is that it operates entirely by querying, adding to, and subtracting from attribute value data structures, and how many The objective is to provide an efficient semantic processing technique using a completely modular method coupled with procedures that are not limited by arbitrary predicate argument templates that specify which arguments must have arguments. .

０１課題を解決するための手段上記その他の目的、特徴及び利点は、本明細書に記載す
る発明によって達成される。英語の入力散文の述部−引
き数構造を決定するためのコンピュータによる方法が開
示される。ブランクで分離された単語文字列の形の入力
文を、まず単語に関する形態論情報及び構文情報にのみ
アクセスできる規則構成要素によって文法解析する。こ
の規則構成要素の出力は、属性値対からなるデータ構造
の形のもので、それが次に、言語知識をさらに統合する
部分的に順序通り配列された一組の手順から構成される
、引き数構造構成要素によって処理される。こうした手
順の出力は、同じ属性値構造であるが、今度は意味（す
なわち、意味のある非構文的な）ｒｆ４性の存在によっ
て性能が向上している。これらの意味属性が一つにまと
まって入力文の引き数構造を形成する。01 SUMMARY OF THE INVENTION The above and other objects, features and advantages are achieved by the invention described herein. A computational method for determining the predicate-argument structure of English input prose is disclosed. An input sentence in the form of a string of words separated by blanks is first parsed by a rule component that has access only to morphological and syntactic information about the words. The output of this rule component is in the form of a data structure consisting of attribute value pairs, which in turn consists of a partially ordered set of steps that further integrate linguistic knowledge. Processed by number structure components. The output of such a procedure is the same attribute value structure, but now with improved performance due to the presence of semantic (i.e., meaningful non-syntactic) rf4ness. Together, these semantic attributes form the argument structure of the input sentence.

その結果、本発明は、構文から自然（人為）言語の意味
処理の第１の段階に進むための、完全にモジュール式の
包括的かつ効率的な方法である。As a result, the present invention is a completely modular, comprehensive and efficient method for proceeding from syntax to the first stages of semantic processing of natural (artificial) languages.

本発明は、その設計の対象となっている言語の一部分で
はなく、すべての散文に適用される。本発明は、定義域
特有の意味情報を使用して、構文構成要素の精度または
効率を改善するものではない。The invention applies to all prose, not just the part of the language for which it is designed. The present invention does not use domain-specific semantic information to improve the accuracy or efficiency of syntactic constructs.

したがって、本発明は、今日の大部分のＮＬＰ応用例で
使用されている制限的方法ではなく、自然言語処理（Ｎ
ＬＰ）に対する無制限の広範囲をカバーする方法である
。Therefore, the present invention uses natural language processing (NLP) rather than the restrictive methods used in most NLP applications today.
This is a method that covers an unlimited wide range for LP).

異なる自然言語に対する具体的な規則及び手順はそれぞ
れ異なるが、本発明で具体化された全般的な概念はすべ
ての自然言語に適用可能である。Although the specific rules and procedures for different natural languages vary, the general concepts embodied in this invention are applicable to all natural languages.

英語の入力文字列の基本意味構造を決定するためのＮＬ
Ｐ方法が開示される。本方法によれば、こうした目的を
達成するために、英語の入力文字列を以下のように処理
する。本発明によれば、この方法には２つの段階がある
。第１段階は意味構成要素なしで構文解析を実行するも
のであり、第２段階は意味解析を実行するものである。NL for determining the basic semantic structure of English input strings
P method is disclosed. According to this method, in order to achieve these objectives, an English input string is processed as follows. According to the invention, the method has two stages. The first stage performs syntactic analysis without semantic components, and the second stage performs semantic analysis.

まず、入力文字列を、構文解析構造要素で解析する。−
例を挙げると、好ましい実施例では、ＰＬＮＬＰ英語文
法（ＰＥＧ）と呼ばれる構文解析プログラムを使用する
。（”ＰＬＮＰＬ”はプログラミング言語の名前であり
、自然言語処理用プログラミング言語の頭字語である。First, an input string is parsed using a parsing structure element. −
By way of example, the preferred embodiment uses a parsing program called PLNLP English Grammar (PEG). ("PLNPL" is the name of a programming language and is an acronym for Programming Language for Natural Language Processing.

）ＰＥＧは、各入力文字列ごとに構文解析を行なう。こ
の解析は、属性と値の収集からなる。こうした集合体を
、「レコード」または「レコード構造」と呼ぶ。) PEG parses each input string. This analysis consists of collecting attributes and values. Such collections are called "records" or "record structures."

構文解析の段階で、ＰＥＧは、各動詞への引き数の割当
ての完了を試みない。しかし、ＰＥＧが行なう解析は、
すべての引き数を計算し充填するのに必要なすべての情
報を含んでいる。During the parsing stage, PEG does not attempt to complete the assignment of arguments to each verb. However, the analysis performed by PEG is
Contains all the information needed to calculate and fill in all arguments.

引き数構造は、意味値をもつ追加の属性を用いて、構文
解析レコードの性能を改善することによって作成される
。構文を、しばしば入力文字列の「表面」構造と呼び、
意味を入力文字列の「深層」構造と呼んでいる。以下の
「深層」属性が定義されている。必要に応じて他のもの
が追加されることもある。Argument structures are created by using additional attributes with semantic values to improve the performance of parsing records. Syntax is often referred to as the "surface" structure of the input string,
The meaning is called the "deep" structure of the input string. The following "deep" attributes are defined. Others may be added as needed.

ａ、ＤＳＵＢＪＥＣＴ−命題の深層（すなわち意味上の
）主語、一般に、動作の行為者。a, DSUBJECT - the deep (i.e. semantic) subject of a proposition, generally the agent of an action.

ｂ、ＤＯＢＪＥＣＴ−命題の深層目的語、すなわち行為
者の動作によって最も直接的に影響を受けるエンティテ
ィ。b. DOBJECT - the deep object of a proposition, i.e. the entity most directly affected by the actor's action.

ｃ、ＤＩＮＤＯＢＪ−深層間接目的語、すなわち行為者
の動作を介して何かを経験し、または受は取るエンティ
ティ。c, DINDOBJ - deep indirect object, i.e. an entity that experiences or receives something through the action of an actor.

ｄ、ＤＰＲＥＤＮＯＭ−命題中でＤＳＵＢＪＥＣＴと同
じになるエンティティ。d. DPREDNOM - entity that becomes the same as DSUBJECT in the proposition.

ｅ、ＤＯＢＪＣＯＭＰ−命題中でＤＯＢＪＥＣＴと同じ
になるエンティティ。e, DOBJCOMP - entity that is the same as DOBJECT in the proposition.

構文解析段階を終えた後、本発明によると、レコードは
意味解析段階の第１ステツプに進む。ここでは、構文基
準により、識別の容易な意味引き数を、すなわちその動
詞に近接しかつ密接に関連する意味引き数を識別する。After completing the syntactic parsing stage, according to the invention, the record proceeds to the first step of the semantic parsing stage. Here, syntactic criteria identify semantic arguments that are easy to identify, ie, those that are close to and closely related to the verb.

これは、単に「深層Ｊ引き数層性を加算して、それらが
その「深層」構文解析が与えるのと同じ値を指すように
させる。This simply adds the ``deep J'' arguments so that they point to the same value that the ``deep'' parse gives.

意味解析段階にはさらに２つのステップがある。There are two further steps in the semantic analysis stage.

第２のステップでは、以下のように引き数を指定し完成
する。In the second step, specify and complete the arguments as follows.

＊不定詞節と分詞節の見えない引き数を指定する。*Specify invisible arguments for infinitive clauses and participial clauses.

＊転置された引き数または「遠隔」引き数を指定する。*Specifies transposed or "remote" arguments.

＊受動構文の見えないまたは転置された引き数を指定す
る。* Specify invisible or transposed arguments in passive constructs.

＊英語の間接目的構文の２つの異なる形式に対する引き
数を等しくする。* Equating the arguments for two different forms of English indirect object constructions.

これで、引き数指定ステップは完了する。任意選択で、
他の構文状況が発見された場合にその状況に対する引き
数指定を処理する手順を考案して、このステップを増補
することもできる。This completes the argument specification step. Optionally,
This step can also be augmented by devising procedures to handle argument specifications for other syntactic situations if they are found.

これらの中核的手順のうちで、順序配置がある程度重要
である。たとえば、見えない引き数を不定詞節及び分詞
節に割り当てる部分手順は、リストの最初に配置しなけ
ればならない。転置された（遠隔）引き数は、受動構文
手順の前に配置しなければならない。Among these core steps, the ordering is of some importance. For example, subprocedures that assign invisible arguments to infinitive and participial clauses must be placed first in the list. Transposed (remote) arguments must be placed before passive syntax procedures.

すべての引き数が適切に指定された後、ポストプロセッ
サがその第３のステップを開始して、主動詞の引き数で
はない解析済みセグメントを再検討する。これらの非引
き数セグメントは「付加部Ｊと呼ばれる。この＠討の結
果、ＶＰ（動詞句）、ＡＪＰ（形容詞句）、ＡＶＰ（副
詞句）、ＮＰ（名詞句）、及びＰＰ（前置詞句）という
各主要クラスの非引き数修飾部を指す意味属性の追加に
よって、レコード構造の性能がさらに改善される。After all arguments are properly specified, the post-processor begins its third step to review parsed segments that are not arguments of the main verb. These non-argument segments are called "additional parts J." The result of this @description is VP (verb phrase), AJP (adjective phrase), AVP (adverbial phrase), NP (noun phrase), and PP (prepositional phrase). The performance of the record structure is further improved by the addition of a semantic attribute that points to the non-argument modifier of each major class.

以下の意味属性が定義されており、必要に応じて他のも
のも追加されることがある。The following semantic attributes are defined, and others may be added as needed.

ａ、ＭＯＤＳ−修飾部；指定されない。a, MODS-modifier; not specified.

ｂ、ＮＡＤＪ−名詞を前から修飾する形容詞。b. NADJ - an adjective that modifies a noun from the front.

ｃ、ＰＡＤＪ−述部形容詞、または修飾名詞を後から修
飾する形容詞。c, PADJ - predicate adjective or adjective that later modifies a modified noun.

ｄ、０ＰＳ−操作子；限定辞と限量辞を含む。d, 0PS-manipulator; includes determiner and quantifier.

ｅ、ＰＡＲＴＩＣＬ−動詞と組み合わされて動詞句の引
き数構造の大きな変化を知らせる前置詞または副詞。e, PARTICL - a preposition or adverb that is combined with a verb to signal a major change in the argument structure of the verb phrase.

ｆ、ＰＲＥＤ−各単語の基本形式。f, PRED - the basic form of each word.

ｇ、ＰＲＯＰ−前置修飾部；不定詞句と分詞句を含む。g. PROP - premodifier; includes infinitive phrases and participial phrases.

ｈ、ＲＥＦ−代名詞が指す名詞。h, REF-noun referred to by the pronoun.

このレコード構造に最終的調整を加えて、不必要な属性
を消去し、一般にレコード構造を１．システムの次の構
成要素に移る前にクリーンアップする。ある自然言語を
別の言語により正確に翻訳するために、出力されたレコ
ード構造を機械翻訳装置で使用することができる。この
出力は、高度のテキスト校訂、改良された情報検索、電
子メールの経路設定及びファイル編成、自動テキスト生
成、及び基本意味解析を必要とするその他のＮＬＰ応用
例にも適用できる。Final adjustments to this record structure were made to remove unnecessary attributes and generally reduce the record structure to 1. Clean up before moving on to the next component of the system. The output record structure can be used in a machine translation device to accurately translate one natural language into another. This output can also be applied to advanced text editing, improved information retrieval, email routing and file organization, automatic text generation, and other NLP applications requiring basic semantic analysis.

Ｅ、実施例第２図は、このシステムの全体的構成を示す。E. Example FIG. 2 shows the overall configuration of this system.

このシステムは、入力段階、構文構造を作成する解析段
階、意味引き数が構文引き数と等しい場合に引き数を指
定する予備機解析ステップ（１）と、他のすべての場合
に引き数を決定し、それ自体が意味引き数を指定するス
テップ（２，１）と、付加部を指定するステップ（２，
２）とを含む主ポストプロセッサ（２）、最後に、入力
文字列に対応する基本意味構造である完成した引き数構
造（３）を含んでいる。The system consists of an input stage, a parsing stage that creates a syntactic structure, a preliminary parsing step (1) that specifies the argument if the semantic argument is equal to the syntactic argument, and determines the argument in all other cases. itself specifies a semantic argument (2,1), and specifies an appendix (2,1).
2), and finally a completed argument structure (3), which is the basic semantic structure corresponding to the input string.

入力段階中に、単語の文字列がシステムに入力される。During the input phase, a string of words is input into the system.

これは通常、キーボードでテキストをタイプ入力するこ
とによって行なう。ただし、単語文字列は他のどんな手
段で入力してもよい。たとえば、音声認識構成要素が利
用できる場合、テキストを人間の声で入力することもで
きる。This is typically done by typing text on the keyboard. However, the word string may be input by any other means. For example, if a voice recognition component is available, text can also be entered with a human voice.

構文解析段階は、構文解析構成要素ＰＥＧによって実行
される。ＰＥＧは、各入力文字列ごとに、属性値「レコ
ード」構造の形の解析記述を生成する。より読みやすい
構文解析「木」も、レコード構造に含まれる情報から表
示される。ＰＥＧは非常に大きな英語語寅リスト（シば
しば辞典（レフシコン）と呼ばれる）及び、英語の標準
の形態論的規則にアクセスする。しかし、辞典は、こう
した規則とあいまって、以下の種類の限られた形態論的
情報及び解析情報のみをＰＥＧに提供する。The parsing stage is performed by the parsing component PEG. PEG generates a parsing description in the form of an attribute value "record" structure for each input string. A more readable parsing "tree" is also displayed from the information contained in the record structure. PEG has access to a very large list of English words (often called the lexicon) and the standard morphological rules of the English language. However, in conjunction with these rules, dictionaries provide PEG with only limited morphological and analytical information of the following types:

＊大部分の英語の単語の正書法に基づく形（綴り）＊各
単語が英語でとり得る品詞＊各単語に与えられた各品詞ごとの時制、数などの形態
情報＊各単語が属する様々な構文上の下位範嗜クラスに関す
る情報（たとえば、特定の動詞が他動詞か否か）ＰＥＧは、ＰＬＮＬＰ　（自然言語処理用プログラミン
グ言語）で書かれた大きなプログラムである。ＰＥＧは
、ボトム・アップ式の完全に並列処理のアルゴリズムに
よって駆動される、約２００の増補句構造規則からなる
。これらの規則は、入力文字列の構文解析を生成する。* Orthographic forms (spellings) of most English words * Parts of speech that each word can take in English * Morphological information such as tense and number for each part of speech given to each word * Various constructions to which each word belongs Information about the subcategory classes above (e.g., whether a particular verb is transitive or not) PEG is a large program written in PLNLP (Natural Language Processing Programming Language). PEG consists of approximately 200 augmented phrase structure rules driven by a bottom-up, fully parallel processing algorithm. These rules produce a parse of the input string.

特定の規則が特定の文字列またはその文字列の一部分に
適用できるかどうかは、その文字列をカバーするレコー
ド構造中のある属性の有無及びそれらの値によって決ま
る。ある属性は辞典によって提供され、他の属性は規則
自体によって追加される。ＰＥＧが行なうのは、個々の
単語に対するレコードから始めて、最後にある構造に到
達するまで、次々により大きなレコード構造を構築する
ことにより、文字列の構文記述を生成することである。The applicability of a particular rule to a particular string or portion of a string depends on the presence and value of certain attributes in the record structure covering the string. Some attributes are provided by the dictionary, others are added by the rules themselves. What PEG does is to generate a syntactic description of a string by starting with records for individual words and building successively larger record structures until the last structure is reached.

最終構造に到達すると、入力文字列全体が解析される。Once the final structure is reached, the entire input string is parsed.

ＰＥＧは形態論的情報及び構文情報だけを使用し、意味
情報は全く使用しないで解析を行なうことに留意された
い。Note that PEG performs analysis using only morphological and syntactic information and no semantic information.

多くの情報が属性値解析構造に含まれている。A lot of information is contained in the attribute value parsing structure.

この情報の一部は単純である。たとえば、動詞句（７）
Ｌ／コード中のＰＡＳＴＩＩｔ性は、オンまたはオフの
値を持つ。それが「オン」の場合、その動詞句は、過去
時制であることを示す。ある情報はより複雑である。た
とえば、動詞句は属性５ＵＢＪＥＣＴをもつが、この属
性は、その値として、その名詞句に属するすべての情報
と共に、その主語として働く名詞句をカバーする他のレ
コード構造全体をｔ旨すポインタをもつ。Some of this information is simple. For example, verb phrase (7)
The PASTIIIt property in the L/code has a value of on or off. If it is "on", it indicates that the verb phrase is in the past tense. Some information is more complex. For example, a verb phrase has the attribute 5UBJECT, which has as its value a pointer to the entire other record structure covering the noun phrase serving as its subject, along with all the information belonging to that noun phrase. .

ＰＥＧによって指定されるすべての属性は、構文属性で
ある。そのうちには、入力文字列の構文引き数を参照す
るものがある（ＳＵＢＪＥＣＴなど）。本明細書に開示
する方法によれば、意味論的後処理の第１段階では、意
味引き数が構文引き数と正確に対応するとき、その意味
引き数を識別する。これを行なうため、ポストプロセッ
サは、単にレコードにある意味属性を追加して、それが
その構文解析と同じ値を指すようにさせる（たとえば、
ＤＳＵＢ　Ｊ　ＥＣＴは、５ＵＢＪＥＣＴが指すのと同
じレコードを指す）。All attributes specified by PEG are syntactic attributes. Some of them (such as SUBJECT) refer to syntactic arguments of the input string. According to the method disclosed herein, the first stage of semantic post-processing identifies a semantic argument when it corresponds exactly to a syntactic argument. To do this, the postprocessor simply adds some semantic attribute to the record and causes it to point to the same value as its parse (e.g.
DSUB JECT points to the same record that 5UBJECT points to).

後処理の次の段階では、引き数識別の残りのすべてのケ
ースを処理する。The next stage of post-processing deals with all remaining cases of argument identification.

第１に、ポストプロセッサは、不定詞または分詞形の動
詞補語を探したり、主語の名詞句に隣接するのではなく
、文の終りに現れる分詞形の主語補語を探して、レコー
ド構造を走査する。動詞補語である現在分詞を見つけた
（ステップ４）と仮定する。たとえば、Ｊｏｈｎ＋　ｉｎ　ｍｙ　ｏｐｉｎｉｏｒ＋、　１ｉｋ
ｅｓ　ｅｎｔｅｒｔａｉｎｉｎｇＶＯｌｅｎ。First, the postprocessor scans the record structure looking for verb complements in the infinitive or participial form, or for participial subject complements that occur at the end of a sentence rather than adjacent to the subject noun phrase. . Assume that we have found a present participle that is a verb complement (step 4). For example, John+ in my opinior+, 1ik
es entertaining VOlen.

の文でｅｎｔｅｒｔａｉｎｉｎｇ　ｗｏｍｅｎ″は）主
動詞″１ｉｋｅｓ″の補語である現在分詞節を形成する
。In the sentence, ``entertaining women'') forms a present participial clause that is the complement of the main verb ``1ikes''.

この節の意味上の目的語（ＤＯＢＪＥＣＴ）は名詞“ｗ
ｏｍｅｎ″であるｏ　”　Ｗｏｍｅｎ″′は構文上の目
的語として現れており、後処理の第１段階で意味上の目
的語であると識別されている。しかし、ｅｎｔｅｒｔａ
ｉｎｉｎｇ　ｖｏ■ｅｎ″の構文上の主語はない〇さら
に、この段階では、”　ｅｎ　ｔｅｒ　ｔａ　ｉｎ　ｉ
ｎｇ　”を行なっているのが誰なのかわからないのでｅ
ｎｔｅｒｔａｉｎｉｎｇｗｏｍｅｎ″の正しい意味上の
主語を決定することは不可能である。女を楽しませてい
るのはＪｏｈｎかもしれないし、他の誰かかもしれない
。食違がＪｏｈｎを楽しませている可能性すらある。そ
６場合は、これは分詞節ではなく　、”ｅｎｔｅｒｔａ
ｉｎｉｎｇが形容詞として働く名詞句となる。したがっ
て、仮の意味上の主語（ｘｘ”）がこの場合の属性［５
ＵＢＪＥＣＴの値として指定される。これで、上記の文
の動詞”ｅｎｔｅｒｔａｉｎ　”の引き数指定はチアす
る。システムの後の構成要素で、“’　ｅｎｔｅｒｔａ
　ｉｎの最も可能性の高いＤＳＵＢＪＥＣＴを計算する
。The semantic object (DOBJECT) of this clause is the noun “w
o ``Women'''' appears as a syntactic object and is identified as a semantic object in the first stage of post-processing.However, enterta
There is no syntactic subject for "ining vo■en"〇Furthermore, at this stage, "en ter ta in i
ng” because I don’t know who is doing it.
It is impossible to determine the correct semantic subject of "intertainingwomen". It may be John who is entertaining the woman, or it may be someone else. It is even possible that the discrepancy is entertaining John. Yes. In case 6, this is not a participial clause, but “enterta”
ining becomes a noun phrase that acts as an adjective. Therefore, the tentative semantic subject (xx”) is the attribute [5
Specified as the value of UBJECT. Now, the argument specification of the verb "entertain" in the above sentence is supported. In a later component of the system, "' enter
Compute the most likely DSUBJECT of in.

第２Ａ図の他のすべてのケースでは、ＤＳＵＥＪＥＣＴ
がすでに節に存在する場合、手順は停社する。そうでな
い場合は、主動の構文上の主語力・当該の節のＤＳＵＢ
ＪＥＣＴとして指定される（ステップ５）。分詞形の主
語補語の例として、次の文を考えてみる。In all other cases in Figure 2A, DSUEJECT
already exists in the clause, the procedure is suspended. If not, the syntactic subject force of the main verb and the DSUB of the clause
designated as JECT (step 5). As an example of a participial subject complement, consider the following sentence.

Ｍａｒｙ、　ａｓ　ｙｏｕ　ｐｒｅｄｉｃｔｅｄ、　ａ
ｒｒｉｖｅｄ　ｅｘｃｉｔｅｄｌｙｗａｖｉｎｇ　ｈｅ
ｒ　ｈａｎｄ。Mary, as you predicted, a
rrived excitedly waving he
r hand.

”ｗａｖｉｎｇ　ｈｅｒ　ｈａｎｄｓ″は）主動詞”　
ａｒｒｉｖｅｄ　”の補語ではなく、主動の主語″Ｍａ
ｒｙ　”にかかるまたは補足する分詞節である。Ｗａｖ
ｉ０ｇ“は、構文上の目的語（ｈｅｒ　ｈａｎｄｓ）を
もち、これは直ちにその意味上の目的語ＤＯＢＪＥＣＴ
として識別される。しかし、″ｗａｖｉｎｇ　ｈｅｒ　
ｈａｎｄｓ“の構文上の主語はない。ステップ５で、”
Ｍａｒｙ″をその意味上の主語（ＤＳＵＢ　Ｊ　ＥＣＴ
）として識別する。“waving her hands” is the main verb.”
It is not the complement of ``arrived'', but the main subject of ``Ma''
It is a participial clause that depends on or supplements “ry”.Wav
i0g" has a syntactic object (her hands), which immediately follows its semantic object DOBJECT
Identified as. However, “waving her”
There is no syntactic subject for “hands”. In step 5, “
Mary” as its semantic subject (DSUB J ECT
).

第２Ｂ図は、遠隔引き数を処理する手順の流れ図である
。現在４つの異なる遠隔状況が処理される。FIG. 2B is a flow diagram of a procedure for processing remote arguments. Currently four different remote situations are handled.

ａ１話題提示二″Ａｌ１ｃｅ、　Ｐｅｔｅｒ　５ａｉｄ
　ｔｈａｔ　Ｊｏｈｎｋｉｓｓｇｄ。a1 Topic Presentation 2''Al1ce, Peter 5aid
that johnkissgd.

ｂ、疑問詞を使った疑問”Ｗｈｏ　ｃｌｉｄ　Ｐｅｔｅ
ｒ　５ａｙｔｈａｔ　Ｊｏｈｎ　ｋｉｓｓｅｄ？　”Ｃ
１関係節：”Ｔｈ１ｓ　ｉｓ　ｔｈｅ　ｇｉｒｌ　ｗｈ
ｏ　Ｐｅｔｅｒ　５ａｉｄｔｈａｔ　Ｊｏｈｎ　ｋｉｓ
ｓｅｄ、　”ｄ、自由関係：　”Ｉ　ｋｎｏｗ　ｗｈｏ
　Ｐｅｔｅｒ　５ａｉｄ　ｔｈａｔＪｏｈｎ　ｋｉｓｓ
ｅｄ。b. Questions using interrogative words “Who clid Pete
r 5ay that John kissed? "C.
1 relative clause: “Th1s is the girl wh
o Peter 5aid that John kiss
sed, ``d, free relation: ``I know who
Peter 5aid that John kiss
ed.

この遠隔引き数処理手順では、遠隔前置要素（たとえば
、上記の文でイタリック体の単語）を識別して、この要
素をＦＲＮＴＮＰと呼ぶ。次に、遠隔関係の候補となる
節を識別し、これをＣＡＮＤと呼ぶ。ＣＡＮＤは通常主
動詞を後から修飾する節のリストの最後の節である。こ
のリストは補語節と不定詞節を含むが、たとえば、従属
接続詞によって導入された従属節は含まない。This remote argument processing procedure identifies the remote prefix element (eg, the italicized word in the sentence above) and calls this element FRNTNP. Next, we identify a node that is a candidate for a remote relationship, and call this a CAND. CAND is usually the last clause in the list of clauses that later modify the main verb. This list includes complement clauses and infinitive clauses, but does not include dependent clauses introduced by subordinating conjunctions, for example.

ＣＡＮＤ自体が次の文のように等位接続の動詞句を含む
場合は（ステップ６）、Ｗｈｏ　ｄｉｄ　Ｐｅｔｅｒ　ｓａｙ　ｔｈａｔ　Ｊｏ
ｈｎ　ｋｉｓｓｅｄ　ａｎｄｈｕｇｇｅｄ？等位接続動詞句を分離する手順に送られる（第２Ｊ図参
照）。最終的に、分離された要素が遠隔手順に戻される
。次に考慮することは、ＣＡＮＤが次の文のように懸垂
前置詞を含むかどうかである（ステップ７）。If CAND itself contains a coordinating verb phrase (step 6), as in the following sentence: Who did Peter say that Jo
hn kissed and hugged? It is sent to a procedure that separates coordinating verb phrases (see Figure 2J). Finally, the separated elements are returned to the remote procedure. The next consideration is whether CAND contains a hanging preposition, as in the following sentence (step 7).

Ｗｈｏ　ｄｉｄ　Ｐｅｔｅｒ　ｓａｙ　ｔｈａｔ　Ｊｏ
ｈｎ　ｒａｎ　１ｎｔｏ　？上記の文では、ＣＡＮＤは
、”Ｊｏｈｎ　ｒａｎ　１ｎｔｏ″であり、懸垂前置詞
は“１ｎｔｏ　”である。このような場合、ＦＲＮＴＮ
Ｐが懸垂前置詞の目的語として指定され（Ｊｏｈｎ　ｒ
ａｎ　１ｎｔｏ　ｗｈｏ？）　、手順は終了する。この
節は、必要に応じて次の引き数識別に利用できる。Who did Peter say that Jo?
hn ran 1nto? In the above sentence, CAND is "John ran 1nto" and the hanging preposition is "1nto". In such a case, FRNTN
P is specified as the object of a suspended preposition (John r
An 1nto who? ), the procedure ends. This clause can be used to identify the next argument if necessary.

ＣＡＮＤ中に等位接続関係がなく、懸垂前置詞もない場
合、この手順の主な仕事が始まる。The main task of this procedure begins when there are no coordinating relationships in the CAND and no hanging prepositions.

遠隔依存関係の引き数を正しく識別するには、複雑な１
組の情報が必要である（ステップ８）。Correctly identifying arguments for remote dependencies requires a complex
A set of information is required (step 8).

第１に、ＣＡＮＤ中の主動詞の下位範噌クラスが重要で
ある。作為動詞か、授与動詞か、それともそのどちらで
もないか。作為動詞は目的語と目的補語を取る。”Ｔｈ
ｅｙ　ｅｌｅｃｔｅｄ　ｈｉｍ　（目的語）ｐｒｅｓｉ
ｄｅｎｔ　（目的補語）、″授与動詞は間接目的語と目
的語を取る。”Ｔｈｅｙ　ｇａｖｅ　ｈｅｒ　（間接目
的語）ａｎ　ａｗａｒｄ　（目的語）、″第２に、ＣＡ
ＮＤは時制節か、それとも不定詞節か。第３に、どれだ
けの引き数がすでに節中にあるか。第４に、すでに存在
している引き数ならびに遠隔要素にどんな特徴があるの
か。First, the subcategory class of the main verb in CAND is important. Is it an action verb, a conferring verb, or neither? Actual verbs take an object and an object complement. “Th
ey selected him (object) presi
dent (objective complement), ``Awarding verbs take an indirect object and an object.'' They give her (indirect object) an award (object),'' Second, CA
Is ND a tense clause or an infinitive clause? Third, how many arguments are already in the clause? Fourth, what are the characteristics of the already existing arguments and remote elements?

ここで特に重要な特徴ＨＵＭは、いくつかの代名詞（′
ｖｈｏ科　Ｉ″＋　　”ｙｏｕ″、ｈｅ″　”　ｓｈｅ
　”など）といくつかの名詞（“ｍａｎ″、°“ｗｏｍ
ａｎ″ｃｈｉｌｄ”など）に付加される。この特徴は、
明らかに意味のある解釈をもっているが、ここでは、意
味論上のものとみなすべきではなく、他の特徴と同じく
、単語リストのメンバに付加できる特徴である。これは
、それが付加される単語が、ある状況で構文上ある挙動
を示す確率が高いことを示す。したがって、辞典にこの
特徴が記載されていることは、意味情報なしで構文解析
構成要素が働くという先の主張と矛盾するものではない
。The particularly important feature here is that HUM has several pronouns ('
vho family I″+ “you”, he″” she
”) and some nouns (“man”, °“wom”, etc.) and some nouns (“man”, °“wom
an″child,″ etc.). This feature is
Although it clearly has a meaningful interpretation, it should not be considered here as a semantic one, but rather a feature that can be attached to a member of a word list like any other feature. This indicates that the word to which it is attached has a high probability of exhibiting a certain syntactic behavior in a certain situation. Therefore, the inclusion of this feature in the dictionary is consistent with the earlier claim that parsing components work without semantic information.

上記の情報に基づいて、この手順は、遠隔依存関係を解
決する（ステップ９）。次の文で、Ｗｈｏ　ｄｉｄ　Ｊ
ｏｈｎ　ｗａｎｔ　ｔｏ　ｋｉｓｓ？ＦＲＮＴＮＰ　（
ｗｈｏ　”　）が、動詞″ｋｉｓｓ　”のＤＯＢＪＥＣ
Ｔとして指定される。これは動詞の動作の影響を直接受
ける。次の文では、Ｗｈｏ　ｄｉｄ　Ｊｏｈｎ　ｗａｎｔ　ｔｏ　ｗｒｉｔ
ｅ？ＦＲＮＴＮＰ　（”ｗｈｏ″）が、動詞″ｗｒｉｔ
ｅ″のＤＩＮＤＯＢＪとして指定される。これは、動詞
の動作を介しであるものを受は取る。次の文では、Ｗｈ
ａｔ　ｄｉｄ　Ｊｏｈｎ　ｗａｎｔ　ｔｏ　ｗｒｉｔｅ
？ＦＲＮＴＮＰ　（ｗｈａｔ　”　）が、動詞″ｗｒｉ
ｔｅ”のＤＯＢ　Ｊ　ＥＣＴとしてｔ舌足される。Based on the above information, the procedure resolves remote dependencies (step 9). In the next sentence, Who did J
ohn want to kiss? FRNTNP (
who”) is the verb “kiss” DOBJEC
Designated as T. This is directly influenced by the action of the verb. In the next sentence, Who did John want to write?
e? FRNTNP (“who”) is the verb “write”
e'' is designated as DINDOBJ. This is specified as DINDOBJ of ``e''.
at did John want to write
? FRNTNP (what”) is the verb “wri”
t is added as DOB J ECT of "te".

受動構文に出会うと（第２Ｃ図）、ポストプロセッサは
、受動国中に懸垂前置詞があるかどうか再度１１べる（
ステップ１０）。たとえば、Ｔｈｅ　ｈｏｕｓｅ　ｗａ
ｓ　ｂｒｏｋｅｎ　１ｎｔｏ。When a passive construction is encountered (Figure 2C), the postprocessor checks again to see if there is a suspended preposition in the passive (Figure 2C).
Step 10). For example, The house wa
s broken 1nto.

上記の文の１ｎｔｏ″のような懸垂前置詞が存在する場
合、構文上の主語（この場合は”ｔｈｅ　ｈｏｕｓｅ　
”　）が懸垂前置詞の目的語として指定される（”ｂｒ
ｏｋｅｎｉｎｔｏ　ｔｈｅ　ｈｏｕｓｅ″）。次に、手
順は先に飛んで、可能性のある”ｂｙ″前置詞句を見つ
け（ステップ１３）、意味上の主語を識別する。When a suspended preposition is present, such as ``1nto'' in the sentence above, the syntactic subject (in this case ``the house
” ) is specified as the object of a suspended preposition (”br
The procedure then jumps ahead to find possible "by" prepositional phrases (step 13) and identify the semantic subject.

受動構文の意味上の引き数を正しく適切に識別するため
に、懸垂前置詞がない場合、複雑な１絹の情報を照会す
る。第１に、主動詞の下位範曙クラスが重要である。作
為動詞か、授与動詞か、それともそのどちらでもないか
（上記参照）。第２に、どれだけの引き数がすでに筒中
にあるか。第３に、節の構文上の主語にどんな特徴があ
るか。In order to correctly and appropriately identify the semantic argument of a passive construction, in the absence of a suspended preposition, a complex piece of information is queried. First, the subcategory class of the main verb is important. Is it an action verb, a conferring verb, or neither (see above)? Second, how many arguments are already in the barrel? Third, what characteristics does the syntactic subject of the clause have?

ここで特に重要な特徴ＡＮＩＭは、人間ではない、生物
を指す名詞に付加される点以外は、すべての点でＨＵＭ
と同様である。この特徴も明らかに意味のある解釈をも
っているが、ここでは意味論上のものとみなさない。The particularly important feature here is that ANIM is HUM
It is similar to Although this feature clearly has a meaningful interpretation, it will not be treated as a semantic one here.

次に、手順は、上記の情報に基づいてＤＳＵＢＪＥＣＴ
以外の意味引き数を指定する（ステップ１２）。次の受
身文では、Ｔｈｅ　ｍａｎ　ｗａｓ　ｅｌｅｃｔｅｄ　Ｐｒｅｓｉ
ｄｅｎｔ。Next, the procedure is to create a DSUBJECT based on the above information.
(step 12). In the following passive sentence, The man was selected Presi
dent.

”ｔｈｅ　ｍａｎ　”がＤ　ＯＢ　Ｊ　Ｅ　ＣＴ、、　
”Ｐｒｅｓｉｄｅｎｔ″がＤＯＢ　Ｊ　ＣＯＭＰである
。次の文では、ｌｉｅ　ｗａｓ　ｇｉｖｅｎ　ａ　ｍａ
ｎｄａｔｅ。``the man'' is DOB J E CT...
“President” is DOB J COMP. In the next sentence, lie was given a ma
ndate.

”ｈｅ“がＤＩＮＤＯＢＪ　（あるものを受は取る人）
で、”ａ　ｍａｎｄａｔｅ　’がＤＯＢＪＥＣＴ（受は
取られるもの）である。次の文では、Ｙｏｕ　ｗｅｒｅ　１ｎｖｉｔｅｄ。“he” is DINDOBJ (the one who takes something)
So, "a mandate" is DOBJECT.In the next sentence, you are invited.

ｙｏｕ″がＤＯＢＪＥＣＴである。ＤＯＢＪＣＯＭＰま
たはＤ　ＩＮＤＯＢＪはない。you'' is DOBJECT. There is no DOBJCOMP or D INDOBJ.

手順は、次に受動構文のＤＳＵＢＪＥＣＴを指定する。The procedure then specifies the passive construct DSUBJECT.

′ｂｙ″前置詞句が存在するかどうか検索して調べる（
ステップ１３）。ない場合は、ＤＳＵＢＪＥＣＴが”ｘ
ｘ″の値に指定される。”ｘｘ”は、後で必要な情報が
処理されたとき、解決できる。″ｂｙ＝ｌ前置詞句が存
在する場合、ＤＳＵＢＪＥＣＴは、その前置詞句の目的
語を指すように指定すれる（ステップ１４）。たとえば
、次の文で、Ｙｏｕ　ｗｅｒｅ　１ｎｖｉｔｅｃｌ　ｂ
ｙ　ｔｈｅ　Ｐｒｅｓｉｄｅｎｔ。Search to see if the ``by'' prepositional phrase exists (
Step 13). If not, DSUBJECT is “x”
x". "xx" can be resolved later when the necessary information is processed. If a "by=l" prepositional phrase is present, DSUBJECT points to the object of that prepositional phrase. (Step 14). For example, in the following sentence, You were 1nvitecl b
y the President.

”　ｔｈｅ　Ｐｒｅｓｉｄｅｎｔ″がＤＳＵＢＪＥＣＴ
である。"the President" is DSUBJECT
It is.

この文は能動文”Ｔｈｅ　Ｐｒｅｓｉｄｅｎｔ　１ｎｖ
ｉｔｅｄ　ｙｏｕ。This sentence is an active sentence “The President 1nv
ited you.

と等価な受動文である。どちらの文でも、ＤＳＵＢＪＥ
ＣＴが”ｔｈｅ　Ｐｒｅｓｉｄｅｎｔ″　ＤＯＢＪＥＣ
Ｔがｙｏｕ″である。このように、能動文と受動文の表
面上の構文形式は全く異なっているが、基礎となる引き
数構造は、それらが意味上類似していることを指摘して
いる。is a passive sentence equivalent to . In both sentences, DSUBJE
CT is “the President” DOBJEC
T is you''.In this way, although the superficial syntactic forms of active and passive sentences are completely different, the underlying argument structures are similar in meaning. There is.

次の部分手順（第２Ｄ図）は、間接目的構文を処理する
ものである。この手順の目的は、′ｖｅｇａｖｅ　ｈｉ
＋ｎ　ａ　ｍａｎｄａｔｅ、　”と”１ｉｆｅ　ｇａｖ
ｅ　ａ　ｍａｎｄａｔｅｔｏ　ｈｉｍ、　”のような文
の意味上の類似性を指摘することである。この点で、間
接目的手順は、受動手順と同じ目的をもつが、ずっと単
純である。第１段階で、”ｔｏ″前置詞句の目的語を見
つける。The next sub-step (Figure 2D) deals with indirect object constructs. The purpose of this step is to
+na mandate, ``and''1ife gav
The aim is to point out the semantic similarity of sentences like ``e a mandate to him,''. In this respect, the indirect objective procedure has the same objective as the passive procedure, but is much simpler. , find the object of the "to" prepositional phrase.

次の段階で、この目的語をＤＩＮＤＯＢＪ属性の値に指
定する。この段階が完了すると、どちらの文もＤＳＵＢ
ＪＥＣＴ″ｗｅ″、ＤＩＮＤＯＢＪ”ｈｉｍ’ｓ及びＤ
ＯＢＪＥＣＴ″ａ　ｍａｎｄａｔｅ″をもつ。The next step is to specify this object as the value of the DINDOBJ attribute. Once this stage is complete, both statements are DSUB
JECT"we", DINDOBJ"him's and D
It has an OBJECT "a mandate".

コレで、ポストプロ・セッサの中心部分の第１ステツプ
（２，１）が終了して、意味上の属性にその値が指定さ
れ、入力文字列に対する引き数構造が構築される。第２
ステツプ（２，２）では、意味上の付加部、すなわち引
き数ではないが、最終的な意味構造にとって重要な修飾
部を指定する。This completes the first step (2,1) of the central part of the post-processor, assigning values to the semantic attributes, and constructing an argument structure for the input string. Second
Step (2, 2) specifies semantic additions, that is, modifiers that are not arguments but are important to the final semantic structure.

この第２ステツプの第１相では、述部形容詞をその主語
名詞句と連係させる（第２Ｅ図）。述部形容詞構文の例
は次の文である。The first phase of this second step associates the predicate adjective with its subject noun phrase (Figure 2E). An example of a predicate adjective construction is the following sentence:

Ｍａｒｙ　ａｎｄ　Ｊｏｈｎ　ａｒｅ、　ａｎｄ　ａｌ
ｗａｙｓ　ｗｉｌｌ　ｂｅ。Mary and John are, and al
ways will be.

ｈａｐｐｙ　。Happy.

形容詞″ｈａｐｐｙ”は主語名詞句″Ｍａｒｙ　ａｎｄ
　Ｊｏｈｎ　”のＩＩ　Ｍａ、ｙｎと’　Ｊｏｈｎ　”
にかかるが、ＮＰからずっと離れている。句が”ｈａｐ
ｐｙ　Ｊｏｈｎ　ａｎｄ　ｈａｐｐｙ　Ｍａｒｙ　”で
ある場合は、問題はない。しかし、上記の文は形容詞が
主語から離れているので、構文解析によって、直接圧し
い接続はできない。この点で、述部形容詞をそのＮＰと
連係させる問題は、遠隔依存関係の問題と同じである。The adjective "happy" is the subject noun phrase "Mary and
II Ma, yn and 'John'
However, it is far away from NP. The phrase is “hap”
py John and happy Mary”, there is no problem. However, in the above sentence, the adjective is far from the subject, so it cannot be directly connected by parsing. In this point, the predicate adjective is The problem of linking with NP is the same as the problem of remote dependencies.

その問題を解決する第１ステツプは、主語名詞句のリス
トを構成することである（ステップ１５）。主ＭＮＰが
１つしかない場合、リストは、１つのメンバを含むリス
トである。リストの各メンバ上で、ＰＡＤＪＩ性を作成
する（ステップ１６）。その値は述部形容詞を指すポイ
ンタである。The first step in solving the problem is to construct a list of subject noun phrases (step 15). If there is only one primary MNP, the list is a list containing one member. Create a PADJI property on each member of the list (step 16). Its value is a pointer to the predicate adjective.

次に、主語ＮＰのリストの次のメンバを処理する。Next, process the next member of the list of subjects NP.

リストが空になると、手順は終了する。The procedure ends when the list is empty.

すべての引き数が識別されると、ポストプロセッサは主
な句範１１ＶＰ、ＮＰ　（ＰＰを含む）、ＡＪＰ、ＡＶ
Ｐを考慮して、すべての非引き数修飾部にその固有の意
味属性が指定されていることを確認する（これらの非引
き数修飾部も「付加部」と呼ばれる）。Once all arguments have been identified, the post-processor uses the main phrase categories 11VP, NP (including PP), AJP, AV
Ensure that all non-argument modifiers have their unique semantic attributes specified, considering P (these non-argument modifiers are also called "appendants").

考慮すべき第１の範嗜は、動詞句である（第２Ｆ図）。The first category to consider is the verb phrase (Figure 2F).

動詞句自体が等位接続のＶＰを含む場合、それは等位接
続ｖＰを分類する手順に送られる（ステップ１７）。分
類された要素は主手順に戻される。ＶＰが等位接続セグ
メントではない場合、前Ｗ修飾部と後置修飾部を含むす
べての修飾部のリストが構成される（ステップ１８）。If the verb phrase itself contains a coordinating VP, it is sent to a procedure for classifying coordinating vPs (step 17). The classified elements are returned to the main procedure. If the VP is not a co-connected segment, a list of all modifiers is constructed, including pre-W modifiers and post-modifiers (step 18).

そのリストの各メンバについて、ａ、それがＮＰである場合は、ＮＰ修飾部を処理する手
順に送られる（ステップ１９）。For each member of that list: a, if it is an NP, it is sent to a procedure that processes NP modifiers (step 19);

ｂ、それがＰＰである場合は、（１）ＰＰ中の前置詞と同じ名前をもつ属性がＶＰ上に
作成される（ステップ２０）。b. If it is a PP: (1) An attribute with the same name as the preposition in the PP is created on the VP (step 20).

（２）ＰＰの目的語がこの属性の値として指定される（
ステップ２０）。(2) The object of PP is specified as the value of this attribute (
Step 20).

（３）セグメントがＮＰ修飾部を処理する手順に送られ
る（第２Ｇ図参照）。(3) The segment is sent to a procedure that processes NP modifiers (see Figure 2G).

Ｃ３それが形容詞句または副詞句である場合、ＭＯＤＳ
属性がＶＰ上に作成され、その値はＡＪＰまたはＡＶＰ
を指すポインタである（ステップ２１）。C3 If it is an adjective phrase or an adverb phrase, MODS
Attribute is created on VP and its value is AJP or AVP
(step 21).

ｄ、それが埋め込まれた節（たとえば、従属節）である
場合、ＰＲＯＰｒＲ性がＶＰ上に作成され、その値は埋
込みクラスを指すポインタである（ステップ２２）。d. If it is an embedded clause (eg, a dependent clause), a PROPrR property is created on the VP and its value is a pointer to the embedded class (step 22).

次に、修飾部のリストの次のメンバを処理する（ステッ
プ２３）。リストが空になると、手順は終了する。Next, the next member of the list of modifiers is processed (step 23). The procedure ends when the list is empty.

考慮すべき次の範曙は、名詞句（第２Ｇ図）である。名
詞句自体が等位接続ＮＰを含む場合は、等位接続ＮＰを
分離する手順に送られる（ステップ２４）。分離された
要素は主手順に戻される。The next category to consider is noun phrases (Figure 2G). If the noun phrase itself contains a coordinate NP, it is sent to a procedure for separating the coordinate NP (step 24). The separated elements are returned to the main procedure.

ＮＰが等位接続セグメントでない場合、前置修飾部と後
置修飾部を含むすべての修飾部のリストが構成される（
ステップ２５）。そのリストの各メンバについて、ａ、それが限定辞または限量辞（ｔｈｅ、　ａｌ　ｔｈ
ｉｓ＋ｓｏｍｅ＋　ａｌｌなどの単語）である場合、Ｏ
ＰＳ　（操作子）属性がＮＰ上に作成され、その値は限
定辞または限量辞を指すポインタである（ステップ２６
）。If NP is not a co-connected segment, a list of all modifiers including pre-modifiers and post-modifiers is constructed (
Step 25). For each member of that list, a, it is a determiner or quantifier (the, al th
is+some+words such as all), then O
A PS (manipulator) attribute is created on the NP whose value is a qualifier or a pointer to a quantifier (step 26
).

ｂ、それが限定辞または限量辞以外の形容詞句である場
合、ＮＡＤＪｇ性がＮＰ上に作成され、その値は形容詞
句を指すポインタである（ステップ２７）。b. If it is a determiner or an adjective phrase other than a determiner, an NADJg property is created on the NP and its value is a pointer to the adjective phrase (step 27).

ｃ、ＮＰが動名詞（”−ｉｎｇ″で終わる名詞）の場合
、（１）問題の句が所有形容詞である場合は、ＤＳＵＢＪ
ＥＣＴ属性がＮＰ上に作成され、その値は所有形容詞を
指すポインタである（ステップ２８）。c. If NP is a gerund (a noun ending in "-ing"), (1) If the phrase in question is a possessive adjective, DSUBJ
An ECT attribute is created on the NP and its value is a pointer to the possessive adjective (step 28).

（２）動名詞ＮＰ中に所有形容詞がない場合は、ＤＳＵ
ＢＪＥＣＴ属性がＮＰ上に作成され、その値はＸＸ″で
ある（ステップ２９）。この値は、より多くの情報が利
用できるとき変更できる。(2) If there is no possessive adjective in the gerund NP, DSU
A BJECT attribute is created on the NP and its value is XX'' (step 29). This value can be changed when more information is available.

ｄ１問題の句が埋込み節（たとえば、関係節）である場
合、ＰＲＯＰ属性がＶＰ上に作成され、その値は埋込み
節を指すポインタである（ステップ３０）。If the phrase in the d1 problem is an embedded clause (eg, a relative clause), a PROP attribute is created on the VP and its value is a pointer to the embedded clause (step 30).

ｅ、それがＰＰである場合、（１）ＰＰ中の前置詞と同じ名前をもつ属性がＮＰ上に
作成される（ステップ３１）。e, if it is a PP: (1) An attribute with the same name as the preposition in the PP is created on the NP (step 31).

（２）ＰＰの目的語がこの属性の値として指定される（
ステップ３１）。(2) The object of PP is specified as the value of this attribute (
Step 31).

（３）そのセグメントがＮＰ修飾部を処理する（すなわ
ち、現在説明している第２Ｇ図の手順）手順に戻される
。(3) The segment is returned to the procedure that processes the NP modifier (ie, the procedure of FIG. 2G currently being described).

ｆ２問題の句が上記以外のものである場合、Ｍ○ＤＳ属
性がＮＰ上に作成され、その値はこの他の句を指すポイ
ンタである（ステップ３２）。If the phrase in the f2 problem is other than the above, an M*DS attribute is created on the NP whose value is a pointer to this other phrase (step 32).

次に、修飾部のリストの次のメンバが処理される（ステ
ップ３３）。リストが空になると、手順が終了する。Next, the next member of the list of modifiers is processed (step 33). The procedure ends when the list is empty.

形容詞向及び副詞句の意味上の修飾部を識別する（第２
Ｈ図と第２Ｉ図）手順の各ステップは同一である。まず
、すべての修飾部のリストを構成する（ステップ３４）
。そのリストの各メンバについて、ａ、それがＰＰである場合、（１）ＰＰの前置詞と同じ名前をもつ属性がＡＪＰまた
はＡＶＰ上に作成される（ステップ３５）。Identifying the semantic modifiers of adjectives and adverbial phrases (Second
Each step of the procedure (Figure H and Figure 2I) is identical. First, construct a list of all modifiers (step 34)
. For each member of that list: a, if it is a PP, (1) an attribute with the same name as the preposition of the PP is created on the AJP or AVP (step 35);

（２）ＰＰの目的語がこの属性の値として指定される（
ステップ３５）。(2) The object of PP is specified as the value of this attribute (
Step 35).

（３）そのセグメントがＮＰ修飾部を処理する手順に送
られる（第２Ｇ図参照）。(3) The segment is sent to a procedure that processes NP modifiers (see Figure 2G).

ｂ、そうでない場合は、ＭＯＤＳ属性がＡＪＰまたはＡ
ＶＰ上に作成され、その値は考慮中のリスト・メンバを
ｔ旨すポインタである（ステップ３６）。b. Otherwise, the MODS attribute is AJP or A
is created on the VP, whose value is a pointer to the list member under consideration (step 36).

次に、修飾部のリストの次のメンバが処理される（ステ
ップ３７）。リストが空になると、手順が終了する。Next, the next member of the list of modifiers is processed (step 37). The procedure ends when the list is empty.

第２Ｊ図は、等位接続動詞句を分離する処理を示す。こ
の手順は、ＶＰ修飾部を処理する処理手順から呼び出さ
れる（第２Ｆ図参照）。まず、より大きな等位接続ＶＰ
内にあるすべてのＶＰＰからリストが構成される（ステ
ップ３８）。次に、ＶＰのリストの各メンバについて、ａ１等位接続ＶＰが共通の構文上の主語をもつ場合、そ
の主語がリストの各メンバＶＰに分配される（ステップ
３９）。その１例は以下の通りである。Figure 2J shows the process of separating coordinating verb phrases. This procedure is called from the procedure that processes the VP modifier (see Figure 2F). First, the larger coordinate connection VP
A list is constructed from all VPPs within (step 38). Next, for each member of the list of VPs: If the a1 coordinating VPs have a common syntactic subject, that subject is distributed to each member VP of the list (step 39). One example is as follows.

Ｊｏｈｎ　ｃａｍｅ　ｉｎ　ａｎｄ　ｓａｔ　ｄｏｗｎ
。John came in and sat down
.

上記の文で、”　ＪＯｈｎ　”は、等位接続ＶＰ″ｃａ
ｍｅｉｎ″と５Ｂｔ　ｄｏｗｎ″の両方の構文上の主語
である。In the above sentence, “JOhn” is a coordinate connection VP”ca
It is the syntactic subject of both "main" and "5Bt down".

６０節が受動態の場合（たとえば、・Ｊｏｈｎ　ｗａｓ
ｈｕｇｇｅｄ　ａｎｄ　ｋｉｓｓｅｄ、　”　）　、各
メンバＶＰ中ノＰＡＳＳＩＶＥ属性がオンにセットされ
る（ステップ４０）。If verse 60 is in the passive voice (for example, John was
``hugged and kissed,''), the PASSIVE attribute in each member VP is set on (step 40).

Ｃ１最終的ｖＰ中に構文上の直接目的語がある場合、そ
の目的語がリストの各他動詞メンバＶＰに分配される（
ステップ４１）。その１例は次の通りである。C1 If there is a syntactic direct object in the final vP, the object is distributed to each transitive member VP of the list (
Step 41). One example is as follows.

Ｊｏｈｎ　ｗｒｏｔｅ　ａｎｄ　ｓｉｇｎｅｄ　ｔｈｅ
　ｄｏｃｕｍｅｎｔ。John wrote and signed the
document.

上記の文で１″ｔｈｅ　ｄｏｃｕｍｅｎｔ″は、等位接
続ＶＰ”　ｗｒｏｔｅ　”と”ｓｉｇｎｅｄ”の両方の
構文上の目的である。In the above sentence, 1"the document" is the syntactical purpose of both the coordinate connections VP "wrote" and "signed".

ｄ、その節に遠隔依存関係が存在する場合、前置要素が
リストの各メンバＶＰに分配される。（ステップ４２）
。その１例は次の通りである。d. If a remote dependency exists on that clause, the prefix element is distributed to each member VP of the list. (Step 42)
. One example is as follows.

Ｗｈａｔ　ｄｉｄ　Ｐｅｔｅｒ　ｓａｙ　ｔｈａｔ　Ｊ
ｏｈｎ　ｗｒｏｔｅ　ａｎｄＳ　ｉｇｎｅｄ？上記の文で、前置疑問詞”　ｗｈａｔ　”がＶ　Ｐ　”
　ｗｒｏｔｅ　”と、ＶＰ″ｓｉｇｎｅｄ″に分配され
る。依存関係は、後で第２Ｂ図に示す手順によって解決
される。What did Peter say that J
ohn wrote and signaled? In the above sentence, the prepositional interrogative “what” is V P ”
"wrote" and VP "signed". The dependencies are later resolved by the procedure shown in FIG. 2B.

ｅ、上記のすべての分配が行なわれた後、ＶＰリストの
個々の各メンバが引き数を識別するため主手順に送られ
る（ステップ４３）。次いで、ＶＰのリストの次のメン
バが処理される（ステップ４４）。リストが空になると
、手順は終了する。e. After all the above distributions have taken place, each individual member of the VP list is sent to the main procedure to identify the argument (step 43). The next member of the list of VPs is then processed (step 44). The procedure ends when the list is empty.

第２に図は、等位接続名詞句を分類する処理を示す。こ
の手順は、ＮＰ修飾部を処理する処理手順から呼び出さ
れる（第２Ｇ図参照）。まず、より大きな等位接続ＮＰ
内にあるすべてのＮＦＬからリストが構成される（ステ
ップ４５）。次に、ＮＰのリストの各メンバについて、ａ、ＭＯＤｓｆｆ性が親ＮＰ上に作成され、その値はリ
ストのＮＰメンバを指すポインタである（ステップ４６
）。この結果、すべての等位接続ＮＰがそれらの１ｆＡ
ＮＰのもとてＭＯＤＳとしてリストされることになる。Second, the figure shows the process of classifying coordinating noun phrases. This procedure is called from the procedure that processes NP modifiers (see Figure 2G). First, the larger coordinate connection NP
A list is constructed from all the NFLs within (step 45). Next, for each member of the list of NPs, a MODsff property is created on the parent NP whose value is a pointer to the NP member of the list (step 46
). As a result, all co-connected NPs have their 1fA
It will be listed as MODS under NP.

ｂ、ＮＰリストの個々の各メンバがそのように措置され
た後、引き数を識別するため主手順に送られる（ステッ
プ４７）。次に、ＮＰのリストの次のメンバが処理され
る（ステップ４８）。リストが空になると、手順が終了
する。b. After each individual member of the NP list is so disposed of, it is sent to the main procedure to identify the arguments (step 47). Next, the next member of the list of NPs is processed (step 48). The procedure ends when the list is empty.

第３図は、遠隔依存関係をもつ文°“Ａ１１ｃｅ。FIG. 3 shows a sentence “A11ce” that has a remote dependency relationship.

Ｐｅｔｅｒ　５ａｉｄ　ｔｈａｔ　Ｊｏｈｎ　ｋｉｓｓ
ｅｄ、　”の処理の際に行なわれる２つの構造段階を示
す。第１は、解析木４９として省略形で示した構文構造
である。この解析木は、ＰＥＧがこの入力文字列を処理
した後にレコード構造中に実際存在する少数の属性と値
のみを使って作成される。この解析木では、”　ＡＩ　
ｉｃｅ″が動詞″ｋｉｓｓｅｄ″から離されており、こ
の２つの単語の間に何らかの意味のある関係の指示はな
い。Peter 5aid that John kiss
The first is a syntactic structure, abbreviated as parse tree 49. This parse tree is a syntactic structure that occurs after PEG has processed this input string. It is created using only a small number of attributes and values that actually exist in the record structure.
``ice'' is separated from the verb ``kissed'' and there is no indication of any meaningful relationship between the two words.

第２は、基本意味構造、引き数構造５０であり、一種の
図として省略形で示されている。この図は、本発明で開
示された意味属性と引き数のみを使用し、それらの引き
数がその動詞によって正しくグループ分けされるように
インデントすることによって作成される。この引き数構
造では、”　ＡＩ　ｉｃｅ“は動詞″ｋｉｓｓｅｄ”の
ＤＯＢＪＥＣＴとして正しく連係されている。The second is the basic semantic structure, argument structure 50, which is shown in abbreviated form as a kind of diagram. This diagram is created by using only the semantic attributes and arguments disclosed in this invention and indenting the arguments so that they are correctly grouped by their verbs. In this argument structure, "AI ice" is correctly linked as a DOBJECT of the verb "kissed".

出力されるレコード構造は、ある自然言語を別の言語に
より正確に翻訳するために機械翻訳装置で使用される。The output record structure is used by a machine translation device to more accurately translate one natural language into another.

この出力は、高度のテキスト校訂、改良された情報検索
、電子メールの経路設定及びファイル編成、及び基本意
味解析を必要とするその他のＮＬＰ応用例にも適用でき
る。This output can also be applied to advanced text editing, improved information retrieval, email routing and file organization, and other NLP applications that require basic semantic analysis.

本発明は、前記の米国特許４７３１７３５号明細書に開
示されているようなデータ処理システムで走行するコン
ピュータ・プログラムとして具体化される。本発明を具
体化したプログラムは、システムのメモリに記憶され、
実行装置によって実行される。自然言語の単語の文字列
は、キーボード、大容量記憶装置、接続端末、または通
信リンクから実行装置に入力できる。本発明の構文解析
段階と意味解析段階は、データ処理システム中の本発明
のプログラム実施例によって実行される。The present invention is embodied as a computer program running on a data processing system such as that disclosed in the aforementioned U.S. Pat. No. 4,731,735. A program embodying the invention is stored in the memory of the system,
Executed by an execution unit. Strings of natural language words may be entered into the execution unit from a keyboard, mass storage device, attached terminal, or communication link. The parsing and semantic parsing steps of the present invention are performed by a programmatic embodiment of the present invention in a data processing system.

本発明のプログラム実施例によって出力された意味属性
レコード構造は、入力文字列の意味上の特徴づけとして
、表示装置、プリンタ、大容量記憶装置、通信リンク、
またはメモリの別の区画に出力することができる。表示
画面上でユーザに直接表示でき、同じデータ処理システ
ムまたは他のデータ処理システム上で走行する利用処理
またはプログラムに入力できる。The semantic attribute record structure output by the program embodiment of the present invention may include a display device, a printer, a mass storage device, a communication link, a
Or it can be output to another partition of memory. It can be displayed directly to a user on a display screen and input into a utilization process or program running on the same data processing system or on another data processing system.

本発明の特定の実施例を開示したが、当業者なら容易に
理解できるように、本発明の精神と範囲から逸脱するこ
となくこの具体的実施例に変更を加えることができる。Although a specific embodiment of the invention has been disclosed, those skilled in the art will readily recognize that changes may be made to the specific embodiment without departing from the spirit and scope of the invention.

【図面の簡単な説明】[Brief explanation of drawings]

第１Ａ図及び第１Ｂ図は、現在の言語学理論による見え
ない引き数の決定するための従来の方法を示す図である
。第２図は、引き数構造を決定するための方法の流れ図を
示す第２Ａ図ないし第２に図の関係を示す図である。第２Ａ図は、流れ図の不定詞補語及び分詞補語を処理す
る部分を示す図である。第２Ｂ図は、流れ図の遠隔引き数を処理する部分を示す
図である。第２Ｃ図は、流れ図の受動構文を処理する部分を示す図
である。第２Ｄ図は、流れ図の前置詞″ｔｏ”をもつ前置詞句中
に間接目的語がある間接目的構文を処理する部分を示す
図である。第２Ｅ図は、流れ図の述部形容詞を主部名詞句と連係す
る部分を示す図である。第２Ｆ図は、流れ図の動詞句修飾部を処理する部分を示
す図である。第２Ｇ図は、流れ図の名詞句修飾部を処理する部分を示
す図である。第２Ｈ図は、流れ図の形容詞句修飾部を処理する部分を
示す図である。第２１図は、流れ図の動詞句の修飾部を処理する部分を
示す図である。第２Ｊ図は、流れ図の等位接続動詞句を処理する部分を
示す図である。第２に図は、流れ図の等位接続名詞句を処理する部分を
示す図である。第３図は、遠隔依存関係の適切な解決法を示す例として
、文”Ａｌ１ｃｅ、　Ｐｅｔｅｒ　５ａｉｄ　ｔｈａｔ
　Ｊｏｈｎｋｉｓｓｅｄ、　”の完成した基本意味構造
（引き数構造）の解析木と表示を示す図である。FIGS. 1A and 1B are diagrams illustrating conventional methods for determining invisible arguments according to current linguistic theory. FIG. 2 is a diagram illustrating a flowchart of a method for determining argument structure; FIG. FIG. 2A is a diagram showing a portion of the flowchart that processes infinitive complements and participial complements. FIG. 2B is a diagram illustrating the portion of the flowchart that processes remote arguments. FIG. 2C is a diagram illustrating the portion of the flowchart that processes passive constructs. FIG. 2D is a diagram showing the portion of the flowchart that processes an indirect object construction in which the indirect object is in a prepositional phrase with the preposition "to." FIG. 2E is a diagram showing the part of the flowchart in which predicate adjectives are linked with subject noun phrases. FIG. 2F is a diagram showing the portion of the flowchart that processes the verb phrase modifier. FIG. 2G is a diagram showing the portion of the flowchart that processes the noun phrase modification portion. FIG. 2H is a diagram showing the portion of the flowchart that processes the adjective phrase modifier portion. FIG. 21 is a diagram showing a portion of the flowchart that processes the modification part of the verb phrase. FIG. 2J is a diagram illustrating the portion of the flowchart that processes coordinating verb phrases. The second figure is a diagram showing the portion of the flowchart that processes coordinating noun phrases. Figure 3 shows the sentence "Al1ce, Peter 5aid that
FIG. 3 is a diagram showing a parse tree and display of the completed basic semantic structure (argument structure) of Johnkissed.

Claims

【特許請求の範囲】[Claims]

（１）自然言語による言語表現を形成する複数の単語か
ら成る文字列を入力するステップと、入力文字列の構文構造を誘導する１組の構文規則によっ
て入力文字列を解析し、前記単語の構文引き数を識別す
るステップと、意味引き数が前記構文引き数と正確に対応するとき、前
記文字列中の第１の単語グループを意味引き数として識
別するステップとを有し、さらに、前記第１グループに
はない、前記文字列中の第２の単語グループの意味引き
数を識別するために、不定詞節及び分詞節の滅失してい
る引き数を指定するサブステップと、遠隔引き数を指定するサブステップと、受動構文の滅失しているまたは離れた引き数を指定する
サブステップと、間接目的構文の引き数を指定するサブステップと、前記入力単語文字列の正規化意味構造を実現する前記指
定引き数を出力するステップとを有することを特徴とす
る自然言語の単語文字列の基本意味構造を決定する自然
言語処理方法。(1) inputting a character string consisting of a plurality of words forming a linguistic expression in a natural language; parsing the input character string according to a set of syntactic rules that guide the syntactic structure of the input character string; identifying a first group of words in the string as a semantic argument when the semantic argument corresponds exactly with the syntactic argument; specifying missing arguments of infinitive clauses and participial clauses in order to identify semantic arguments of a second group of words in the string that are not in one group; a substep of specifying a missing or remote argument of a passive construct; a substep of specifying an argument of an indirect object construct; and realizing a normalized semantic structure of the input word string. A natural language processing method for determining a basic semantic structure of a word string of a natural language, comprising the step of outputting the specified argument.

（２）間接目的構文の引き数を指定するステップの後で
、述部形容詞をその主語名詞句と連係するステップ、動詞句修飾部をその動詞と連係するステップ、名詞句修
飾部をその名詞と連係するステップ、形容詞句修飾部を
その形容詞と連係するステップ、及び副詞句修飾部をその副詞と連係するステップを含み、修飾部が意味論的に前記入力単語文字列に連係される、請求項１に記載の自然言語処理方法。(2) After the step of specifying the argument of the indirect object construction, a step of associating the predicate adjective with its subject noun phrase, a step of associating the verb phrase modifier with the verb, and a step of associating the noun phrase modifier with the noun. associating, associating an adjective phrase modifier with its adjective, and associating an adverb phrase modifier with its adverb, wherein the modifier is semantically associated with the input word string. 1. The natural language processing method according to 1.

（３）前記指定ステップで、前記解析ステップで累積さ
れた構文情報のすべてを含む前記構文構造を走査する手
順によって基本意味構造が作成される、請求項１に記載
の自然言語処理方法。(3) The natural language processing method according to claim 1, wherein, in the specifying step, a basic semantic structure is created by a procedure of scanning the syntactic structure including all of the syntactic information accumulated in the parsing step.

（４）前記正規化意味構造が、意味論的に重要な新しい
属性と値とを前記構文構造に追加することによって実現
される、請求項３に記載の自然言語処理方法。(4) The natural language processing method according to claim 3, wherein the normalized semantic structure is realized by adding new semantically important attributes and values to the syntactic structure.

（５）自然言語による言語表現を形成する複数の単語か
らなる文字列を入力するステップと、入力文字列の構文構造を誘導する１組の構文規則によっ
て入力文字列を解析し、前記単語の構文引き数を識別す
るステップとを有し、さらに、前記入力文字列の意味引
き数を識別するために、不定詞節と分詞節の滅失してい
る引き数を指定するサブステップと、遠隔引き数を指定するサブステップと、受動構文の滅失しているまたは離れた引き数を指定する
サブステップと、間接目的構文の引き数を指定するサブステップと、前記入力単語文字列の正規化意味構造を実現する前記指
定引き数を出力するステップとを有することを特徴とす
る自然言語の単語文字列の基本意味構造を決定するため
の自然言語処理方法。(5) inputting a character string consisting of a plurality of words forming a linguistic expression in natural language; parsing the input character string according to a set of syntactic rules that guide the syntactic structure of the input character string; identifying missing arguments of infinitive clauses and participial clauses to identify semantic arguments of the input string; and remote arguments. a substep of specifying the missing or remote argument of the passive construction; a substep of specifying the argument of the indirect object construction; and a substep of specifying the normalized semantic structure of the input word string. A natural language processing method for determining a basic semantic structure of a word string of a natural language, comprising the step of outputting the specified argument for realizing the specified argument.

（６）間接目的構文の引き数を指定する前記ステップの
後で、述部形容詞をその主語名詞句と連係するステップ、動詞句修飾部をその動詞と連係するステップ、名詞句修
飾部をその名詞と連係するステップ、形容詞句修飾部を
その形容詞と連係するステップ、及び副詞句修飾部をその副詞と連係するステップを含み、修飾部が意味論的に前記入力単語文字列に連係される、請求項５に記載の自然言語処理方法。(6) After the above step of specifying the argument of the indirect object construction, the step of associating the predicate adjective with its subject noun phrase, the step of associating the verb phrase modifier with its verb, and the step of associating the noun phrase modifier with its noun , associating an adjective phrase modifier with its adjective, and associating an adverbial phrase modifier with its adverb, wherein the modifier is semantically associated with the input word string. The natural language processing method according to item 5.

（７）前記指定ステップで、前記解析ステップで累積さ
れた構文情報のすべてを含む前記構文構造を走査する手
順によって基本意味構造が作成される、請求項５に記載
の自然言語処理方法。(7) The natural language processing method according to claim 5, wherein, in the specifying step, a basic semantic structure is created by a procedure of scanning the syntactic structure including all of the syntactic information accumulated in the parsing step.

（８）前記正規化意味構造が、意味論的に重要な新しい
属性と値を前記構文構造に追加することによって達成で
きるという、請求項７に記載の自然言語処理方法。8. The natural language processing method of claim 7, wherein the normalized semantic structure is achieved by adding new semantically important attributes and values to the syntactic structure.

（９）自然言語の言語表現を形成する複数の単語から構
成される文字列を入力するステップと、入力文字列の構
文構造を誘導する、意味情報から自由な１組の構文規則
によって入力文字列を解析し、前記単語の構文引き数を
識別するステップとを有し、さらに、前記文字列の意味引き数を識別するために、不定詞節と
分詞節の滅失している引き数を指定するサブステップと
、遠隔引き数を指定するサブステップと、受動構文の滅失しているまたは離れた引き数を指定する
サブステップと、間接目的構文の引き数を指定するサブステップと、前記入力単語文字列の正規化意味構造を提供するレコー
ド中に前記指定引き数を出力するステップとを有するこ
とを特徴とする自然言語の単語文字列の基本意味構造を
決定する自然言語処理方法。(9) inputting a character string composed of a plurality of words forming a linguistic expression of a natural language; and a step of inputting a character string consisting of a plurality of words forming a linguistic expression of a natural language; and identifying the syntactic arguments of the word, and further specifying the missing arguments of infinitive clauses and participial clauses to identify the semantic arguments of the string. a substep of specifying a remote argument; a substep of specifying a missing or remote argument of a passive construction; a substep of specifying an argument of an indirect object construction; and outputting the specified argument in a record providing a normalized semantic structure of the string.

（１０）間接目的構文の引き数を指定する前記ステップ
の後で、述部形容詞をその主語名詞句と連係するステップ、動詞句修飾部をその動詞と連係するステップ、名詞句修
飾部をその名詞と連係するステップ、形容詞句修飾部を
その形容詞と連係するステップ、及び副詞句修飾部をその副詞と連係するステップを含み、修飾部が意味論的に前記入力単語文字列に連係される、請求項９に記載の自然言語処理方法。(10) After the above step of specifying the argument of the indirect object construction, a step of associating the predicate adjective with its subject noun phrase, a step of associating the verb phrase modifier with the verb, and a step of associating the noun phrase modifier with its noun. , associating an adjective phrase modifier with its adjective, and associating an adverbial phrase modifier with its adverb, wherein the modifier is semantically associated with the input word string. The natural language processing method according to item 9.

（１１）前記指定ステップで、前記解析ステップで累積
された構文情報のすべてを含む前記構文構造を走査する
手順によって基本意味構造が作成される、請求項９に記
載の自然言語処理方法。(11) The natural language processing method according to claim 9, wherein, in the specifying step, a basic semantic structure is created by a procedure of scanning the syntactic structure including all of the syntactic information accumulated in the parsing step.

（１２）前記正規化意味構造が、意味論的に重要な新し
い属性と値を前記構文構造に追加することによって達成
される、請求項１１に記載の自然言語処理方法。(12) The natural language processing method of claim 11, wherein the normalized semantic structure is achieved by adding new semantically important attributes and values to the syntactic structure.