JP2005157602A

JP2005157602A - Conversation control device, conversation control method, and those programs

Info

Publication number: JP2005157602A
Application number: JP2003393282A
Authority: JP
Inventors: Yosuke Hayashi; 洋介林
Original assignee: Aruze Corp
Current assignee: Universal Entertainment Corp
Priority date: 2003-11-25
Filing date: 2003-11-25
Publication date: 2005-06-16
Anticipated expiration: 2023-11-25
Also published as: JP4475628B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide a conversation control device and a conversation control method for controlling a conversation with a user and achieving enhancement of a conversation database by automatically learning a new word, etc. <P>SOLUTION: This control device includes a conversation database means in which a plurality of items of topic specifying information are stored; a conversation control means for storing a talking history defined by previous conversations and previous responses and selecting response sentences by collating the topic specifying information and speech information defined by the talking history; and a new word registration means for storing the word included in the speech of the user into the database means, as new topic specifying information when the word does not correspond to any of the topic specifying information stored in the database means, outputting a sentence inquiring the meaning of the word to the user and storing the response returned from the user as a meaning information sentence corresponding to the topic specifying information. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

本発明は、会話制御装置、会話制御方法並びにこれらのプログラムに関し、より詳しくは自動的に新しい言葉などを学習することにより会話データベースの充実を図ることが可能な、会話制御装置、会話制御方法並びにこれらのプログラムに関する。 The present invention relates to a conversation control device, a conversation control method, and a program thereof. More specifically, the present invention relates to a conversation control device, a conversation control method, and a program capable of enhancing a conversation database by automatically learning new words and the like. Regarding these programs.

近年、コンピュータを中心とした情報処理の手段として開発され進展してきたディジタル信号処理、高集積度ＬＳＩ製造技術、ＤＶＤなどディスクメディアの微細精密加工製造技術の高度化を背景にして、機械が自然な音声を話すようになったり、人の発する言葉を理解して適切な回答を返すことができる会話制御技術が提供されるようになってきている。 In recent years, machines have become more natural due to the advancement of digital signal processing, high-integration LSI manufacturing technology, and fine precision processing manufacturing technology for disk media such as DVDs, which have been developed and advanced as information processing means centering on computers. Conversation control technology has come to be provided that can speak a voice or understand a language spoken by a person and return an appropriate answer.

このような会話制御技術においては、会話の履歴、話題の推移などが加味された人間同士の会話と同様な自然な対話を実現可能な会話制御装置が提案されている（特許文献１）。
特開２００２−３５８３０４号公報 In such a conversation control technology, a conversation control apparatus has been proposed that can realize a natural conversation similar to a conversation between humans in consideration of conversation history, topic transition, and the like (Patent Document 1).
JP 2002-358304 A

一般に、上記のような会話制御装置は、ユーザの発話とこの発話に対応する回答文を予め記憶・登録する会話データベース手段を用意しておき、ユーザの発話に応じて、会話データベース手段からこれに対応する回答文を検索してユーザに返すことにより、会話制御装置に発話に意味認識を行わせ、この意味認識に基づいてユーザと会話制御装置間の会話を成立させるようになっている。従って、ユーザと会話制御装置間の会話の内容は、会話データベース手段に記憶・登録されている内容に依存しており、会話データベース手段に記憶・登録されている内容が多岐多様な単語、言葉であればあるほど、ユーザと会話制御装置間の会話の内容も変化に富んだ充実したものとなる。 In general, the conversation control device as described above prepares a conversation database means for storing and registering a user's utterance and an answer sentence corresponding to the utterance in advance, and the conversation database means responds to the user's utterance. By retrieving the corresponding answer sentence and returning it to the user, the conversation control device is made to recognize the meaning of the utterance, and the conversation between the user and the conversation control device is established based on the meaning recognition. Therefore, the contents of the conversation between the user and the conversation control device depend on the contents stored and registered in the conversation database means, and the contents stored and registered in the conversation database means are a wide variety of words and words. The more content there is, the richer and more varied the content of the conversation between the user and the conversation control device.

現状の会話制御装置の構築においては、会話制御装置の意味認識を完成させるに当たり、言葉、単語等の意味を個別にオペレータ（人間）による入力作業により会話データベース手段に登録することにより行っている。このような会話データベース手段に登録作業は、どのような言葉、単語を登録するのか、その言葉、単語についてどのような内容を登録するかを準備しておく必要があり、会話データベース手段を充実したものとするためには登録すべき言葉、単語数が数千から数万となるため、登録の準備に必要な作業及びオペレータによる入力作業は膨大な量となり、結果として会話制御装置の製品化の障害となっていた。 In the construction of the current conversation control device, in order to complete the recognition of the conversation control device, the meaning of words, words, etc. is individually registered in the conversation database means by an input operation by an operator (human). In order to register in such a conversation database means, it is necessary to prepare what kind of words and words are to be registered and what kind of contents are to be registered for the words and words. Since the number of words and words to be registered is several thousand to several tens of thousands for registration, the amount of work required for preparation for registration and input by the operator is enormous, resulting in the commercialization of a conversation control device. It was an obstacle.

本発明の目的は、自動的に新しい言葉などを学習することにより会話データベースの充実を図ることが可能なユーザと会話制御可能な会話制御装置並びに会話制御方法を提供することにある。 An object of the present invention is to provide a conversation control apparatus and a conversation control method capable of controlling a conversation with a user who can enhance a conversation database by automatically learning new words and the like.

上記課題を解決するための手段として、本発明は以下のような特徴を有する。 As means for solving the above problems, the present invention has the following features.

本発明の第１の態様は、ユーザからの発話に応答する回答を出力することが可能な会話制御装置として提案される。 A first aspect of the present invention is proposed as a conversation control device capable of outputting an answer in response to a user's utterance.

この会話制御装置は、複数の話題特定情報を記憶する会話データベース手段（会話データベース）と、従前の会話及び従前の回答により定まる談話履歴を記憶し、この談話履歴により定まる話題特定情報と発話情報とを照合して回答文を選択する会話制御手段（会話制御部）と、利用者の発話に含まれる語が、会話データベース手段に記憶された話題特定情報の内いずれにも相当しない場合、その語を新たな話題特定情報として会話データベース手段に記憶させるとともに、利用者にその語の意味を質問する文を出力し、利用者から返された返答をその話題特定情報に対応する意味情報文として会話データベース手段に記憶させる新単語登録手段（新単語登録部）とを有することを特徴としている。 This conversation control device stores a conversation database means (conversation database) for storing a plurality of topic specifying information, a conversation history determined by previous conversations and previous answers, topic specification information and speech information determined by the conversation history, If the word included in the user's utterance does not correspond to any of the topic identification information stored in the conversation database means Is stored as new topic identification information in the conversation database means, a sentence that asks the user the meaning of the word is output, and the response returned from the user is spoken as a semantic information sentence corresponding to the topic identification information. It has a new word registering means (new word registering part) stored in the database means.

この会話制御装置によれば、利用者と会話を行うことにより新たな語を会話データベース手段に追加登録することが可能となり、オペレータによる新語登録処理などの労力を要することなく、より記憶内容の豊富な会話制御装置を作成することが可能となる。 According to this conversation control device, it is possible to additionally register new words in the conversation database means by having a conversation with the user, and the memory contents are more abundant without requiring labor such as new word registration processing by the operator. It is possible to create a simple conversation control device.

上記の会話制御装置において、新単語登録手段は、ある話題特定情報について記憶された意味情報文を用いて、その話題特定情報の意味情報文の検証を行うための検証質問文を生成し、利用者がこの検証質問文に対して肯定する返答を返した場合には、その意味情報文の信頼度を示す数値を増加させ、次回検証質問文を生成する場合には、信頼度を示す数値が最も大きい意味情報文を用いて、その意味情報文の検証を行うための検証質問文を生成するようにしてもよい。
この会話制御装置によれば、自動的に新たに登録する語の意味内容を検証し、最も適切な意味をその語の意味として登録することが可能となる。 In the above conversation control device, the new word registering means generates and uses a verification question sentence for verifying the semantic information sentence of the topic specifying information using the semantic information sentence stored for the topic specifying information. When the person returns an affirmative response to the verification question sentence, the numerical value indicating the reliability of the semantic information sentence is increased, and when the next verification question sentence is generated, the numerical value indicating the reliability is A verification question sentence for verifying the semantic information sentence may be generated using the largest semantic information sentence.
According to this conversation control device, it is possible to automatically verify the meaning content of a newly registered word and register the most appropriate meaning as the meaning of the word.

また、上記の会話制御装置において、新単語登録手段は、ある話題特定情報について記憶された意味情報文を用いてその意味情報文の検証を行うための検証質問文を生成し、利用者がこの検証質問文に対して否定する返答を返した場合には、その話題特定情報の意味を問いただすための問い合わせ質問文を生成し、利用者がこの問い合わせ質問文に対する返答を返した場合に、その返答をその話題特定情報の新たな意味情報文として会話データベース手段に記憶させるようにしてもよい。 Further, in the above conversation control device, the new word registration means generates a verification question sentence for verifying the semantic information sentence using the semantic information sentence stored for a certain topic specifying information, and the user When a negative response is returned to the verification question message, an inquiry question message for asking the meaning of the topic specific information is generated, and when the user returns a response to this inquiry question message, the response May be stored in the conversation database means as a new semantic information sentence of the topic identification information.

かかる会話制御装置によれば、利用者から提供される意味を複数登録しておき、のちにこの複数登録されたものの中から、最適のものをその語の意味として登録することが可能となる。 According to such a conversation control device, it is possible to register a plurality of meanings provided by the user and then register the optimum one among the plurality of registered ones as the meaning of the word.

本発明の第２の態様は、ユーザからの発話に応答する回答を出力することが可能な会話制御方法として提案される。 The second aspect of the present invention is proposed as a conversation control method capable of outputting an answer in response to an utterance from a user.

この会話制御方法は、複数の話題特定情報を予め記憶させるステップと、利用者の発話に含まれる語が話題特定情報として未だ登録されていない場合、その語について、利用者に意味を問い合わせる問い合わせ質問文を出力するステップと、問い合わせ質問に対する利用者の回答を取得するステップと、この語を新たな話題特定情報として記憶させるとともに、回答をその話題特定情報の意味情報文として記憶させるステップとを有することを特徴としている。
この会話制御方法によれば、利用者と会話を行うことにより新たな語を追加登録することが可能となり、オペレータによる新語登録処理などの労力を要することなく、より記憶内容の豊富な会話制御方法を実現することが可能となる。 The conversation control method includes a step of storing a plurality of topic specifying information in advance, and an inquiry question that inquires the user about the meaning of a word included in the user's utterance if the word is not yet registered as the topic specifying information. A step of outputting a sentence; a step of acquiring a user's answer to the inquiry question; and storing the word as new topic specifying information and storing the answer as a semantic information sentence of the topic specifying information. It is characterized by that.
According to this conversation control method, a new word can be additionally registered by having a conversation with the user, and a conversation control method with abundant memory contents without requiring labor such as new word registration processing by the operator. Can be realized.

上記の会話制御方法は、ある話題特定情報について記憶された意味情報文を用いて、その話題特定情報の意味情報文の検証を行うための検証質問文を生成し、出力するステップと、利用者がこの検証質問文に対して肯定する返答を返した場合には、その意味情報文の信頼度を示す数値を増加させるステップと、話題特定情報について次に検証質問文を生成する場合には、信頼度を示す数値（肯定回数情報）が最も大きい意味情報文を用いて、その意味情報文の検証を行うための検証質問文を生成するステップとをさらに有するようにしてもよい。
この会話制御方法によれば、自動的に新たに登録する語の意味内容を検証し、最も適切な意味をその語の意味として登録することが可能となる。 The above conversation control method uses a semantic information sentence stored with respect to certain topic specifying information to generate and output a verification question sentence for verifying the semantic information sentence of the topic specifying information; Returns an affirmative response to this verification question sentence, the step of increasing the numerical value indicating the reliability of the semantic information sentence, and the next generation of the verification question sentence for the topic identification information, There may be further included a step of generating a verification question sentence for verifying the semantic information sentence by using the semantic information sentence having the largest numerical value (acknowledgment number information) indicating the reliability.
According to this conversation control method, it is possible to automatically verify the meaning content of a newly registered word and register the most appropriate meaning as the meaning of the word.

また、上記の会話制御方法は、ある話題特定情報について記憶された意味情報文を用いてその意味情報文の検証を行うための検証質問文を生成し、出力するステップと、利用者がこの検証質問文に対して否定する返答を返した場合には、その話題特定情報の意味を問いただすための問い合わせ質問文を生成し、出力するステップと、利用者がこの問い合わせ質問文に対する返答を返した場合に、その返答をその話題特定情報の新たな意味情報文として記憶させるステップとをさらに有していてもよい。 Further, the above conversation control method uses a semantic information sentence stored for a certain topic specifying information to generate and output a verification question sentence for verifying the semantic information sentence, and the user performs this verification. When a negative response is returned to the question text, a query question text is generated to ask the meaning of the topic specific information, and the output step, and the user returns a response to the query text And a step of storing the response as a new semantic information sentence of the topic specifying information.

本発明によれば、自動的に新しい言葉などを学習することにより会話データベースの充実を図ることが可能なユーザと会話制御可能な会話制御装置並びに会話制御方法及びこれらのプログラムを提供することが可能となる。 According to the present invention, it is possible to provide a conversation control device, a conversation control method, and a program thereof capable of controlling conversation with a user who can automatically enhance a conversation database by automatically learning new words and the like. It becomes.

以下に図面を参照しながら本発明の第１の実施の形態を説明する。
本実施の形態は、話者の発話内容及び／又は発話内容に対する回答文を出力可能な会話制御装置に関する。 A first embodiment of the present invention will be described below with reference to the drawings.
The present embodiment relates to a conversation control apparatus capable of outputting a speaker's utterance content and / or an answer sentence to the utterance content.

［１．会話制御装置の構成例］
［１．１．全体構成］
図１は、本実施の形態に係る学習機能を有する会話制御装置１の概略構成図である。 [1. Configuration example of conversation control device]
[1.1. overall structure]
FIG. 1 is a schematic configuration diagram of a conversation control apparatus 1 having a learning function according to the present embodiment.

会話制御装置１は、たとえばコンピュータ、ワークステーションなどの情報処理装置であって、演算処理装置（ＣＰＵ）、主メモリ（ＲＡＭ）、読出し専用メモリ（ＲＯＭ）、入出力装置（Ｉ／Ｏ）、ハードディスク装置等の外部記憶装置を具備している装置で構成されている。前記ＲＯＭ、もしくは外部記憶装置などに情報処理装置を会話制御装置１として機能させるためのプログラム、もしくは会話制御方法をコンピュータに実行させるためのプログラムが記憶されており、該プログラムを主メモリ上に載せ、ＣＰＵがこれを実行することにより会話制御装置１が実現される。また、上記プログラムは必ずしも当該装置内の記憶装置に記憶されていなくともよく、磁気ディスク、光ディスク、光磁気ディスク、ＣＤ（Compact Disc）、ＤＶＤ（Digital Video Disc）などのコンピュータ読み取り可能なプログラム記録媒体や、外部の装置（例えば、ＡＳＰ（アプリケーション・サービス・プロバイダ）のサーバなど）から提供され、これを主メモリに乗せる構成であっても良い。 The conversation control device 1 is an information processing device such as a computer or a workstation, for example, an arithmetic processing unit (CPU), a main memory (RAM), a read only memory (ROM), an input / output device (I / O), a hard disk. It is composed of a device having an external storage device such as a device. A program for causing the information processing device to function as the conversation control device 1 or a program for causing the computer to execute the conversation control method is stored in the ROM or an external storage device, and the program is loaded on the main memory. The conversation control device 1 is realized by the CPU executing this. In addition, the program does not necessarily have to be stored in a storage device in the apparatus, and a computer-readable program recording medium such as a magnetic disk, an optical disk, a magneto-optical disk, a CD (Compact Disc), a DVD (Digital Video Disc), etc. Alternatively, it may be provided from an external device (for example, a server of an ASP (Application Service Provider)), and this may be placed on the main memory.

図１に示すように、会話制御装置１は、入力部１００と、音声認識部２００と、会話制御部３００と、文解析部４００と、会話データベース５００と、出力部６００と、音声認識辞書記憶部７００と、新単語登録部８００を備えている。 As shown in FIG. 1, the conversation control device 1 includes an input unit 100, a speech recognition unit 200, a conversation control unit 300, a sentence analysis unit 400, a conversation database 500, an output unit 600, and a speech recognition dictionary storage. Unit 700 and new word registration unit 800.

［入力部］
入力部１００は、利用者から入力された入力情報を取得するものである。この入力部１００は、例えば、マイクロホンが挙げられる。入力部１００は、取得した発話内容に対応する音声を音声信号として音声認識部２００に出力する。 [Input section]
The input unit 100 acquires input information input from a user. An example of the input unit 100 is a microphone. The input unit 100 outputs the voice corresponding to the acquired utterance content to the voice recognition unit 200 as a voice signal.

［音声認識部］
音声認識部２００は、入力部１００で取得した発話内容に基づいて、発話内容に対応する文字列を特定するものである。具体的には、入力部１００から音声信号が入力された音声認識部２００は、入力された音声信号に基づいて、この音声信号を、音声認識辞書記憶部７００に格納されている辞書および会話データベース５００と照合して、音声信号から推測される音声認識結果を出力する。なお、図３に示す構成例では、音声認識部２００は、会話制御部３００に会話データベース５００の記憶内容の取得を要求し、会話制御部３００が要求に応じて取得した会話データベース５００の記憶内容を受け取るようになっているが、音声認識部２００が直接会話データベース５００の記憶内容を取得して音声信号との比較を行う構成であってもかまわない。 [Voice recognition part]
The voice recognition unit 200 specifies a character string corresponding to the utterance content based on the utterance content acquired by the input unit 100. Specifically, the speech recognition unit 200 to which the speech signal is input from the input unit 100, based on the input speech signal, uses this speech signal as a dictionary and a conversation database stored in the speech recognition dictionary storage unit 700. A speech recognition result estimated from the speech signal is output in comparison with 500. In the configuration example illustrated in FIG. 3, the speech recognition unit 200 requests the conversation control unit 300 to acquire the storage content of the conversation database 500, and the storage content of the conversation database 500 acquired by the conversation control unit 300 in response to the request. However, the voice recognition unit 200 may directly acquire the stored contents of the conversation database 500 and compare it with the voice signal.

［音声認識部の構成例］
図２に、音声認識部２００の構成例を示す機能ブロック図を示す。音声認識部２００は、特徴抽出部２００Ａと、バッファメモリ（ＢＭ）２００Ｂと、単語照合部２００Ｃと、バッファメモリ（ＢＭ）２００Ｄと、候補決定部２００Ｅと、単語仮説絞込部２００Ｆを有している。単語照合部２００Ｃ及び単語仮説絞込部２００Ｆは音声認識辞書記憶部７００に接続されており、候補決定部２００Ｅは会話制御部３００に接続されている。 [Configuration example of voice recognition unit]
FIG. 2 is a functional block diagram illustrating a configuration example of the voice recognition unit 200. The speech recognition unit 200 includes a feature extraction unit 200A, a buffer memory (BM) 200B, a word matching unit 200C, a buffer memory (BM) 200D, a candidate determination unit 200E, and a word hypothesis narrowing unit 200F. Yes. The word matching unit 200C and the word hypothesis narrowing unit 200F are connected to the speech recognition dictionary storage unit 700, and the candidate determination unit 200E is connected to the conversation control unit 300.

単語照合部２００Ｃに接続された音声認識辞書記憶部７００は、音素隠れマルコフモデル（以下、隠れマルコフモデルをＨＭＭという。）を記憶している。音素ＨＭＭは、各状態を含んで表され、各状態はそれぞれ以下の情報を有する。（ａ）状態番号、（ｂ）受理可能なコンテキストクラス、（ｃ）先行状態、及び後続状態のリスト、（ｄ）出力確率密度分布のパラメータ、及び（ｅ）自己遷移確率及び後続状態への遷移確率から構成されている。なお、本実施形態において用いる音素ＨＭＭは、各分布がどの話者に由来するかを特定する必要があるため、所定の話者混合ＨＭＭを変換して生成する。ここで、出力確率密度関数は３４次元の対角共分散行列をもつ混合ガウス分布である。また、単語照合部２００Ｃに接続された音声認識辞書記憶部７００は単語辞書を記憶している。単語辞書は、音素ＨＭＭの各単語毎にシンボルで表した読みを示すシンボル列を格納する。 The speech recognition dictionary storage unit 700 connected to the word matching unit 200C stores a phoneme hidden Markov model (hereinafter, the hidden Markov model is referred to as HMM). The phoneme HMM is represented including each state, and each state has the following information. (A) state number, (b) acceptable context class, (c) list of preceding and subsequent states, (d) parameters of output probability density distribution, and (e) self-transition probabilities and transitions to subsequent states. It consists of probabilities. Note that the phoneme HMM used in the present embodiment is generated by converting a predetermined speaker mixed HMM because it is necessary to specify which speaker each distribution is derived from. Here, the output probability density function is a mixed Gaussian distribution having a 34-dimensional diagonal covariance matrix. In addition, the speech recognition dictionary storage unit 700 connected to the word matching unit 200C stores a word dictionary. The word dictionary stores a symbol string indicating a symbolic reading for each word of the phoneme HMM.

話者の発声音声はマイクロホンなどに入力されて音声信号に変換された後、特徴抽出部２００Ａに入力される。特徴抽出部２００Ａは、入力された音声信号をＡ／Ｄ変換した後、特徴パラメータを抽出し、これを出力する。特徴パラメータを抽出し、これを出力する方法としては様々なものが考えられるが、例えば一例としては、ＬＰＣ分析を実行し、対数パワー、１６次ケプストラム係数、Δ対数パワー及び１６次Δケプストラム係数を含む３４次元の特徴パラメータを抽出する方法などが挙げられる。抽出された特徴パラメータの時系列はバッファメモリ（ＢＭ）２００Ｂを介して単語照合部２００Ｃに入力される。 The voice of the speaker is input to a microphone or the like and converted into an audio signal, and then input to the feature extraction unit 200A. The feature extraction unit 200A performs A / D conversion on the input audio signal, extracts feature parameters, and outputs them. There are various methods for extracting and outputting feature parameters. For example, as an example, LPC analysis is performed, and logarithmic power, 16th-order cepstrum coefficient, Δlogarithmic power, and 16th-order Δcepstrum coefficient are calculated. Examples include a method of extracting 34-dimensional feature parameters. The extracted time series of feature parameters is input to the word matching unit 200C via the buffer memory (BM) 200B.

単語照合部２００Ｃは、ワン−パス・ビタビ復号化法を用いて、バッファメモリ２００Ｂを介して入力される特徴パラメータのデータに基づいて、音声認識辞書記憶部７００に記憶された音素ＨＭＭと単語辞書とを用いて単語仮説を検出し、尤度を計算して出力する。ここで、単語照合部２００Ｃは、各時刻の各ＨＭＭの状態毎に、単語内の尤度と発声開始からの尤度を計算する。尤度は、単語の識別番号、単語の開始時刻、先行単語の違い毎に個別にもつ。また、計算処理量の削減のために、音素ＨＭＭ及び単語辞書とに基づいて計算される総尤度のうちの低い尤度のグリッド仮説を削減するようにしてもよい。単語照合部２００Ｃは、検出した単語仮説とその尤度の情報を発声開始時刻からの時間情報（具体的には、例えばフレーム番号）とともにバッファメモリ２００Ｄを介して候補決定部２００Ｅ及び単語仮説絞込部２００Ｆに出力する。 The word matching unit 200C uses the one-pass Viterbi decoding method to generate a phoneme HMM and a word dictionary stored in the speech recognition dictionary storage unit 700 based on feature parameter data input via the buffer memory 200B. Is used to detect the word hypothesis, and the likelihood is calculated and output. Here, the word matching unit 200C calculates the likelihood in the word and the likelihood from the start of utterance for each state of each HMM at each time. The likelihood is individually provided for each word identification number, word start time, and difference between preceding words. In order to reduce the amount of calculation processing, the low likelihood grid hypothesis among the total likelihoods calculated based on the phoneme HMM and the word dictionary may be reduced. The word matching unit 200C includes information on the detected word hypothesis and its likelihood along with time information from the utterance start time (specifically, for example, a frame number) and the candidate determination unit 200E and the word hypothesis narrowing down via the buffer memory 200D. Output to the unit 200F.

候補決定部２００Ｅは、会話制御部３００を参照して、検出した単語仮説と所定の談話空間内の話題特定情報とを比較し、検出した単語仮説の内、所定の談話空間内の話題特定情報と一致するものがあるか否かを判定し、一致するものがある場合は、その一致する単語仮説を認識結果として出力し、一方一致するものがない場合は、単語仮説絞込部２００Ｆに単語仮説の絞込を行うよう要求する。 The candidate determining unit 200E refers to the conversation control unit 300, compares the detected word hypothesis with the topic specifying information in the predetermined discourse space, and among the detected word hypotheses, the topic specifying information in the predetermined discourse space If there is a match, the matching word hypothesis is output as a recognition result. If there is no match, the word hypothesis narrowing unit 200F Request to narrow down hypotheses.

候補決定部２００Ｅの動作例を説明する。今、単語照合部２００Ｃが複数の単語仮説「カンタク」「カタク」「カントク」およびその尤度（認識率）を出力し、所定の談話空間は「映画」に関するものでありその話題特定情報には「カントク（監督）」は含まれているが、「カンタク（干拓）」及び「カタク（仮託）」は含まれていないとする。また「カンタク」「カタク」「カントク」の尤度（認識率）は「カンタク」が最も高く「カントク」は最も低く、「カタク」は両者の中間であったとする。 An operation example of the candidate determination unit 200E will be described. Now, the word matching unit 200C outputs a plurality of word hypotheses “Kantaku”, “Katak”, “Kantoku” and the likelihood (recognition rate), and the predetermined discourse space relates to “movie”, and the topic specifying information includes “Kantoku (director)” is included, but “Kantaku (reclaimed)” and “Katak (temporary contract)” are not included. Further, the likelihood (recognition rate) of “Kantaku”, “Katak”, and “Kantoku” is highest in “Kantaku”, lowest in “Kantoku”, and “Katak” is in between.

上記の状況において、候補決定部２００Ｅは、検出した単語仮説と所定の談話空間内の話題特定情報とを比較して、単語仮説「カントク」が、所定の談話空間内の話題特定情報と一致するものであると判定し、単語仮説「カントク」を認識結果として出力し、会話制御部に渡す。このように処理することにより、現在扱われている話題「映画」に関連した「カントク（監督）」が、より上位の尤度（認識率）を有する単語仮説「カンタク」「カタク」に優先されて選択され、その結果会話の文脈に即した音声認識結果を出力することが可能となる。 In the above situation, the candidate determining unit 200E compares the detected word hypothesis with the topic specifying information in the predetermined discourse space, and the word hypothesis “Kantoku” matches the topic specifying information in the predetermined discourse space. The word hypothesis “Kantoku” is output as a recognition result and passed to the conversation control unit. By processing in this way, “Kantoku (Director)” related to the topic “Movie” currently being handled is prioritized over the word hypotheses “Kantaku” and “Katak” with higher likelihood (recognition rate). As a result, it is possible to output a speech recognition result in accordance with the context of the conversation.

一方、一致するものがない場合は、候補決定部２００Ｅからの単語仮説の絞込を行う要求に応じて単語仮説絞込部２００Ｆが認識結果を出力するよう動作する。単語仮説絞込部２００Ｆは、単語照合部２００Ｃからバッファメモリ２００Ｄを介して出力される複数個の単語仮説に基づいて、音声認識辞書記憶部７００に記憶された統計的言語モデルを参照して、終了時刻が等しく開始時刻が異なる同一の単語の単語仮説に対して、当該単語の先頭音素環境毎に、発声開始時刻から当該単語の終了時刻に至る計算された総尤度のうちの最も高い尤度を有する１つの単語仮説で代表させるように単語仮説の絞り込みを行った後、絞り込み後のすべての単語仮説の単語列のうち、最大の総尤度を有する仮説の単語列を認識結果として出力する。本実施形態においては、好ましくは、処理すべき当該単語の先頭音素環境とは、当該単語より先行する単語仮説の最終音素と、当該単語の単語仮説の最初の２つの音素とを含む３つの音素並びをいう。 On the other hand, if there is no match, the word hypothesis narrowing unit 200F operates to output the recognition result in response to a request for narrowing down word hypotheses from the candidate determination unit 200E. The word hypothesis narrowing-down unit 200F refers to a statistical language model stored in the speech recognition dictionary storage unit 700 based on a plurality of word hypotheses output from the word matching unit 200C via the buffer memory 200D. For the same word hypothesis of the same word with the same end time but different start time, the highest likelihood of the calculated total likelihood from the utterance start time to the end time of the word for each head phoneme environment of the word After narrowing down word hypotheses so that they are represented by a single word hypothesis having a degree, a word string of a hypothesis having the maximum total likelihood is output as a recognition result among the word strings of all the word hypotheses after narrowing down To do. In the present embodiment, preferably, the first phoneme environment of the word to be processed is three phonemes including the final phoneme of the word hypothesis preceding the word and the first two phonemes of the word hypothesis of the word. Say a line.

単語仮説絞込部２００Ｆによる単語絞込処理の例を図３を参照しながら説明する。図３は、単語仮説絞込部２００Ｆの処理の一例を示すタイミングチャートである。
例えば（ｉ−１）番目の単語Ｗi-１の次に、音素列ａ１，ａ２，…，ａnからなるｉ番目の単語Ｗiがくるときに、単語Ｗi-１の単語仮説として６つの仮説Ｗａ，Ｗｂ，Ｗｃ，Ｗｄ，Ｗｅ，Ｗｆが存在しているとする。ここで、前者３つの単語仮説Ｗａ，Ｗｂ，Ｗｃの最終音素は／ｘ／であるとし、後者３つの単語仮説Ｗｄ，Ｗｅ，Ｗｆの最終音素は／ｙ／であるとする。終了時刻ｔeにおいて単語仮説Ｗａ，Ｗｂ，Ｗｃを前提とする３つの仮説と、単語仮説Ｗｄ，Ｗｅ，Ｗｆを前提とする１の仮説が残っているものとすると、先頭音素環境が等しい前者３つ仮説のうち、総尤度が最も高い仮説一つを残し、その他を削除する。 An example of word narrowing processing by the word hypothesis narrowing unit 200F will be described with reference to FIG. FIG. 3 is a timing chart showing an example of processing of the word hypothesis narrowing-down unit 200F.
For example, when the i-th word Wi consisting of the phoneme sequence a1, a2,..., An comes after the (i-1) -th word Wi-1, six hypotheses Wa, Assume that Wb, Wc, Wd, We, and Wf exist. Here, it is assumed that the final phoneme of the former three word hypotheses Wa, Wb, and Wc is / x /, and the final phoneme of the latter three word hypotheses Wd, We, and Wf is / y /. Assuming that three hypotheses premised on the word hypotheses Wa, Wb, and Wc and one hypothesis premised on the word hypotheses Wd, We, and Wf remain at the end time te, the former three with the same initial phoneme environment Of the hypotheses, one of the hypotheses with the highest total likelihood is left and the others are deleted.

なお、単語仮説Ｗｄ，Ｗｅ，Ｗｆを前提とする仮説は先頭音素環境が他の３つの仮説と違うため、すなわち、先行する単語仮説の最終音素がｘではなくｙであるため、この単語仮説Ｗｄ，Ｗｅ，Ｗｆを前提とする仮説は削除しない。すなわち、先行する単語仮説の最終音素毎に１つのみ仮説を残す。 Note that the hypothesis premised on the word hypothesis Wd, We, Wf is that the leading phoneme environment is different from the other three hypotheses, that is, the final phoneme of the preceding word hypothesis is y instead of x, so this word hypothesis Wd , We and Wf are not deleted. That is, only one hypothesis is left for each final phoneme of the preceding word hypothesis.

以上の実施形態においては、当該単語の先頭音素環境とは、当該単語より先行する単語仮説の最終音素と、当該単語の単語仮説の最初の２つの音素とを含む３つの音素並びとして定義されているが、本発明はこれに限らず、先行する単語仮説の最終音素と、最終音素と連続する先行する単語仮説の少なくとも１つの音素とを含む先行単語仮説の音素列と、当該単語の単語仮説の最初の音素を含む音素列とを含む音素並びとしてもよい。
以上の実施の形態において、特徴抽出部２００Ａと、単語照合部２００Ｃと、候補決定部２００Ｅと、単語仮説絞込部２００Ｆとは、例えば、デジタル電子計算機などのコンピュータで構成され、バッファメモリ２００Ｂ，２００Ｄと、音声認識辞書記憶部７００とは、例えばハードディスク、メモリなどの記憶装置で構成される。 In the above embodiment, the first phoneme environment of the word is defined as three phoneme sequences including the final phoneme of the word hypothesis preceding the word and the first two phonemes of the word hypothesis of the word. However, the present invention is not limited to this, the phoneme string of the preceding word hypothesis including the final phoneme of the preceding word hypothesis, and at least one phoneme of the preceding word hypothesis continuous with the last phoneme, and the word hypothesis of the word A phoneme sequence including a phoneme string including the first phoneme may be used.
In the above embodiment, the feature extraction unit 200A, the word collation unit 200C, the candidate determination unit 200E, and the word hypothesis narrowing unit 200F are configured by a computer such as a digital electronic computer, for example, and include a buffer memory 200B, The 200D and the speech recognition dictionary storage unit 700 are configured by a storage device such as a hard disk or a memory.

以上の実施の形態においては、単語照合部２００Ｃ、単語仮説絞込部２００Ｆとを用いて音声認識を行っているが、本発明はこれに限られず、例えば、音素ＨＭＭを参照する音素照合部と、例えばＯｎｅ
ＰａｓｓＤＰアルゴリズムを用いて統計的言語モデルを参照して単語の音声認識を行う音声認識部とで構成してもよい。
また、本実施の形態では、音声認識部２００は会話制御装置１の一部分として説明するが、音声認識部２００、音声認識辞書記憶部７００、及び会話データベース５００より構成される、独立した音声認識装置とすることも可能である。 In the above embodiment, speech recognition is performed using the word matching unit 200C and the word hypothesis narrowing unit 200F. However, the present invention is not limited to this. For example, a phoneme matching unit that refers to a phoneme HMM For example, One
You may comprise with the speech recognition part which performs speech recognition of a word with reference to a statistical language model using a Pass DP algorithm.
In the present embodiment, the voice recognition unit 200 is described as a part of the conversation control device 1, but an independent voice recognition device including the voice recognition unit 200, the voice recognition dictionary storage unit 700, and the conversation database 500. It is also possible.

［音声認識部の動作例］
つぎに図４を参照しながら音声認識部２００の動作について説明する。図４は、音声認識部２００の動作例を示すフロー図である。入力部１００より音声信号を受け取ると、音声認識部２００は入力された音声の特徴分析を行い、特徴パラメータを生成する（ステップＳ４０１）。次に、この特徴パラメータと音声認識辞書記憶部７００に記憶された音素ＨＭＭ及び言語モデルとを比較して、所定個数の単語仮説及びその尤度を取得する（ステップＳ４０２）。次に、音声認識部２００は、取得した所定個数の単語仮説と検出した単語仮説と所定の談話空間内の話題特定情報とを比較し、検出した単語仮説の内、所定の談話空間内の話題特定情報と一致するものがあるか否かを判定する（ステップＳ４０３、Ｓ４０４）。一致するものがある場合は、音声認識部２００はその一致する単語仮説を認識結果として出力する（ステップＳ４０５）。一方、一致するものがない場合は、音声認識部２００は取得した単語仮説の尤度に従って、最大尤度を有する単語仮説を認識結果として出力する（ステップＳ４０６）。 [Operation example of voice recognition unit]
Next, the operation of the speech recognition unit 200 will be described with reference to FIG. FIG. 4 is a flowchart showing an operation example of the voice recognition unit 200. When receiving a speech signal from the input unit 100, the speech recognition unit 200 performs a feature analysis of the input speech and generates a feature parameter (step S401). Next, this feature parameter is compared with the phoneme HMM and language model stored in the speech recognition dictionary storage unit 700 to obtain a predetermined number of word hypotheses and their likelihoods (step S402). Next, the speech recognition unit 200 compares the acquired predetermined number of word hypotheses with the detected word hypothesis and the topic identification information in the predetermined discourse space, and among the detected word hypotheses, the topic in the predetermined discourse space It is determined whether there is a match with the specific information (steps S403 and S404). If there is a match, the speech recognition unit 200 outputs the matching word hypothesis as a recognition result (step S405). On the other hand, if there is no match, the speech recognition unit 200 outputs the word hypothesis having the maximum likelihood as the recognition result according to the acquired likelihood of the word hypothesis (step S406).

［音声認識辞書記憶部］
再び、図１に戻って、会話制御装置１の構成例の説明を続ける。
音声認識辞書記憶部７００は、標準的な音声信号に対応する文字列を格納するものである。この照合をした音声認識部２００は、その音声信号に対応する単語仮説に対応する文字列を特定し、その特定した文字列を文字列信号として会話制御部３００に出力する。 [Voice recognition dictionary storage]
Returning to FIG. 1 again, the description of the configuration example of the conversation control device 1 will be continued.
The voice recognition dictionary storage unit 700 stores a character string corresponding to a standard voice signal. The voice recognition unit 200 that has performed the collation identifies a character string corresponding to the word hypothesis corresponding to the voice signal, and outputs the identified character string to the conversation control unit 300 as a character string signal.

［文解析部］
次に、図５を参照しながら文解析部４００の構成例について説明する。図５は、会話制御装置１の部分拡大ブロック図であって、会話制御部３００及び文解析部４００の具体的構成例を示すブロック図である。なお、図５においては、会話制御部３００、文解析部４００、および会話データベース５００のみ図示しており、その他の構成要素の表示は省略されている。 [Sentence Analysis Department]
Next, a configuration example of the sentence analysis unit 400 will be described with reference to FIG. FIG. 5 is a partially enlarged block diagram of the conversation control device 1, and is a block diagram illustrating a specific configuration example of the conversation control unit 300 and the sentence analysis unit 400. In FIG. 5, only the conversation control unit 300, the sentence analysis unit 400, and the conversation database 500 are shown, and the display of other components is omitted.

文解析部４００は、入力部１００又は音声認識部２００で特定された文字列を解析するものである。この文解析部４００は、本実施の形態では、図５に示すように、文字列特定部４１０と、形態素抽出部４２０と、形態素データベース４３０と、入力種類判定部４４０と、発話種類データベース４５０とを有している。文字列特定部４１０は、入力部１００及び音声認識部２００で特定された一連の文字列を一文節毎に区切るものである。この一文節とは、文法の意味を崩さない程度に文字列をできるだけ細かく区切った一区切り文を意味する。具体的に、文字列特定部４１０は、一連の文字列の中に、ある一定以上の時間間隔があるときは、その部分で文字列を区切る。文字列特定部４１０は、その区切った各文字列を形態素抽出部４２０及び入力種類判定部４４０に出力する。尚、以下で説明する「文字列」は、一文節毎の文字列を意味するものとする。 The sentence analysis unit 400 analyzes the character string specified by the input unit 100 or the speech recognition unit 200. In this embodiment, as shown in FIG. 5, the sentence analysis unit 400 includes a character string identification unit 410, a morpheme extraction unit 420, a morpheme database 430, an input type determination unit 440, and an utterance type database 450. have. The character string specifying unit 410 divides a series of character strings specified by the input unit 100 and the speech recognition unit 200 into one sentence. This one-sentence means a delimiter sentence in which character strings are divided as finely as possible without breaking the meaning of the grammar. Specifically, when there is a certain time interval or more in a series of character strings, the character string specifying unit 410 divides the character string at that portion. The character string specifying unit 410 outputs the divided character strings to the morpheme extracting unit 420 and the input type determining unit 440. It should be noted that “character string” described below means a character string for each phrase.

［形態素抽出部］
形態素抽出部４２０は、文字列特定部４１０で区切られた一文節の文字列に基づいて、その一文節の文字列の中から、文字列の最小単位を構成する各形態素を第一形態素情報として抽出するものである。ここで、形態素とは、本実施の形態では、文字列に現された語構成の最小単位を意味するものとする。この語構成の最小単位としては、例えば、名詞、形容詞、動詞などの品詞が挙げられる。
各形態素は、図６に示すように、本実施の形態ではm１,m２,m３…,と表現することができる。図６は、文字列とこの文字列から抽出される形態素との関係を示す図である。図６に示すように、文字列特定部４１０から文字列が入力された形態素抽出部４２０は、入力された文字列と、形態素データベース４３０に予め格納されている形態素群(この形態素群は、それぞれの品詞分類に属する各形態素についてその形態素の見出し語・読み・品詞・活用形などを記述した形態素辞書として用意されている)とを照合する。その照合をした形態素抽出部４２０は、その文字列の中から、予め記憶された形態素群のいずれかと一致する各形態素(m１,m２、…)を抽出する。この抽出された各形態素を除いた要素(n１,n２,n３…)は、例えば助動詞等が挙げられる。 [Morpheme extraction unit]
The morpheme extraction unit 420 sets, as first morpheme information, each morpheme constituting the minimum unit of the character string from the character string of the one phrase according to the character string of the one sentence divided by the character string specifying unit 410. To extract. Here, in this embodiment, the morpheme means the minimum unit of the word structure represented in the character string. Examples of the minimum unit of the word structure include parts of speech such as nouns, adjectives and verbs.
As shown in FIG. 6, each morpheme can be expressed as m1, m2, m3... In the present embodiment. FIG. 6 is a diagram illustrating a relationship between a character string and a morpheme extracted from the character string. As shown in FIG. 6, the morpheme extraction unit 420 to which the character string is input from the character string specifying unit 410 includes the input character string and a morpheme group stored in advance in the morpheme database 430. And each morpheme belonging to the part-of-speech classification is prepared as a morpheme dictionary describing the morpheme entry word, reading, part-of-speech, utilization form, etc.). The matched morpheme extraction unit 420 extracts each morpheme (m1, m2,...) That matches one of the previously stored morpheme groups from the character string. Examples of the elements (n1, n2, n3...) Excluding each extracted morpheme include auxiliary verbs.

この形態素抽出部４２０は、抽出した各形態素を第一形態素情報として管理部３１０に出力する。なお、第一形態素情報は構造化されている必要はない。ここで「構造化」とは、文字列の中に含まれる形態素を品詞等に基づいて分類し配列することをいい、たとえば発話文である文字列を、「主語＋目的語＋述語」などの様に、所定の順番で形態素を配列してなるデータに変換することを言う。もちろん、構造化した第一形態素情報を用いたとしても、それが本実施の形態を実現を妨げることはない。 The morpheme extraction unit 420 outputs each extracted morpheme to the management unit 310 as first morpheme information. Note that the first morpheme information need not be structured. Here, “structured” means to classify and arrange morphemes contained in a character string based on the part of speech, for example, a character string that is an utterance sentence, such as “subject + object + predicate”. In the same way, it refers to conversion into data obtained by arranging morphemes in a predetermined order. Of course, even if structured first morpheme information is used, it does not prevent the present embodiment from being realized.

［入力種類判定部］
入力種類判定部４４０は、文字列特定部４１０で特定された文字列に基づいて、発話内容の種類(発話種類)を判定するものである。この発話種類は、発話内容の種類を特定する情報であって、本実施の形態では、例えば図７に示す「発話文のタイプ」を意味する。図７は、「発話文のタイプ」と、その発話文のタイプを表す二文字のアルファベット、及びその発話文のタイプに該当する発話文の例を示す図である。 [Input type determination unit]
The input type determination unit 440 determines the type of utterance content (speech type) based on the character string specified by the character string specifying unit 410. This utterance type is information for specifying the type of utterance content, and in the present embodiment, it means, for example, the “spoken sentence type” shown in FIG. FIG. 7 is a diagram illustrating an example of an “uttered sentence type”, a two-letter alphabet representing the type of the spoken sentence, and an spoken sentence corresponding to the type of the spoken sentence.

ここで、「発話文のタイプ」は、本実施の形態では、図７に示すように、陳述文(D ; Declaration)、時間文(T ; Time)、場所文(L ; Location)、反発文(N ; Negation)などから構成される。この各タイプから構成される文は、肯定文又は質問文で構成される。「陳述文」とは、利用者の意見又は考えを示す文を意味するものである。この陳述文は本実施の形態では、図７に示すように、例えば"私は佐藤が好きです"などの文が挙げられる。「場所文」とは、場所的な概念を伴う文を意味するものである。「時間文」とは、時間的な概念を伴う文を意味するものである。「反発文」とは、陳述文を否定するときの文を意味する。「発話文のタイプ」についての例文は図７に示す通りである。 Here, in the present embodiment, the “spoken sentence type” includes a statement sentence (D; Declaration), a time sentence (T; Time), a location sentence (L; Location), and a repulsive sentence, as shown in FIG. (N; Negation). The sentence composed of each type is composed of an affirmative sentence or a question sentence. The “declaration sentence” means a sentence indicating a user's opinion or idea. In the present embodiment, the statement includes a sentence such as “I like Sato” as shown in FIG. “Place sentence” means a sentence with a place concept. “Time sentence” means a sentence with a temporal concept. “Rebound sentence” means a sentence when a statement is denied. An example sentence for “spoken sentence type” is as shown in FIG.

入力種類判定部４４０が「発話文のタイプ」を判定するには、入力種類判定部４４０は、本実施の形態では、図８に示すように、陳述文であることを判定するための定義表現辞書、反発文であることを判定するための反発表現辞書等を用いる。具体的に、文字列特定部４１０から文字列が入力された入力種類判定部４４０は、入力された文字列に基づいて、その文字列と発話種類データベース４５０に格納されている各辞書とを照合する。その照合をした入力種類判定部４４０は、その文字列の中から、各辞書に関係する要素を抽出する。 In order for the input type determination unit 440 to determine the “spoken sentence type”, in this embodiment, the input type determination unit 440 defines a definition expression for determining that it is a statement sentence as shown in FIG. A dictionary, a repulsive expression dictionary for determining that the sentence is a repelled sentence, and the like are used. Specifically, the input type determination unit 440 to which the character string is input from the character string specifying unit 410 compares the character string with each dictionary stored in the utterance type database 450 based on the input character string. To do. The input type determination unit 440 that has performed the collation extracts elements related to each dictionary from the character string.

この入力種類判定部４４０は、抽出した要素に基づいて、「発話文のタイプ」を判定する。例えば、入力種類判定部４４０は、ある事象について陳述している要素が文字列の中に含まれる場合には、その要素が含まれている文字列を陳述文として判定する。入力種類判定部４４０は、判定した「発話文のタイプ」を回答取得部３５０に出力する。 The input type determination unit 440 determines “spoken sentence type” based on the extracted elements. For example, when an element that describes a certain event is included in a character string, the input type determination unit 440 determines the character string that includes the element as a statement. The input type determination unit 440 outputs the determined “spoken sentence type” to the answer acquisition unit 350.

［会話データベース］
次に、会話データベース５００が記憶するデータのデータ構成例について図９を参照しながら説明する。図９は、会話データベース５００が記憶するデータのデータ構成の一例を示す概念図である。 [Conversation database]
Next, a data configuration example of data stored in the conversation database 500 will be described with reference to FIG. FIG. 9 is a conceptual diagram illustrating an example of a data configuration of data stored in the conversation database 500.

前記会話データベース５００は、図９に示すように、話題を特定するための話題特定情報８１０を予め複数記憶する。又、それぞれの話題特定情報８１０は、他の話題特定情報と関連づけられていてもよく、例えば、図９に示す例では、話題特定情報Ｃが特定されると、この話題特定情報Ｃに関連づけられている他の話題特定情報Ａ、Ｂ，Ｄが定まるように記憶されている。
具体的には、話題特定情報８１０は、本実施の形態では、利用者から入力されると予想される入力内容又は利用者への回答文に関連性のある「キーワード」を意味する。 As shown in FIG. 9, the conversation database 500 stores a plurality of pieces of topic specifying information 810 for specifying topics in advance. Each topic specifying information 810 may be associated with other topic specifying information. For example, in the example shown in FIG. 9, when the topic specifying information C is specified, it is associated with the topic specifying information C. The other topic specifying information A, B, D is stored so as to be determined.
Specifically, in the present embodiment, the topic identification information 810 means “keywords” that are relevant to the input content expected to be input by the user or the answer sentence to the user.

話題特定情報８１０には、一又は複数の話題タイトル８２０が対応付けられて記憶されている。話題タイトル８２０は、一つの文字、複数の文字列又はこれらの組み合わせからなる形態素により構成されている。各話題タイトル８２０には、利用者への回答文８３０が対応付けられて記憶されている。また、回答文８３０の種類を示す複数の回答種類は、回答文８３０に対応付けられている。 One or more topic titles 820 are stored in the topic specifying information 810 in association with each other. The topic title 820 is composed of morphemes composed of one character, a plurality of character strings, or a combination thereof. Each topic title 820 stores an answer sentence 830 to the user in association with it. A plurality of answer types indicating the type of the answer sentence 830 are associated with the answer sentence 830.

次に、ある話題特定情報と他の話題特定情報との関連づけについて説明する。図１０は、ある話題特定情報８１０Ａと他の話題特定情報８１０Ｂ、８１０Ｃ_１〜８１０Ｃ_４、８１０Ｄ_１〜８１０Ｄ_３…との関連付けを示す図である。なお、以下の説明において「関連づけされて記憶される」とは、ある情報Ｘを読み取るとその情報Ｘに関連づけられている情報Ｙを読み取りできることをいい、例えば、情報Ｘのデータの中に情報Ｙを読み出すための情報（例えば、情報Ｙの格納先アドレスを示すポインタ、情報Ｙの格納先物理メモリアドレス、論理アドレスなど）が格納されている状態を、「情報Ｙが情報Ｘに『関連づけされて記憶され』ている」というものとする。 Next, the association between certain topic specifying information and other topic specifying information will be described. FIG. 10 is a diagram showing an association between certain topic specifying information 810A and other topic specifying information 810B, 810C _{1 to} 810C ₄ , 810D _{1 to} 810D ₃ . In the following description, “stored in association” means that when information X is read, information Y associated with the information X can be read. For example, information Y in the data of the information X Is stored as information (for example, a pointer indicating the storage destination address of information Y, a physical memory address of the storage destination of information Y, and a logical address). "Remembered".

図１０に示す例では、話題特定情報は他の話題特定情報との間で上位概念、下位概念、同義語、対義語（本図の例では省略）が関連づけされて記憶させることができる。本図に示す例では、話題特定情報８１０Ａ(＝「映画」)に対する上位概念の話題特定情報として話題特定情報８１０Ｂ(＝「娯楽」)が話題特定情報８１０Ａに関連づけされて記憶されており、たとえば話題特定情報(「映画」)に対して上の階層に記憶される。 In the example shown in FIG. 10, topic specific information can be stored in association with other topic specific information in association with a higher concept, a lower concept, a synonym, and a synonym (omitted in the example of this figure). In the example shown in this figure, topic specifying information 810B (= “entertainment”) is stored in association with the topic specifying information 810A as topic specifying information of the higher concept for the topic specifying information 810A (= “movie”). The topic specific information (“movie”) is stored in the upper hierarchy.

また、話題特定情報８１０Ａ(＝「映画」)に対する下位概念の話題特定情報８１０Ｃ_１（＝「監督」）、話題特定情報８１０Ｃ_２（＝「主演」）、話題特定情報８１０Ｃ_３（＝「配給会社」）、話題特定情報８１０Ｃ_４（＝「上映時間」）、および話題特定情報８１０Ｄ_１（＝「七人の侍」）、話題特定情報８１０Ｄ_２（＝「乱」）、話題特定情報８１０Ｄ_３（＝「用心棒」）、…、が話題特定情報８１０Ａに関連づけされて記憶されている。 Further, topic specific information 810C ₁ (= “director”), topic specific information 810C ₂ (= “starring”), topic specific information 810C ₃ (= “distribution company” for the topic specific information 810A (= “movie”) )), Topic identification information 810C ₄ (= “screening time”), topic identification information 810D ₁ (= “Seven Samurai”), topic identification information 810D ₂ (= “Ran”), topic identification information 810D ₃ ( = "Bouncer"), ... are stored in association with the topic identification information 810A.

又、話題特定情報８１０Ａには、同義語９００が関連づけられている。この例では、話題特定情報８１０Ａであるキーワード「映画」の同義語として「作品」、「内容」、「シネマ」が記憶されている様子を示している。このような同意語を定めることにより、発話にはキーワード「映画」は含まれていないが「作品」、「内容」、「シネマ」が発話文等に含まれている場合に、話題特定情報８１０Ａが発話文等に含まれているものとして取り扱い、またその逆の取り扱いをすることを可能とする。 In addition, the synonym 900 is associated with the topic identification information 810A. In this example, “works”, “contents”, and “cinema” are stored as synonyms of the keyword “movie” that is the topic identification information 810A. By defining such synonyms, the topic specifying information 810A is obtained when the utterance does not include the keyword “movie” but includes “works”, “contents”, and “cinema” in the utterance sentence or the like. Can be handled as being included in an utterance, etc., and vice versa.

本実施の形態にかかる会話制御装置１は、会話データベース５００の記憶内容を参照することにより、ある話題特定情報を特定するとその話題特定情報に関連づけられて記憶されている他の話題特定情報及びその話題特定情報の話題タイトル、回答文などを高速で検索・抽出することが可能となる。 When the conversation control device 1 according to the present embodiment identifies certain topic identification information by referring to the stored content of the conversation database 500, the other topic identification information stored in association with the topic identification information and It is possible to search and extract topic titles and answer sentences of topic identification information at high speed.

次に、話題タイトル（「第二形態素情報」ともいう）のデータ構成例について、図１１を参照しながら説明する。図１１は、話題タイトルのデータ構成例を示す図である。 Next, a data configuration example of a topic title (also referred to as “second morpheme information”) will be described with reference to FIG. FIG. 11 is a diagram illustrating a data configuration example of topic titles.

話題特定情報８１０Ｄ_１、８１０Ｄ_２、８１０Ｄ_３、…はそれぞれ複数の異なる話題タイトル８２０_１、８２０_２、…、話題タイトル８２０_３、８２０_４、…、話題タイトル８２０_５、８２０_６、…、…を有している。本実施の形態では、図１１に示すように、それぞれの話題タイトル８２０は、第一特定情報１００１と、第二特定情報及１００２と、第三特定情報１００３によって構成される情報である。
ここで、第一特定情報１００１は、本実施の形態では、話題を構成する主要な形態素を意味するものである。第一特定情報の例としては、例えば文を構成する主語が挙げられる。また、第二特定情報１００２は、本実施の形態では、第一特定情報１００１と密接な関連性を有する形態素を意味するものである。この第二特定情報１００２は、例えば目的語が挙げられる。更に、第三特定情報１００３は、本実施の形態では、ある対象についての動きを示す形態素、又は名詞等を修飾する形態素を意味するものである。この第三特定情報１００３は、例えば動詞、副詞又は形容詞が挙げられる。なお、第一特定情報１００１、第二特定情報１００２、第三特定情報１００３それぞれの意味は上述の内容に限定される必要はなく、別の意味を第一特定情報１００１、第二特定情報１００２、第三特定情報１００３に与えても、これらから文の内容を把握可能な限り、本実施の形態は成立する。 The topic identification information 810D ₁ , 810D ₂ , 810D ₃ ,... Includes a plurality of different topic titles 820 ₁ , 820 ₂ ,..., Topic titles 820 ₃ , 820 ₄ ,..., Topic titles 820 ₅ , 820 ₆ ,. Have. In the present embodiment, as shown in FIG. 11, each topic title 820 is information including first specific information 1001, second specific information and 1002, and third specific information 1003.
Here, the 1st specific information 1001 means the main morpheme which comprises a topic in this Embodiment. As an example of the first specific information, for example, a subject constituting a sentence can be cited. The second specific information 1002 means a morpheme having a close relationship with the first specific information 1001 in the present embodiment. The second specific information 1002 includes, for example, an object. Further, in the present embodiment, the third identification information 1003 means a morpheme that indicates a movement of a certain object or a morpheme that modifies a noun or the like. The third specific information 1003 includes, for example, a verb, an adverb, or an adjective. The meanings of the first identification information 1001, the second identification information 1002, and the third identification information 1003 do not have to be limited to the above-described contents, and other meanings are defined as the first identification information 1001, the second identification information 1002, Even if it is given to the third specific information 1003, as long as the content of the sentence can be grasped from these, this embodiment is established.

例えば、主語が「七人の侍」、形容詞が「面白い」である場合には、図１１に示すように、話題タイトル（第二形態素情報）８２０_２は、第一特定情報１００１である形態素「七人の侍」と、第三特定情報１００３である形態素「面白い」とから構成されることになる。なお、この話題タイトル８２０_２には第二特定情報である形態素は含まれておらず、該当する形態素がないことを示すための記号「＊」が第二特定情報として格納されている。 For example, the subject is "Seven Samurai" and the adjective is "interesting", as shown in FIG. 11, the topic title (second morpheme information) 820 ₂ is the first specification information 1001 morpheme " It consists of “Seven Samurai” and the morpheme “Funny” which is the third specific information 1003. Incidentally, this is the topic title 820 ₂ not included morpheme is the second specifying information, the symbol for indicating that there is no corresponding morpheme "*" is stored as the second specific information.

なお、この話題タイトル８２０_２（七人の侍；＊；面白い）は、「七人の侍は面白い」の意味を有する。この話題タイトル８２０を構成する括弧内は、以下では左から第一特定情報１００１、第二特定情報１００２、第三特定情報１００３の順番となっている。また、話題タイトル８２０のうち、第一から第三特定情報に含まれる形態素がない場合には、その部分については、「＊」を示すことにする。 The topic title 820 ₂ (Seven Samurai; *; Interesting) has the meaning of “Seven Samurai is interesting”. In the parentheses constituting the topic title 820, the first specific information 1001, the second specific information 1002, and the third specific information 1003 are in the following order from the left. In addition, in the topic title 820, when there is no morpheme included in the first to third specific information, “*” is indicated for the portion.

なお、上記話題タイトル８２０を構成する特定情報は、上記のような第一から第三特定情報のように三つに限定されるものではなく、話題タイトル８２０は更に他の特定情報（第四特定情報、又はそれ以上）を有する構成としてもよい。 The specific information constituting the topic title 820 is not limited to three like the first to third specific information as described above, and the topic title 820 further includes other specific information (fourth specific information). Information or more).

次に、回答文８３０について図１２を参照しながら説明する。本実施の形態では、回答文８３０は、図１２に示すように、利用者から発話された発話文のタイプに対応した回答をするために、陳述(D ; Declaration)、時間(T ; Time)、場所(L ; Location)、否定(N ; Negation)などのタイプ(回答種類)に分類され、タイプを示す情報を附されている。また、回答文８３０について肯定文は「Ａ」とし、質問文は「Ｑ」とする。 Next, the answer sentence 830 will be described with reference to FIG. In the present embodiment, as shown in FIG. 12, the answer sentence 830 includes a statement (D; Declaration) and a time (T; Time) in order to make an answer corresponding to the type of utterance sentence uttered by the user. , Location (L; Location), negation (N; Negation), and other types (answer types), and information indicating the type is attached. Further, regarding the answer sentence 830, the affirmative sentence is “A” and the question sentence is “Q”.

次に、話題特定情報、およびこの話題特定情報に対応付けされた話題タイトル，回答文の関係について説明する。図１３は、ある話題特定情報「佐藤」に対応付けされた話題タイトル，回答文の具体例を示す図である。
例えば、話題タイトル（８２０）１−１が(佐藤；＊；好き){これは、「佐藤が好きです」に含まれる形態素を抽出したもの}である場合には、その話題タイトル（８２０）１−１に対応する回答文（８３０）１−１は、(DA；陳述肯定文「私も佐藤が好きです」)、(TA；時間肯定文「私は打席に立ったときの佐藤が好きです」)など各回答種類に対応した回答文の集合として準備されている。後述する回答取得部３５０は、その話題タイトル８２０に対応付けられた回答文８３０、すなわち各回答種類に対応した回答文の集合からいずれか一を取得する。 Next, the relationship between the topic identification information, the topic title associated with the topic identification information, and the answer sentence will be described. FIG. 13 is a diagram illustrating a specific example of a topic title and an answer sentence associated with certain topic specifying information “Sato”.
For example, when the topic title (820) 1-1 is (Sato; *; likes) {this is an extracted morpheme included in "I like Sato"}, the topic title (820) 1 Answer sentence (830) 1-1 corresponding to -1 is (DA; statement affirmation sentence "I also like Sato"), (TA; time affirmation sentence "I like Sato when I was standing at bat ")", Etc. are prepared as a set of answer sentences corresponding to each answer type. An answer acquisition unit 350 to be described later acquires any one from an answer sentence 830 associated with the topic title 820, that is, an answer sentence set corresponding to each answer type.

［会話制御部］
ここで図５に戻り、会話制御部３００の構成例を説明する。
会話制御部３００は、会話制御装置１内の各構成要素（音声認識部２００，文解析部４００、会話データベース５００，出力部６００，新単語登録部８００）間のデータの受け渡しを制御するとともに、発話に応答する回答文の決定、出力を行う機能を有する。 [Conversation control unit]
Here, returning to FIG. 5, a configuration example of the conversation control unit 300 will be described.
The conversation control unit 300 controls the exchange of data between each component (speech recognition unit 200, sentence analysis unit 400, conversation database 500, output unit 600, new word registration unit 800) in the conversation control device 1. It has a function to determine and output an answer sentence that responds to an utterance.

前記会話制御部３００は、本実施の形態では、図５に示すように、管理部３１０と、話題特定情報検索部３２０と、省略文補完部３３０と、話題検索部３４０と、回答取得部３５０とを有している。 In the present embodiment, as shown in FIG. 5, the conversation control unit 300 includes a management unit 310, a topic identification information search unit 320, an abbreviated sentence complement unit 330, a topic search unit 340, and an answer acquisition unit 350. And have.

［管理部］
管理部３１０は、会話制御部３００の全体の動作を管理・制御する機能を有する。また、管理部３１０は談話履歴を記憶し、且つ必要に応じて更新する機能を有する。管理部３１０は話題特定情報検索部３２０と、省略文補完部３３０と、話題検索部３４０と、回答取得部３５０からの要求に応じて、記憶している談話履歴の全部又は一部をこれら各部に渡す機能を有する。 [Management Department]
The management unit 310 has a function of managing and controlling the overall operation of the conversation control unit 300. The management unit 310 has a function of storing the discourse history and updating it as necessary. In response to requests from the topic identification information search unit 320, the abbreviated sentence complement unit 330, the topic search unit 340, and the answer acquisition unit 350, the management unit 310 converts all or part of the stored discourse history into these units. The function to pass to.

「談話履歴」とは、ユーザと会話制御装置１間の会話の話題や主題を特定する情報であって、後述する談話履歴は「着目話題特定情報」「着目話題タイトル」「利用者入力文話題特定情報」「回答文話題特定情報」の少なくともいずれか一つを含む情報である。また、談話履歴に含まれる「着目話題特定情報」「着目話題タイトル」「回答文話題特定情報」は直前の会話によって定められたものに限定されず、過去の所定期間の間に着目話題特定情報」「着目話題タイトル」「回答文話題特定情報」となったもの、若しくはそれらの累積的記録であってもよい。 The “discourse history” is information for specifying the topic and subject of the conversation between the user and the conversation control device 1, and the discourse history to be described later is “target topic specification information”, “target topic title”, “user input sentence topic” This information includes at least one of “specific information” and “answer sentence topic specific information”. In addition, “focused topic identification information”, “focused topic title”, and “answer sentence topic specific information” included in the discourse history are not limited to those determined by the previous conversation, but focused topic identification information during a past predetermined period. "Remarked topic title", "Reply sentence topic specific information", or a cumulative record thereof.

また、管理部３１０は音声認識部２００からの出力を受け取ると、これを文解析部４００に渡し、文解析部４００が返す第一形態素情報を新単語登録部８００に渡す。新単語登録部８００が新単語登録処理を行う場合は、この第一形態素情報に対する回答出力処理等は、新単語登録部８００により行われる。一方、新単語登録部８００が新単語登録処理を行わない場合は、新単語登録部８００が管理部３１０に会話制御処理を行うよう通知し、管理部３１０はこの通知に応答して、第一形態素情報を話題特定情報検索部３２０にこの第一形態素情報を渡すように動作する。 Further, upon receiving the output from the speech recognition unit 200, the management unit 310 passes this to the sentence analysis unit 400, and passes the first morpheme information returned by the sentence analysis unit 400 to the new word registration unit 800. When the new word registration unit 800 performs a new word registration process, the new word registration unit 800 performs an answer output process for the first morpheme information. On the other hand, when the new word registration unit 800 does not perform the new word registration process, the new word registration unit 800 notifies the management unit 310 to perform the conversation control process, and the management unit 310 responds to the notification by first The morpheme information operates to pass the first morpheme information to the topic identification information search unit 320.

以下、会話制御部３００を構成する管理部３１０以外の各部について説明する。 Hereinafter, each part other than the management part 310 which comprises the conversation control part 300 is demonstrated.

［話題特定情報検索部］
話題特定情報検索部３２０は、第一形態素情報と談話範囲に属する各話題特定情報とを照合し、各話題特定情報の中から、第一形態素情報を構成する形態素と一致する話題特定情報を検索するものである。具体的に、話題特定情報検索部３２０は、管理部３１０を経由して形態素抽出部４２０から入力された第一形態素情報が「佐藤」及び「好き」の二つの形態素で構成される場合には、入力された第一形態素情報と話題特定情報群とを照合する。 [Topic specific information search part]
The topic identification information search unit 320 collates the first morpheme information with each topic identification information belonging to the discourse range, and searches the topic identification information for topic identification information that matches the morpheme constituting the first morpheme information. To do. Specifically, the topic identification information search unit 320, when the first morpheme information input from the morpheme extraction unit 420 via the management unit 310 is composed of two morphemes "Sato" and "like" The input first morpheme information and the topic specifying information group are collated.

この照合をした話題特定情報検索部３２０は、着目話題タイトル８２０focus（前回までに検索された話題タイトル）に第一形態素情報を構成する形態素（例えば「佐藤」）が含まれているときは、その着目話題タイトル８２０focusを回答取得部３５０に出力する。一方、着目話題タイトル８２０focusに第一形態素情報を構成する形態素が含まれていないときは、話題特定情報検索部３２０は、第一形態素情報に基づいて利用者入力文話題特定情報を決定し、入力された第一形態素情報及び利用者入力文話題特定情報を省略文補完部３３０に出力する。なお、「利用者入力文話題特定情報」は、第一形態素情報に含まれる形態素の内、利用者が話題としている内容に該当する形態素に相当する話題特定情報、若しくは第一形態素情報に含まれる形態素の内、利用者が話題としている内容に該当する可能性がある形態素に相当する話題特定情報をいう。 The topic identification information search unit 320 that has performed this collation, when a morpheme constituting the first morpheme information (for example, “Sato”) is included in the focused topic title 820focus (topic title searched up to the previous time), The subject topic title 820focus is output to the answer acquisition unit 350. On the other hand, when the morpheme constituting the first morpheme information is not included in the focused topic title 820focus, the topic identification information search unit 320 determines the user input sentence topic identification information based on the first morpheme information and inputs it. The first morpheme information and the user input sentence topic specifying information are output to the abbreviated sentence complementing unit 330. "User input sentence topic specific information" is included in the topic specific information corresponding to the morpheme corresponding to the content that the user is talking about or the first morpheme information among the morphemes included in the first morpheme information. The topic specific information corresponding to the morpheme which may correspond to the content which the user is talking about among morphemes.

［省略文補完部］
省略文補完部３３０は、前記第一形態素情報を、前回までに検索された話題特定情報８１０（以下、「着目話題特定情報」）及び前回の回答文に含まれる話題特定情報８１０（以下、「回答文話題特定情報」という）を利用して、補完することにより複数種類の補完された第一形態素情報を生成する。例えば発話文が「好きだ」という文であった場合、省略文補完部３３０は、着目話題特定情報「佐藤」を、第一形態素情報「好き」に含めて、補完された第一形態素情報「佐藤、好き」を生成する。 [Abbreviated sentence completion part]
The abbreviated sentence complementing unit 330 uses the first morpheme information as the topic specifying information 810 (hereinafter referred to as “focused topic specifying information”) searched up to the previous time and the topic specifying information 810 (hereinafter referred to as “ A plurality of types of complemented first morpheme information is generated by complementing using “answer sentence topic specifying information”). For example, when the utterance sentence is a sentence “I like”, the abbreviated sentence complementing unit 330 includes the topic topic identification information “Sato” in the first morpheme information “like” and the complemented first morpheme information “ "Sato likes".

すなわち、第一形態素情報を「Ｗ」、着目話題特定情報や回答文話題特定情報の集合を「Ｄ」とすると、省略文補完部３３０は、第一形態素情報「Ｗ」に集合「Ｄ」の要素を含めて、補完された第一形態素情報を生成する。 In other words, if the first morpheme information is “W” and the set of the topic topic identification information and the answer sentence topic specification information is “D”, the abbreviated sentence complementing unit 330 adds the set “D” to the first morpheme information “W”. Complemented first morpheme information including elements is generated.

これにより、第一形態素情報を用いて構成される文が、省略文であって日本語として明解でない場合などにおいて、省略文補完部３３０は、集合「Ｄ」を用いて、その集合「Ｄ」の要素（例えば、”佐藤”）を第一形態素情報「Ｗ」に含めることができる。この結果、省略文補完部３３０は、第一形態素情報「好き」を補完された第一形態素情報「佐藤、好き」に置換する。なお、補完された第一形態素情報「佐藤、好き」は、「佐藤が好きだ」という発話内容に対応する。 As a result, when the sentence constructed using the first morpheme information is an abbreviated sentence and is not clear as Japanese, the abbreviated sentence complementing unit 330 uses the set “D” to set the set “D”. (For example, “Sato”) can be included in the first morpheme information “W”. As a result, the abbreviated sentence complementing unit 330 replaces the first morpheme information “like” with the complemented first morpheme information “Sato, like”. The complemented first morpheme information “Sato, I like” corresponds to the utterance content “I like Sato”.

すなわち、省略文補完部３３０は、利用者の発話内容が省略文である場合などであっても、集合「Ｄ」を用いて省略文を補完することができる。この結果、省略文補完部３３０は、第一形態素情報から構成される文が省略文であっても、その文が適正な日本語となるようにすることができる。 That is, the abbreviated sentence complementing unit 330 can supplement the abbreviated sentence using the set “D” even when the user's utterance content is an abbreviated sentence. As a result, even if the sentence composed of the first morpheme information is an abbreviated sentence, the abbreviated sentence complementing unit 330 can make the sentence in proper Japanese.

また、省略文補完部３３０が、前記集合「Ｄ」に基づいて、補完後の第一形態素情報に一致する話題タイトル８２０を検索する。補完後の第一形態素情報に一致する話題タイトル８２０を発見した場合は、省略文補完部３３０はこの話題タイトル８２０を回答取得部３５０に出力する。回答取得部３５０は、省略文補完部３３０で検索された適切な話題タイトル８２０に基づいて、利用者の発話内容に最も適した回答文８３０を出力することができる。 In addition, the abbreviated sentence complementing unit 330 searches for the topic title 820 that matches the first morpheme information after completion based on the set “D”. When a topic title 820 that matches the first morpheme information after complement is found, the abbreviated sentence complement unit 330 outputs the topic title 820 to the answer acquisition unit 350. The answer acquisition unit 350 can output the answer sentence 830 most suitable for the user's utterance content based on the appropriate topic title 820 searched by the abbreviated sentence complementing unit 330.

尚、省略文補完部３３０は、集合「Ｄ」の要素を第一形態素情報に含めるだけに限定されるものではない。この省略文補完部３３０は、着目話題タイトルに基づいて、その話題タイトルを構成する第一特定情報、第二特定情報又は第三特定情報のいずれかに含まれる形態素を、抽出された第一形態素情報に含めても良い。 Note that the abbreviated sentence complementing unit 330 is not limited to only including elements of the set “D” in the first morpheme information. The abbreviated sentence complementing unit 330 extracts the first morpheme extracted from the morpheme included in any one of the first specific information, the second specific information, or the third specific information constituting the topic title based on the topic title of interest. It may be included in the information.

［話題検索部］
話題検索部３４０は、省略文補完部３３０で話題タイトル８２０が決まらなかったとき、第一形態素情報と、利用者入力文話題特定情報に対応する各話題タイトル８２０とを照合し、各話題タイトル８２０の中から、第一形態素情報に最も適する話題タイトル８２０を検索するものである。
具体的に、省略文補完部３３０から検索命令信号が入力された話題検索部３４０は、入力された検索命令信号に含まれる利用者入力文話題特定情報及び第一形態素情報に基づいて、その利用者入力文話題特定情報に対応付けられた各話題タイトル８２０の中から、その第一形態素情報に最も適した話題タイトル８２０を検索する。話題検索部３４０は、その検索した話題タイトル８２０を検索結果信号として回答取得部３５０に出力する。 [Topic Search Department]
When the topic title 820 is not determined by the abbreviated sentence complementing unit 330, the topic search unit 340 collates the first morpheme information with each topic title 820 corresponding to the user input sentence topic specifying information, and each topic title 820 The topic title 820 that is most suitable for the first morpheme information is searched for.
Specifically, the topic search unit 340 to which the search command signal is input from the abbreviated sentence complement unit 330 is used based on the user input sentence topic identification information and the first morpheme information included in the input search command signal. The topic title 820 most suitable for the first morpheme information is searched from the topic titles 820 associated with the person input sentence topic identification information. The topic search unit 340 outputs the searched topic title 820 to the answer acquisition unit 350 as a search result signal.

図１３は、ある話題特定情報８１０（＝「佐藤」）に対応付けされた話題タイトル８２０，回答文８３０の具体例を示す図である。図１３に示すように、例えば、話題検索部３４０は、入力された第一形態素情報「佐藤、好き」に話題特定情報８１０（＝「佐藤」）が含まれるので、その話題特定情報８１０（＝「佐藤」）を特定し、次に、その話題特定情報８１０（＝「佐藤」）に対応付けられた各話題タイトル（８２０）１-１,１-２,…と入力された第一形態素情報「佐藤、好き」とを照合する。 FIG. 13 is a diagram showing a specific example of the topic title 820 and the answer sentence 830 associated with certain topic specifying information 810 (= “Sato”). As shown in FIG. 13, for example, the topic search unit 340 includes the topic identification information 810 (= “Sato”) in the input first morpheme information “Sato, I like”, so the topic identification information 810 (= First, the first morpheme information that is input as each topic title (820) 1-1, 1-2,... Associated with the topic specifying information 810 (= “Sato”) Match “Sato, I like”.

話題検索部３４０は、その照合結果に基づいて、各話題タイトル（８２０）１-１〜１-２の中から、入力された第一形態素情報「佐藤、好き」と一致する話題タイトル（８２０）１-１(佐藤；＊；好き)を特定する。話題検索部３４０は、検索した話題タイトル（８２０）１-１(佐藤；＊；好き)を検索結果信号として回答取得部３５０に出力する。 The topic search unit 340, based on the comparison result, the topic title (820) that matches the input first morpheme information “Sato, I like” from among the topic titles (820) 1-1 and 1-2. Specify 1-1 (Sato; *; likes). The topic search unit 340 outputs the searched topic title (820) 1-1 (Sato; *; likes) to the answer acquisition unit 350 as a search result signal.

［回答取得部］
回答取得部３５０は、話題検索部３４０で検索された話題タイトル８２０に基づいて、その話題タイトル８２０に対応付けられた回答文８３０および感情フラグ８４０を取得する。また、回答取得部３５０は、話題検索部３４０で検索された話題タイトル８２０に基づいて、その話題タイトル８２０に対応付けられた各回答種類と、入力種類判定部４４０で判定された発話種類とを照合する。その照合をした回答取得部３５０は、各回答種類の中から、判定された発話種類と一致する回答種類を検索する。 [Answer section]
The answer acquisition unit 350 acquires an answer sentence 830 and an emotion flag 840 associated with the topic title 820 based on the topic title 820 searched by the topic search unit 340. Further, the answer acquisition unit 350 determines each answer type associated with the topic title 820 and the utterance type determined by the input type determination unit 440 based on the topic title 820 searched by the topic search unit 340. Match. The answer acquisition unit 350 that has performed the collation searches for an answer type that matches the determined utterance type from among the answer types.

図１３に示すように、例えば、回答取得部３５０は、話題検索部３４０で検索された話題タイトル８２０が話題タイトル１-１（佐藤；＊；好き）である場合には、その話題タイトル１-１に対応付けられている回答文１-１（DA,TAなどの拡開投手類語とに用意された回答文の集合）の中から、入力種類判定部４４０で判定された「発話文のタイプ」（例えばDA）と一致する回答種類（DA）に対応する回答文を特定する。この回答種類(DA)に対応する回答文を特定した回答取得部３５０は、特定した回答種類（DA）に対応付けされた回答文「私も佐藤が好きです。」を取得する。
ここで、上記"DA"、"TA"等のうち、"A"は、肯定形式を意味する。従って、発話種類及び回答種類に"A"が含まれているときは、ある事柄について肯定することを示している。また、発話種類及び回答種類には、"DQ"、"TQ"等の種類を含めることもできる。この"DQ"、"TQ"等のうち"Q"は、ある事柄についての質問を意味する。 As shown in FIG. 13, for example, when the topic title 820 searched by the topic search unit 340 is the topic title 1-1 (Sato; *; likes), the answer obtaining unit 350- “Type of utterance sentence” determined by the input type determination unit 440 from the answer sentence 1-1 (a set of answer sentences prepared for the expanded pitcher synonyms such as DA and TA) associated with 1. ”(For example, the answer sentence corresponding to the answer type (DA) matching the DA) is identified. The answer acquisition unit 350 that has identified the answer sentence corresponding to the answer type (DA) obtains the answer sentence “I also like Sato” associated with the identified answer type (DA).
Here, among the “DA”, “TA”, etc., “A” means an affirmative form. Therefore, when “A” is included in the utterance type and the answer type, it indicates that a certain matter is affirmed. In addition, types such as “DQ” and “TQ” can be included in the utterance type and the answer type. Of these “DQ”, “TQ”, etc., “Q” means a question about a certain matter.

回答種類が上記質問形式(Q)からなるときは、この回答種類に対応付けられる回答文は、肯定形式(A)で構成される。この肯定形式(A)で作成された回答文としては、質問事項に対して回答する文等が挙げられる。例えば、発話文が「あなたはスロットマシンを操作したことがありますか?」である場合には、この発話文についての発話種類は、質問形式(Q)となる。この質問形式(Q)に対応付けられる回答文は、例えば「私はスロットマシンを操作したことがあります」(肯定形式(A))が挙げられる。 When the answer type is the above question format (Q), the answer text associated with the answer type is configured in an affirmative format (A). Examples of the answer sentence created in this affirmative form (A) include a sentence that answers a question item. For example, when the utterance sentence is “Have you operated the slot machine?”, The utterance type for this utterance sentence is a question form (Q). An example of an answer sentence associated with the question format (Q) is “I have operated a slot machine” (affirmative format (A)).

一方、発話種類が肯定形式(A)からなるときは、この回答種類に対応付けられる回答文は、質問形式(Q)で構成される。この質問形式(Q)で作成された回答文としては、発話内容に対して聞き返す質問文、又は特定の事柄を聞き出す質問文等が挙げられる。例えば、発話文が「私はスロットマシンで遊ぶのが趣味です」である場合には、この発話文についての発話種類は、肯定形式(A)となる。この肯定形式(A)に対応付けられる回答文は、例えば"パチンコで遊ぶのは趣味ではないのですか?"(特定の事柄を聞き出す質問文(Q))が挙げられる。 On the other hand, when the utterance type is an affirmative form (A), the answer sentence associated with the answer type is configured with a question form (Q). Examples of the answer sentence created in the question format (Q) include a question sentence that is replied to the utterance content or a question sentence that asks a specific matter. For example, if the utterance sentence is “I am playing with a slot machine”, the utterance type for this utterance sentence is an affirmative form (A). The answer sentence associated with this affirmative form (A) is, for example, “isn't it a hobby to play with pachinko?” (Question sentence (Q) to ask for a specific matter).

回答取得部３５０は、取得した回答文８３０を回答文信号として管理部３１０に出力する。回答取得部３５０から回答文信号が入力された管理部３１０は、入力された回答文信号を出力部６００に出力する。 The response acquisition unit 350 outputs the acquired response text 830 to the management unit 310 as a response text signal. The management unit 310 to which the answer sentence signal is input from the answer acquisition unit 350 outputs the input answer sentence signal to the output unit 600.

出力部６００は、回答取得部３５０で取得された回答文を出力するものである。この出力部６００は、例えば、スピーカ、ディスプレイなどが挙げられる。具体的に、管理部３１０から回答文が入力された出力部６００は、入力された回答文に基づいて、音声をもってその回答文、例えば、「私も佐藤が好きです」を出力する。 The output unit 600 outputs the answer sentence acquired by the answer acquisition unit 350. Examples of the output unit 600 include a speaker and a display. Specifically, the output unit 600 to which an answer sentence is input from the management unit 310 outputs the answer sentence, for example, “I also like Sato” with a voice based on the input answer sentence.

［新単語登録部］
次に、新単語登録部８００について、図１，図１４を参照しながら説明する。図１４は、新単語登録部８００の構成例を示す機能ブロック図である。
図１に示すように、新単語登録部８００は、会話制御部３００、会話データベース５００とデータ交換可能に接続されているとともに、出力部６００に出力可能に接続されている。 [New word registration department]
Next, the new word registration unit 800 will be described with reference to FIGS. FIG. 14 is a functional block diagram illustrating a configuration example of the new word registration unit 800.
As shown in FIG. 1, the new word registration unit 800 is connected to the conversation control unit 300 and the conversation database 500 so as to be able to exchange data, and is connected to the output unit 600 so as to be able to output data.

図１４に示すように、新単語登録部８００は、登録確認部１４０１と、この登録確認部１４０１に接続された意味問い合わせ部１４０２及び意味検証部１４０３と、意味問い合わせ部１４０２に接続された登録処理部１４０４とを有している。登録確認部１４０１，意味検証部１４０３，登録処理部１４０４は会話データベース５００に接続されており、意味問い合わせ部１４０２，意味検証部１４０３は出力部６００に接続されている。 As illustrated in FIG. 14, the new word registration unit 800 includes a registration confirmation unit 1401, a semantic inquiry unit 1402 and a semantic verification unit 1403 connected to the registration confirmation unit 1401, and a registration process connected to the semantic inquiry unit 1402. Part 1404. The registration confirmation unit 1401, the semantic verification unit 1403, and the registration processing unit 1404 are connected to the conversation database 500, and the semantic inquiry unit 1402 and the semantic verification unit 1403 are connected to the output unit 600.

［登録確認部］
登録確認部１４０１は、会話制御部３００から渡された第一形態素情報を受け取り、この第一形態素情報に含まれる形態素が会話データベース５００に話題特定情報８１０として登録されているか否か、及びその形態素が話題特定情報８１０として登録されている場合にその話題特定情報８１０の意味情報８４０（その形態素の定義、又は定義に相当する説明、解釈などをいう）が確定しているか否かを調べる。会話データベース５００を調べた結果、その形態素が会話データベース５００に話題特定情報８１０として未だ登録されていない場合は、登録確認部１４０１は意味問い合わせ部１４０２に意味問い合わせ処理を行うよう要求する。一方、その形態素が会話データベース５００に話題特定情報８１０として登録されてはいるが、その意味情報が確定していない場合は、登録確認部１４０１は意味検証部１４０３にその話題特定情報（形態素）８１０の意味検証処理を行うよう要求する。 [Registration Confirmation Department]
The registration confirmation unit 1401 receives the first morpheme information passed from the conversation control unit 300, whether or not the morpheme included in the first morpheme information is registered as the topic identification information 810 in the conversation database 500, and the morpheme. Is registered as the topic specifying information 810, it is checked whether or not the semantic information 840 of the topic specifying information 810 (which means definition of the morpheme or explanation, interpretation corresponding to the definition, etc.) is confirmed. As a result of examining the conversation database 500, if the morpheme is not yet registered as the topic identification information 810 in the conversation database 500, the registration confirmation unit 1401 requests the semantic inquiry unit 1402 to perform semantic inquiry processing. On the other hand, if the morpheme is registered as topic identification information 810 in the conversation database 500 but the semantic information is not fixed, the registration confirmation unit 1401 sends the topic identification information (morpheme) 810 to the semantic verification unit 1403. Requests that the semantic verification process be performed.

なお、ここで第一形態素情報に含まれる形態素であって、新単語登録部８００の処理対象となる形態素は、第一特定情報１００１に限定されてもよいし、第一から第三までのすべての特定情報１００１〜１００３を処理対象としてもよい。 Here, the morphemes included in the first morpheme information, and the morphemes to be processed by the new word registration unit 800 may be limited to the first specific information 1001, or all of the first to third morphemes. The specific information 1001 to 1003 may be the processing target.

［意味問い合わせ部、登録処理部］
意味問い合わせ部１４０２は、登録確認部１４０１から意味問い合わせ処理を行うよう要求されると、その形態素（話題特定情報）について、話者に意味を問い合わせる問い合わせ質問文を出力する。例えば、その形態素が「佐藤」であれば、意味問い合わせ部１４０２はその形態素「佐藤」と予め用意してある問い合わせ質問文テンプレートを用いて「佐藤って何ですか？」という問い合わせ質問文を生成する。生成された問い合わせ質問文は出力部６００に渡され、出力部６００により話者に提示される。 [Semantic inquiry part, registration processing part]
When the semantic inquiry unit 1402 is requested by the registration confirmation unit 1401 to perform semantic inquiry processing, the semantic inquiry unit 1402 outputs an inquiry question sentence for inquiring the speaker about the meaning of the morpheme (topic specifying information). For example, if the morpheme is “Sato”, the semantic inquiry unit 1402 generates an inquiry question sentence “What is Sato?” Using the inquiry question sentence template prepared in advance with the morpheme “Sato”. To do. The generated inquiry question sentence is passed to the output unit 600 and presented to the speaker by the output unit 600.

話者がこの問い合わせ質問文に対する回答を会話制御装置１に返すと、この回答は入力部１００，音声認識部２００、会話制御部３００、文解析部４００を介して新単語登録部８００に渡される。登録確認部１４０１はこの回答を意味問い合わせ部１４０２にそのまま渡す。意味問い合わせ部１４０２は、この回答を登録確認部１４０１を介して受け取り、回答が前述の問い合わせ質問文に対する回答であるか否かを判定する。例えば「知らないな」「わからない」など回答でないものと判断した場合は、意味問い合わせ部１４０２は別の問い合わせ質問文を生成して、出力する。一方、それが回答であると判定した場合は、その話題特定情報（形態素）及び回答を登録処理部１４０４に渡す。登録処理部１４０４はその話題特定情報（形態素）及び回答を会話データベース５００に新たな話題特定情報８１０及びその意味情報８４０として登録する。なお、この時点では意味情報８４０は確定した意味情報８４０ではなく、意味検証部１４０３の意味検証処理を受ける必要のある意味情報（未確定の意味情報）である。 When the speaker returns an answer to the inquiry question sentence to the conversation control device 1, the answer is passed to the new word registration unit 800 via the input unit 100, the speech recognition unit 200, the conversation control unit 300, and the sentence analysis unit 400. . The registration confirmation unit 1401 passes this answer to the semantic inquiry unit 1402 as it is. The semantic inquiry unit 1402 receives this answer via the registration confirmation unit 1401 and determines whether or not the answer is an answer to the inquiry question sentence. For example, if it is determined that the answer is not “I don't know” or “I don't know”, the semantic inquiry unit 1402 generates and outputs another inquiry question sentence. On the other hand, when it is determined that it is an answer, the topic specifying information (morpheme) and the answer are passed to the registration processing unit 1404. The registration processing unit 1404 registers the topic specifying information (morpheme) and the answer as new topic specifying information 810 and its semantic information 840 in the conversation database 500. At this time, the semantic information 840 is not the confirmed semantic information 840 but semantic information (undefined semantic information) that needs to be subjected to the semantic verification processing of the semantic verification unit 1403.

［意味検証部］
意味検証部１４０３は、登録確認部１４０１から意味検証処理を行うよう要求されると、会話データベース５００からその話題特定情報の意味情報８４０を取り出し、この意味情報の正否・適否を話者に問いただす検証質問文を生成、出力する。出力された検証質問文は出力部６００に渡され、話者（ユーザ）に提示される。例えば、話題特定情報８１０が「佐藤」、その未確定の意味情報８４０が「ねばり強いバッティングで人気のプロ野球選手」として、会話データベース５００に登録されている場合、意味検証部１４０３は「佐藤は、ねばり強いバッティングで人気のプロ野球選手ですよね？」という検証質問文を生成し、出力するように動作する。なお、その話題特定情報８１０について複数の未確定の意味情報８４０が存在している場合は、所定の条件を満たす一の意味情報８４０を選択して、検証質問文の生成を行う。「所定の条件」は例えば後述の肯定回数情報が最大であること、などである。 [Semantic verification section]
When the semantic verification unit 1403 is requested by the registration confirmation unit 1401 to perform the semantic verification process, the semantic verification unit 1403 retrieves the semantic information 840 of the topic identification information from the conversation database 500, and verifies whether the semantic information is correct or not. Generate and output a question sentence. The output verification question sentence is transferred to the output unit 600 and presented to the speaker (user). For example, when the topic identification information 810 is registered in the conversation database 500 as “Sato” as the topic identification information 810 and “unsuccessful semantic information 840 as“ professional baseball player with strong and strong batting ”, the meaning verification unit 1403 displays“ It works to generate and output a verification question sentence saying, "Is it a popular professional baseball player with sticky and strong batting?" When a plurality of undefined semantic information 840 exists for the topic identification information 810, one semantic information 840 satisfying a predetermined condition is selected and a verification question sentence is generated. The “predetermined condition” is, for example, that affirmative number information described later is the maximum.

話者はこの検証質問文に対する回答を会話制御装置１に返すと、この回答は入力部１００，音声認識部２００、会話制御部３００、文解析部４００を介して新単語登録部８００に渡される。登録確認部１４０１はこの回答を意味検証部１４０３にそのまま渡す。意味検証部１４０３は、この回答を登録確認部１４０１を介して受け取り、回答が前述の検証質問文に対する肯定か否かを判定する。例えば回答として受け取った話者の発話が「そうだよ」「うん」「ＹＥＳ」などであれば肯定と判定して、前述の意味情報の肯定回数情報を増加するよう会話データベース５００を更新する。一方、肯定でないと判定した場合は、その意味情報の肯定回数を増加させないか、或いは減少させるよう会話データベース５００を更新する。 When the speaker returns an answer to the verification question sentence to the conversation control device 1, the answer is passed to the new word registration unit 800 via the input unit 100, the speech recognition unit 200, the conversation control unit 300, and the sentence analysis unit 400. . The registration confirmation unit 1401 passes this answer to the semantic verification unit 1403 as it is. The semantic verification unit 1403 receives this answer via the registration confirmation unit 1401, and determines whether the answer is affirmative with respect to the verification question sentence. For example, if the utterance of the speaker received as an answer is “Yes”, “Yes”, “YES” or the like, it is determined as affirmative, and the conversation database 500 is updated so as to increase the affirmative number information of the above-described semantic information. On the other hand, if it is determined that it is not affirmative, the conversation database 500 is updated so as not to increase or decrease the number of affirmations of the semantic information.

［２．会話制御装置の動作（会話制御方法の実行）］
上記構成を有する会話制御装置１が動作することにより会話制御方法が実行されることとなる。図１５は、本実施の形態に係る会話制御装置１の動作により実行される会話制御方法の手順を示すフロー図である。
先ず、入力部１００が、利用者からの発話内容を取得するステップを行う（ステップＳ１５０１）。具体的には、入力部１００は、利用者の発話内容を構成する音声を取得する。入力部１００は、取得した音声を音声信号として音声認識部２００に出力する。なお、入力部１００は、利用者からの音声ではなく、利用者から入力された文字列(例えば、テキスト形式で入力された文字データ)を取得してもよい。 [2. Operation of conversation control device (execution of conversation control method)]
The conversation control method is executed by the operation of the conversation control apparatus 1 having the above configuration. FIG. 15 is a flowchart showing the procedure of the conversation control method executed by the operation of conversation control apparatus 1 according to the present embodiment.
First, the input unit 100 performs a step of acquiring the utterance content from the user (step S1501). Specifically, the input unit 100 acquires the voice that constitutes the utterance content of the user. The input unit 100 outputs the acquired voice to the voice recognition unit 200 as a voice signal. Note that the input unit 100 may acquire a character string (for example, character data input in a text format) input from the user instead of the voice from the user.

次いで、音声認識部２００が、入力部１００で取得した発話内容に基づいて、発話内容に対応する文字列を特定するステップを行う（ステップＳ１５０２）。具体的には、入力部１００から音声信号が入力された音声認識部２００は、入力された音声信号に基づいて、その音声信号に対応する単語仮説（候補）を特定する。音声認識部２００は、特定した単語仮説（候補）に対応付けられた文字列を取得し、取得した文字列を文字列信号として会話制御部３００に出力する。 Next, the voice recognition unit 200 performs a step of specifying a character string corresponding to the utterance content based on the utterance content acquired by the input unit 100 (step S1502). Specifically, the speech recognition unit 200 to which the speech signal is input from the input unit 100 specifies a word hypothesis (candidate) corresponding to the speech signal based on the input speech signal. The voice recognition unit 200 acquires a character string associated with the identified word hypothesis (candidate), and outputs the acquired character string to the conversation control unit 300 as a character string signal.

そして、文字列特定部４１０が、音声認識部２００で特定された一連の文字列を一文毎に区切るステップを行う(ステップＳ１５０３)。具体的には、管理部３１０から文字列信号（あるいは形態素信号）が入力された文字列特定部４１０は、その入力された一連の文字列の中に、ある一定以上の時間間隔があるときは、その部分で文字列を区切る。文字列特定部４１０は、その区切った各文字列を形態素抽出部４２０及び入力種類判定部４４０に出力する。なお、文字列特定部４１０は、入力された文字列がキーボードから入力された文字列である場合には、句読点又はスペース等のある部分で文字列を区切るのが好ましい。 Then, the character string specifying unit 410 performs a step of dividing the series of character strings specified by the voice recognition unit 200 for each sentence (step S1503). Specifically, the character string specifying unit 410 to which a character string signal (or morpheme signal) is input from the management unit 310 has a certain time interval or more in the input series of character strings. , Delimit the string at that part. The character string specifying unit 410 outputs the divided character strings to the morpheme extracting unit 420 and the input type determining unit 440. In addition, when the input character string is a character string input from the keyboard, the character string specifying unit 410 preferably divides the character string at a part such as a punctuation mark or a space.

その後、形態素抽出部４２０が、文字列特定部４１０で特定された文字列に基づいて、文字列の最小単位を構成する各形態素を第一形態素情報として抽出するステップを行う(ステップＳ１５０４)。具体的に、文字列特定部４１０から文字列が入力された形態素抽出部４２０は、入力された文字列と、形態素データベース４３０に予め格納されている形態素群とを照合する。なお、その形態素群は、本実施の形態では、それぞれの品詞分類に属する各形態素について、その形態素の見出し語・読み・品詞・活用形などを記述した形態素辞書として準備されている。 Thereafter, the morpheme extraction unit 420 performs a step of extracting each morpheme constituting the minimum unit of the character string as the first morpheme information based on the character string specified by the character string specifying unit 410 (step S1504). Specifically, the morpheme extraction unit 420 to which the character string is input from the character string specifying unit 410 collates the input character string with a morpheme group stored in advance in the morpheme database 430. In this embodiment, the morpheme group is prepared as a morpheme dictionary in which each morpheme belonging to each part-of-speech classification describes a morpheme entry word, reading, part-of-speech, utilization form, and the like.

この照合をした形態素抽出部４２０は、入力された文字列の中から、予め記憶された形態素群に含まれる各形態素と一致する各形態素(m１,m２、…)を抽出する。形態素抽出部４２０は、抽出した各形態素を第一形態素情報として管理部３１０に出力する。管理部３１０は、第一形態素情報を新単語登録部８００に渡す。 The matched morpheme extraction unit 420 extracts each morpheme (m1, m2,...) That matches each morpheme included in a previously stored morpheme group from the input character string. The morpheme extraction unit 420 outputs each extracted morpheme to the management unit 310 as first morpheme information. The management unit 310 passes the first morpheme information to the new word registration unit 800.

新単語登録部８００は、第一形態素情報に含まれる形態素が話題特定情報８１０として登録済みであり、且つその意味情報８４０が確定しているかを判定する（ステップＳ１５０５）。 The new word registration unit 800 determines whether the morpheme included in the first morpheme information has been registered as the topic identification information 810 and the semantic information 840 has been determined (step S1505).

形態素が話題特定情報８１０として登録されていないか、又は形態素が話題特定情報８１０として登録されているがその意味情報８４０が確定してしない場合は、新単語登録部８００は新単語登録処理を実行する（ステップＳ１５０６）。図１６に、新単語登録処理の処理例を示す。 If the morpheme is not registered as the topic identification information 810 or if the morpheme is registered as the topic identification information 810 but its semantic information 840 is not fixed, the new word registration unit 800 executes a new word registration process (Step S1506). FIG. 16 shows an example of new word registration processing.

まず、新単語登録部８００は、第一形態素情報の全部又は一部を入力として受け付ける（ステップＳ１６００）。
ステップ１５０６における判定の結果、この第一形態素情報に含まれる形態素が会話データベース５００に話題特定情報８１０として未だ登録されていない場合（ステップＳ１６０１、ＮＯ）は、新単語登録部８００は意味問い合わせ処理を行う（ステップＳ１６０２）。すなわち、新単語登録部８００はその形態素（話題特定情報）について、話者に意味を問い合わせる問い合わせ質問文を出力する。生成された問い合わせ質問文は出力部６００に渡され、出力部６００により話者に提示される。 First, the new word registration unit 800 accepts all or part of the first morpheme information as an input (step S1600).
If the result of determination in step 1506 is that the morpheme included in the first morpheme information has not yet been registered as topic identification information 810 in the conversation database 500 (step S1601, NO), the new word registration unit 800 performs semantic inquiry processing. This is performed (step S1602). That is, the new word registration unit 800 outputs an inquiry question sentence that inquires the speaker about the meaning of the morpheme (topic specifying information). The generated inquiry question sentence is passed to the output unit 600 and presented to the speaker by the output unit 600.

話者がこの問い合わせ質問文に対する回答を会話制御装置１に返すと、この回答は入力部１００，音声認識部２００、会話制御部３００、文解析部４００を介して新単語登録部８００に渡され、返答受付処理（ステップＳ１６０３）が行われる。 When the speaker returns an answer to the inquiry question sentence to the conversation control device 1, the answer is passed to the new word registration unit 800 via the input unit 100, the speech recognition unit 200, the conversation control unit 300, and the sentence analysis unit 400. Then, a response acceptance process (step S1603) is performed.

新単語登録部８００は、返答受付処理（ステップＳ１６０３）において取得した回答が前述の問い合わせ質問文に対する回答であるか否かを判定し、回答でないものと判断した場合（ステップＳ１６０４、ＮＯ）は、新単語登録部８００は別の問い合わせ質問文を生成して、出力し、（ステップＳ１６０２）、さらに返答受付を行う（ステップＳ１６０３）。一方、それが回答であると判定した場合（ステップＳ１６０４、ＹＥＳ）は、その話題特定情報（形態素）及び回答を会話データベース５００に新たな話題特定情報８１０及びその意味情報８４０として登録する（ステップＳ１６０５）。この後図１５に示すフローに戻り、ステップＳ１５０１において受け付けた発話についての会話制御処理を終了する。 The new word registration unit 800 determines whether or not the answer acquired in the response acceptance process (step S1603) is an answer to the above-described inquiry question sentence, and determines that it is not an answer (step S1604, NO) The new word registration unit 800 generates and outputs another inquiry question sentence (step S1602), and further receives a response (step S1603). On the other hand, if it is determined that it is an answer (step S1604, YES), the topic specifying information (morpheme) and the answer are registered in the conversation database 500 as new topic specifying information 810 and its semantic information 840 (step S1605). ). Thereafter, returning to the flow shown in FIG. 15, the conversation control process for the utterance accepted in step S1501 is terminated.

一方、ステップＳ１６０１においてその形態素が会話データベース５００に話題特定情報８１０として登録されてはいるが、その意味情報が確定していない場合（ステップＳ１６０１、ＹＥＳ）は、新単語登録部８００は、その話題特定情報（形態素）８１０の意味検証処理を行う。すなわち、新単語登録部８００は会話データベース５００の記憶内容を参照して、話題特定情報（形態素）８１０の意味情報を読み取り、最もＹＥＳと回答された回数が多い意味情報文、すなわち肯定回数情報が最大である意味情報文を検証質問文として取得する（ステップＳ１６０６）。図１７は、ある話題特定情報８１０であって、意味情報８４０が未確定の状態である話題特定情報８１０のデータ構成を示している。この例では、意味情報は複数の意味情報文１７０１，１７０２，１７０３、…が登録されている。意味情報文１７０１，１７０２，１７０３、…にはそれぞれ、ＹＥＳと回答された回数を示す肯定回数情報「１０」、「７」、「３」、…が記憶されている。この例では、ステップＳ１６０６で選択される意味情報文は、意味情報文１７０１「ねばり強いバッティングで人気のプロ野球選手」が選択され、検証質問文の生成に用いられる。 On the other hand, if the morpheme is registered as topic identification information 810 in the conversation database 500 in step S1601, but the semantic information is not finalized (YES in step S1601), the new word registration unit 800 The meaning verification process of the specific information (morpheme) 810 is performed. That is, the new word registration unit 800 refers to the stored contents of the conversation database 500, reads the semantic information of the topic identification information (morpheme) 810, and the semantic information sentence that is most frequently answered as YES, that is, the affirmative frequency information The maximum semantic information sentence is acquired as a verification question sentence (step S1606). FIG. 17 shows the data structure of the topic identification information 810 that is certain topic identification information 810 and whose semantic information 840 is in an unconfirmed state. In this example, a plurality of semantic information sentences 1701, 1702, 1703,... Are registered as semantic information. In the semantic information sentences 1701, 1702, 1703,..., Positive number information “10”, “7”, “3”,. In this example, as the semantic information sentence selected in step S1606, the semantic information sentence 1701 “popular professional baseball player with sticky and strong batting” is selected and used to generate a verification question sentence.

図１６に戻り新単語登録処理の説明を続ける。ステップＳ１６０６で出された検証質問文に対して話者が回答を会話制御装置１に返すと、この回答は入力部１００，音声認識部２００、会話制御部３００、文解析部４００を介して新単語登録部８００に渡され、返答受付処理（ステップＳ１６０７）が行われる。 Returning to FIG. 16, the description of the new word registration process will be continued. When the speaker returns an answer to the conversation control device 1 in response to the verification question sentence issued in step S1606, the answer is sent via the input unit 100, the speech recognition unit 200, the conversation control unit 300, and the sentence analysis unit 400. It is passed to the word registration unit 800, and a response acceptance process (step S1607) is performed.

新単語登録部８００はこの回答が検証質問文の内容を肯定しているか否かを判定し（ステップＳ１６０８）、肯定していると判定する場合（ステップＳ１６０８、ＹＥＳ）は、新単語登録部８００は、ステップＳ１６０６において選択された意味情報文の肯定回数情報をインクリメントするよう、会話データベース５００の内容を更新する（ステップＳ１６０９）。例えば、意味情報文１７０１について肯定された場合、その肯定回数情報「１０」は「１１」に更新されることとなる。ステップＳ１６０９の実行後、図１５に示すフローに戻り、ステップＳ１５０１において受け付けた発話についての会話制御処理を終了する。 The new word registration unit 800 determines whether or not this answer affirms the content of the verification question sentence (step S1608). When it is determined that the answer is affirmative (YES in step S1608), the new word registration unit 800 Updates the contents of the conversation database 500 to increment the affirmative number information of the semantic information sentence selected in step S1606 (step S1609). For example, when the semantic information sentence 1701 is affirmed, the affirmation count information “10” is updated to “11”. After executing step S1609, the process returns to the flow shown in FIG. 15, and the conversation control process for the utterance accepted in step S1501 is terminated.

一方、ステップＳ１６０８において、肯定していないと判定した場合（ステップＳ１６０８、ＮＯ）は、新単語登録部８００は、その話題特定情報８１０についての別の意味情報文を取得するよう、ステップＳ１６０２からステップＳ１６０５の処理を行う。ここで新たに取得された意味情報文は、意味情報８４０に追加されて登録される。ステップＳ１６０５の処理実行後、図１５に示すフローに戻り、ステップＳ１５０１において受け付けた発話についての会話制御処理を終了する。
以上で新単語登録処理（ステップ１５０７）の説明を終了する。 On the other hand, if it is determined in step S1608 that the determination is not affirmative (NO in step S1608), the new word registration unit 800 performs steps from step S1602 to acquire another semantic information sentence for the topic identification information 810. The process of S1605 is performed. The newly acquired semantic information sentence is added to the semantic information 840 and registered. After executing the process in step S1605, the process returns to the flow shown in FIG. 15, and the conversation control process for the utterance accepted in step S1501 is terminated.
This is the end of the description of the new word registration process (step 1507).

続いて、図１５に戻り、会話制御処理の残りの部分について説明する。
ステップＳ１５０６において、第一形態素情報に含まれる形態素が話題特定情報８１０として登録済みであり、且つその意味情報８４０が確定していると判定される場合（ステップＳ１５０６、ＹＥＳ）は、新単語登録部８００は会話制御部３００の管理部３００に、会話制御処理を続行するよう要求する。この要求に応じて管理部３１０は、文解釈部４００に発話文のタイプの判定を行うよう要求するとともに、第一形態素情報を話題特定情報検索部３２０に渡す。
発話文のタイプの判定を行うよう要求されると、文解釈部４００の入力種類判定部４４０が、文字列特定部４１０で特定された一文を構成する各形態素に基づいて、「発話文のタイプ」を判定するステップを行う（ステップＳ１５０７）。具体的には、文字列特定部４１０から文字列が入力された入力種類判定部４４０は、入力された文字列に基づいて、その文字列と発話種類データベース４５０に格納されている各辞書とを照合し、その文字列の中から、各辞書に関係する要素を抽出する。この要素を抽出した入力種類判定部４４０は、抽出した要素に基づいて、その要素がどの「発話文のタイプ」に属するのかを判定する。入力種類判定部４４０は、判定した「発話文のタイプ」（発話種類）を回答取得部３５０に出力する。 Next, returning to FIG. 15, the remaining part of the conversation control process will be described.
If it is determined in step S1506 that the morpheme included in the first morpheme information has already been registered as the topic identification information 810 and the semantic information 840 has been confirmed (YES in step S1506), a new word registration unit 800 requests the management unit 300 of the conversation control unit 300 to continue the conversation control process. In response to this request, the management unit 310 requests the sentence interpretation unit 400 to determine the type of the spoken sentence, and passes the first morpheme information to the topic identification information search unit 320.
When it is requested to determine the type of the spoken sentence, the input type determining unit 440 of the sentence interpreting unit 400 determines “the type of the spoken sentence” based on each morpheme constituting one sentence specified by the character string specifying unit 410. Is determined (step S1507). Specifically, the input type determination unit 440, to which the character string is input from the character string specifying unit 410, determines the character string and each dictionary stored in the utterance type database 450 based on the input character string. Collation is performed, and elements related to each dictionary are extracted from the character string. The input type determination unit 440 that extracted this element determines to which “spoken sentence type” the element belongs based on the extracted element. The input type determination unit 440 outputs the determined “spoken sentence type” (speech type) to the answer acquisition unit 350.

そして、話題特定情報検索部３２０が、形態素抽出部４２０で抽出された第一形態素情報と着目話題タイトル８２０focusとを比較するステップを行う(ステップＳ１５０８)。
第一形態素情報を構成する形態素と着目話題タイトル８２０focusとが一致する場合、話題特定情報検索部３２０は、その話題タイトル８２０を回答取得部３５０に出力する。一方、話題特定情報検索部３２０は、第一形態素情報を構成する形態素と話題タイトル８２０とが一致しなかった場合には、入力された第一形態素情報及び利用者入力文話題特定情報を検索命令信号として省略文補完部３３０に出力する。 Then, the topic specifying information search unit 320 compares the first morpheme information extracted by the morpheme extraction unit 420 with the topic topic title 820focus (step S1508).
If the morpheme constituting the first morpheme information matches the topic topic title 820focus, the topic identification information search unit 320 outputs the topic title 820 to the answer acquisition unit 350. On the other hand, when the morpheme constituting the first morpheme information and the topic title 820 do not match, the topic identification information search unit 320 retrieves the input first morpheme information and user input sentence topic identification information. This is output to the abbreviated sentence complementing unit 330 as a signal.

その後、省略文補完部３３０が、話題特定情報検索部３２０から入力された第一形態素情報に基づいて、着目話題特定情報及び回答文話題特定情報を、入力された第一形態素情報に含め、省略文を保管するステップを行う(ステップＳ１５０９)。具体的には、第一形態素情報を「Ｗ」、着目話題特定情報及び回答文話題特定情報の集合を「Ｄ」とすると、省略文補完部３３０は、第一形態素情報「Ｗ」に話題特定情報「Ｄ」の要素を含めて、補完された第一形態素情報を生成し、この補完された第一形態素情報と集合「Ｄ」に関連づけされたすべての話題タイトル８２０とを照合し、補完された第一形態素情報と一致する話題タイトル８２０があるか検索する。補完された第一形態素情報と一致する話題タイトル８２０がある場合は、省略文補完部３３０は、その話題タイトル８２０を回答取得部３５０に出力する。一方、補完された第一形態素情報と一致する話題タイトル８２０を発見しなかった場合は、省略文補完部３３０は、第一形態素情報と利用者入力文話題特定情報とを話題検索部３４０に渡す。 After that, the abbreviated sentence complementing unit 330 includes the focused topic specifying information and the answer sentence topic specifying information in the input first morpheme information based on the first morpheme information input from the topic specifying information search unit 320 and omits it. A step of storing the sentence is performed (step S1509). Specifically, when the first morpheme information is “W” and the set of the topic topic identification information and the answer sentence topic identification information is “D”, the abbreviated sentence complementing unit 330 identifies the topic as the first morpheme information “W”. Complemented first morpheme information including the element of information “D” is generated, and the complemented first morpheme information is collated with all topic titles 820 associated with the set “D” to be complemented. Whether there is a topic title 820 that matches the first morpheme information is searched. If there is a topic title 820 that matches the complemented first morpheme information, the abbreviated sentence complementing unit 330 outputs the topic title 820 to the answer acquisition unit 350. On the other hand, when the topic title 820 that matches the supplemented first morpheme information is not found, the abbreviated sentence complementing unit 330 passes the first morpheme information and the user input sentence topic specifying information to the topic search unit 340. .

次いで、話題検索部３４０は、第一形態素情報と、利用者入力文話題特定情報とを照合し、各話題タイトル８２０の中から、第一形態素情報に適した話題タイトル８２０を検索するステップを行う(ステップＳ１５１０)。具体的には、省略文補完部３３０から検索命令信号が入力された話題検索部３４０は、入力された検索命令信号に含まれる利用者入力文話題特定情報及び第一形態素情報に基づいて、その利用者入力文話題特定情報に対応付けられた各話題タイトル８２０の中から、その第一形態素情報に適した話題タイトル８２０を検索する。話題検索部３４０は、その検索の結果得られた話題タイトル８２０を検索結果信号として回答取得部３５０に出力する。 Next, the topic search unit 340 collates the first morpheme information with the user input sentence topic identification information, and performs a step of searching for the topic title 820 suitable for the first morpheme information from each topic title 820. (Step S1510). Specifically, the topic search unit 340 to which the search command signal is input from the abbreviated sentence complement unit 330 is based on the user input sentence topic identification information and the first morpheme information included in the input search command signal. A topic title 820 suitable for the first morpheme information is searched from the topic titles 820 associated with the user input sentence topic identification information. The topic search unit 340 outputs the topic title 820 obtained as a result of the search to the answer acquisition unit 350 as a search result signal.

次いで、回答取得部３５０が、話題特定情報検索部３２０、省略文補完部３３０，あるいは話題検索部３４０で検索された話題タイトル８２０に基づいて、判定された利用者の発話種類と話題タイトル８２０に対応付けられた各回答種類とを照合する。この照合をした回答取得部３５０は、各回答種類の中から、判定された発話種類と一致する回答種類を検索する(ステップＳ１５１１)。 Next, the answer acquisition unit 350 determines the utterance type and the topic title 820 of the user determined based on the topic title 820 searched by the topic identification information search unit 320, the abbreviated sentence complement unit 330, or the topic search unit 340. Collate each associated answer type. The answer acquisition unit 350 that has performed the collation searches for the answer type that matches the determined utterance type from among the answer types (step S1511).

具体的に、話題検索部３４０から検索結果信号と、入力種類判定部４４０から「発話文のタイプ」とが入力された回答取得部３５０は、入力された検索結果信号に対応する「話題タイトル」と、入力された「発話文のタイプ」とに基づいて、その「話題タイトル」に対応付けられている回答種類群の中から、「発話文のタイプ」(DAなど)と一致する回答種類を特定する。 Specifically, the answer acquisition unit 350 to which the search result signal is input from the topic search unit 340 and the “spoken sentence type” is input from the input type determination unit 440 is the “topic title” corresponding to the input search result signal. And the answer type that matches the “spoken sentence type” (such as DA) from the answer type group associated with the “topic title” based on the entered “spoken sentence type”. Identify.

この回答取得部３５０は、会話データベース５００から、特定した回答種類に対応付けられた回答文８３０を取得する。回答取得部３５０は、管理部３１０を介して、取得した回答文８３０を出力部６００に出力する。管理部３１０から回答文が入力された出力部６００は、入力された回答文８３０を出力する(ステップＳ１５１２)。 The answer acquisition unit 350 acquires an answer sentence 830 associated with the specified answer type from the conversation database 500. The answer acquisition unit 350 outputs the acquired answer sentence 830 to the output unit 600 via the management unit 310. The output unit 600 to which the answer sentence is input from the management unit 310 outputs the input answer sentence 830 (step S1512).

［３．利点］
本会話制御装置１によれば、会話制御装置１が話者と会話を行うことにより、自動的に会話データベース５００の記憶内容に新しい単語、言葉が追加蓄積されるため、オペレータの単語の意味の入力作業を必要とすることなく、会話データベース５００の記憶内容の向上を図ることが可能となる。 [3. advantage]
According to the conversation control apparatus 1, since the conversation control apparatus 1 has a conversation with the speaker, new words and words are automatically added and stored in the stored contents of the conversation database 500. It is possible to improve the stored contents of the conversation database 500 without requiring input work.

［４．変形例］
（１）会話制御装置１の音声認識部２００、会話制御部３００、文解析部４００の構成及び動作は、第１の実施の形態において説明したものに限られず、会話データベース手段を利用して利用者の発話内容に応じた回答を返すものであれば、第１の実施の形態において説明したもの以外のどのような音声認識部２００、会話制御部３００、文解析部４００であっても、本発明に係る会話制御装置１の構成要素として使用可能である。 [4. Modified example]
(1) The configurations and operations of the speech recognition unit 200, the conversation control unit 300, and the sentence analysis unit 400 of the conversation control device 1 are not limited to those described in the first embodiment, and are utilized using conversation database means. Any speech recognition unit 200, conversation control unit 300, and sentence analysis unit 400 other than those described in the first embodiment can be used as long as they return an answer according to the content of the user's utterance. It can be used as a component of the conversation control device 1 according to the invention.

（２）上記実施の形態の説明では、会話制御装置１への発話の入力は音声によるものとして説明したが、会話制御装置１への発話の入力は音声に限られるものではなく、キーボード、タッチパネル、ポインティングデバイスなどの文字入力手段による文字列データとして会話制御装置１へ発話を入力し、会話制御装置１がこの文字列データとして入力された発話に対して会話データベース５００を用いて回答文を出力する構成としても本発明は成立する。 (2) In the description of the above embodiment, the input of the utterance to the conversation control device 1 has been described as being by voice, but the input of the utterance to the conversation control device 1 is not limited to voice, but a keyboard, a touch panel An utterance is input to the conversation control device 1 as character string data by a character input means such as a pointing device, and the conversation control device 1 outputs an answer sentence to the utterance input as the character string data using the conversation database 500. The present invention is also established as a configuration.

［５．その他］
上記実施の形態では、特定話題情報８１０及びその意味情報文８４０を自動的に登録するものとしたが、これらに加えて話題タイトル８２０、回答文８３０を話者の発話から抽出して会話制御装置１が自動的に登録する構成としてもよいし、或いは特定話題情報８１０及びその意味情報文８４０のみ会話制御装置１が自動的に登録し、のちにオペレータが登録された特定話題情報８１０及びその意味情報文８４０を参照して話題タイトル８２０、回答文８３０を追加するようにしてもよい。 [5. Others]
In the above embodiment, the specific topic information 810 and its semantic information sentence 840 are automatically registered, but in addition to these, the topic title 820 and the answer sentence 830 are extracted from the utterance of the speaker, and the conversation control device. 1 may be automatically registered, or the conversation control device 1 automatically registers only the specific topic information 810 and its semantic information sentence 840, and then the specific topic information 810 and the meaning of the operator registered. The topic title 820 and the answer sentence 830 may be added with reference to the information sentence 840.

会話制御装置の構成例を示す機能ブロック図Functional block diagram showing a configuration example of a conversation control device 音声認識部の構成例を示す機能ブロック図Functional block diagram showing a configuration example of the speech recognition unit 単語仮説絞込部の処理を示すタイミングチャートTiming chart showing processing of word hypothesis filtering unit 音声認識部の動作例を示すフロー図Flow diagram showing an example of the operation of the voice recognition unit 会話制御装置の部分拡大ブロック図Partial enlarged block diagram of the conversation control device 文字列とこの文字列から抽出される形態素との関係を示す図The figure which shows the relationship between the character string and the morpheme extracted from this character string 「発話文のタイプ」と、その発話文のタイプを表す二文字のアルファベット、及びその発話文のタイプに該当する発話文の例を示す図The figure which shows the example of the utterance sentence which corresponds to the type of the utterance sentence, the two letter alphabet which shows the type of the utterance sentence, and the type of the utterance sentence 文のタイプとそのタイプを判定するための辞書の関係を示す図The figure which shows the relationship between the type of sentence and the dictionary for judging the type 会話データベースが記憶するデータのデータ構成の一例を示す概念図Conceptual diagram showing an example of the data structure of data stored in the conversation database ある話題特定情報と他の話題特定情報との関連付けを示す図The figure which shows the correlation with a certain topic specific information and other topic specific information 話題タイトル（「第二形態素情報」ともいう）のデータ構成例を示す図Data structure example of topic title (also called “second morpheme information”) 回答文のデータ構成例を説明するための図Illustration for explaining an example of the data structure of an answer sentence ある話題特定情報に対応付けされた話題タイトル，回答文の具体例を示す図The figure which shows the concrete example of the topic title and reply sentence which are matched with the certain topic specific information 新単語登録部の構成例を示す機能ブロック図Functional block diagram showing a configuration example of the new word registration unit 会話制御方法の手順例を示すフロー図Flow chart showing an example of the procedure for the conversation control method 新単語登録処理の手順例を示すフロー図Flow chart showing an example of new word registration process 未確定の意味情報のデータ構成例を示す図Diagram showing an example of the data structure of undefined semantic information

符号の説明Explanation of symbols

１ … 会話制御装置
１００ … 入力部
２００ … 音声認識部
３００ … 会話制御部
４００ … 文解析部
５００ … 会話データベース
６００ … 出力部
７００ … 音声認識辞書記憶部
８００ … 新単語登録部
１４０１ … 登録確認部
１４０２ … 意味問い合わせ部
１４０３ … 意味検証部
１４０４ … 登録処理部 DESCRIPTION OF SYMBOLS 1 ... Conversation control apparatus 100 ... Input part 200 ... Speech recognition part 300 ... Conversation control part 400 ... Sentence analysis part 500 ... Conversation database 600 ... Output part 700 ... Speech recognition dictionary memory | storage part 800 ... New word registration part 1401 ... Registration confirmation part 1402 ... Semantic inquiry unit 1403 ... Semantic verification unit 1404 ... Registration processing unit

Claims

ユーザからの発話に応答する回答を出力することが可能な会話制御装置において、
複数の話題特定情報を記憶する会話データベース手段と、
従前の会話及び従前の回答により定まる談話履歴を記憶し、この談話履歴により定まる話題特定情報と発話情報とを照合して回答文を選択する会話制御手段と、
利用者の発話に含まれる語が、前記会話データベース手段に記憶された話題特定情報の内いずれにも相当しない場合、その語を新たな話題特定情報として会話データベース手段に記憶させるとともに、利用者にその語の意味を質問する文を出力し、利用者から返された返答をその話題特定情報に対応する意味情報文として会話データベース手段に記憶させる新単語登録手段と
を有することを特徴とする会話制御装置。 In a conversation control device capable of outputting an answer in response to an utterance from a user,
A conversation database means for storing a plurality of topic specific information;
A conversation control means for storing a conversation history determined by a previous conversation and a previous answer, and selecting a reply sentence by comparing topic identification information and utterance information determined by the conversation history;
If a word included in the user's utterance does not correspond to any of the topic identification information stored in the conversation database means, the word is stored in the conversation database means as new topic identification information, and the user Conversation characterized by having a new word registration means for outputting a sentence asking about the meaning of the word and for storing a response returned from the user in the conversation database means as a semantic information sentence corresponding to the topic specifying information Control device.

請求項１に記載の会話制御装置において、前記新単語登録手段は、ある話題特定情報について記憶された意味情報文を用いて、その話題特定情報の意味情報文の検証を行うための検証質問文を生成し、利用者がこの検証質問文に対して肯定する返答を返した場合には、その意味情報文の信頼度を示す数値を増加させ、次回検証質問文を生成する場合には、信頼度を示す数値が最も大きい意味情報文を用いて、その意味情報文の検証を行うための検証質問文を生成することを特徴とする会話制御装置。 2. The conversation control device according to claim 1, wherein the new word registration means uses a semantic information sentence stored for a certain topic specifying information to verify a semantic information sentence of the topic specifying information. When the user returns an affirmative response to the verification question sentence, the numerical value indicating the reliability of the semantic information sentence is increased, and when the next verification question sentence is generated, the A conversation control device characterized by generating a verification question sentence for verifying a semantic information sentence using a semantic information sentence having the largest numerical value indicating the degree.

請求項１に記載の会話制御装置において、前記新単語登録手段は、ある話題特定情報について記憶された意味情報文を用いてその意味情報文の検証を行うための検証質問文を生成し、利用者がこの検証質問文に対して否定する返答を返した場合には、その話題特定情報の意味を問い質すための問い合わせ質問文を生成し、利用者がこの問い合わせ質問文に対する返答を返した場合に、その返答をその話題特定情報の新たな意味情報文として会話データベース手段に記憶させることを特徴とする、会話制御装置。 2. The conversation control apparatus according to claim 1, wherein the new word registration unit generates a verification question sentence for verifying the semantic information sentence using the semantic information sentence stored with respect to certain topic specifying information, and uses the verification question sentence. When the user returns a negative response to this verification question message, a query question message is generated to interrogate the meaning of the topic specific information, and the user returns a response to this query question message And storing the response in the conversation database means as a new semantic information sentence of the topic identification information.

ユーザからの発話に応答する回答を出力することが可能な会話制御方法において、
複数の話題特定情報を予め記憶させるステップと、
利用者の発話に含まれる語が話題特定情報として未だ登録されていない場合、その語について、利用者に意味を問い合わせる問い合わせ質問文を出力するステップと、
前記問い合わせ質問文に対する利用者の回答を取得するステップと、
前記語を新たな話題特定情報として記憶させるとともに、前記回答をその話題特定情報の意味情報文として記憶させるステップと
を有することを特徴とする、会話制御方法。 In a conversation control method capable of outputting an answer in response to an utterance from a user,
Storing a plurality of pieces of topic specific information in advance;
If a word included in the user's utterance is not yet registered as topic identification information, a step of outputting an inquiry question for inquiring the user about the meaning of the word,
Obtaining a user's answer to the inquiry question;
Storing the word as new topic specifying information and storing the answer as a semantic information sentence of the topic specifying information.

請求項４に記載の会話制御方法において、
ある話題特定情報について記憶された意味情報文を用いて、その話題特定情報の意味情報文の検証を行うための検証質問文を生成し、出力するステップと、
利用者がこの検証質問文に対して肯定する返答を返した場合には、その意味情報文の信頼度を示す数値を増加させるステップと、
前記話題特定情報について次に検証質問文を生成する場合には、信頼度を示す数値が最も大きい意味情報文を用いて、その意味情報文の検証を行うための検証質問文を生成するステップと
をさらに有することを特徴とする会話制御方法。 The conversation control method according to claim 4,
Generating and outputting a verification question sentence for verifying the semantic information sentence of the topic specifying information using the semantic information sentence stored for the topic specifying information; and
If the user returns an affirmative response to the verification question sentence, increasing the numerical value indicating the reliability of the semantic information sentence;
Next, when generating a verification question sentence for the topic identification information, using a semantic information sentence having the largest numerical value indicating reliability, generating a verification question sentence for verifying the semantic information sentence; A conversation control method characterized by further comprising:

請求項４に記載の会話制御方法において、
ある話題特定情報について記憶された意味情報文を用いてその意味情報文の検証を行うための検証質問文を生成し、出力するステップと、
利用者がこの検証質問文に対して否定する返答を返した場合には、その話題特定情報の意味を問いただすための問い合わせ質問文を生成し、出力するステップと、
利用者がこの問い合わせ質問文に対する返答を返した場合に、その返答をその話題特定情報の新たな意味情報文として記憶させるステップと
をさらに有することを特徴とする会話制御方法。 The conversation control method according to claim 4,
Generating and outputting a verification question sentence for verifying the semantic information sentence using the semantic information sentence stored with respect to certain topic identification information;
If the user returns a negative response to the verification question sentence, a step of generating and outputting an inquiry question sentence for asking the meaning of the topic identification information;
And a step of storing the response as a new semantic information sentence of the topic specifying information when the user returns a response to the inquiry question sentence.

請求項１から請求項３のいずれかに記載の会話制御装置としてコンピュータを機能させるためのプログラム。 The program for functioning a computer as a conversation control apparatus in any one of Claims 1-3.

請求項４から請求項６のいずれかに記載の会話制御方法をコンピュータに実行させるためのプログラム。 The program for making a computer perform the conversation control method in any one of Claims 4-6.