JPH04184397A

JPH04184397A - Voice controller

Info

Publication number: JPH04184397A
Application number: JP2313546A
Authority: JP
Inventors: Atsushi Ookumo; 篤大蜘蛛
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1990-11-19
Filing date: 1990-11-19
Publication date: 1992-07-01
Anticipated expiration: 2013-12-02
Also published as: JP2830460B2

Abstract

PURPOSE:To allow the control of a machine with such sensation that human beings carry on conversation with each other by providing a controlled variable setting section which determines a controlled variable by at least two of the result of the recognition by a voice recognizing section, the result of classification by a vocabulary classifying section and the controlled variable stored in a memory. CONSTITUTION:The word which is the information on the result of the recognition is classified to the word belonging to either of the basic word or auxiliary word in the vocabulary classifying section 3. The control by as much as the value for the recognized word is executed in the 1st controlled variable setting section 4 and this quantity is stored in a memory 6 in the case of the basic word. If the result of the recognition is the auxiliary word, the control is executed when the word is sent from the vocabulary classifying section 3 to the 2nd controlled variable setting section 5. The controlled variable is also stored in the memory 6. The control section 7 controls an appliance 8 to be controlled by the controlled variable. If the quantity controlled immediately before is previously stored into the memory, meaning can be provided in a adverb even if the adverb is next inputted by a voice. Since the voice input can be made as if the conversions are carried out between people and, therefore, the laborious dealing with the voice controller is relieved and the controlled operation is executed in the sensation near to conversation with people.

Description

【発明の詳細な説明】産業上の利用分野本発明は、音声入力による機器操作を人間同士の対話に
近い形態で実現する音声制御装置に関するものである。DETAILED DESCRIPTION OF THE INVENTION Field of Industrial Application The present invention relates to a voice control device that realizes device operation by voice input in a form similar to human interaction.

従来の技術機械技術や電気技術等の発達により、我々のまわりには
さまざまな機器が存在している。それらの機器には、ダ
イヤル式、レバー式、ボタン式などによる何らかの調整
操作機能か付いているものが多い。人間がこれらの調整
を行うときには、その優れた感覚で数回の試行の後に最
適な値に調整を行うことができる。Conventional Technology Due to the development of mechanical technology, electrical technology, etc., various devices exist around us. Many of these devices have some type of adjustment function, such as a dial type, lever type, or button type. When humans make these adjustments, they can adjust to the optimal value after several trials using their excellent sense.

一方、音声認識技術は現在発展途上にある技術で、まだ
完成した技術であるとは言えないが、方式、用途によっ
ては実用されるものも出てきておりＪ各種の機器に音声
認識技術が取り入れられるようになってきている。On the other hand, voice recognition technology is currently in the process of development and cannot be said to be a completed technology yet, but some methods and applications are beginning to be put into practical use, and voice recognition technology is being incorporated into various devices. It is becoming more and more common.

従来、音声認識技術を用いて何らかの機械を操作す木と
き、多くの場合は、人間が言葉を発声しそれを機械側が
認識し、その結果によって機器の操作を行っていた。し
かし、調整操作などの続けて何度も繰り返す操作につい
ては、同じ単語を何度も発声する必要が生じ、煩わしさ
を感じることになりかねない。Conventionally, when using voice recognition technology to operate some type of machine, in most cases a human utters words, the machine recognizes the words, and then operates the machine based on the results. However, when operations such as adjustment operations are repeated many times in succession, it becomes necessary to utter the same words over and over again, which can be bothersome.

一方、人間同士の対話では、話の流れをお互いに把握し
ているため、例えば「もう少し」とか「逆にコなどのよ
うに、先の会話にあった形容詞等を修飾する副詞だけで
話が通じることもある。On the other hand, in dialogue between humans, because both parties understand the flow of the conversation, they can use only adverbs that modify adjectives that were used in the previous conversation, such as "a little more" or "on the contrary, ko." Sometimes it works.

発明か解決しようとする課題しかしながら、機械に対しては上述したような融通が効
かないため人間と対話するようにはいかず、普段人間同
士で話すときとの違和感から、使い勝手などの面で非常
に不自然さを感じ、音声認識による調整操作等はかえっ
て煩わしさを感じることになる。However, since machines do not have the flexibility mentioned above, they are not able to interact with humans, and because they feel uncomfortable when talking with humans, they are very difficult to use in terms of usability. It feels unnatural, and adjustment operations based on voice recognition are even more bothersome.

本発明は、上記のような従来の課題を解決するためのも
ので、音声入力で調整操作などの続けて何度も繰り返す
操作を行う場合に、人間同士で対話をしているような感
覚で機械を制御することのできる音声制御装置を提供す
ることを目的とする。The present invention is intended to solve the above-mentioned conventional problems, and when performing operations that are repeated many times in a row, such as adjustment operations, using voice input, it is possible to create a system that feels like a conversation between humans. The purpose of the present invention is to provide a voice control device that can control a machine.

課題を解決するための手段この目的を達成するために、本発明は、音声入力を認識
する音声認識部と、音声認識部で認識した結果を語意に
より２つ以上に分類する語意分類部と、語意分類部によ
る分類結果によって制御量を設定する制御量設定部と、
直前に行った制御量設定部の制御量を記憶しておくメモ
リを設け、制御量設定部か、音声認識部による認識結果
と語意分類部による分類結果とメモリに記憶された制御
量の内の少なくとも２つによって制御量を決定する構成
を有している。Means for Solving the Problems In order to achieve this object, the present invention comprises: a speech recognition section that recognizes speech input; a word meaning classification section that classifies the results recognized by the speech recognition section into two or more types according to the meaning of the word; a control amount setting unit that sets a control amount based on the classification result by the word meaning classification unit;
A memory is provided to store the control amount of the control amount setting section that was performed immediately before, and the control amount setting section, the recognition result by the speech recognition section, the classification result by the meaning classification section, and the control amount stored in the memory are It has a configuration in which the control amount is determined by at least two factors.

作用本発明は、上記構成により、直前に制御した量をメモリ
に記憶しておくことで、前の会話に現れた形容詞を修飾
するそれ自体では意味を持たない副詞を次に音声入力し
ても、その副詞に意味を持たせることが可能になり、よ
り人間同士の対話に近い感覚で制御操作が可能な音声制
御装置を実現することができる。Effect of the present invention With the above configuration, the amount controlled immediately before is stored in the memory, so that even if an adverb that modifies an adjective that appeared in the previous conversation and has no meaning by itself is input by voice next time, the amount controlled immediately before is stored in the memory. , it becomes possible to give meaning to the adverb, and it is possible to realize a voice control device that allows control operations to be performed with a feeling closer to human interaction.

実施例以下、本発明の一実施例について図面を参照しながら説
明する。図は本発明の一実施例における音声制御装置の
ブロック結線図である。EXAMPLE Hereinafter, an example of the present invention will be described with reference to the drawings. The figure is a block diagram of a voice control device according to an embodiment of the present invention.

図において、１は音声を入力するマイクロホン、２は音
声の認識を行う音声認識部、３は音声認識部２において
認識された単語を基本語と補助語に分類を行う語意分類
部、４は音声認識部２において認識された単語が基本語
のときに制御量を決定する第１制御量設定部、５は音声
認識部２において認識された単語が補助語のときに制御
量を決定する第２制御量設定部、６は前回行った第１制
御量設定部４または第２制御量設定部５の制御量を記憶
しておくためのメモリ、７は第１制御量設定部４または
第２制御量設定部５で決定した制御量により被制御機器
８をコントロールする制御部である。In the figure, 1 is a microphone that inputs speech, 2 is a speech recognition unit that recognizes speech, 3 is a word meaning classification unit that classifies words recognized by speech recognition unit 2 into basic words and auxiliary words, and 4 is speech A first control amount setting section 5 determines the control amount when the word recognized in the recognition section 2 is a basic word, and a second control amount setting section 5 determines the control amount when the word recognized in the speech recognition section 2 is an auxiliary word. A controlled variable setting section, 6 is a memory for storing the previously performed controlled variable of the first controlled variable setting section 4 or the second controlled variable setting section 5, and 7 is the first controlled variable setting section 4 or the second control This is a control unit that controls the controlled device 8 based on the control amount determined by the amount setting unit 5.

以上のように構成された音声制御装置について、被制御
機器８がボリュームであり、このボリュームの大小を調
節する操作を例として、以下その動作を説明する。The operation of the audio control device configured as described above will be described below, with the controlled device 8 being a volume, taking as an example an operation to adjust the volume.

まず、いくつかの予め設定しておく事柄がある音声認識
部２には、音声認識に使用する単語を登録しておく。語
意分類部３には、音声認識部２に登録した単語が基本語
か補助語のとちらに属する単語であるかの情報を設定し
ておく。基本語は相反する意味を持つ２単語もしくはそ
れに同義語を加えたものであり、補助語は前に発声した
単語に対する背定語・否定語、程度を表す言葉なとであ
る。第１制御量設定部４には、基本語の各単語ごとに制
御する量を、制御を行う前から後へ加算または減算する
値として登録しておく。第２制御量設定部５には、補助
語の各単語ごとに、直前に行った制御に対する制御後の
倍率として制御量を登録しておく。First, words to be used for speech recognition are registered in the speech recognition section 2, which has several preset settings. The word meaning classification section 3 is set with information indicating whether the word registered in the speech recognition section 2 belongs to a basic word or an auxiliary word. A basic word is two words with contradictory meanings or a synonym added to them, and an auxiliary word is a declarative or negative word for the previously uttered word, or a word that expresses degree. In the first control amount setting section 4, the amount to be controlled for each basic word is registered as a value to be added or subtracted from before to after control. In the second control amount setting unit 5, a control amount is registered for each word of the auxiliary word as a multiplication factor after the control performed immediately before.

ボリュームの調節操作の例としては、「大きく」、「小
さく」を基本語として、「もう少し」を補助語として登
録しておく。調整量は、「大きく」は＋２０、「小さく
」は−２，０、「もう少し」は＋０，５を設定しておく
。As examples of volume adjustment operations, "louder" and "lower" are registered as basic words, and "a little more" is registered as an auxiliary word. The adjustment amount is set to +20 for "larger", -2.0 for "smaller", and +0.5 for "a little more".

マイクロホン１から単語音声が入力されると、音声認識
部２では音声認識を行い、その認識結果情報を単語とし
て語意分類部３に送る。語意分類部３では、認識結果情
報である単語が、基本語と補助語のどちらに属する単語
であるかを分類する。When word speech is input from the microphone 1, the speech recognition section 2 performs speech recognition and sends the recognition result information as a word to the meaning classification section 3. The word meaning classification unit 3 classifies whether the word, which is the recognition result information, belongs to a basic word or an auxiliary word.

基本語のときは、第１制御量設定部４に予め設定してお
いたその認識単語に対する値の分だけ制御を行う。また
、同時に、制御を行った量をメモリ６に記憶する。In the case of a basic word, control is performed by the value for the recognition word set in advance in the first control amount setting section 4. At the same time, the controlled amount is stored in the memory 6.

「太き（ｊか入力された場合は、ボリュームを現在の値
から設定した値、即ち÷２０の分だけ「大きく」する操
作を行い、同時にこの制御した量である。＋２０をメモ
リに蓄える。If "thick" (j) is input, perform an operation to "increase" the volume from the current value by the set value, that is, ÷20, and at the same time store this controlled amount +20 in the memory.

認識結果か補助語のときは、語意分類部３から第２制御
量設定部５に単語か送られたときに、メモリ６内に蓄え
られている前回の制御量を呼び出し、その値に予め登録
しておいた値を掛は合わせて制御を行う。このときの制
御量もメモリ６に記憶する。例えば「もう少し」が入力
された場合、この単語の持つ制御量、即ち＋０５をメモ
リ６に蓄えられていた直前の制御量（ここでは＋２０）
に街は合わせ、＋１．０の分だけボリュームを調節する
。また、同時にこの制御量＋１．０をメモリ６に蓄える
。In the case of a recognition result or an auxiliary word, when the word is sent from the meaning classification section 3 to the second control amount setting section 5, the previous control amount stored in the memory 6 is called and registered in advance to that value. Control is performed by multiplying the values set in advance. The control amount at this time is also stored in the memory 6. For example, when "a little more" is input, the control amount of this word, that is +05, is changed to the previous control amount stored in the memory 6 (+20 in this case).
, and adjust the volume by +1.0. At the same time, this control amount +1.0 is stored in the memory 6.

制御部７は、第１制御量設定部４または第２制御量設定
部５から送られてきた制御量により被制御１ｌｌｔｌｊ
、器８（ここではボリューム）の制御を行う。The control unit 7 controls the controlled variable 1lltlj based on the controlled variable sent from the first controlled variable setting unit 4 or the second controlled variable setting unit 5.
, the device 8 (volume in this case).

なお、本実施例では音声認識部２における認識の対象を
単語としているが、これは文章なと単語以外でもよい。In this embodiment, the speech recognition unit 2 recognizes words, but it may be other than words, such as sentences.

発明の効果以上のように本発明は、直前に制御した量をメモリに記
憶しておくことで、前の会話に現れた形容詞を修飾する
それ自体では意味を持たない副詞を次に音声入力しても
、その副詞に意味を持たせることか可能になる。従って
、機器を何度も続けて音声入力で制御する際に、人間同
士でやりとりしているかのように音声入力が行えるので
、音声制御装置に対する煩わしさを和らげることができ
、結果として、より人間同士の対話に近い感覚で制御操
作が可能な音声制御装置を実現することができる。Effects of the Invention As described above, the present invention stores in memory the amount controlled just before, and then inputs by voice an adverb that modifies an adjective that appeared in the previous conversation and has no meaning by itself. However, it becomes possible to give meaning to the adverb. Therefore, when controlling a device using voice input over and over again, voice input can be performed as if it were a human being communicating with another person, reducing the inconvenience of voice control devices and, as a result, making it more human-friendly. It is possible to realize a voice control device that allows control operations to be performed with a feeling similar to a conversation between two people.

【図面の簡単な説明】[Brief explanation of the drawing]

図は本発明の一実施例における音声制御装置のブロック
結線図である。１・・・マイクロホン、２・・・音声認識部、３・・・
語意分類部、４・・・第１制御量設定部、５・・・第２
制御量設定部、６・・・メモリ、７・・・制御部、８・
・・被制御機器。代理人の氏名　弁理士　小蝦治　明　１ｉカ１２名制砲
蓋決定手段The figure is a block diagram of a voice control device according to an embodiment of the present invention. 1...Microphone, 2...Speech recognition unit, 3...
word meaning classification section, 4... first control amount setting section, 5... second
Controlled amount setting section, 6... Memory, 7... Control section, 8.
...Controlled equipment. Name of agent Patent attorney Akira Koeji 1i car 12 person system

Claims

【特許請求の範囲】[Claims]

音声入力を認識する音声認識部と、上記音声認識部で認
識した結果を語意により２つ以上に分類する語意分類部
と、上記語意分類部による分類結果によって制御量を設
定する制御量設定部と、直前に行った上記制御量設定部
の制御量を記憶しておくメモリを有し、上記制御量設定
部が、上記音声認識部による認識結果と上記語意分類部
による分類結果と上記メモリに記憶された制御量の内の
少なくとも２つによって制御量を決定することを特徴と
する音声制御装置。a speech recognition section that recognizes speech input; a word meaning classification section that classifies the recognition result by the speech recognition section into two or more types according to word meaning; and a control amount setting section that sets a control amount based on the classification result by the word meaning classification section. , has a memory for storing the control amount of the control amount setting section that was performed immediately before, and the control amount setting section stores in the memory the recognition result by the speech recognition section and the classification result by the word meaning classification section. A voice control device characterized in that a control amount is determined based on at least two of the control amounts.