JPH0451299A

JPH0451299A - Voice recognition controller

Info

Publication number: JPH0451299A
Application number: JP2159793A
Authority: JP
Inventors: Tetsuo Furuya; 古谷　哲夫; Gichu Ota; 義注太田
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1990-06-20
Filing date: 1990-06-20
Publication date: 1992-02-19

Abstract

PURPOSE:To inform a user of a word or phrase to be recognized by a main body device in standard voice at the time of the recognition when the user makes a request at an optional point of time by providing a voice input means, a voice recognizing means, a control means which controls the operation of the main body device, a voice reproducing means, and a manual operation means. CONSTITUTION:The user operates a help key 5 and then an interruption signal is inputted to the interruption terminal 46 of a control part 4 through a 1st encoder 6. The control part 4 when detecting the interruption signal sends a standard voice reproduction command to a voice recognition part 3. The voice recognition part 3 when receiving the reproduction command detects the address where data on the standard voice of a word of a number on a 2nd memory 33 is recorded according to the number of the current word to be recognized. Then the data is read out in order and decoded in real time when necessary to output an analog voice signal to an amplifier 8 by the D/A conversion part of an input/output part 34, thereby outputting a voice from a speaker 9.

Description

【発明の詳細な説明】［産業上の利用分野］本発明は、利用者の操作に基づき装置の運転制御を行う
音声認識制御装置に係り、特に、利用者の発声入力に基
づいて１例えば空肩機等の装置の運転制御を行うのに好
適な音声認識制御装置に関するものである。DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to a voice recognition control device that controls the operation of a device based on a user's operation. The present invention relates to a voice recognition control device suitable for controlling the operation of devices such as shoulder aircraft.

［従来の技術］従来の音声認識制御装置は、例えば特開昭５６−１０２
６３５号公報に記載された「空調機の制御装置」が知ら
れている。[Prior Art] A conventional voice recognition control device is disclosed in, for example, Japanese Patent Application Laid-open No. 56-102.
2. Description of the Related Art "Air conditioner control device" described in Japanese Patent No. 635 is known.

この従来例は、利用者があらかじめ運転命令語の音声を
発声して登録しておき、例えば空調機を操作するときは
、利用者が上記運転命令語を発声して、空調機がこれを
認識することにより行う。In this conventional example, the user vocalizes and registers the operating command in advance. For example, when operating the air conditioner, the user speaks the operating command and the air conditioner recognizes it. Do by doing.

つまり利用者が発声した音声と上記登録した音声とを比
較して、最も類似した音声の運転命令語を選択し、これ
に対応する動作を行う。In other words, the voice uttered by the user is compared with the registered voice, the most similar driving command is selected, and the corresponding operation is performed.

また、利用者の手動による操作をも可能とし、音声によ
る操作と手動による操作との切り替えは、音声または手
動操作により行うことができる。Further, manual operation by the user is also possible, and switching between voice operation and manual operation can be performed by voice or manual operation.

本従来例によれば、利用者は空調機の操作手段として、
音声入力または手動操作を自由に選択して用いる。こと
ができる。According to this conventional example, the user can operate the air conditioner by
Freely select and use voice input or manual operation. be able to.

［発明が解決しようとする課題］上記従来技術は、利用者が登録した運転命令語を忘れた
り、利用者の発声の仕方が長期的に変動する場合につい
ての配慮がなされていなかった。[Problems to be Solved by the Invention] The above-mentioned conventional technology does not take into account cases where the user forgets the registered driving command words or the way the user speaks changes over time.

つまり、利用者が登録した運転命令語と大きく異なる音
声を発声入力すると、空調機はどの登録音声とも類似し
ない音声として認識不能の応答をするか、他の運転命令
語と誤認識して利用者の意図しない動作をすることがあ
る。このような場合、および利用者が運転命令語を全く
忘れてしまった場合には、利用者は登録した運転命令語
の記録を見直すことになる。しかし、上記記録が残って
いない場合、あるいは同じ単語でも利用者の発声の仕方
が登録時と異なっていると思われる場合には、利用者は
あらためて運転命令語の音声を登録し直さなければなか
った。In other words, if the user inputs a voice that is significantly different from the registered driving command, the air conditioner will respond with an unrecognizable voice that is not similar to any registered voice, or it will misrecognize it as another driving command and the user will may behave unintentionally. In such a case, or if the user has completely forgotten the driving command, the user will have to review the record of the registered driving command. However, if the above record does not remain, or if the way the user pronounces the same word is different from when it was registered, the user will have to re-register the audio of the driving command. Ta.

また、特定の話者が発声した運転命令語を最初から標準
音声として登録しておき、利用者毎の音声の登録は行わ
ず、上記標準音声を比較の対象として不特定の利用者が
発声する運転命令語を認識する方式とした場合には、利
用者に上記標準音声そのものを知らせることについての
配慮がなされていなかった。つまり、利用者が上記登録
されている運転命令語の音声を発声しても発声の仕方が
上記標準音声と大きく異なる場合には、その音声は正し
く認識されないおそれがある。このように利用者が発声
する運転命令語が何回も連続して正しく認識されない場
合、利用者は上記標準的音声、つまり正しく認識される
音声を知る手段が容易されていなかった。In addition, the driving commands uttered by a specific speaker are registered as standard voices from the beginning, and the voices are not registered for each user, but are uttered by unspecified users using the standard voices as a comparison target. When using a system that recognizes driving command words, no consideration was given to informing the user of the standard voice itself. In other words, even if the user utters the voice of the registered driving command word, if the manner of utterance is significantly different from the standard voice, the voice may not be recognized correctly. In this way, when the driving commands uttered by the user are not recognized correctly many times in a row, the user has no easy way to know the standard voice, that is, the voice that is correctly recognized.

さらに、上記従来技術は、利用者が任意の時点で、空調
機の動作設定状態を音声により知りたいという要望を満
たしていなかった。つまり、冷房。Furthermore, the above-mentioned conventional technology does not satisfy the user's desire to know the operating setting state of the air conditioner by voice at any time. In other words, air conditioning.

送風等の運転状態や目標温度等の設定情報を、利用者が
要求した時点で音声により知らせる機能が備わっていな
かった。したがって、利用者はこれらの情報を知りたい
場合、空調機の本体または操作器の表示を見る必要があ
る。つまり操作を音声で行うことにより、盲人でも操作
ができ、暗がりでも操作ができるようになるが、操作に
必要な動作設定状態の情報は視覚により確認する必要が
あり、盲人の使用や暗がり等での使用には不便が生じる
という問題があった。It did not have a function to notify the user of operating status such as air blowing, setting information such as target temperature, etc. by voice when requested by the user. Therefore, if a user wants to know this information, he or she needs to look at the display on the air conditioner's main body or controller. In other words, by using voice commands, even blind people can perform operations even in the dark, but the information on the operating settings necessary for operation must be confirmed visually, making it difficult for blind people to use the machine or in the dark. There was a problem in that it was inconvenient to use.

本発明の目的は、上記従来技術の問題点を解決し、制御
すべき本体装置が認識の対象とする語句を、利用者の任
意の時点での要求により、認識の際に比較の標準とする
音声で利用者に知らせることができる音声認識制御装置
を提供することにある。An object of the present invention is to solve the above-mentioned problems of the prior art, and to set the words and phrases to be recognized by the main unit to be controlled as standards for comparison at the time of recognition at the user's request at any time. An object of the present invention is to provide a voice recognition control device that can notify a user by voice.

また、本体装置の動作設定状態の情報を、利用者の任意
の時点での要求により音声で利用者に知らせることので
きる音声認識制御装置を提供することにある。Another object of the present invention is to provide a voice recognition control device that can notify a user of information on the operational setting state of a main body device by voice according to the user's request at any time.

［課題を解決するための手段］上記目的を達成するために１本発明に係る音声認識制御
装置のもっとも基本的な構成は、利用者の発声入力によ
る運転操作に基づき、制御すべき本体装置の運転制御を
行う音声認識制御装置において、利用者の発声する命令
語の音声を入力する音声入力手段と、その入力された音
声の特徴量を抽出し、あらかじめ登録されている命令語
の標準音声の特徴量と比較することにより、該命令語を
認識して認識結果を出力する音声認識手段と、前記認識
結果に基づいて前記本体装置の運転制御を行う制御手段
と、この制御手段からの指示信号に基づき前記標準音声
の再生を行う音声再生手段と、利用者が手動操作を行う
手動操作部と、その手動操作を表わす信号を前記制御手
段に出力する手動操作手段とを備え、利用者の特定の手
動操作により前記標準音声の再生を行うようにしたもの
である。[Means for Solving the Problems] In order to achieve the above object, the most basic configuration of the voice recognition control device according to the present invention is to control the main body device to be controlled based on the driving operation by the user's voice input. A voice recognition control device that performs driving control includes a voice input means for inputting the voice of the command word uttered by the user, and a voice input means for inputting the voice of the command word uttered by the user. A voice recognition means that recognizes the command word and outputs a recognition result by comparing it with a feature amount, a control means that controls the operation of the main unit based on the recognition result, and an instruction signal from the control means. an audio reproduction means for reproducing the standard audio based on the user's identification; a manual operation section for manual operation by the user; and a manual operation means for outputting a signal representing the manual operation to the control means; The standard audio is played back by manual operation.

より詳しく述べれば、上記目的は以下の手段により達成
できる。More specifically, the above object can be achieved by the following means.

音声認識制御装置の操作器に、認識対象とする語句の音
声再生、あるいは動作設定状態等の音声出力を利用者が
要求するための特定のスイッチ等を設ける。そして音声
認識制御装置の本体には上記認識対象語句の音声あるい
は上記動作設定状態等の情報を表現する音声を再生する
音声再生手段、および上記特定のスイッチ等の操作を検
知して上記音声再生手段を動作させる制御手段を設ける
。The operating device of the voice recognition control device is provided with a specific switch or the like for the user to request audio playback of words to be recognized or audio output of operation setting status, etc. The main body of the voice recognition control device includes a voice reproduction means for reproducing the voice of the recognition target word or a voice expressing information such as the operation setting state, and a voice reproduction means that detects the operation of the specific switch etc. A control means is provided to operate the.

つまり、上記音声再生手段は、上記認識対象語句や上記
情報等を表現する音声の音声データを記録保持する記録
手段、外部からの制御信号に基づき上記音声データを選
択する選択手段、および該選択された音声データを音声
信号に復号化して出力する復号化手段からなる。In other words, the audio reproduction means includes a recording means for recording and holding audio data of the speech expressing the recognition target phrase and the information, a selection means for selecting the audio data based on an external control signal, and a selection means for selecting the audio data based on an external control signal. It consists of a decoding means that decodes the recorded audio data into an audio signal and outputs it.

なお、上記認識対象語句の音声は、認識の際に利用者の
発声する入力音声と比較する標準音声としてあらかじめ
登録しである音声とする。Note that the voice of the recognition target phrase is a voice that has been registered in advance as a standard voice to be compared with the input voice uttered by the user during recognition.

また、上記制御手段は、現時点の上記動作設定状態等を
常に保持し、上記スイッチ等の操作を検知すると上記保
持している動作設定状態等に基づき、これを表現する上
記音声データを選択して再生させる制御信号を上記音声
再生手段に出力する。Further, the control means always maintains the current operation setting state, etc., and when detecting an operation of the switch, etc., selects the audio data representing the operation setting state, etc., based on the held operation setting state, etc. A control signal for reproduction is output to the audio reproduction means.

利用者が上記音声出力の要求をする方法は、上記特定の
スイッチ等の操作による代わりに要求を示す特定の語句
を定め、利用者が該語句を発声し音声認識制御装置がこ
れを認識して上記音声出力を行う方法としてもよい、つ
まり、音声認識制御装置が有する音声認識手段に該語句
を認識したことを示す信号を出力する機能を設け、上記
制御手段が該信号を検知することにより上記音声出力を
行う構成とすればよい。The method for the user to request the above-mentioned voice output is to define a specific phrase indicating the request instead of operating the specific switch, etc., the user utters the phrase, and the voice recognition control device recognizes this. The above-mentioned voice output may be performed; in other words, the voice recognition means of the voice recognition control device is provided with a function of outputting a signal indicating that the word/phrase has been recognized, and the control means detects the signal. It may be configured to output audio.

［作用］利用者が認識対象語句の音声再生を要求して特定のスイ
ッチ等の操作を行うと、制御手段がこれを検知して該認
識対象語句の音声データを選択して再生させる制御信号
を音声再生手段に出力する。[Operation] When the user requests audio playback of the recognition target word and operates a specific switch, etc., the control means detects this and sends a control signal to select and reproduce the audio data of the recognition target word. Output to audio reproduction means.

この音声再生手段は、前記制御信号を入力し、これに基
づき前記認識対象語句の音声データを順次選択入力し、
これを復号化して音声信号として出力する。上記音声デ
ータは認識の際に利用者の発声する入力音声との比較に
用いる標準音声のデータとする。したがって、利用者は
任意の時点で認識対象語句の音声再生を要求することが
でき、かつ認識の際に標準音声として用いている音声を
知ることができる。The audio reproduction means inputs the control signal and sequentially selects and inputs the audio data of the recognition target words based on the control signal,
This is decoded and output as an audio signal. The above audio data is standard audio data used for comparison with the input audio uttered by the user during recognition. Therefore, the user can request audio playback of the recognition target phrase at any time, and can also know the audio used as the standard audio during recognition.

また、利用者が動作設定状態等の音声出力を要求して上
記特定のスイッチ等の操作を行うと、上記制御手段がこ
れを検知して保持している現時点の動作設定状態等に基
づき、これを表現する音声データを選択して再生させる
制御信号を上記音声再生手段に出力する。上記音声再生
手段は該制御信号を入力し、これが示す音声データを順
次選択入力し、これを復号化して音声信号として出力す
る。したがって、利用者は任意の時点で操作設定情報等
を音声により知ることができる。In addition, when the user requests audio output of the operation setting status, etc. and operates the specific switch, etc., the control means detects this and outputs the voice output based on the current operation setting status etc. A control signal for selecting and reproducing audio data expressing the is output to the audio reproducing means. The audio reproduction means inputs the control signal, sequentially selects and inputs audio data indicated by the control signal, decodes the data, and outputs the decoded data as an audio signal. Therefore, the user can obtain operation setting information and the like by voice at any time.

さらに、利用者が上記音声出力を要求する手段として特
定のスイッチ等の操作の代わりに特定の語句を発声入力
する方式では１次のように動作する。すなわち、上記音
声認識手段は、通常、音声認識の動作中以外には利用者
の発声入力を待機している。利用者が上記特定の語句を
発声すると該音声！！識手段は該発声を入力して音声認
識の動作を行い、上記特定の語句を認識するとこれを示
す信号を出力する。上記制御手段は該信号を検知すると
上記特定のスイッチ等の操作を検知した場合と全く同様
の動作を行い、上記音声再生手段に所定の音声の出力を
行わせる。したがって、利用者は任意の時点で特定の語
句を発声することにより、認識対象の語句や、その時点
での動作設定状態の情報等を音声により知ることができ
る。なお、上記音声認識手段が利用者の発声入力を待機
していない期間、つまり音声認識の動作を行なっている
期間は、利用者が運転命令語等の語句を発声入力中か、
その直後の期間である。よってこの期間には、利用者は
認識対象の語句や動作設定状態等の音声出力を要求する
ことはなく、利用者が要求しうる任意の時点で上記音声
出力を行うことができる。Furthermore, in a method in which the user vocally inputs a specific phrase instead of operating a specific switch or the like as a means for requesting the audio output, the system operates as follows. That is, the voice recognition means normally waits for the user's voice input except when voice recognition is in progress. When the user utters the above specific words, the corresponding voice! ! The recognition means inputs the utterance and performs a voice recognition operation, and when it recognizes the specific word or phrase, outputs a signal indicating this recognition. When the control means detects the signal, it performs the same operation as when detecting the operation of the specific switch, etc., and causes the sound reproduction means to output a predetermined sound. Therefore, by uttering a specific word or phrase at any time, the user can hear the word or phrase to be recognized, information on the operation settings at that time, and the like. Note that during the period when the voice recognition means is not waiting for the user's voice input, that is, the voice recognition operation is performed, it is possible to determine whether the user is inputting words such as driving commands or the like by voice.
This is the period immediately after that. Therefore, during this period, the user does not request the voice output of words to be recognized, operation settings, etc., and the voice output can be performed at any time the user requests.

［実施例］以下５本発明の各実施例を第１図ないし第１６図を参照
して説明する。[Embodiments] Each of the five embodiments of the present invention will be described below with reference to FIGS. 1 to 16.

まず、本発明による音声認識制御装置の一実施例として
、音声認識形空調機の制御装置の構成を第１図ないし第
５図を参照して説明する。First, as an embodiment of the voice recognition control device according to the present invention, the configuration of a voice recognition type air conditioner control device will be described with reference to FIGS. 1 to 5.

第１図は、本発明の一実施例に係る音声認識形空調機の
制御系のブロック図、第２図は、第１図の空調機の外観
の一例を示す略示構成図、第３図は、第１図の音声認識
部の一構成例を示すブロック図、第４ｒＭは、制御部の
一構成例を示すブロック図、第５図は、音声合成器の一
構成例を示すブロック図である。FIG. 1 is a block diagram of a control system of a voice recognition type air conditioner according to an embodiment of the present invention, FIG. 2 is a schematic configuration diagram showing an example of the external appearance of the air conditioner shown in FIG. 1, and FIG. is a block diagram showing an example of the configuration of the speech recognition section in FIG. 1, 4rM is a block diagram showing an example of the configuration of the control section, and FIG. be.

第１図において、マイクロホン１は、利用者が空調器を
操作するために運転命令語を発声入力するものである０
発声入力された音声信号は増幅器２を介して音声認識部
３に入力される。音声認識部３は、アナログ音声信号を
入力し、その音声信号とあらかじめ登録されている語句
の標準音声との特徴量どうしを比較して、その音声信号
がどの語句のものであるかを認識して該語句の番号等を
制御部４に出力するものである。これは、市販の音声認
識ＬＳＩ、あるいは汎用１チツプ型マイクロプロセツサ
等を用いて構成することができる。In FIG. 1, a microphone 1 is used by the user to vocally input operating commands to operate the air conditioner.
The input voice signal is input to the voice recognition section 3 via the amplifier 2. The speech recognition unit 3 inputs an analog speech signal, compares the feature values of the speech signal and standard speech of words and phrases registered in advance, and recognizes which word or phrase the speech signal belongs to. The number of the word or phrase is output to the control section 4. This can be constructed using a commercially available voice recognition LSI or a general-purpose one-chip microprocessor.

手動操作キー群１４は、利用者が手動により空調機の操
作を行うためのキースイッチ等である。The manual operation key group 14 includes key switches and the like for the user to manually operate the air conditioner.

これらの操作で生じる信号は第２のエンコーダ１５を介
して制御部４に入力される。Signals generated by these operations are input to the control section 4 via the second encoder 15.

ヘルプキー５は、利用者が認識対象単語の標準音声の再
生や、設定温度等の空調機の動作状態等の情報の音声出
力を要求するために操作するキースイッチ等である。こ
の操作で生じうる信号は第１のエンコーダ６を介して制
御部４に入力される。The help key 5 is a key switch or the like that is operated by the user to request the reproduction of the standard voice of the recognition target word or the voice output of information such as the operating status of the air conditioner such as the set temperature. A signal that may be generated by this operation is input to the control unit 4 via the first encoder 6.

音声合成器７は、制御部４から語句の番号等を入力し、
あらかじめ内部のメモリに記録されている該番号に対応
する音声の符号化データを順次復号化し、アナログ音声
信号として出力するものである。これは市販の音声合成
ＬＳＩ等を用いて構成することができる。上記音声信号
は増幅器８で増幅され、スピーカ９から音声として出力
される。The speech synthesizer 7 inputs the word number etc. from the control unit 4,
The audio encoded data corresponding to the number recorded in advance in the internal memory is sequentially decoded and output as an analog audio signal. This can be constructed using a commercially available speech synthesis LSI or the like. The audio signal is amplified by an amplifier 8 and output as audio from a speaker 9.

制御部４は、利用者の発声入力やキー操作を表わす信号
、および温度、湿度等の情報を入力し。The control unit 4 inputs signals representing the user's vocal inputs and key operations, and information such as temperature and humidity.

これらに基づいて空調機の制御を行うものである。The air conditioner is controlled based on these.

これは、市販の汎用マイクロプロセッサ等を用いて構成
することができる。センサ１１は、室内。This can be configured using a commercially available general-purpose microprocessor or the like. The sensor 11 is indoors.

外の温度、湿度等を検知して電気信号に変換するもので
あり、その電気信号は第３のエンコーダＩＱを介して制
御部４に入力される６空調機駆動回路１２は、制御部４からの制御信号に基づ
き、空調機機構部１３を駆動する電気信号を発声するも
のである。空調機機構部１３は、空気圧縮機や送風ファ
ンなど、空調の動作を行う駆動部分である。It detects the outside temperature, humidity, etc. and converts it into an electrical signal, and the electrical signal is input to the control unit 4 via the third encoder IQ. Based on the control signal, an electric signal for driving the air conditioner mechanism section 13 is generated. The air conditioner mechanism section 13 is a drive section that performs air conditioning operations, such as an air compressor or a blower fan.

操作器１６は、利用者が発声または手動により空調機の
操作を行う部分であり、空調機のリモートコントローラ
等である。この操作器１６は、マイクロホン１．増幅器
２、ヘルプキー５、第１のエンコーダ６、手動操作キー
群１４．第２のエンコーダ１５により構成され、空調機
本体１７と結線１８により結合されている。The operating device 16 is a part through which the user operates the air conditioner by speaking or manually, and is a remote controller for the air conditioner. This operating device 16 is connected to the microphone 1. Amplifier 2, help key 5, first encoder 6, manual operation key group 14. It is composed of a second encoder 15 and is connected to the air conditioner main body 17 by a connection 18.

第１図に示す音声認識形空調機の外観の一例を第２図に
示す。FIG. 2 shows an example of the appearance of the voice recognition type air conditioner shown in FIG. 1.

次に、音声認識部３の一構成例を第３図に示して説明す
る。Next, an example of the configuration of the voice recognition section 3 will be described with reference to FIG. 3.

演算部３１は、第２のメモリ３３にあらかじめ記録され
たプログラムに従い、ディジタル信号を入力してこれに
演算を施し、結果をディジタル信号で出力するものであ
る。第１のメモリ３２は。The calculation section 31 inputs a digital signal, performs calculations on it, and outputs the result in the form of a digital signal, according to a program pre-recorded in the second memory 33. The first memory 32 is.

演算部３１が演算中に一時的にデータを記録する書き変
え可能なメモリであり、汎用ＲＡＭ　（ランダムアクセ
スメモリ）等である。第２のメモリ３３は、プログラム
やデータを半永久的に記録する読み出し専用メモリであ
り、汎用ＦＲＯＭ　（プログラマプルリードオンリーメ
モリ）等である。これには、演算部３１が音声認識のた
めの演算を行うプログラムや音声認識の際に参照する、
語句の標準音声の特徴量のデータ等を記録する。入出力
部３４は、演算部３１が外部との間で信号を入出力する
際に介するインタフェース回路であり、Ａ／Ｄ　（アナ
ログ−ディジタル）変換器、　Ｄ／Ａ　（ディジタル−
アナログ）変換器を含む。Ａ／Ｄ端子３５は入出力部３
４内のＡ／Ｄ変換器にアナログ信号を入力する端子であ
り、利用者の発声する音声信号を入力する。Ｄ／Ａ端子
３６は入出力部３４内のＤ／Ａ変換器からのアナログ信
号を出力する端子である。このアナログ信号は音声合成
器７と共用の増幅器８、スピーカ９により音声として出
力される０通信端子３７は外部との間でディジタル信号
の入出力を行う端子であり、ここでは制御部４との間で
信号の入出力を行う。This is a rewritable memory in which data is temporarily recorded while the calculation unit 31 is calculating, and is a general-purpose RAM (random access memory) or the like. The second memory 33 is a read-only memory that semi-permanently records programs and data, such as a general-purpose FROM (programmer pull read only memory). This includes a program that the calculation unit 31 uses to perform calculations for voice recognition, a program that the calculation unit 31 refers to during voice recognition, etc.
Data such as the feature amount of the standard speech of the phrase is recorded. The input/output unit 34 is an interface circuit through which the calculation unit 31 inputs and outputs signals to and from the outside, and includes an A/D (analog-digital) converter, a D/A (digital-digital) converter, and an A/D (analog-digital) converter.
analog) converter. A/D terminal 35 is input/output section 3
This is a terminal for inputting analog signals to the A/D converter in 4, and inputs audio signals uttered by the user. The D/A terminal 36 is a terminal that outputs an analog signal from the D/A converter in the input/output section 34. This analog signal is output as audio by an amplifier 8 shared with the voice synthesizer 7 and a speaker 9. The communication terminal 37 is a terminal for inputting and outputting digital signals with the outside. Signals are input and output between the two.

次に制御部４の一構成例を第４図に示して説明する。Next, an example of the configuration of the control section 4 will be described with reference to FIG. 4.

演算部４１は、第２のメモリ４３にあらかじめ記録され
たプログラムに従い、ディジタル信号を入力してこれに
演算を施し、結果をディジタル信号で出力するものであ
る。第１のメモリ４２は、演算部４１が演算中に一時的
にデータを記録する書き変え可能なメモリであり、汎−
用ＲＡＭ等である。第２のメモリ４３はプログラムやデ
ータを半永久的に記録する読み出し専用メモリであり、
汎用ＦＲＯＭ等である。The calculation unit 41 inputs a digital signal, performs calculation on it, and outputs the result as a digital signal, according to a program recorded in advance in the second memory 43. The first memory 42 is a rewritable memory in which data is temporarily recorded while the calculation unit 41 is calculating, and is a general-purpose memory.
RAM etc. The second memory 43 is a read-only memory that records programs and data semi-permanently,
This is a general-purpose FROM, etc.

入出力部４４は、演算部４１が外部との間で信号を入出
力する際に介するインタフェース回路である。ここでは
第１のエンコーダ６、第２のエンコーダ１５、第３のエ
ンコーダ１０からの信号を入力し、音声合成器７、空調
機駆動回路１２に信号を出力する。また、音声認識部３
との間で信号の入出力を行う。The input/output unit 44 is an interface circuit through which the calculation unit 41 inputs and outputs signals to and from the outside. Here, signals from the first encoder 6, second encoder 15, and third encoder 10 are input, and the signals are output to the speech synthesizer 7 and the air conditioner drive circuit 12. In addition, the voice recognition unit 3
Inputs and outputs signals between the

通信端子４５は、音声認識部３．音声合成器７との間で
信号の入出力を行う端子である。ここでは音声、単語の
番号等を出力し、認識結果の単語の番号等を入力する０
割込端子４６は外部から割込信号を入力する端子であり
、その割り込み信号の入力により演算部４１が割込演算
処理を行う。The communication terminal 45 connects the voice recognition unit 3. This is a terminal for inputting and outputting signals to and from the speech synthesizer 7. Here, the voice, word number, etc. are output, and the word number, etc. of the recognition result is input.
The interrupt terminal 46 is a terminal for inputting an interrupt signal from the outside, and upon input of the interrupt signal, the arithmetic unit 41 performs interrupt arithmetic processing.

次に、音声合成器７の一構成例を第５図に示して説明す
る。Next, an example of the configuration of the speech synthesizer 7 will be described with reference to FIG. 5.

メモリ５２は、データを半永久的に記録する読み出し専
用メモリであり、汎用ＦＲＯＭ等である。The memory 52 is a read-only memory that records data semi-permanently, and is a general-purpose FROM or the like.

メモリ５２には語句の音声の符号化データがあらかじめ
記録されている。演算部５１は音声出力する語句の番号
を外部から入力し、メモリ５２上の該番号に対応する符
号化データを選択して順次読み出して、リアルタイムで
復号化する演算を行う。The memory 52 has recorded in advance encoded data of speech of words. The calculation unit 51 inputs the number of a word to be outputted from the outside, selects and sequentially reads the encoded data corresponding to the number on the memory 52, and performs a decoding operation in real time.

その復号化した音声信号はＤ／Ａ変換器５４を介してア
ナログ音声信号として出力される。The decoded audio signal is outputted as an analog audio signal via the D/A converter 54.

次に、上記構成の音声認識形空調機の制御動作について
、利用者の操作とこれに対応する各部の動作を第６図な
いし第１６図を参照して説明する。Next, regarding the control operation of the voice recognition type air conditioner having the above configuration, the user's operation and the corresponding operation of each part will be explained with reference to FIGS. 6 to 16.

第６図は、空調機の運転開始から停止までの制御部４の
動作を示すフローチャート、第７図は、音声認識部３の
音声認識動作を示すフローチャート、第８図は、ヘルプ
キー５の操作による制御部４の割込処理を示すフローチ
ャート、第９図は、音声認識部３の標準音声再生の動作
を示すフローチャート、第１０図は、発声によるヘルプ
要求の場合の音声認識部３の動作を示すフローチャート
、第１１図は、音声出力の項目分類図、第１２図は、音
声出力の順序の一例を示す説明図、第１３図は。6 is a flowchart showing the operation of the control unit 4 from the start of operation to the stop of the air conditioner, FIG. 7 is a flowchart showing the voice recognition operation of the voice recognition unit 3, and FIG. 8 is the operation of the help key 5. FIG. 9 is a flowchart showing the operation of the voice recognition unit 3 for standard voice reproduction, and FIG. 10 is a flowchart showing the operation of the voice recognition unit 3 in the case of a voiced help request. FIG. 11 is an item classification diagram of audio output, FIG. 12 is an explanatory diagram showing an example of the order of audio output, and FIG. 13 is a flowchart shown.

次のヘルプ要求までに時間制限を設ける動作のフローチ
ャート、第１４図は、−室以上のりジェクト回数で認識
対象単語を自動再生する動作のフローチャート、第１５
図は、一定時間内に次の発声入力がない場合のみガイダ
ンス音声を出力する動作のフローチャート、第１６図は
、認識対象単語の音声再生中の発声入力により操作を行
うフローチャートである。FIG. 14 is a flowchart of the operation of setting a time limit until the next help request, and FIG.
The figure is a flowchart of an operation for outputting a guidance voice only when there is no next voice input within a certain period of time, and FIG. 16 is a flowchart for performing operations based on a voice input while the voice of a recognition target word is being reproduced.

まず、空調機の運転開始から停止までの間の利用者の操
作とこれに対応する制御部４の動作を第６図のフローチ
ャートを参照しながら説明する。First, the user's operations from the start to the stop of the air conditioner and the corresponding operations of the control unit 4 will be explained with reference to the flowchart in FIG. 6.

空調機の電源が投入されると、制御部４は空調機の運転
を指示する単語、例えば「冷房」、「暖房」、「送風」
、「除湿」の単語番号を音声認識部３に送信する。音声
認識部３は、これらを認識対象単語の番号として受信し
、以後利用者の発声入力があれば、その発声が上記認識
対象単語のいずれの音声であるかの認識を行う（以上５
−１）。When the air conditioner is powered on, the control unit 4 issues words that instruct the operation of the air conditioner, such as "cooling,""heating," and "ventilation."
, transmits the word number of "dehumidification" to the speech recognition unit 3. The speech recognition unit 3 receives these as the numbers of the recognition target words, and if there is a voice input from the user from now on, it recognizes which of the above recognition target words the utterances are.
-1).

利用者が、例えば「冷房」と発声すると音声認識部３は
上記認識の動作を行い、認識結果を認識単語番号として
受信する。つまり、正しく認識すれば「冷房」の単語番
号０を送信する。もし上記発声音声がいずれの認識対象
単語のものとも認識できない場合、音声認識部３は認識
不能を示すリジェクトコードを送信する。制御部４は上
記認識単語番号またはりジエクトコードを受信すると（
以上５−２）、リジェクトコードの場合は（ｓ　−３）
「もう−度お願いします６」等の利用者に再発声を促す
リジェクトメツセージの音声を出力する。つまり該リジ
ェクトメツセージの語句の番号を音声合成器７に送信し
、音声合成器７はこれを受信すると上記語句の音声を出
力する。そして再び利用者の発声を待機する（以上５−
４）　、認識単語番号の場合は（ｓ−３）、制御部４は
空調機の運転、この例では冷房運転を開始し、「２４度
、弱風で冷房します、」等のその時点での設定温度。When the user utters, for example, "air conditioning," the voice recognition unit 3 performs the above recognition operation and receives the recognition result as a recognized word number. In other words, if it is recognized correctly, the word number 0 of "air conditioner" is transmitted. If the uttered voice cannot be recognized as that of any recognition target word, the voice recognition unit 3 transmits a reject code indicating unrecognizability. When the control unit 4 receives the recognition word number or redirect code, (
Above 5-2), in case of reject code (s -3)
Outputs the voice of a rejection message that prompts the user to repeat the message, such as "Please try again 6". That is, the number of the word of the rejected message is transmitted to the speech synthesizer 7, and upon receiving this, the speech synthesizer 7 outputs the sound of the word. Then, it waits for the user to speak again (5-
4) In the case of the recognized word number (s-3), the control unit 4 starts the operation of the air conditioner, in this example, the cooling operation, and at that point, such as "24 degrees, cooling with weak wind," etc. Set temperature.

風量など、空調機の動作設定状態を利用者に知らせるメ
ツセージを音声出力する（以上５−５）　。A message informing the user of the operating settings of the air conditioner, such as the air volume, is output as voice (5-5 above).

そして動作設定状態の変更項目を指定する単語、例えば
「止まれ」、「温度」、「風」の単語番号を音声認識部
３に送信する（ｓ−６）　、利用者が、例えば設定温度
を変更するために「温度」と発声すると、音声認識部３
は該発声を正しく認識すれば「温度」の単語番号５を送
信する。制御部４は上記認識単語番号またはりジェクト
コードを受信すると（以上５−７）、リジェクトコード
の場合は（ｓ−８）、上記リジェクトメツセージの音声
を出力し、再び利用者の発声を待機する（ｓ−９）　。Then, the user transmits the word number of a word specifying a change item in the operation setting state, such as "stop", "temperature", or "wind" to the voice recognition unit 3 (s-6), and the user changes, for example, the temperature setting. When you say "temperature" to
If it correctly recognizes the utterance, it will transmit word number 5 of "temperature". When the control unit 4 receives the recognized word number or rejection code (5-7), if it is a reject code (s-8), it outputs the voice of the reject message and waits for the user to speak again. (s-9).

認識単語番号の場合で（ｓ−８）、これが「止まれ」の
場合には（ｓ−１０）、制御部は「停止します、」等の
メツセージを音声出力して（Ｓ−１１）、空調機を停止
する（ｓ−１２）、ｒ止まれ」以外の場合には、制御部
４は「温度を高くしますか、低くしますか、」等の、利
用者に設定変更の指示を促すガイダンス音声を出力する
（ｓ−１３）。If it is a recognized word number (s-8), and if it is "stop" (s-10), the control unit outputs a message such as "stop" (S-11), and the air conditioner is activated. In cases other than "Stop the machine (s-12), R Stop", the control unit 4 provides guidance prompting the user to change settings, such as "Do you want to raise or lower the temperature?" Output audio (s-13).

そして設定変更を指示する単語、例えば「高く」、「低
く」の単語番号を音声認識部３に受信する（ｓ−１４）
、利用者が、例えば設定温度を低くするために「低く」
と発声すると、音声認識部３は該発声を正しく認識すれ
ば、「低く」の単語番号８を送信する。制御部４は上記
認識単語番号またはりジェクトコードを受信すると（以
上５−１５）、リジェクトコードの場合は（ｓ−１６）
、上記リジェクトメツセージの音声を出力し、再び利用
者の発声を待機する（ｓ−１７）−１！識単語番号の場
合は（ｓ−１５）、制御部４は「２３度、弱風で冷房し
ます、」等の、設定変更後の動作設定状態を利用者に知
らせるメツセージを音声出力する（以上５−１８）、そ
して空調機の動作設定状態を変更する。この例では設定
温度を１度低くする（ｓ−１９）。Then, the speech recognition unit 3 receives the word numbers for instructing a setting change, such as "high" and "low" (s-14).
, the user can select "lower" to lower the set temperature, for example.
When the voice recognition unit 3 correctly recognizes the utterance, it transmits the word number 8 of "low". When the control unit 4 receives the recognition word number or rejection code (5-15 above), if it is a reject code, (s-16)
, outputs the audio of the reject message and waits for the user to speak again (s-17)-1! In the case of the recognition word number (s-15), the control unit 4 outputs a voice message informing the user of the operation setting status after the setting change, such as "23 degrees, cooling with weak wind." 5-18), and change the operating setting state of the air conditioner. In this example, the set temperature is lowered by 1 degree (s-19).

なお、上記でガイダンス音声出力（ｓ−１３）またはり
ジェクトメッセージ音声出力（ｓ　−１７）の後、利用
者が発声を行うまでの時間・に制限を設け、その制限時
間内利用者が発声を行わなかった場合には前の段階の発
声の認識結果（ｓ−７）。In addition, in the above, after the guidance voice output (s-13) or the reject message voice output (s-17), a time limit is set for the user to speak, and the user must speak within that time limit. If not performed, the recognition result of the previous step's utterance (s-7).

つまり「温度」等も無効として再度、その発声を待機す
る段階（ｓ−７）に戻る方式とすることもできる。In other words, it is also possible to set "temperature" etc. as invalid and return to the step (s-7) of waiting for the utterance of the "temperature".

次に音声認識部３の音声認識動作、つまり発声入力され
た音声の単語を認識する動作を第７図のフローチャート
を参照しながら説明する。Next, the speech recognition operation of the speech recognition section 3, that is, the operation of recognizing words of input speech will be explained with reference to the flowchart of FIG.

音声認識部３は利用者の発声入力を待機している。音量
の変化等により発声入力の開始を検出すると（ｔ−１）
、音声認識部３は音量の変化等をもとに発声入力の終了
を検出し、発声の開始から終了までの区間を単語音声と
して切り出す、同時に上記区間の単語音声の特徴量をリ
アルタイムで抽出し、これを第１のメモリ３２上に記録
保持する（以上ｔ−２）、次に、音声認識部３は、入力
音声つまり上記切り出した単語音声と、全認識対象単語
の標準音声との間で特徴量の比較演算を行う、つまり特
徴量どうしの相違度を計算する。上記認識対象単語は制
御部４から受信した単語番号に対応するものであり、１
同車語番号の組を受信する毎に認識対象単語をすべてこ
れらに対応するものに更新する（以上ｔ−Ｓ）、そして
相違度の最小値を選出する（ｔ−４）　、その最小値が
あらかじめ定めた所定のしきい値を超える場合は（を−
５）、上記入力音声はいずれの認識対象単語の音声でも
ないと見なして認識不能を示すリジェクトコードを制御
部４に送信する（ｔ−７）、上記最小値が上記しきい値
以下の場合は（ｔ−５）、その最小値を与える標準音声
の電話番号を認識結果として制御部４に送信する（ｔ−
６）。The voice recognition unit 3 is waiting for the user's voice input. When the start of vocal input is detected due to a change in volume, etc. (t-1)
, the speech recognition unit 3 detects the end of the speech input based on changes in volume, etc., cuts out the section from the start to the end of the speech as word speech, and at the same time extracts the feature amount of the word speech in the said section in real time. , this is recorded and held on the first memory 32 (t-2 above).Next, the speech recognition unit 3 performs the following operations: A comparison operation is performed on the feature quantities, that is, the degree of difference between the feature quantities is calculated. The recognition target word corresponds to the word number received from the control unit 4, and is 1
Every time a set of same vehicle language numbers is received, all recognition target words are updated to those corresponding to these (t-S), and the minimum value of the degree of dissimilarity is selected (t-4), and the minimum value is If the predetermined threshold value is exceeded, (-
5), consider that the input speech is not the speech of any recognition target word, and send a reject code indicating unrecognizability to the control unit 4 (t-7); if the minimum value is less than the threshold; (t-5), and transmits the standard voice phone number that gives the minimum value to the control unit 4 as a recognition result (t-5).
6).

次に、利用者が認識対象単語や動作設定状態等を知りた
くて、これらの音声再生、音声出力を要求してヘルプキ
ー５を操作した場合の制御部４の動作を、第８図のフロ
ーチャートを参照しながら説明する。Next, the flowchart in FIG. 8 shows the operation of the control unit 4 when the user operates the help key 5 to request audio playback and audio output in order to know the recognition target words, operation setting status, etc. This will be explained with reference to.

利用者が任意の時点で上記要求ができるようにするため
、第８図に示す制御部４の動作は割込処理とする。つま
り、第６図に示す動作中に割込処理を行う、利用者がヘ
ルプキー５の操作を行うことにより、第１のエンコーダ
６を介して制御部４の割込端子４６に割込信号が入力さ
れる。In order to enable the user to make the above request at any time, the operation of the control section 4 shown in FIG. 8 is performed as an interrupt process. That is, when the user operates the help key 5 to perform interrupt processing during the operation shown in FIG. is input.

制御部４は該割込信号を検出すると、音声認識部３およ
び音声合成器７に、これらが音声再生。When the control unit 4 detects the interrupt signal, the control unit 4 sends the voice recognition unit 3 and the voice synthesizer 7 to reproduce the voice.

音声出力中であればこれを中止させる再生中止コマンド
を送信する。なお、音声認識部３の音声再生動作につい
ては後で説明する（以上ｕ−１）。If audio output is in progress, a playback stop command is sent to stop it. Note that the voice reproduction operation of the voice recognition unit 3 will be explained later (u-1 above).

そして制御部４は、認識対象単語の標準音声を再生する
場合には音声認識部３に標準音声再生コマンドを送信し
、動作設定状態等の音声出力をする場合には該動作設定
状態等を表現する語句の番号を音声合成器７に送信する
。音声認識部３は該再生コマンドを受信すると、その時
点での認識対象単語の標準音声を順次再生する。また、
設定温度、風量等の動作設定状態を「温度２４度、弱風
に設定されています。」等のように音声出力する場合に
は、例えば「温度」、「２４度」、「弱風」、「冷房に
」、「設定されています、」の各語句の番号を順に音声
合成器７に送信する。これにより音声合成器７は上記語
句の音声を順次出力する。Then, the control unit 4 transmits a standard voice reproduction command to the voice recognition unit 3 when reproducing the standard voice of the word to be recognized, and when outputting the voice of the operation setting state etc., expresses the operation setting state etc. The number of the word to be used is sent to the speech synthesizer 7. When the speech recognition unit 3 receives the reproduction command, it sequentially reproduces the standard speech of the word to be recognized at that time. Also,
If you want to output the operation settings such as set temperature, air volume, etc., such as "Temperature is set to 24 degrees, low wind.", for example, "temperature", "24 degrees", "low wind", The numbers of the words "cooling" and "set" are sequentially transmitted to the speech synthesizer 7. Thereby, the speech synthesizer 7 sequentially outputs the speech of the above-mentioned words.

上記のように音声出力する情報は空調機の動作設定状態
の他に、現在の気温、湿度等の環境情報、直前のメツセ
ージ、ガイダンス音声を聞き返したい場合のためのこれ
らの繰り返し出力等が考えられる（以上ｕ−２）、そし
て、ヘルプ回数カウンタを増加する。ただし、所定数に
達した場合はリセットする。つまり、上記の認識対象単
語、動作設定状態、環境情報、直前のメツセージ、ガイ
ダンスの各項目に順序を定め、利用者が一回ヘルプキー
５を操作する毎に該順序に従って一項目づつ音声再生、
音声出力を行う、つまり、上記ヘルプ回数カウンタは該
項目のカウンタである。したがって、全項目の音声再生
、出力を終了すれば該ヘルプ回数カウンタをリセットす
る（以上ｕ−３）。As mentioned above, in addition to the operating settings of the air conditioner, the information to be output audibly may include environmental information such as current temperature and humidity, previous messages, and repeated outputs of these in case you want to listen back to the guidance voice. (U-2), and the help count counter is increased. However, if the predetermined number is reached, it will be reset. In other words, an order is set for each item of the above-mentioned recognition target word, operation setting state, environment information, last message, and guidance, and each time the user operates the help key 5, the audio is played back one item at a time according to the order.
A voice output is performed, that is, the above-mentioned help number counter is a counter for the item. Therefore, when the audio reproduction and output of all items is completed, the help number counter is reset (u-3).

以上で制御部４は、利用者のヘルプキー５操作に対する
割込処理を終了する。With this, the control unit 4 ends the interrupt processing in response to the user's operation of the help key 5.

なお、−回のヘルプキー５の操作につき上記複数の項目
の音声の再生、出力を連続して行ってもよいことはもち
ろんである。It goes without saying that the audio of the plurality of items described above may be continuously played back and output for each operation of the help key 5 - times.

次に、音声認識部３の認識対象単語の標準音声の再生動
作を、第９図のフローチャートを参照しながら説明する
。Next, the reproduction operation of the standard speech of the word to be recognized by the speech recognition section 3 will be explained with reference to the flowchart of FIG.

音声認識部３は、標準音声の再生コマンドを制御部４か
ら受信すると、その時点での認識対象単語の番号をもと
に、第２のメモリ３３上の該番号の単語の標準音声のデ
ータが記録されているアドレスを検知する（ｖ−１）、
そして第２のメモリ３３上の該アドレスのデータを順次
読み出し、必要によりリアルタイムで復号化して（ｖ−
２）、入出力部３４のＤ／Ａ変換器によりアナログ音声
信号として増幅器８に出力し、スピーカ９から音声が出
力される（ｖ−３）、上記（ｖ−２）、（ｖ−３）の動
作を繰り返し１つの単語の標準音声の再生を終わると（
ｖ−５）、次の単語の標準音声の再生を行う、全認識対
象単語の標準音声を再生し終わるまで上記（ｖ−２）か
ら（ｖ−５）までの動作を繰り返す（ｖ−６）、もし標
準音声の再生中に、利用者がヘルプキー５の操作を行っ
た場合には、制御部４から再生中止コマンドが送信され
、音声認識部３はこれを受信すると上記標準音声の再生
を中止する（ｖ−４）。When the speech recognition section 3 receives a standard speech reproduction command from the control section 4, based on the number of the word to be recognized at that time, the speech recognition section 3 reads the data of the standard speech of the word with the number in the second memory 33. Detecting the recorded address (v-1),
Then, the data at the address on the second memory 33 is sequentially read out, and if necessary, decoded in real time (v-
2), The D/A converter of the input/output unit 34 outputs it as an analog audio signal to the amplifier 8, and the audio is output from the speaker 9 (v-3), (v-2), (v-3) above. After repeating the operation and finishing playing the standard voice of one word (
v-5), play the standard voice of the next word; repeat the operations from (v-2) to (v-5) until the standard voice of all words to be recognized has been played (v-6); If the user operates the help key 5 while the standard voice is being played back, a playback stop command is sent from the control unit 4, and when the voice recognition unit 3 receives this command, it starts playing the standard voice. Abort (v-4).

なお、上記標準音声の音声データは通常、その標準音声
の特徴量とは別に、Ａｔ）ＰＣＭ　（適応差分符号化）
データ等の音声再生に適した形で第２のメモリ３３上に
記録しておく、場合によっては標準音声の特徴量から再
生用の音声データを算出してもよい。Note that the audio data of the above-mentioned standard voice is usually processed using At) PCM (adaptive differential coding) in addition to the feature amount of the standard voice.
The audio data for reproduction may be calculated from the characteristic amount of standard audio recorded on the second memory 33 in a form suitable for audio reproduction, such as data.

以上の標準音声の再生動作は利用者の任意の要求時点で
行うため、音声認識部３の割込処理とする。Since the standard voice reproduction operation described above is performed at any time requested by the user, it is an interrupt process of the voice recognition unit 3.

次に、利用者がヘルプキー５の操作の代わりに、「ヘル
プ」等の特定の単語の発声入力によって標準音声の再生
や動作設定状態等の音声出力を要求する例における音声
認識部３の動作を、第７図。Next, the operation of the voice recognition unit 3 in an example in which the user requests the reproduction of a standard voice or the voice output of an operation setting state, etc. by inputting a specific word such as "help" instead of operating the help key 5. , Figure 7.

第１０図のフローチャートを参照しながら説明する。This will be explained with reference to the flowchart in FIG.

第１０図には、第７図からの変更点のみ記す。In FIG. 10, only changes from FIG. 7 are shown.

ここでは上記特定の単語を「ヘルプ」とする例について
説明する。Here, an example in which the above specific word is "help" will be explained.

音声認識部３は、制御部４から送信される認識対象単語
の他に、通常ｒヘルプ」を認識対象単語とする。利用者
がｒヘルプ」と発声すると、音声認識部３は該発声を入
力して、音声認識の動作を行う（第７図ｔ−１からｔ−
５まで）、その発声音声と「ヘルプ」の標準音声との相
違度が最小であり、つまり該発声音声の認識結果が「ヘ
ルプ」であれば（ｔ−８）、音声認識部３は制御部４の
割込端子４６に割り込み信号を送信する。これにより制
御部４は第８図に示す、ヘルプボタン５の操作が行われ
たのと同じ動作を行う（ｔ−９）。In addition to the recognition target word transmitted from the control unit 4, the speech recognition unit 3 uses "normal r help" as a recognition target word. When the user utters "r help", the voice recognition unit 3 inputs the utterance and performs voice recognition operation (from t-1 to t- in Figure 7).
5), if the degree of difference between the uttered voice and the standard voice of "help" is the minimum, that is, if the recognition result of the uttered voice is "help" (t-8), the voice recognition unit 3 An interrupt signal is sent to the interrupt terminal 46 of No. 4. As a result, the control section 4 performs the same operation as when the help button 5 is operated, as shown in FIG. 8 (t-9).

上記認識結果が「ヘルプ」以外であれば（ｔ−８）、音
声認識部３は制御部４の通信端子４５に該認識結果を示
す単語番号を送信する（ｔ−６）　。If the recognition result is other than "help" (t-8), the voice recognition unit 3 transmits a word number indicating the recognition result to the communication terminal 45 of the control unit 4 (t-6).

ここで、利用者が音声再生、音声出力を要求する項目の
分類の一例を第１１図に示す。Here, FIG. 11 shows an example of the classification of items for which the user requests audio playback and audio output.

「■認識対象単語」は利用者がその時点で発声入力でき
る単語であり、音声認識部３は利用者の発声音声と該単
語の標準音声とを比較して該発声された単語を認識する
。"■ Recognition target word" is a word that the user can input aloud at that time, and the speech recognition unit 3 compares the user's uttered voice with the standard voice of the word and recognizes the uttered word.

「■動作設定状態」は、その時点での空調機の設定温度
、風量、および冷房、暖房等の運転状態等の情報である
。"■ Operation setting state" is information such as the set temperature of the air conditioner, the air volume, and the operating state of cooling, heating, etc. at that time.

「■環境情報」は、室内外の気温、湿度等の情報である
。上記■、■は利用者が空調機の運転。"■Environmental information" is information such as indoor and outdoor temperature and humidity. For ■ and ■ above, the user operates the air conditioner.

停止を行ったり、運転中に温度、風量等の設定を変更す
る際の参考にする。Use this as a reference when stopping or changing settings such as temperature and air volume during operation.

「■直前メツセージ」は利用者の発声入力による操作に
応じて制御部４が出力する音声であり。"■ Last minute message" is a voice output by the control unit 4 in response to the user's voice input operation.

設定変更後の動作設定状態を表現する音声や、利用者の
発声を促すガイダンス音声等である。これは利用者が聞
き漏らし等で再度間きたい場合などに要求する。These include audio that expresses the operational setting state after the settings have been changed, and guidance audio that prompts the user to speak. This is requested when the user misses something and wants to pause again.

利用者にとってどの項目が優先的に必要な情報であるか
は、運転中、停止中等の空調機の動作状態、リジェクト
（認識不能）後、ガイダンス音声出力後など利用者が次
に行うべき操作（発声入力）により異なる。そこで、空
調機の動作状態や利用者が行うべき操作に応じて各項目
の音声再生、音声出力の順序を適切に設定する。その−
例を第１２図に示して説明する。Which information is prioritized for the user depends on the operational status of the air conditioner, such as when it is running or stopped, and what operations the user should perform next, such as after rejecting (unrecognizable) or after outputting guidance voice. Varies depending on voice input). Therefore, the order of audio playback and audio output of each item is appropriately set depending on the operating state of the air conditioner and the operations to be performed by the user. That-
An example will be explained with reference to FIG.

空調機が電源投入後で停止中のときは、利用者が空調機
の運転を開始するために必要な情報は必要性の順に■認
識対象単語、つまり「冷房」等の運転を指示する単語、
■環境情報、つまり現在の気温等である。■動作設定状
態は「停止中です、」等の表現になる。よって音声再生
、音声出力の順序は■、■、■の順とする。したがって
、例えば利用者がヘルプキー５を操作すると制御部４は
最初に■認識対象単語の音声再生を行い、その後−定時
間内に再びヘルプキー５の操作が行われると。When the air conditioner is stopped after the power is turned on, the information necessary for the user to start operating the air conditioner is sorted in order of necessity ■ Recognition target words, words that instruct operation such as "cooling";
■Environmental information, such as current temperature. ■The operation setting status is expressed as "stopped." Therefore, the order of audio playback and audio output is ■, ■, ■. Therefore, for example, when the user operates the help key 5, the control section 4 first performs (1) audio reproduction of the recognition target word, and then - when the help key 5 is operated again within a fixed period of time.

制御部４は■環境情報の音声出力を行う、その−定時間
内にヘルプキー５の操作が行われず、その後再び該操作
が行われた場合には、制御部４は改めて■の音声再生か
ら始める。The control unit 4 performs audio output of environmental information. If the help key 5 is not operated within a certain period of time and the operation is performed again after that, the control unit 4 starts again from the audio reproduction of ■. start.

利用者が「冷房」等の運転を指示する単語を発声入力し
、これが正しく認識されれば、制御部４は空調機の運転
を開始する。その発声がリジェクト（認識不能）となっ
た場合には利用者が誤った単語を発声しているか１発声
の仕方が標準音声と大きく異なっている場合もあり、必
要な情報は認識対象単語の標準音声であるので、音声再
生の項目は■認識対象単語のみとする。なお、リジェク
トとなってから利用者が発声入力をし直すまでに制限時
間を設けている場合には該時間内のみ上記■のみの再生
とする。以後のりジエクトの場合についてはすべて上記
と同様する。The user vocally inputs a word instructing the air conditioner to operate, such as "cooling", and if this is correctly recognized, the control unit 4 starts operating the air conditioner. If the utterance is rejected (unrecognizable), the user may have uttered the wrong word, or the utterance may be significantly different from the standard voice, so the necessary information may be based on the standard of the word to be recognized. Since it is audio, the audio playback items are only ■words to be recognized. Note that if a time limit is set between when the user inputs the voice again after the rejection occurs, only the above item (2) will be played within the time limit. All subsequent cases of redirection are the same as above.

空調機の運転開始時には、その時点での設定温度、風量
等を知らせるメツセージが出力される。When the air conditioner starts operating, it outputs a message informing it of the set temperature, air volume, etc. at that time.

よって運転開始直後は利用者が該メツセージを聞き返し
たい場合があり、また、一定時間内は空調機の設定温度
、風量等の動作設定状態の変更が不可であり、利用者が
動作設定状態を変更する発声入力を行っても無効となる
。よって、該一定時間内は音声出力の項目は■直前のメ
ツセージ、つまり動作設定状態を知らせるメツセージの
みとする。Therefore, the user may want to hear the message back immediately after the start of operation, and the operating settings such as the air conditioner's temperature and air volume cannot be changed within a certain period of time, so the user may not be able to change the operating settings. Even if you make a voice input to do so, it will be invalid. Therefore, during the certain period of time, the audio output items are only the message immediately before (1), that is, the message informing the operation setting state.

上記一定時間が経過し空調機の動作設定状態の変更が可
となった後は、利用者が該変更を行うのに必要な情報は
必要性の順に■認識対象単語、■動作設定状態、■環境
情報であり、音声再生、音声出力の順序も上記のように
する。After the above-mentioned certain period of time has passed and it is possible to change the air conditioner's operating settings, the information necessary for the user to make the change is in the order of necessity: ■ Word to be recognized, ■ Operating settings, ■ This is environmental information, and the order of audio playback and audio output is also as described above.

利用者が「温度」等の動作設定状態の変更項目の単語を
発声しこれが正しく認識されれば、「温度を高くします
か、低くしますか、」等の変更の指示を促すガイダンス
音声が出力される。この時点で利用者が最も必要な情報
は、現在の設定温度等の動作設定状態であるので、音声
再生、音声出力の順序は■動作設定状態、■認識対象単
語、■環境情報の順とする。When the user speaks the words to change the operating settings, such as "temperature," and the words are recognized correctly, a guidance voice prompts the user to change the temperature, such as "do you want to raise or lower the temperature?" Output. The information that the user most needs at this point is the operating settings such as the current temperature setting, so the order of audio playback and audio output should be: ■ Operating settings, ■ Words to be recognized, and ■ Environmental information. .

利用者が「低く」等の変更の指示の発声入力を行いこれ
が正しく！！識されると、空調機の動作設定状態が変更
され、変更後の動作設定状態を知らせるメツセージが出
力される。この時点では運転開始直後と全く同様、動作
設定状態の変更が不可の期間中は音声出力の項目は■直
前のメツセージのみとし、該期間の経過後は■認識対象
単語、■動作設定状態、■環境情報の順とする。The user vocalizes the instruction to change things, such as "lower", and this is correct! ! When the air conditioner is recognized, the operating setting state of the air conditioner is changed, and a message is output to notify the changed operating setting state. At this point, just like immediately after the start of operation, during the period when the operation setting state cannot be changed, the audio output item is only ■the previous message, and after the period has passed, ■recognition target word, ■operation setting state, The order will be environmental information.

利用者が空調機の停止を指示する発声を行った場合にも
上記と同様に、運転再開不可の期間中は■空調機の停止
を知らせる直前のメツセージのみとし、該期間の経過後
は停止中であるので■認識対象単語、■環境情報、■動
作設定状態の順とする。In the same way as above, even if the user gives an instruction to stop the air conditioner, during the period when operation cannot be restarted, only the message immediately before informing the user that the air conditioner has stopped will be sent, and after the period has passed, the air conditioner will be stopped. Therefore, the order is ■word to be recognized, ■environmental information, and ■operation setting state.

次に、１回のヘルプキー５の操作から次の該操作までの
間に時間制限を設ける例における制御部４の動作を、第
１３図のフローチャートを参照しながら説明する。Next, the operation of the control unit 4 in an example in which a time limit is set between one operation of the help key 5 and the next operation will be explained with reference to the flowchart of FIG. 13.

利用者が１回のヘルプキー５の操作から一定時間内に次
のヘルプキー５の操作を行った場合には、次の順序の項
目の音声再生、音声出力を行い、その一定時間の経過後
改めてヘルプキー５の操作を行った場合には最初の項目
の音声再生、音声出力から始める。If the user operates the next help key 5 within a certain period of time after one operation of the help key 5, the next order of items will be played and output as audio, and after the specified period of time has elapsed. When the help key 5 is operated again, audio reproduction and audio output of the first item are started.

利用者がヘルプキー５の操作を行うと、制御部４は上記
一定時間を測定するタイマをスタートさせる（ｕ−６）
、そして１回の音声再生、音声出力を行い（ｕ−２）、
ヘルプ回数カウンタを増加する（ｕ−３）。When the user operates the help key 5, the control unit 4 starts a timer that measures the above-mentioned fixed time (u-6).
, and performs one audio playback and audio output (u-2),
Increment the help number counter (u-3).

次に、利用者がヘルプキー５の操作を行うと制御部４は
上記タイマをチエツクし、上記一定時間が経過していれ
ばヘルプ回数カウンタをリセットする。よって最初の項
目の音声再生、音声出力が行われる。その一定時間が経
過していなければヘルプ回数カウンタは前回増加したま
まの状態であるから、次の項目の音声再生、音声出力が
行われる。このようにすれば利用者が前回のヘルプキー
５の操作の直後以外にヘルプキー５を操作すれば常に必
要性の高い順に音声再生、音声出力を得ることができ、
途中の項目から得ることが無い。そして現在行われてい
る音声再生、音声出力が不要な場合はヘルプキー５の操
作によりこれを途中で打ち切り、次の項目の音声再生、
音声出力を得ることができる。Next, when the user operates the help key 5, the control section 4 checks the timer and resets the help number counter if the predetermined time has elapsed. Therefore, the audio reproduction and audio output of the first item are performed. If the certain period of time has not elapsed, the help count counter remains incremented from the previous time, and the next item is played back and output. In this way, if the user operates the help key 5 other than immediately after the previous operation of the help key 5, the user can always obtain audio playback and audio output in the order of necessity.
There is nothing to be gained from intermediate items. If the current audio playback or audio output is not required, press the help key 5 to abort the audio playback or audio output for the next item.
You can get audio output.

なお、ヘルプキー５を一定時間以上押し続ける、一定の
短い時間内に複数回連続して操作する等の特定の操作を
定め、これによって現在行われている音声再生、音声出
力を中止し、次以降の項目の音声再生、音声出力もすべ
て中止することもできる。つまり該特定の操作により、
制御部４は再生中止コマンドのみを音声認識部３．音声
合成器７に送信し、次以降の項目に関する標準音声再生
コマンドまたは語句の番号を音声認識部３または音声合
成器７に送信しない。In addition, specific operations such as holding down the help key 5 for more than a certain period of time or operating it multiple times in a row within a certain short period of time are specified, and the current audio playback or audio output is stopped and the next operation is performed. You can also cancel all audio playback and audio output for subsequent items. In other words, by the specific operation,
The control unit 4 sends only the playback stop command to the voice recognition unit 3. Standard audio reproduction commands or phrase numbers regarding the next and subsequent items are not transmitted to the speech recognition unit 3 or the speech synthesizer 7.

また、ヘルプキー５の操作の代わりに「ヘルプ」等の特
定の語句の発声入力により音声認識部３が認識対象単語
の音声再生を行う場合には、該音声再生中に上記特定の
単語の発声入力を行っても、音声認識部３は認識動作を
行っていないので該特定の単語は認識されない、したが
って、この場合には音声認識部３に標準音声再生の終了
信号を制御部４に送信する機能を設け、制御部４は該終
了信号を受信してから上記一定時間内に該特定の単語の
発声入力が行われた場合には、次の項目の音声出力を行
う。In addition, when the speech recognition unit 3 plays back the recognition target word by vocalizing a specific word such as "help" instead of operating the help key 5, the specific word is uttered during the audio playback. Even if an input is made, the specific word is not recognized because the speech recognition section 3 is not performing a recognition operation. Therefore, in this case, the speech recognition section 3 sends a standard speech reproduction end signal to the control section 4. The control section 4 outputs the next item as a voice if the specific word is voiced within the predetermined time after receiving the end signal.

次に、利用者の発声音声が一定回数以上リジエクト（認
識不能）となった場合に自動的に認識対象単語の標準音
声の再生を行う例における制御部４の動作を、第６図、
第１４図のフローチャートを参照しながら説明する。第
１４図には、第６図からの変更点のみを記す。Next, FIG. 6 shows the operation of the control unit 4 in an example in which the standard voice of the word to be recognized is automatically played back when the voice uttered by the user is rejected (unrecognizable) more than a certain number of times.
This will be explained with reference to the flowchart shown in FIG. FIG. 14 shows only the changes from FIG. 6.

利用者の発声音声がリジェクトとなると、制御部４はリ
ジェクトカウンタを増加する（ｓ−２０）。When the voice uttered by the user is rejected, the control unit 4 increases the reject counter (s-20).

リジェクトカウンタがあらかじめ定めた一定値を超える
と（ｓ−２１）、制御部４は音声認識部３に標準音声再
生コマンドを送信し、認識対象単語の標準音声が自動的
に再生される（ｓ−２２）。When the reject counter exceeds a predetermined value (s-21), the control unit 4 sends a standard voice playback command to the speech recognition unit 3, and the standard voice of the recognition target word is automatically played back (s-21). 22).

この後、制御部４は上記リジェクトカウンタをリセット
する（ｓ−２３）、利用者の発声音声が正しく認識され
た場合は制御部４は上記リジェクトカウンタをリセット
しく５−２４）、以後の動作を行う、このようにすれば
利用者の発声音声が何回もリジェクトとなった場合に、
利用者に特別な操作をさせずに、発声入力可能な単語を
正しい発声方法で音声利用者に知らせることができる。After that, the control unit 4 resets the above-mentioned reject counter (s-23). If the user's uttered voice is correctly recognized, the control unit 4 resets the above-mentioned reject counter 5-24), and performs the subsequent operation. In this way, if the user's voice is rejected many times,
To inform a voice user of words that can be input by voice using the correct voice method without requiring the user to perform any special operations.

また、第１１図の■から■までの項目別、あるいは全項
目の音声再生、音声出力の頻度のカウンタを設け、つま
りカウンタとタイマとを連動させて一定期間あたりの回
数を数え、その頻度が一定値を超えた場合１項目別ある
いは全項目の音声出力する情報をより詳細な内容のもの
に自動的に切り替え、逆に該頻度が一定値以下となった
場合には上記情報をより簡略な内容のものに自動的に切
り替えることもできる。In addition, a counter is provided for the frequency of audio playback and audio output for each item or all items from ■ to ■ in Figure 11. In other words, the counter and timer are linked to count the number of times per a certain period of time, and the frequency can be calculated. If the frequency exceeds a certain value, the audio output information for each item or all items will be automatically switched to more detailed content, and conversely, if the frequency falls below a certain value, the above information will be changed to a simpler one. You can also automatically switch to the content.

例えば、■認識対象単語の標準音声については、ヘルプ
キー５の操作頻度が一定値を越えればｒ止めたいときは
止まれ、温度を変えたいときは温度、風の強さを変えた
いときは風と言って下さい、」など、発声方法を詳しく
説明するものに自動的に切り替え、ヘルプキー５の操作
頻度が上記一定値以下となれば「止まれ、温度、風」等
の簡単なものに自動的に切り替える。このようにすれば
利用者が操作方法に不慣れでヘルプキー５の操作が頻繁
な場合には、よりわかりやすい情報を利用者に提供する
ことができ、利用者が操作方法に慣れてきてヘルプキー
５の操作頻度が少なくなった場合には、不必要な情報を
利用者に聞かせることがない。For example, ■ For the standard voice of the word to be recognized, if the frequency of operation of the help key 5 exceeds a certain value, it will stop if you want to stop it, it will turn off if you want to change the temperature, it will turn off the wind when you want to change the strength of the wind, etc. If the frequency of operation of the help key 5 falls below the above-mentioned certain value, the voice will automatically change to a simple phrase such as ``Stop, temperature, wind.'' Switch. In this way, if the user is unfamiliar with the operation method and frequently presses the help key 5, more easily understandable information can be provided to the user. When the frequency of operations decreases, unnecessary information is not asked to the user.

また、上記１項目の音声再生、音声出力開始直後に利用
者が該音声再生、音声出力を不必要として中止し、次の
項目の音声再生、音声出力を要求してヘルプキー５の操
作を行うことが゛頻繁に生じた場合、上記中止された項
目を必要性が高くない項目として自動的に音声再生、音
声出力の順序を後退せさることができる。つまり、上記
中止の頻度に応じて各項目の音声再生、音声出力の順序
を自動的に入れ替えることができる。これは１項目の音
声再生、音声出力開始からの経過時間を測定する前記タ
イマを利用し、その経過時間が一定値以下の場合の頻度
が一定値を越えた場合、自動的に該項目の音声再生、音
声出力の順序を後退させる。In addition, immediately after the start of the audio playback or audio output for the above one item, the user cancels the audio playback or audio output as unnecessary, requests the audio playback or audio output for the next item, and operates the help key 5. If this occurs frequently, the canceled item can be automatically moved back in the order of audio playback and audio output as an item that is not highly necessary. In other words, the order of audio reproduction and audio output of each item can be automatically changed depending on the frequency of the above-mentioned cancellation. This uses the above-mentioned timer that measures the elapsed time from the start of audio playback and audio output for one item, and if the elapsed time is less than a certain value and the frequency exceeds a certain value, the audio of the item is automatically Reverses the order of playback and audio output.

次に設定温度を変更する場合のように利用者の２段階以
上の発声入力が必要な場合に、前段階の発声入力から一
定時間内に次段階の発声入力が行われない場合のみ利用
者の発声入力が促すガイダンス音声を出力する例におけ
る制御部４の動作を、第６図、第１５図のフローチャー
トを参照しながら説明する。第１５図には、第６図から
の変更点のみ記す。Next, when the user's voice input is required in two or more stages, such as when changing the temperature setting, the user's voice input is only required if the voice input of the next stage is not performed within a certain period of time after the voice input of the previous stage. The operation of the control unit 4 in an example in which a guidance voice prompted by voice input is output will be described with reference to flowcharts shown in FIGS. 6 and 15. In FIG. 15, only changes from FIG. 6 are shown.

例えば利用者が設定温度を変更する場合、慣れた利用者
は「温度」、「低く」と続けて発声し。For example, when a user wants to change the temperature setting, an experienced user will say "temperature" and "lower" in succession.

慣れない利用者は「温度」と発声して「温度を高くしま
すか、低くしますか、」等のガイダンス音声を聞いてか
ら「低く」と発声する方式である。Unaccustomed users should say ``temperature'' and listen to a voice guidance such as ``Do you want to raise or lower the temperature?'' and then say ``lower.''

利用者の「温度」または「風」の発声音声が正しく認識
されると（ｓ−１０）、制御部４は音声認識部３に「高
く」、「低く」等の設定変更の指示をする単語の番号を
送信する。音声認識部３は、これらの単語を認識対象単
語とする。利用者が一定時間内に発声を行えば（ｓ−２
６，５−２７）、その発声音声が正しく認識された場合
は（Ｓ−２８）、制御部４は変更後の動作設定状態を知
らせるメツセージの音声を出力して動作設定状態を変更
する（ｓ−１８）、発声音声がリジェクトとなった場合
は（ｓ−２８）＋制御部４は利用者に設定変更の指示を
促すガイダンス音声を出力する（ｓ−１３）。When the user's uttered voice of "temperature" or "wind" is correctly recognized (s-10), the control unit 4 instructs the voice recognition unit 3 to change the settings such as "high" or "low". Send your number. The speech recognition unit 3 uses these words as recognition target words. If the user speaks within a certain period of time (s-2
6, 5-27), if the uttered sound is correctly recognized (S-28), the control unit 4 outputs a message sound informing the changed operation setting state and changes the operation setting state (S-28). -18) If the uttered voice is rejected (s-28), the control unit 4 outputs a guidance voice prompting the user to change the settings (s-13).

利用者が上記一定時間内に設定変更指示の発声を行わな
ければ（ｓ−２７）、制御部４は該指示の発声を促すガ
イダンス音声を出力する（Ｓ−１３）０例えば利用者が
「温度」から少し間をおいて「低く」と発声し、いずれ
も正しく認識された場合は設定温度が低くなり、「温度
」は正しく認識されたが「低く」はりジェクトとなった
場合は、「温度を高くしますか、低くしますか、」のガ
イダンス音声が出力される。If the user does not utter a setting change instruction within the above-mentioned fixed time (s-27), the control unit 4 outputs a guidance voice prompting the user to utter the instruction (S-13)0 For example, if the user ", then say "low" after a short pause. If both are correctly recognized, the set temperature will be lowered. If "temperature" is correctly recognized but "low" is ejected, "temperature" will be "Do you want to raise it higher or lower it?" guidance voice is output.

次に、認識対象単語の音声再生中の利用者の発声により
空調機の操作を行う例における音声認識部３の動作を、
第９図、第１６図のフローチャートを参照しながら説明
する。第１６図には、第９図からの変更点のみ記す。Next, the operation of the voice recognition unit 3 in an example in which the air conditioner is operated by the user's utterance while the voice of the recognition target word is being played back is as follows.
This will be explained with reference to the flowcharts of FIGS. 9 and 16. In FIG. 16, only the changes from FIG. 9 are shown.

利用者がヘルプキー５を操作し、音声認識部３が認識対
象単語の音声再生を行っている際に、利用者が目的の単
語の音声再生の直後から次の単語の音声再生終了までの
間に何らかの発声を行えば、音声認識部３が発声入力待
ちの状態で該目的の単語が発声入力されたと同じ動作を
行う。例えば音声認識部３が利用者の要求により「止ま
れ」、「温度」、・・・のように認識対象単語の音声再
生を行っていて、利用者が「止まれ」の音声再生直後か
ら次の「温度Ｊの音声再生終了までの間に「アラ」等の
何らかの発声を行えば、音声！！諏部３は以後の音声再
生を中止し、発声入力待ちの状態を「止まれ」の発声が
行われたのと同じ動作を行う、つまり、「止まれ」の単
語番号を制御部４に送信し、制御部４は空調機を停止す
る。When the user operates the help key 5 and the voice recognition unit 3 is playing back the voice of the word to be recognized, the period from immediately after the user plays the voice of the target word until the end of the voice playback of the next word. If a word is uttered, the speech recognition unit 3 performs the same operation as if the target word had been uttered while waiting for the utterance input. For example, if the speech recognition unit 3 is playing back words to be recognized such as "stop", "temperature", etc. at the user's request, the user may start the next word "stop" immediately after playing the words "stop". If you say something like "Ara" until the end of the audio playback of Temperature J, you can hear the sound! ! The Sube 3 stops the subsequent audio playback, and performs the same operation as when the utterance of "stop" is performed in the state of waiting for voice input, that is, sends the word number of "stop" to the control unit 4, The control unit 4 stops the air conditioner.

音声認識部３が１単語の音声再生中に利用者が何らかの
発声を行えば（ｖ−７）、音声認識部３は該単語および
それ以降の単語の音声再生を中止し、したがって、最初
の単語音声再生に利用者が発声を行った場合にはこの動
作は行わない（以上ｖ−８）、１単語の音声再生終了後
から次の単語の音声再生開始までの間に利用者が何らか
の発声を行えば（ｖ−９からｖ−１１まで）、音声認識
部３は以後の音声再生を中止し、直前に音声再生した単
語の番号を制御部４の通信端子４５に送信する（ｖ−１
２）、このようにすれば、利用者は目的の単語を知った
時点で空調機の操作を行うことができ、全単語の音声再
生が終わるのを待って改めて単語の発声入力を行う必要
がない。If the user makes any utterance while the voice recognition unit 3 is reproducing the voice of one word (v-7), the voice recognition unit 3 stops the voice reproduction of that word and the following words, and therefore the first word This operation is not performed if the user utters a voice during audio playback (see v-8 above). If this is done (from v-9 to v-11), the speech recognition unit 3 stops the subsequent audio playback and transmits the number of the word that was just played back to the communication terminal 45 of the control unit 4 (v-1
2) In this way, the user can operate the air conditioner as soon as he or she knows the desired word, and does not have to wait until the audio playback of all the words has finished before inputting the word aloud again. do not have.

なお、上記の、「ヘルプ」等特定の単語の発声入力によ
り認識対象単語の標準音声再生を行う例においで、その
特定の単語を含めて複数の話者の標準音声が登録されて
いる場合には、その特定の単語の発声入力があれば、発
声音声と最も相違度の小さい標準音声を与える話者の標
準音声の再生を行う。In addition, in the above example of playing back the standard voice of the recognition target word by inputting a specific word such as "help", if the standard voice of multiple speakers including that specific word is registered. If the utterance input of the specific word is input, the speaker reproduces the standard voice of the speaker who gives the standard voice with the least difference from the uttered voice.

例えば、認識対象単語の標準音声が話者Ａ、Ｂ。For example, the standard voices of the word to be recognized are speakers A and B.

Ｃについて登録されている場合１話者りが「ヘルプ」等
と上記特定の単語を発声入力し、その発声音声と最も相
違度が小さい標準音声が話者Ａの特定単語の標準音声で
あれば、音声認識部３は話者Ａについて登録した認識対
象単語の標準音声の再生を行う、このようにすれば利用
者は自分の発声に最も近い標準音声を知ることができる
。If C is registered, if one speaker inputs the above specific word such as "help" and the standard voice with the smallest difference from that uttered voice is the standard voice of speaker A's specific word. , the speech recognition unit 3 plays back the standard speech of the recognition target word registered for speaker A. In this way, the user can know the standard speech that is closest to his or her own utterance.

［発明の効果］以上詳細に説明したように、本発明によれば、音声認識
制御装置において利用者の特定の操作または特定の単語
の発声入力により、認識対象語句の標準音声の再生や、
装置の動作状態等を表現する音声の出力の動作を割込処
理により行なっているので、利用者は任意の時点で認識
対象語句の音声再生や、装置の動作状態等を表現する音
声の出力を要求することができる。したがって、利用者
は発声入力できる命令語や、運転操作に必要あるいは参
考となる情報が不明な場合に、これを視覚によらず音声
により知ることができ、特に盲人の使用あるいは暗がり
での使用に便利な音声認識制御装置を提供することがで
きる。[Effects of the Invention] As described above in detail, according to the present invention, the voice recognition control device can reproduce the standard voice of the recognition target word or phrase by the user's specific operation or vocal input of a specific word.
Since the operation of outputting audio expressing the operating status of the device is performed by interrupt processing, the user can play back the audio of the recognition target phrase or output audio expressing the operating status of the device at any time. can be requested. Therefore, when the user is unsure of command words that can be input vocally or information that is necessary or helpful for driving operations, the user can know this by voice instead of visually, which is especially useful for blind people or for use in the dark. A convenient voice recognition control device can be provided.

このとき、認識対象語句を知らせる音声は、利用者の発
声入力した語句を認識する際に、その発声音声と比較す
る標準音声を用いているので、利用者は正しく認識され
る発声方法を知ることができる。At this time, the voice that informs the recognition target word uses a standard voice that is compared with the voice input when recognizing the word input by the user, so the user does not need to know how to pronounce it correctly. I can do it.

また、上記認識対象語句や装置の動作状態等の音声出力
は、利用者が要求を行った時点において、つまり該時点
において可能な操作や該時点の装置の動作状態等に応じ
て、利用者にとって必要性の高い情報から順に音声出力
を行なってでいるので、利用者は該時点において必要性
の高い順に上記の情報を得ることができる。In addition, the audio output of the above-mentioned words to be recognized, the operating state of the device, etc., is made available to the user at the time the user makes the request, that is, depending on the operations available at that time and the operating state of the device at that time. Since the audio output is performed in order of necessity, the user can obtain the above-mentioned information in order of necessity at the time.

さらに、認識対象語句の音声出力中に、目的の語句の音
声出力から一定時間内に利用者が何らかの発声を行えば
、以後の音声出力を中止し、該語句の発声入力が行われ
てこれを正しく認識したのと同じ動作を行うこともでき
る。よって、利用者は目的の語句を知った後も不必要に
残りの音声出力を聞いたのち、改めて目的の語句の発声
入力を行う必要がない。Furthermore, if the user utters something within a certain period of time after the target word is output while the recognition target word is being output, further voice output will be stopped and the word will be input as a voice. You can also perform the same action if it is correctly recognized. Therefore, even after the user knows the target word/phrase, there is no need to unnecessarily listen to the remaining audio output and then re-input the target word/phrase.

また、一つの操作のために複数段階の発声入力が必要な
場合には、全段階の発声入力が行われてから一定時間内
に次段階の発声入力が行われない場合にのみ、次の発声
入力を促す案内音声を出力することができるので、操作
に慣れた利用者が不必要に案内音声を聞かされることな
く、また操作に不慣れな利用者は必要な案内音声を得る
ことができる。In addition, if multiple stages of voice input are required for one operation, the next voice input will be performed only if the next level of voice input is not performed within a certain period of time after all stages of voice input are completed. Since a guidance voice prompting input can be output, a user who is accustomed to the operation will not be forced to listen to the guidance voice unnecessarily, and a user who is not accustomed to the operation can obtain the necessary guidance voice.

上記を総合して１本発明によれば、制御すべき本体装置
が認識の対象とする語句を、利用者の任意の時点での要
求により、対象の際に比較の標準とする音声で利用者に
知らせることができる音声認識制御装置を提供すること
ができる。To summarize the above, according to the present invention, the main body device to be controlled can, at the user's request at any time, recognize words and phrases that are to be recognized by the user in a voice that is used as a standard for comparison. It is possible to provide a voice recognition control device that can notify the user.

また５本体装置の動作設定状態の情報を、利用者の任意
の時点での要求により音声で利用者に知らせることので
きる音声認識制御装置を提供することができる。Further, it is possible to provide a voice recognition control device that can notify a user of information on the operation setting state of the main unit 5 by voice according to the user's request at any time.

【図面の簡単な説明】[Brief explanation of the drawing]

第１図は、本発明の一実施例に係る音声認識制御装置の
制御系のブロック図、第２図は、第１図の空調機の外観
の一例を示す略示構成図、第３図は、第１図の音声認識
部の一構成例を示すブロック図、第４図は、制御部の一
構成例を示すブロック図、第５図は、音声合成器の一構
成例を示すブロック図、第６図は、空調機の運転開始か
ら停止までの制御部４の動作を示すフローチャート、第
７図は、音声認識部３の音声認識動作を示すフローチャ
ート、第８図は、ヘルプキー５の操作による制御部４の
割込処理を示すフローチャート、第９図は、音声認識部
３の標準音声再生の動作を示すフローチャート、第１０
図は、発声によるヘルプ要求の場合の音声認識部３の動
作を示すフローチャート、第１１図は、音声出力の項目
分類、第１２図は、音声出力の順序の一例を示す説明図
、第１３図は１次のヘルプ要求までに時間制限を設ける
動作のフローチャート、第１４図は、−室以上のりジェ
クト回数で認識対象単語を自動再生する動作のフローチ
ャート、第１５図は、一定時間内に次の発声入力がない
場合のみガイダンス音声を出力する動作のフローチャー
ト、第１６図は、認識対象単語の音声再生中の発声入力
により操作を行うフローチャートである。１・・・マイクロホン、３・・・音声認識部、４・・・
制御部、５・・・ヘルプキー、７・・・音声合成器、９
・・・スピーカ、１２・・・空調機駆動回路、１４・・
・手動操作キ３１・・・演算部、３２・・・第１のメモ
リ、３３・・・第２のメモリ、３４・・・入出力部、４
１・・・演算部、４２・・・第１のメモリ、４３・・・
第２のメモリ、４４・・・入出力部、５１・・・演算部
、５２・・・メモリ、５３・・・デコーダ、５４・・・
Ｄ／Ａ変換器。FIG. 1 is a block diagram of a control system of a voice recognition control device according to an embodiment of the present invention, FIG. 2 is a schematic configuration diagram showing an example of the external appearance of the air conditioner shown in FIG. 1, and FIG. , FIG. 4 is a block diagram showing an example of the configuration of the control section, FIG. 5 is a block diagram showing an example of the configuration of the speech synthesizer, 6 is a flowchart showing the operation of the control unit 4 from the start of operation to the stop of the air conditioner, FIG. 7 is a flowchart showing the voice recognition operation of the voice recognition unit 3, and FIG. 8 is the operation of the help key 5. FIG. 9 is a flowchart showing the interrupt processing of the control unit 4 according to FIG. 9, and FIG.
11 is a flowchart showing the operation of the voice recognition unit 3 in the case of a voiced help request. FIG. 11 is an item classification of voice output. FIG. 12 is an explanatory diagram showing an example of the order of voice output. FIG. 13 14 is a flowchart of the operation to set a time limit until the first help request, FIG. 14 is a flowchart of the operation to automatically reproduce the recognition target word with a number of times of - room or more, and FIG. FIG. 16 is a flowchart of an operation for outputting guidance voice only when there is no voice input. FIG. 16 is a flowchart for performing an operation based on voice input while the voice of a recognition target word is being reproduced. 1...Microphone, 3...Speech recognition section, 4...
Control unit, 5... Help key, 7... Speech synthesizer, 9
...Speaker, 12...Air conditioner drive circuit, 14...
・Manual operation key 31...Calculation unit, 32...First memory, 33...Second memory, 34...Input/output unit, 4
1... Arithmetic unit, 42... First memory, 43...
2nd memory, 44... input/output unit, 51... calculation unit, 52... memory, 53... decoder, 54...
D/A converter.

Claims

【特許請求の範囲】１、利用者の発声入力による運転操作に基づき、制御す
べき本体装置の運転制御を行う音声認識制御装置におい
て、利用者の発声する命令語の音声を入力する音声入力手段
と、その入力された音声の特徴量を抽出し、あらかじめ登録
されている命令語の標準音声の特徴量と比較することに
より、該命令語を認識して認識結果を出力する音声認識
手段と、前記認識結果に基づいて前記本体装置の運転制御を行う
制御手段と、この制御手段からの指示信号に基づき前記標準音声の再
生を行う第１の音声再生手段と、利用者が手動操作を行
う手動操作部と、その手動操作を表わす信号を前記制御
手段に出力する手動操作手段とを備え、利用者の特定の手動操作により前記標準音声の再生を行
うことを特徴とする音声認識制御装置。２、第１の音声再生手段は、利用者の発声入力する特定
の語句を認識することにより、標準音声の再生を行うも
のであることを特徴とする請求項１記載の音声認識制御
装置。３、第１の音声再生手段は、複数の話者について標準音
声およびその特徴量を登録しておき、特定語句の発声入
力があったときに、その発声入力の特徴量が前記登録さ
れた特徴量と最小相違度を与える発声を入力した話者に
ついて、該話者の命令語の標準音声の再生を行うもので
あることを特徴とする請求項２記載の音声認識制御装置
。４、利用者の発声入力による運転操作に基づき、制御す
べき本体装置の運転制御を行う音声認識制御装置におい
て、利用者の発声する命令語の音声を入力する音声入力手段
と、その入力された音声の特徴量を抽出し、あらかじめ登録
されている命令語の標準音声の特徴量と比較することに
より、該命令語を認識して認識結果を出力する音声認識
手段と、前記認識結果に基づいて前記本体装置の運転制御を行う
制御手段と、この制御手段からの指示信号に基づき、あらかじめ記録
された情報語句の音声を選択して再生する第２の音声再
生手段と、利用者が手動操作を行う手動操作部と、その手動操作を
表わす信号を前記制御手段に出力する手動操作手段とを
備え、利用者の特定の手動操作または特定の語句の発声入力の
いずれかにより、その情報語句の音声を選択して再生を
行うことを特徴とする音声認識制御装置。５、第２の音声再生手段は、利用者の特定の手動操作ま
たは特定の語句の発声入力のいずれかが行われた時点に
おける、制御すべき本体装置の動作状態、および利用者
が行いうる操作に基づき、利用者の必要性に応じた順序
で、命令語の標準音声または情報語句の音声を選択して
再生を行うものであることを特徴とする請求項４記載の
音声認識制御装置。６、第２の音声再生手段は、利用者の１回の特定の手動
操作または特定の語句の発声入力のいずれか毎に、利用
者の必要性に応じた順序に従い、命令語の標準音声また
は情報語句の音声の中から１項目の音声を選択して再生
を行い、その音声の再生の開始または終了のいずれかか
ら所定時間内に、特定の手動操作または特定の語句の発
声入力のいずれかが行われた場合にのみ、次の順序の一
項目の音声を選択して再生を行うものであることを特徴
とする請求項５記載の音声認識制御装置。７、利用者の発声入力する語句が認識不足となる回数に
基づいて自動的に命令語の標準音声の再生を行うことを
特徴とする請求項１ないし６記載のいずれかの音声認識
制御装置。８、利用者の特定の手動操作または特定の語句の発声入
力の頻度に基づき、あらかじめ記録された情報語句の音
声を選択して発声する該情報語句を自動的に変更するこ
とを特徴とする請求項４記載の音声認識制御装置。９、音声再生中に、利用者が特定の手動操作または特定
の語句の発声入力のいずれかを行う頻度に基づき、各項
目の音声の選択再生順序を自動的に入れ替えることを特
徴とする請求項６記載の音声認識制御装置。１０、利用者の発声入力による運転操作に基づき、制御
すべき本体装置の運転制御を行う音声認識制御装置にお
いて、利用者の発声する命令語の音声を入力する音声入力手段
と、その入力された音声の特徴量を抽出し、あらかじめ登録
されている命令語の標準音声の特徴量と比較することに
より、該命令語を認識して認識結果を出力する音声認識
手段と、前記認識結果に基づいて前記本体装置の運転制御を行う
制御手段と、この制御手段からの指示信号に基づき前記標準音声の再
生を行う第１の音声再生手段と、利用者が手動操作を行
う手動操作部と、その手動操作を表わす信号を前記制御
手段に出力する手動操作手段とを備え、利用者の複数段階の命令語の発声入力により１個の運転
操作を行う場合に、前段階の命令語の発声入力ののち、
一定時間内に次段階の命令語の発声入力がない場合にの
み、利用者に前記次段階の命令語の発声を促す内容の情
報音声の再生を行うことを特徴とする音声認識制御装置。[Scope of Claims] 1. In a voice recognition control device that controls the operation of a main unit to be controlled based on a driving operation based on a user's voice input, a voice input means for inputting the voice of a command word uttered by a user. and a voice recognition means that extracts the feature amount of the input voice and compares it with the feature amount of a standard voice of the command word registered in advance, thereby recognizing the command word and outputting a recognition result; a control means for controlling the operation of the main unit based on the recognition result; a first audio reproduction means for reproducing the standard audio based on an instruction signal from the control means; and a manual operation for manual operation by the user. A voice recognition control device comprising: an operation section; and a manual operation means for outputting a signal representing the manual operation to the control means, and playing back the standard voice according to a specific manual operation by a user. 2. The voice recognition control device according to claim 1, wherein the first voice reproduction means reproduces the standard voice by recognizing a specific phrase inputted by the user. 3. The first audio reproduction means registers standard voices and their features for a plurality of speakers, and when a specific word is uttered, the feature of the utterance input is used as the registered feature. 3. The speech recognition control device according to claim 2, wherein the speech recognition control device reproduces a standard voice of a command word of a speaker who inputs an utterance giving the amount of difference and the minimum degree of difference. 4. In a voice recognition control device that controls the operation of the main unit to be controlled based on the driving operation by the user's voice input, a voice input means for inputting the voice of the command word uttered by the user; a voice recognition means for extracting a feature amount of a voice and comparing it with a feature amount of a standard voice of the command word registered in advance to recognize the command word and outputting a recognition result; a control means for controlling the operation of the main unit; a second audio reproduction means for selecting and reproducing the audio of pre-recorded information words based on an instruction signal from the control means; and a manual operation unit that outputs a signal representing the manual operation to the control means, and the control means outputs the audio of the information word by either the user's specific manual operation or the voice input of the specific word. A voice recognition control device that selects and plays. 5. The second audio reproduction means is configured to control the operating state of the main device to be controlled and the operations that the user can perform at the time when either a specific manual operation or vocal input of a specific word is performed by the user. 5. The voice recognition control device according to claim 4, wherein standard voices of command words or voices of information words are selected and reproduced in an order according to the needs of the user. 6. The second audio reproduction means reproduces the standard voice of the command word or One item of audio is selected from the information word/phrase audio and played back, and within a predetermined time from either the start or end of the audio playback, either a specific manual operation or vocal input of a specific word/phrase is performed. 6. The voice recognition control device according to claim 5, wherein the voice recognition control device selects and reproduces the voice of one item in the next order only when the voice recognition is performed. 7. The voice recognition control device according to any one of claims 1 to 6, characterized in that the standard voice of the command word is automatically reproduced based on the number of times a word input by the user is insufficiently recognized. 8. A claim characterized in that the voice of the information phrase recorded in advance is selected and the voiced information phrase is automatically changed based on a user's specific manual operation or the frequency of voice input of a specific phrase. The voice recognition control device according to item 4. 9. A claim characterized in that, during audio playback, the selection playback order of audio for each item is automatically changed based on the frequency with which the user performs either a specific manual operation or vocal input of a specific word/phrase. 6. The voice recognition control device according to 6. 10. In a voice recognition control device that controls the operation of the main unit to be controlled based on the driving operation by the user's voice input, a voice input means for inputting the voice of the command word uttered by the user; a voice recognition means for extracting a feature amount of a voice and comparing it with a feature amount of a standard voice of the command word registered in advance to recognize the command word and outputting a recognition result; a control means for controlling the operation of the main unit; a first sound reproduction means for reproducing the standard sound based on an instruction signal from the control means; a manual operation section for manual operation by a user; a manual operation means for outputting a signal representing an operation to the control means, and when one driving operation is performed by the user's vocal input of command words in multiple stages, after the command word is vocalized in the previous stage, ,
A voice recognition control device characterized in that only when there is no voice input of a next-stage command within a certain period of time, an information voice is played that prompts a user to utter the next-stage command.