JPS59195739A - Audio response unit - Google Patents

Audio response unit

Info

Publication number
JPS59195739A
JPS59195739A JP58070289A JP7028983A JPS59195739A JP S59195739 A JPS59195739 A JP S59195739A JP 58070289 A JP58070289 A JP 58070289A JP 7028983 A JP7028983 A JP 7028983A JP S59195739 A JPS59195739 A JP S59195739A
Authority
JP
Japan
Prior art keywords
voice
signal
microphone
synthesized
information processing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP58070289A
Other languages
Japanese (ja)
Inventor
Hitoshi Takase
高瀬 均
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sanyo Electric Co Ltd
Sanyo Denki Co Ltd
Original Assignee
Sanyo Electric Co Ltd
Sanyo Denki Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sanyo Electric Co Ltd, Sanyo Denki Co Ltd filed Critical Sanyo Electric Co Ltd
Priority to JP58070289A priority Critical patent/JPS59195739A/en
Publication of JPS59195739A publication Critical patent/JPS59195739A/en
Pending legal-status Critical Current

Links

Abstract

PURPOSE:To input a voice to a microphone to recognize the voice even during speaking of a synthesized voice by recognizing the voice after the voice signal from the microphone is corrected with a synthesized voice signal from a voice synthesizing part. CONSTITUTION:By the request from an information processing part 1, a voice synthesizing part 6 synthesizes a desired voice signal S1 and outputs it from a speaker 8 through an amplifier 7. A voice signal V1 obtained from a microphone 3 through an amplifier 4 is corrected in a correcting means 5 to become a signal V2, and this signal is recognized by a voice recognizing part 2 and is inputted as an instruction and a data signal, which correspond to the signal V2, to the processing part 1. If the user speaks toward the microphone 8 when a voice S is outputted from the speaker 8, the voice S is superposed and inputted, and the signal V1 becomes a superposed voice signal. The signal S1 is converted to a tuning synthesized voice signal S2 tuned to the synthesized voice component S by a tuning circuit 51 in the correcting means 5, and this signal is subtracted from the superposed voice signal V1 in a subtractor 52 to obtain the voice signal V2 having no noise components.

Description

【発明の詳細な説明】 (イ)M業上の利用分野 本発明は音声の入出力を可能とした音声応答装置に関す
る。
DETAILED DESCRIPTION OF THE INVENTION (a) Field of use in M industry The present invention relates to a voice response device that allows input and output of voice.

(ロ)従来技術 この種従来の音声応答装置には、情報処刑部に人間の音
声を認識する音声認識部と音声を合成する音声合成部と
を組合せて、音声認識部にオペレータの音声を認識せし
めて情報処理部の操作及び情報処理部へのデータ入力を
行ない、音声合成部にて情報処理部の操作を促がすメツ
セージの出力及び情報処理部からの処理結果の出力を行
なうものがある。
(b) Prior art This type of conventional voice response device combines a voice recognition unit that recognizes human voice in the information processing unit and a voice synthesis unit that synthesizes voice, and the voice recognition unit recognizes the operator's voice. There is a device that at least operates the information processing unit and inputs data to the information processing unit, and the speech synthesis unit outputs a message prompting the operation of the information processing unit and outputs the processing results from the information processing unit. .

斯様な音声応答装W1は、その操作を行なうに際して、
音声合成部からの例えは、「ゝゝYES“又はゝゝNo
”を発声入力して)さい。」、「番号を発声入力して下
さい。」等のメツセージが出力され、それに従って、オ
ペレータが音声認識部に「YES」又は「NO」を発声
入力したり、番号を発声入力する事に依り、情報処理部
の操作及びデータ入力ができるので、断る装置の操作に
不慣れなオペレータにとって非常に便利ではある。しか
し力から、オペレータが多少とも斯様な操作に慣れて来
ると、上述の如きメツセージが不用となる場合が多い。
When such voice response device W1 performs its operation,
An example from the speech synthesis unit is “ゝゝYES” or “No”.
Messages such as "Please speak and input" or "Please input the number by voice" are output, and the operator can input ``YES'' or ``NO'' into the voice recognition unit according to the message. Since the information processing unit can be operated and data inputted by inputting the number aloud, it is very convenient for operators who are not accustomed to operating the refusal device. However, as the operator becomes more or less accustomed to such operations, such messages as described above are often unnecessary.

ところが、合成音声のメツセージの発声中には、オペレ
ータの入力音声にこのメツセージが雑音として重畳する
事になり、音声認識部での正確力認識動作が不可能とな
る慣れがあった。この為、従来装置に於いては合成音声
の発声終了まで、次の音声入力は受は入れられない構成
となっており、この為無駄な待ち時間を費やす欠点があ
った。
However, when a synthetic voice message is being uttered, the message is superimposed on the operator's input voice as noise, making it impossible for the voice recognition unit to perform accurate force recognition. For this reason, the conventional apparatus is configured such that the next voice input cannot be accepted until the synthetic voice is finished producing, which has the disadvantage of wasting waiting time.

(ハ)発明の目的 本発明は合成音声メツセージの発声中及び終了後にかか
わらず、オペレータの入力音声を常に受は入れ可能とし
て、操作時間の短縮を図り九音声応答装置を提供する事
を目的としたものである。
(c) Purpose of the Invention The object of the present invention is to provide a nine-voice response device that can always accept the operator's input voice, regardless of whether the synthesized voice message is being uttered or after it has been uttered, thereby shortening the operation time. This is what I did.

に)発明の構成 本発明の音声応答装置は、マイクロフォンと音声認識部
との間に、マイクロフォンからの音声信号を情報処理部
からの要求に依り音声合成部で合成された合成音声信号
にて補正する補正手段を介押し、音声合成部がスピーカ
から合成音声を発声中であっても、上記マイクロフォン
へ音声を入力せしめて音声認識部での認識動作を可能と
したものである。
B) Structure of the Invention The voice response device of the present invention corrects the voice signal from the microphone with a synthesized voice signal synthesized by the voice synthesis unit in response to a request from the information processing unit, between the microphone and the voice recognition unit. Even when the voice synthesizer is producing synthesized voice from the speaker, the voice is inputted to the microphone to enable the voice recognition unit to perform the recognition operation.

(!@実施例 第1図に本発明の音声応答装置の一実施例を示す。同図
に於いて、(1)は情報処理部でめ9、例えばセンサー
、記憶回路、タイマー回路等からなり、家庭での日常生
活に係る各種の電気機器を集中制徊1スるホームコント
ローラがこれに該当する。
(!@Embodiment Fig. 1 shows an embodiment of the voice response device of the present invention. In the figure, (1) is an information processing section consisting of a sensor, a memory circuit, a timer circuit, etc.). A home controller that centrally controls various electrical devices related to daily life at home falls under this category.

(2)は該情報処理部+1+に連なった音声認識部であ
り、マイクロフォン(3)から増巾器(4)を介して得
られる音声信号V1を補正手段(5)にて補正した音声
信号V2を認識する事に依って、この音声信号V2に対
応した命令信号及びデータ信号を上記情報処理部fl)
へ入力する。この音声認識部(2)としては、例えば三
洋電機斡)製の品番5RB−64なる音声認識ボードが
用いられる。(6)は上記情報処理部(1)に連なった
音声合成部であり、情報処理部(1)からの要求に依り
、必要な音声信号S1を合成して増巾器(7)を介して
スピーカ(8)から出力する。この音声合成部(6)と
しては、例えば三洋電機@製の品番LC8100なるL
SIが用いられる。(9)は上記補正手段(5)からの
音声信号v2を増巾クリップ回路(IO)にてクリップ
した信号をさらに積分して波形整形する積分器でめシ、
この積分器(10Vからの信号は音声検知信号■8とな
り、この信号V8にて上記情報処理部ft)への割込み
がかけれる。尚、上記情報処理部fi+はこの割込みに
依って、音声認識部(2)からの命令又はデータ信号を
受は入れ可能状態と2なる。
(2) is a voice recognition unit connected to the information processing unit +1+, and the voice signal V2 is obtained by correcting the voice signal V1 obtained from the microphone (3) via the amplifier (4) by the correction means (5). By recognizing this, the command signal and data signal corresponding to this audio signal V2 are transmitted to the information processing unit fl).
Enter. As this voice recognition unit (2), for example, a voice recognition board manufactured by Sanyo Electric Co., Ltd., product number 5RB-64 is used. (6) is a speech synthesis section connected to the information processing section (1), which synthesizes the necessary speech signal S1 and sends it to the amplifier (7) according to a request from the information processing section (1). Output from the speaker (8). This voice synthesis unit (6) is, for example, L made by Sanyo Electric @, product number LC8100.
SI is used. (9) is an integrator that further integrates and shapes the waveform of the signal obtained by clipping the audio signal v2 from the correction means (5) using the amplification clip circuit (IO);
The signal from this integrator (10V) becomes the audio detection signal 8, and this signal V8 causes an interrupt to the information processing section ft. Note that the information processing section fi+ becomes ready to receive commands or data signals from the voice recognition section (2) due to this interruption.

ここで本発明実施例装置の特徴とする補正手段(5)に
ついて詳述する。該補正手段(5)は音声合成部(6)
から得られる合成音声信号S1のゲイン調整及び位相調
整を行なう同調器ω1)と、該同調器韓)からの同調合
成音声信号S2を上記マイクロフォン(3)から増巾器
(4)を介して得られる音声信号v1から差し引く減算
器(−とからなり、この減算器■からの音声信号v2が
音声認識部(2)に入力されると共に、この音声信号v
2は増巾クリップ回路(10)及び積分器(9)にて音
声検知信号v8として情報処理部に割込みがかけられる
。第2図に各信号si、s2、Vl、V2.VBの波形
を示し、同図に基づいて本発明装置の動作を述べる。
Here, the correction means (5), which is a feature of the apparatus according to the embodiment of the present invention, will be described in detail. The correction means (5) is a speech synthesis section (6)
A tuner ω1) performs gain adjustment and phase adjustment of the synthesized speech signal S1 obtained from the above, and a tuned synthesized speech signal S2 from the tuner ω1) is obtained from the microphone (3) via the amplifier (4). A subtracter (-) is input to the voice recognition unit (2), and the voice signal v2 from this subtractor (2) is input to the voice recognition unit (2).
2, an amplification clip circuit (10) and an integrator (9) interrupt the information processing section as a voice detection signal v8. FIG. 2 shows each signal si, s2, Vl, V2. The waveform of VB is shown, and the operation of the device of the present invention will be described based on the figure.

情報処理部mからの要求に依り、音声合成部(6)から
例えばl’−”YES“又は1′Nθ″を発声入力して
下さい。」力るメツセージの第2図に示す如き合成音声
信号S1が出力され、これがスピーカ(8)にて発声さ
れている時、即ち、発声開始から11時間後、オペレー
タがマイクロフォン(3)に向って「NO」と発声する
と、このマイクロフォン(3)には、オペレータの音声
Vに上記スピーカ(8)から発声されている上記合成音
声Sが重畳されて入力され、増巾器(4)からは第2図
に示す如き重畳音声信号Vlが得られる。即ち、この重
畳音声信号v1は、それに含まれる合成音声S成分が音
声■に対する雑音となるので、音声認識不可能な信号と
なっている。従って、補正手段(5)に於いては、同r
回路(61)にてこの時の合成音声信号S1を上記重畳
音声信号Vlに含まれる合成音声S成分に同調せしめた
第2図に示す如き間開合成音声信号S2とし、減算器(
52)にてこの同調合成音声信号S2を上記重畳音声信
号v1から差し引いて、第2図に示す如き、オペレータ
の真の音声■に依る音声信号V2を得る。斯して得られ
た雑音成分のない音声信号v2は音声認識部(2)にて
認識されると共に、増巾クリップ回1iP1f101及
び積分器(9)にて第2図に示す如き音声検知信号v8
を得て上記情報処理部(1)に割込みをかけ、情報処F
l+1(x)を上記認識処理部(2)からの認識結果で
ある命令又はデータ信号の受は入れ可能状態とする。
In response to a request from the information processing unit m, please vocally input, for example, l'-"YES" or 1'Nθ" from the speech synthesis unit (6)." A synthesized speech signal as shown in FIG. When S1 is output and is being uttered by the speaker (8), that is, 11 hours after the start of utterance, when the operator utters "NO" into the microphone (3), this microphone (3) , the synthesized voice S uttered from the speaker (8) is superimposed on the operator's voice V, and a superimposed voice signal Vl as shown in FIG. 2 is obtained from the amplifier (4). That is, this superimposed voice signal v1 is a signal that cannot be recognized as a voice because the synthesized voice S component contained therein becomes noise for the voice (2). Therefore, in the correction means (5), the same r
A circuit (61) converts the synthesized speech signal S1 at this time into a gap synthesized speech signal S2 as shown in FIG.
At step 52), this tuned synthesized voice signal S2 is subtracted from the superimposed voice signal v1 to obtain a voice signal V2 based on the operator's true voice ■ as shown in FIG. The thus obtained voice signal v2 free of noise components is recognized by the voice recognition unit (2), and is converted into a voice detection signal v8 as shown in FIG. 2 by the amplification clip circuit 1iP1f101 and the integrator (9).
The information processing unit (1) is interrupted by the information processing unit F.
l+1(x) is placed in a state in which it is ready to receive a command or data signal which is the recognition result from the recognition processing section (2).

さらに、第2図の信号図には示していないが、情報処理
部(1)に割込みがかかり、認識処理部(2)での認識
動作が正確に行なわれた時点に於いて、上記音声合成部
(1)での合成動作を直ちに中止せしめ、スピーカ(8
)からの合成音声出力を中断せしめる構成としてもよい
。この後、マイクロフォン(3)にはオペレータの音声
Vのみが入力される事となるが上記補正手段(I′I)
への合成音声信号S1が無声状態と力るので、補正手段
(6)からの音声信号■2はやはり、オ°ベレータの音
声Vのみに依るものとなシ、認識処理部(2)での確実
な認識動作が行表われる。
Furthermore, although not shown in the signal diagram of FIG. Immediately stop the synthesis operation in section (1) and turn on the speaker (8).
) may be configured to interrupt the output of synthesized speech. After this, only the operator's voice V will be input to the microphone (3), but the correction means (I'I)
Since the synthesized speech signal S1 is in a voiceless state, the speech signal 2 from the correction means (6) depends only on the voice V of the operator. Reliable recognition behavior is performed.

従って、オペレータは、合成音声メツセージの先頭部分
、例えばPゝY E S ″又はゞ′NO“」を聞くだ
けで次の音声入力形式を知る事ができ、直ちに、NO“
であればrNOJと発声入力でき、このNO“力る入力
情報が情報処理部(1)に受は入れられる事となる。
Therefore, the operator can know the next voice input format just by listening to the first part of the synthesized voice message, such as ``PY E S'' or ゞNO'', and can immediately select NO.
If so, rNOJ can be inputted by speaking, and the input information inputted as "NO" will be accepted by the information processing section (1).

(へ)発明の効果 本発明の音声応答装置は、以上の説明から明らかな如く
、マイクロフォンと音声認識部との間に、音声合成部か
らの合成音声信号にてマイクロフォンからの音声信号を
補正する補正手段を介挿しているので、音声合成部から
スピーカを介して合成音声メツ七−ジが発声中でおって
も、この合成音声に影響される事なく、マイクロフォン
に入力されたオペレータの音声を音声認識部にて正確に
認識できる。従って、断る装置の掃作に慣れたオペレー
タにとって、合成音声のメッセージカ終了するのを待つ
事なく、次の音声入力ができ、操作時間の大巾な短縮が
図れる。
(F) Effects of the Invention As is clear from the above description, the voice response device of the present invention corrects the voice signal from the microphone with the synthesized voice signal from the voice synthesis unit between the microphone and the voice recognition unit. Since the correction means is inserted, even if the synthesized voice message is being uttered from the voice synthesis unit through the speaker, the operator's voice input to the microphone will not be affected by this synthesized voice. Accurate recognition is possible with the voice recognition unit. Therefore, an operator who is accustomed to cleaning the refusal device can input the next voice without waiting for the synthesized voice message to finish, thereby greatly reducing the operating time.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は本発明の音声応答装置の一実施例のブロック図
、第2図は本発明装置の信号波形図であり、(1)は情
報処理部、(2)は音声認識部、(3)はマイクロフォ
ン、(5)は補正手段、(6)は音声合成部、(8)C
′2>
FIG. 1 is a block diagram of an embodiment of the voice response device of the present invention, and FIG. 2 is a signal waveform diagram of the device of the present invention, in which (1) is an information processing section, (2) is a voice recognition section, and (3 ) is a microphone, (5) is a correction means, (6) is a speech synthesis unit, (8) C
'2>

Claims (1)

【特許請求の範囲】[Claims] (1)マイクロフォンと、該マイクロフォンカラ得られ
る人間の音声信号を認識する音声認識部と該音声認識部
での認識結果を入力情報として処理する情報処理部と、
該情報処理部からの要求に依シ音声信号を合成する音声
合成部と、該合成部からの合成音声信号に基づいて合成
音声を発声するスピーカと、を備えた音声応答装置に於
いて、上記マイクロフォンと音声認識部との間に、マイ
クロフォンからの音声信号を上記音声合成部の合成音声
信号にて補正する補正手段を介挿する事に依って、音声
合成部がスピーカから合成音声を発声中であっても、上
記マイクロフォンへ音声を入力せしめて上記音声認識部
での認識動作を可能とした事を特徴とする音声応答装置
(1) a microphone, a voice recognition unit that recognizes a human voice signal obtained from the microphone, and an information processing unit that processes the recognition result of the voice recognition unit as input information;
In the voice response device described above, the voice response device includes a voice synthesis unit that synthesizes a voice signal depending on a request from the information processing unit, and a speaker that emits a synthesized voice based on the synthesized voice signal from the synthesis unit. By inserting a correction means between the microphone and the speech recognition section, which corrects the speech signal from the microphone with the synthesized speech signal of the speech synthesis section, the speech synthesis section is producing synthesized speech from the speaker. A voice response device characterized in that the voice recognition unit can perform a recognition operation by inputting voice to the microphone.
JP58070289A 1983-04-20 1983-04-20 Audio response unit Pending JPS59195739A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP58070289A JPS59195739A (en) 1983-04-20 1983-04-20 Audio response unit

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP58070289A JPS59195739A (en) 1983-04-20 1983-04-20 Audio response unit

Publications (1)

Publication Number Publication Date
JPS59195739A true JPS59195739A (en) 1984-11-06

Family

ID=13427168

Family Applications (1)

Application Number Title Priority Date Filing Date
JP58070289A Pending JPS59195739A (en) 1983-04-20 1983-04-20 Audio response unit

Country Status (1)

Country Link
JP (1) JPS59195739A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS62299997A (en) * 1986-06-20 1987-12-26 松下電器産業株式会社 Interactive type voice input/output unit
JPS63121096A (en) * 1986-11-10 1988-05-25 松下電器産業株式会社 Interactive type voice input/output device
JPH1195791A (en) * 1997-07-31 1999-04-09 Lucent Technol Inc Voice recognizing method

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS62299997A (en) * 1986-06-20 1987-12-26 松下電器産業株式会社 Interactive type voice input/output unit
JPH0756595B2 (en) * 1986-06-20 1995-06-14 松下電器産業株式会社 Interactive voice input / output device
JPS63121096A (en) * 1986-11-10 1988-05-25 松下電器産業株式会社 Interactive type voice input/output device
JPH1195791A (en) * 1997-07-31 1999-04-09 Lucent Technol Inc Voice recognizing method
USRE38649E1 (en) 1997-07-31 2004-11-09 Lucent Technologies Inc. Method and apparatus for word counting in continuous speech recognition useful for reliable barge-in and early end of speech detection

Similar Documents

Publication Publication Date Title
JPH096390A (en) Voice recognition interactive processing method and processor therefor
KR100974054B1 (en) Providing custom audio profile in wireless device
US4825384A (en) Speech recognizer
JPS60247697A (en) Voice recognition responder
JPS59195739A (en) Audio response unit
US7043427B1 (en) Apparatus and method for speech recognition
JPS63149699A (en) Voice input/output device
WO2015093013A1 (en) Speech recognition apparatus and computer program product for speech recognition
KR100194765B1 (en) Speech recognition system using echo cancellation and method
EP1091347A2 (en) Multi-stage speech recognition
JP2585547B2 (en) Method for correcting input voice in voice input / output device
JPH11126093A (en) Voice input adjusting method and voice input system
JPH0566793A (en) Speech input device
JPS59153238A (en) Voice input/output system
JPH11109987A (en) Speech recognition device
JP2975808B2 (en) Voice recognition device
JP2002244696A (en) Controller by speech recognition
JP2020085942A (en) Information processing apparatus, information processing method, and program
KR950000532B1 (en) Method of perceiving the phonetics of a language in the hand-free dialing system
US20060080097A1 (en) Voice acknowledgement independent of a speaker while dialling by name
JPH04181998A (en) Device and method for speech recognition
JPH02162399A (en) Voice recognition device
JP2000047689A (en) Speech recognition device
JPS6239747B2 (en)
JPH03188500A (en) Speech recognizing device