JP2020052145A - 音声認識装置、音声認識方法、及び音声認識プログラム - Google Patents
音声認識装置、音声認識方法、及び音声認識プログラム Download PDFInfo
- Publication number
- JP2020052145A JP2020052145A JP2018179407A JP2018179407A JP2020052145A JP 2020052145 A JP2020052145 A JP 2020052145A JP 2018179407 A JP2018179407 A JP 2018179407A JP 2018179407 A JP2018179407 A JP 2018179407A JP 2020052145 A JP2020052145 A JP 2020052145A
- Authority
- JP
- Japan
- Prior art keywords
- user
- voice recognition
- voice
- party
- output
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 61
- 230000004044 response Effects 0.000 abstract description 5
- 230000002452 interceptive effect Effects 0.000 description 9
- 238000004590 computer program Methods 0.000 description 4
- 238000010586 diagram Methods 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/165—Management of the audio stream, e.g. setting of volume, audio stream path
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04K—SECRET COMMUNICATION; JAMMING OF COMMUNICATION
- H04K1/00—Secret communication
- H04K1/02—Secret communication by adding a second signal to make the desired signal unintelligible
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04K—SECRET COMMUNICATION; JAMMING OF COMMUNICATION
- H04K3/00—Jamming of communication; Counter-measures
- H04K3/40—Jamming having variable characteristics
- H04K3/45—Jamming having variable characteristics characterized by including monitoring of the target or target signal, e.g. in reactive jammers or follower jammers for example by means of an alternation of jamming phases and monitoring phases, called "look-through mode"
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04K—SECRET COMMUNICATION; JAMMING OF COMMUNICATION
- H04K3/00—Jamming of communication; Counter-measures
- H04K3/80—Jamming or countermeasure characterized by its function
- H04K3/82—Jamming or countermeasure characterized by its function related to preventing surveillance, interception or detection
- H04K3/825—Jamming or countermeasure characterized by its function related to preventing surveillance, interception or detection by jamming
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04K—SECRET COMMUNICATION; JAMMING OF COMMUNICATION
- H04K3/00—Jamming of communication; Counter-measures
- H04K3/80—Jamming or countermeasure characterized by its function
- H04K3/86—Jamming or countermeasure characterized by its function related to preventing deceptive jamming or unauthorized interrogation or access, e.g. WLAN access or RFID reading
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04K—SECRET COMMUNICATION; JAMMING OF COMMUNICATION
- H04K2203/00—Jamming of communication; Countermeasures
- H04K2203/10—Jamming or countermeasure used for a particular application
- H04K2203/12—Jamming or countermeasure used for a particular application for acoustic communication
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Theoretical Computer Science (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Computer Networks & Wireless Communication (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Computer Security & Cryptography (AREA)
- Electromagnetism (AREA)
- Microelectronics & Electronic Packaging (AREA)
- Quality & Reliability (AREA)
- Circuit For Audible Band Transducer (AREA)
- Soundproofing, Sound Blocking, And Sound Damping (AREA)
- Telephonic Communication Services (AREA)
Abstract
Description
まず、図1を参照して、本発明の一実施形態である音声認識装置の構成について説明する。
図2は、本発明の一実施形態である音声認識処理の流れを示すフローチャートである。図2に示すフローチャートは、音声認識装置1がユーザP1に対して発話を要求する度毎に開始となり、音声認識処理はステップS1の処理に進む。
2 音声入力装置
3A,3B スピーカ
11 音声認識処理部
12 音データベース(音DB)
13 音声再生部
14 音量設定部
P1 ユーザ
P2 第三者
Claims (6)
- ユーザの発話音声を認識する音声認識装置であって、
前記ユーザに要求する発話内容が第三者に聞かれたくない内容であるか否かに応じて任意の妨害音の出力を制御すると共に、前記ユーザの発話が終了したことに応じて前記妨害音の出力を停止する制御部を備える
ことを特徴とする音声認識装置。 - 前記制御部は、音楽出力手段が音楽を出力している場合、該音楽の出力音量を前記発話内容の聞き取りを妨害するレベルに制御することを特徴とする請求項1に記載の音声認識装置。
- 前記制御部は、ユーザに発話を求める場面及び状況とユーザからの要求信号の有無に基づいて、前記ユーザに要求する発話内容が第三者に聞かれたくない内容であるか否かを判別することを特徴とする請求項1又は2に記載の音声認識装置。
- 前記制御部は、音声入力装置を介して取得した音声データから前記妨害音を除去することにより前記ユーザの発話音声を認識することを特徴とする請求項1〜3のうち、いずれか1項に記載の音声認識装置。
- ユーザの発話音声を認識する音声認識方法であって、
前記ユーザに要求する発話内容が第三者に聞かれたくない内容であるか否かに応じて任意の妨害音の出力を制御すると共に、前記ユーザの発話が終了したことに応じて前記妨害音の出力を停止するステップを含む
ことを特徴とする音声認識方法。 - ユーザの発話音声を認識する音声認識プログラムであって、
前記ユーザに要求する発話内容が第三者に聞かれたくない内容であるか否かに応じて任意の妨害音の出力を制御すると共に、前記ユーザの発話が終了したことに応じて前記妨害音の出力を停止する処理をコンピュータに実行させる
ことを特徴とする音声認識プログラム。
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2018179407A JP2020052145A (ja) | 2018-09-25 | 2018-09-25 | 音声認識装置、音声認識方法、及び音声認識プログラム |
US16/567,301 US11276404B2 (en) | 2018-09-25 | 2019-09-11 | Speech recognition device, speech recognition method, non-transitory computer-readable medium storing speech recognition program |
CN201910864279.0A CN110942770B (zh) | 2018-09-25 | 2019-09-12 | 音声识别装置、音声识别方法、存储音声识别程序的非暂时性计算机可读介质 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2018179407A JP2020052145A (ja) | 2018-09-25 | 2018-09-25 | 音声認識装置、音声認識方法、及び音声認識プログラム |
Publications (1)
Publication Number | Publication Date |
---|---|
JP2020052145A true JP2020052145A (ja) | 2020-04-02 |
Family
ID=69883292
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2018179407A Pending JP2020052145A (ja) | 2018-09-25 | 2018-09-25 | 音声認識装置、音声認識方法、及び音声認識プログラム |
Country Status (3)
Country | Link |
---|---|
US (1) | US11276404B2 (ja) |
JP (1) | JP2020052145A (ja) |
CN (1) | CN110942770B (ja) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2020052145A (ja) * | 2018-09-25 | 2020-04-02 | トヨタ自動車株式会社 | 音声認識装置、音声認識方法、及び音声認識プログラム |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2004096664A (ja) * | 2002-09-04 | 2004-03-25 | Matsushita Electric Ind Co Ltd | ハンズフリー通話装置および方法 |
JP2007006363A (ja) * | 2005-06-27 | 2007-01-11 | Fujitsu Ltd | 電話機 |
JP2007256606A (ja) * | 2006-03-23 | 2007-10-04 | Aruze Corp | 出音システム |
Family Cites Families (37)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3138370B2 (ja) * | 1993-09-09 | 2001-02-26 | 株式会社日立製作所 | 情報処理装置 |
US6963759B1 (en) * | 1999-10-05 | 2005-11-08 | Fastmobile, Inc. | Speech recognition technique based on local interrupt detection |
US6937977B2 (en) * | 1999-10-05 | 2005-08-30 | Fastmobile, Inc. | Method and apparatus for processing an input speech signal during presentation of an output audio signal |
US20010044786A1 (en) * | 2000-03-14 | 2001-11-22 | Yoshihito Ishibashi | Content usage management system and method, and program providing medium therefor |
CN1618203A (zh) * | 2001-12-15 | 2005-05-18 | 汤姆森特许公司 | 视频会议带宽选择机制 |
US20040125922A1 (en) * | 2002-09-12 | 2004-07-01 | Specht Jeffrey L. | Communications device with sound masking system |
WO2006021943A1 (en) * | 2004-08-09 | 2006-03-02 | Nice Systems Ltd. | Apparatus and method for multimedia content based |
US20060109983A1 (en) * | 2004-11-19 | 2006-05-25 | Young Randall K | Signal masking and method thereof |
JP2006215206A (ja) * | 2005-02-02 | 2006-08-17 | Canon Inc | 音声処理装置およびその制御方法 |
JP4765394B2 (ja) | 2005-05-10 | 2011-09-07 | トヨタ自動車株式会社 | 音声対話装置 |
KR100735557B1 (ko) * | 2005-10-12 | 2007-07-04 | 삼성전자주식회사 | 음성 신호를 감쇄하고 마스킹하여 음성 신호를 교란시키는방법 및 장치 |
US20070208806A1 (en) * | 2006-03-02 | 2007-09-06 | Sun Microsystems, Inc. | Network collaboration system with conference waiting room |
US8886537B2 (en) * | 2007-03-20 | 2014-11-11 | Nuance Communications, Inc. | Method and system for text-to-speech synthesis with personalized voice |
US7689421B2 (en) * | 2007-06-27 | 2010-03-30 | Microsoft Corporation | Voice persona service for embedding text-to-speech features into software programs |
KR20110042315A (ko) * | 2008-07-18 | 2011-04-26 | 코닌클리케 필립스 일렉트로닉스 엔.브이. | 공공 장소들에서 사적 대화들을 엿듣는 것을 방지하기 위한 방법 및 시스템 |
US8983845B1 (en) * | 2010-03-26 | 2015-03-17 | Google Inc. | Third-party audio subsystem enhancement |
WO2012063963A1 (ja) * | 2010-11-11 | 2012-05-18 | 日本電気株式会社 | 音声認識装置、音声認識方法、および音声認識プログラム |
JP2012113130A (ja) * | 2010-11-25 | 2012-06-14 | Yamaha Corp | サウンドマスキング装置 |
JP5695447B2 (ja) * | 2011-03-01 | 2015-04-08 | 株式会社東芝 | テレビジョン装置及び遠隔操作装置 |
US8972251B2 (en) * | 2011-06-07 | 2015-03-03 | Qualcomm Incorporated | Generating a masking signal on an electronic device |
JP2013019803A (ja) | 2011-07-12 | 2013-01-31 | Mitsubishi Motors Corp | 運転支援装置 |
US9230556B2 (en) * | 2012-06-05 | 2016-01-05 | Apple Inc. | Voice instructions during navigation |
US8670986B2 (en) * | 2012-10-04 | 2014-03-11 | Medical Privacy Solutions, Llc | Method and apparatus for masking speech in a private environment |
KR102069863B1 (ko) * | 2012-11-12 | 2020-01-23 | 삼성전자주식회사 | 입력 수단의 결제 기능을 제어하는 전자 장치 및 방법 |
JP2014130251A (ja) * | 2012-12-28 | 2014-07-10 | Glory Ltd | 会話保護システム及び会話保護方法 |
US9697831B2 (en) * | 2013-06-26 | 2017-07-04 | Cirrus Logic, Inc. | Speech recognition |
US20150117439A1 (en) * | 2013-10-24 | 2015-04-30 | Vonage Network, Llc | Systems and methods for controlling telephony communications |
US20150230022A1 (en) * | 2014-02-07 | 2015-08-13 | Samsung Electronics Co., Ltd. | Wearable electronic system |
WO2016051519A1 (ja) * | 2014-09-30 | 2016-04-07 | 三菱電機株式会社 | 音声認識システム |
US9489172B2 (en) * | 2015-02-26 | 2016-11-08 | Motorola Mobility Llc | Method and apparatus for voice control user interface with discreet operating mode |
US9715283B2 (en) * | 2015-02-26 | 2017-07-25 | Motorola Mobility Llc | Method and apparatus for gesture detection in an electronic device |
JP2016177204A (ja) * | 2015-03-20 | 2016-10-06 | ヤマハ株式会社 | サウンドマスキング装置 |
JP2016177205A (ja) * | 2015-03-20 | 2016-10-06 | ヤマハ株式会社 | サウンドマスキング装置 |
CN106657552B (zh) * | 2016-11-30 | 2019-08-06 | Oppo广东移动通信有限公司 | 防止监听的方法、装置及终端 |
JP2020052145A (ja) * | 2018-09-25 | 2020-04-02 | トヨタ自動車株式会社 | 音声認識装置、音声認識方法、及び音声認識プログラム |
US11915123B2 (en) * | 2019-11-14 | 2024-02-27 | International Business Machines Corporation | Fusing multimodal data using recurrent neural networks |
US11776557B2 (en) * | 2020-04-03 | 2023-10-03 | Electronics And Telecommunications Research Institute | Automatic interpretation server and method thereof |
-
2018
- 2018-09-25 JP JP2018179407A patent/JP2020052145A/ja active Pending
-
2019
- 2019-09-11 US US16/567,301 patent/US11276404B2/en active Active
- 2019-09-12 CN CN201910864279.0A patent/CN110942770B/zh active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2004096664A (ja) * | 2002-09-04 | 2004-03-25 | Matsushita Electric Ind Co Ltd | ハンズフリー通話装置および方法 |
JP2007006363A (ja) * | 2005-06-27 | 2007-01-11 | Fujitsu Ltd | 電話機 |
JP2007256606A (ja) * | 2006-03-23 | 2007-10-04 | Aruze Corp | 出音システム |
Also Published As
Publication number | Publication date |
---|---|
CN110942770B (zh) | 2023-07-28 |
US11276404B2 (en) | 2022-03-15 |
CN110942770A (zh) | 2020-03-31 |
US20200098371A1 (en) | 2020-03-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11348595B2 (en) | Voice interface and vocal entertainment system | |
CN109714663B (zh) | 一种耳机的控制方法、耳机及存储介质 | |
US8705753B2 (en) | System for processing sound signals in a vehicle multimedia system | |
JP4260046B2 (ja) | 音声明瞭度改善装置及び音声明瞭度改善方法 | |
CN107995360B (zh) | 通话处理方法及相关产品 | |
US10140089B1 (en) | Synthetic speech for in vehicle communication | |
JP4209247B2 (ja) | 音声認識装置および方法 | |
JP2010156826A (ja) | 音響制御装置 | |
JP2013531273A (ja) | スピーカ及びマイクロホンを備える音声認識システムを調整する方法、及び音声認識システム | |
JP2012163692A (ja) | 音声信号処理システム、音声信号処理方法および音声信号処理方法プログラム | |
CN110942770B (zh) | 音声识别装置、音声识别方法、存储音声识别程序的非暂时性计算机可读介质 | |
JP5593759B2 (ja) | 通話音声処理装置、通話音声制御装置および方法 | |
JP2008167319A (ja) | ヘッドホンシステム、ヘッドホン駆動制御装置およびヘッドホン | |
JP2004013084A (ja) | 音量制御装置 | |
CN111464902A (zh) | 信息处理方法、装置及耳机和存储介质 | |
JP6995254B2 (ja) | 音場制御装置及び音場制御方法 | |
JP4765394B2 (ja) | 音声対話装置 | |
JP7474548B2 (ja) | オーディオデータの再生の制御 | |
KR20220091151A (ko) | 차량용 능동 소음 제어 장치 및 그 제어 방법 | |
JP2007219122A (ja) | 音響機器及びプログラム | |
JP4353084B2 (ja) | 映像再生方法及び装置及びプログラム | |
JP4493557B2 (ja) | 音声信号判断装置 | |
WO2021245871A1 (ja) | 通話環境生成方法、通話環境生成装置、プログラム | |
JP7105320B2 (ja) | 音声認識装置、音声認識装置の制御方法、コンテンツ再生装置、及びコンテンツ送受信システム | |
Schmidt et al. | Evaluation of in-car communication systems |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
RD01 | Notification of change of attorney |
Free format text: JAPANESE INTERMEDIATE CODE: A7426 Effective date: 20181002 |
|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A821 Effective date: 20181002 |
|
A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20210211 |
|
A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20211215 |
|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20211221 |
|
A02 | Decision of refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A02 Effective date: 20220614 |