JP6786139B1 - 音声入力装置 - Google Patents
音声入力装置 Download PDFInfo
- Publication number
- JP6786139B1 JP6786139B1 JP2020116321A JP2020116321A JP6786139B1 JP 6786139 B1 JP6786139 B1 JP 6786139B1 JP 2020116321 A JP2020116321 A JP 2020116321A JP 2020116321 A JP2020116321 A JP 2020116321A JP 6786139 B1 JP6786139 B1 JP 6786139B1
- Authority
- JP
- Japan
- Prior art keywords
- sound
- unit
- voice
- wearer
- sound collecting
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 claims description 58
- 238000012545 processing Methods 0.000 claims description 51
- 238000004458 analytical method Methods 0.000 claims description 48
- 230000008569 process Effects 0.000 claims description 43
- 230000002708 enhancing effect Effects 0.000 claims 1
- 238000010586 diagram Methods 0.000 abstract description 3
- 238000003384 imaging method Methods 0.000 description 60
- 238000001514 detection method Methods 0.000 description 21
- 238000004891 communication Methods 0.000 description 15
- 238000010191 image analysis Methods 0.000 description 14
- 230000033001 locomotion Effects 0.000 description 8
- 230000006870 function Effects 0.000 description 6
- 210000000988 bone and bone Anatomy 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 238000006243 chemical reaction Methods 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 230000003287 optical effect Effects 0.000 description 4
- 230000003213 activating effect Effects 0.000 description 3
- 238000013459 approach Methods 0.000 description 3
- 238000010801 machine learning Methods 0.000 description 3
- 230000001629 suppression Effects 0.000 description 3
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 2
- 230000003044 adaptive effect Effects 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- UFHFLCQGNIYNRP-UHFFFAOYSA-N Hydrogen Chemical compound [H][H] UFHFLCQGNIYNRP-UHFFFAOYSA-N 0.000 description 1
- 125000002066 L-histidyl group Chemical group [H]N1C([H])=NC(C([H])([H])[C@](C(=O)[*])([H])N([H])[H])=C1[H] 0.000 description 1
- WHXSMMKQMYFTQS-UHFFFAOYSA-N Lithium Chemical compound [Li] WHXSMMKQMYFTQS-UHFFFAOYSA-N 0.000 description 1
- HBBGRARXTFLTSG-UHFFFAOYSA-N Lithium ion Chemical compound [Li+] HBBGRARXTFLTSG-UHFFFAOYSA-N 0.000 description 1
- 230000001133 acceleration Effects 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- OJIJEKBXJYRIBZ-UHFFFAOYSA-N cadmium nickel Chemical compound [Ni].[Cd] OJIJEKBXJYRIBZ-UHFFFAOYSA-N 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 210000005069 ears Anatomy 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 229910052744 lithium Inorganic materials 0.000 description 1
- 229910001416 lithium ion Inorganic materials 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000006386 memory function Effects 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 229910052759 nickel Inorganic materials 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 229920001296 polysiloxane Polymers 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
- 238000011946 reduction process Methods 0.000 description 1
- 230000002787 reinforcement Effects 0.000 description 1
- 230000005236 sound signal Effects 0.000 description 1
- 238000012549 training Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
- H04R3/005—Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
- G10L21/028—Voice signal separating using properties of sound source
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01S—RADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
- G01S3/00—Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received
- G01S3/80—Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received using ultrasonic, sonic or infrasonic waves
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01S—RADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
- G01S5/00—Position-fixing by co-ordinating two or more direction or position line determinations; Position-fixing by co-ordinating two or more distance determinations
- G01S5/18—Position-fixing by co-ordinating two or more direction or position line determinations; Position-fixing by co-ordinating two or more distance determinations using ultrasonic, sonic, or infrasonic waves
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R1/00—Details of transducers, loudspeakers or microphones
- H04R1/20—Arrangements for obtaining desired frequency or directional characteristics
- H04R1/32—Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only
- H04R1/40—Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers
- H04R1/406—Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers microphones
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R5/00—Stereophonic arrangements
- H04R5/033—Headphones for stereophonic communication
- H04R5/0335—Earpiece support, e.g. headbands or neckrests
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2201/00—Details of transducers, loudspeakers or microphones covered by H04R1/00 but not provided for in any of its subgroups
- H04R2201/02—Details casings, cabinets or mounting therein for transducers covered by H04R1/02 but not provided for in any of its subgroups
- H04R2201/028—Structural combinations of loudspeakers with built-in power amplifiers, e.g. in the same acoustic enclosure
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2201/00—Details of transducers, loudspeakers or microphones covered by H04R1/00 but not provided for in any of its subgroups
- H04R2201/40—Details of arrangements for obtaining desired directional characteristic by combining a number of identical transducers covered by H04R1/40 but not provided for in any of its subgroups
- H04R2201/401—2D or 3D arrays of transducers
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2420/00—Details of connection covered by H04R, not provided for in its groups
- H04R2420/07—Applications of wireless loudspeakers or wireless microphones
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2460/00—Details of hearing devices, i.e. of ear- or headphones covered by H04R1/10 or H04R5/033 but not provided for in any of their subgroups, or of hearing aids covered by H04R25/00 but not provided for in any of its subgroups
- H04R2460/03—Aspects of the reduction of energy consumption in hearing devices
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2460/00—Details of hearing devices, i.e. of ear- or headphones covered by H04R1/10 or H04R5/033 but not provided for in any of their subgroups, or of hearing aids covered by H04R25/00 but not provided for in any of its subgroups
- H04R2460/07—Use of position data from wide-area or local-area positioning systems in hearing devices, e.g. program or information selection
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Signal Processing (AREA)
- Acoustics & Sound (AREA)
- Otolaryngology (AREA)
- Quality & Reliability (AREA)
- Computational Linguistics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- General Health & Medical Sciences (AREA)
- General Physics & Mathematics (AREA)
- Radar, Positioning & Navigation (AREA)
- Remote Sensing (AREA)
- Circuit For Audible Band Transducer (AREA)
- Obtaining Desirable Characteristics In Audible-Bandwidth Transducers (AREA)
- Details Of Audible-Bandwidth Transducers (AREA)
Abstract
Description
12…先端面 13…下面
14…上面 20…右腕部(第2腕部)
21…フレキシブル部 22…先端面
23…下面 24…上面
30…本体部 31…下垂部
32…本体部筐体 32a…透過部
32b…グリル 41…第1集音部
42…第2集音部 43…第3集音部
44…第4集音部 45…第5集音部
46…第6集音部 47…第7集音部
50…操作部 60…撮像部
70…センサ部 80…制御部
80a…音声解析部 80b…音声処理部
80c…入力解析部 80d…撮像制御部
80e…画像解析部 81…記憶部
82…通信部 83…近接センサ
84…放音部 90…バッテリー
100…首掛け型装置(音声入力装置)
Claims (6)
- 対象音源を挟んだ位置に配置可能な第1腕部及び第2腕部と、
前記第1腕部及び第2腕部のそれぞれに3箇所以上設けられた複数の集音部と、
各集音部によって取得された音に基づいて、その音が発せられた音源の空間上の位置又は方向を特定する音声解析部を備え、
前記音声解析部は、前記第1腕部に設けられた前記集音部によって取得した音と、前記第2腕部に設けられた集音部によって取得した音とで、それぞれ別々の音源の空間上の位置又は方向を特定する
音声入力装置。 - 前記音声入力装置は、首掛け型の装置であり、
前記対象音源は、前記音声入力装置の装着者の口である
請求項1に記載の音声入力装置。 - 前記音声解析部は、前記第1腕部に設けられた前記集音部によって取得した音に基づいて特定した音源が、前記装着者の前記第1腕部側にいる第1の対話者の口と一致するか否かを判断するとともに、前記第2腕部に設けられた前記集音部によって取得した音に基づいて特定した音源が、前記装着者の前記第2腕部側にいる第2の対話者の口と一致するか否かを判断する
請求項2に記載の音声入力装置。 - 前記音声解析部が特定した音源の位置又は方向に基づいて、前記集音部で取得した音声データに含まれる音成分を強調又は抑圧する処理を行う音声処理部を、さらに備える
請求項1から請求項3のいずれかに記載の音声入力装置。 - 前記音声処理部は、前記音声解析部が特定した音源の位置又は方向に基づいて、前記集音部で取得した音声データに含まれる音成分を強調する処理と抑圧する処理を同時に行う
請求項4に記載の音声入力装置。 - 前記音声入力装置は、首掛け型の装置であり、
装着者の首裏に相当する位置に、一又は複数のさらに集音部を備える
請求項1から請求項5のいずれかに記載の音声入力装置。
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2020116321A JP6786139B1 (ja) | 2020-07-06 | 2020-07-06 | 音声入力装置 |
EP21837976.6A EP4178220A1 (en) | 2020-07-06 | 2021-06-16 | Voice-input device |
CN202180049798.7A CN115868176A (zh) | 2020-07-06 | 2021-06-16 | 声音输入装置 |
US18/014,752 US20230290369A1 (en) | 2020-07-06 | 2021-06-16 | Audio input device |
PCT/JP2021/022813 WO2022009626A1 (ja) | 2020-07-06 | 2021-06-16 | 音声入力装置 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2020116321A JP6786139B1 (ja) | 2020-07-06 | 2020-07-06 | 音声入力装置 |
Publications (2)
Publication Number | Publication Date |
---|---|
JP6786139B1 true JP6786139B1 (ja) | 2020-11-18 |
JP2022014137A JP2022014137A (ja) | 2022-01-19 |
Family
ID=73219996
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2020116321A Active JP6786139B1 (ja) | 2020-07-06 | 2020-07-06 | 音声入力装置 |
Country Status (5)
Country | Link |
---|---|
US (1) | US20230290369A1 (ja) |
EP (1) | EP4178220A1 (ja) |
JP (1) | JP6786139B1 (ja) |
CN (1) | CN115868176A (ja) |
WO (1) | WO2022009626A1 (ja) |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2016063587A1 (ja) | 2014-10-20 | 2016-04-28 | ソニー株式会社 | 音声処理システム |
JP6476938B2 (ja) * | 2015-02-04 | 2019-03-06 | 富士ゼロックス株式会社 | 音声解析装置、音声解析システムおよびプログラム |
CN108141654B (zh) * | 2015-10-13 | 2020-02-14 | 索尼公司 | 信息处理装置 |
US20170303052A1 (en) * | 2016-04-18 | 2017-10-19 | Olive Devices LLC | Wearable auditory feedback device |
JP6947183B2 (ja) * | 2016-09-13 | 2021-10-13 | ソニーグループ株式会社 | 音源位置推定装置及びウェアラブルデバイス |
EP3518095A4 (en) * | 2016-09-23 | 2019-09-11 | Sony Corporation | INFORMATION PROCESSING DEVICE AND INFORMATION PROCESSING METHOD |
US20190138603A1 (en) * | 2017-11-06 | 2019-05-09 | Bose Corporation | Coordinating Translation Request Metadata between Devices |
JP2019122035A (ja) * | 2018-01-05 | 2019-07-22 | オンキヨー株式会社 | オーディオ入出力装置 |
-
2020
- 2020-07-06 JP JP2020116321A patent/JP6786139B1/ja active Active
-
2021
- 2021-06-16 US US18/014,752 patent/US20230290369A1/en active Pending
- 2021-06-16 CN CN202180049798.7A patent/CN115868176A/zh active Pending
- 2021-06-16 EP EP21837976.6A patent/EP4178220A1/en active Pending
- 2021-06-16 WO PCT/JP2021/022813 patent/WO2022009626A1/ja unknown
Also Published As
Publication number | Publication date |
---|---|
EP4178220A1 (en) | 2023-05-10 |
JP2022014137A (ja) | 2022-01-19 |
US20230290369A1 (en) | 2023-09-14 |
CN115868176A (zh) | 2023-03-28 |
WO2022009626A1 (ja) | 2022-01-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9491553B2 (en) | Method of audio signal processing and hearing aid system for implementing the same | |
US10405081B2 (en) | Intelligent wireless headset system | |
US20160183014A1 (en) | Hearing device with image capture capabilities | |
CA3166345A1 (en) | Hearing aid systems and methods | |
EP3533237A1 (en) | Facial recognition system | |
US11432067B2 (en) | Cancelling noise in an open ear system | |
CN111935573A (zh) | 音频增强方法、装置、存储介质及可穿戴设备 | |
CN114697812A (zh) | 声音采集方法、电子设备及*** | |
JP2023511090A (ja) | ステレオ収音方法および装置、端末デバイス、ならびにコンピュータ可読記憶媒体 | |
CN113393856A (zh) | 拾音方法、装置和电子设备 | |
CN108632695A (zh) | 一种耳机 | |
JP2020113981A (ja) | 補聴器システム | |
CN109117819B (zh) | 目标物识别方法、装置、存储介质及穿戴式设备 | |
JP7095692B2 (ja) | 情報処理装置及びその制御方法、並びに記録媒体 | |
JP6290827B2 (ja) | オーディオ信号を処理する方法及び補聴器システム | |
JP6786139B1 (ja) | 音声入力装置 | |
JP7118456B2 (ja) | 首掛け型装置 | |
JP6874437B2 (ja) | コミュニケーションロボット、プログラム及びシステム | |
WO2021095832A1 (ja) | 首掛け型装置 | |
KR101669463B1 (ko) | 지능형 카메라 | |
JP6853589B1 (ja) | 首掛け型装置 | |
JP2021082301A (ja) | 首掛け型装置 | |
US20230083358A1 (en) | Earphone smartcase with audio processor | |
US20220248131A1 (en) | Sound acquisition apparatus and sound acquisition method | |
US20240205614A1 (en) | Integrated camera and hearing interface device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20200721 |
|
A871 | Explanation of circumstances concerning accelerated examination |
Free format text: JAPANESE INTERMEDIATE CODE: A871 Effective date: 20200721 |
|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20200806 |
|
A975 | Report on accelerated examination |
Free format text: JAPANESE INTERMEDIATE CODE: A971005 Effective date: 20200817 |
|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20200825 |
|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20200826 |
|
TRDD | Decision of grant or rejection written | ||
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20200929 |
|
A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20201021 |
|
R150 | Certificate of patent or registration of utility model |
Ref document number: 6786139 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |