EP1062487A1 - Mikrophonanordnung für die spracherkennung unter variablen räumlichen bedingungen - Google Patents
Mikrophonanordnung für die spracherkennung unter variablen räumlichen bedingungenInfo
- Publication number
- EP1062487A1 EP1062487A1 EP99914401A EP99914401A EP1062487A1 EP 1062487 A1 EP1062487 A1 EP 1062487A1 EP 99914401 A EP99914401 A EP 99914401A EP 99914401 A EP99914401 A EP 99914401A EP 1062487 A1 EP1062487 A1 EP 1062487A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- microphone
- transmission channel
- speech
- speaker
- correction unit
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 230000005540 biological transmission Effects 0.000 claims abstract description 31
- 238000012937 correction Methods 0.000 claims abstract description 16
- 238000000034 method Methods 0.000 claims abstract description 11
- 238000012549 training Methods 0.000 claims description 9
- 238000001514 detection method Methods 0.000 claims description 7
- 230000003044 adaptive effect Effects 0.000 claims description 5
- 230000005236 sound signal Effects 0.000 claims description 3
- 238000010183 spectrum analysis Methods 0.000 claims description 2
- 238000012545 processing Methods 0.000 description 5
- 230000006870 function Effects 0.000 description 4
- 230000001419 dependent effect Effects 0.000 description 3
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10K—SOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
- G10K15/00—Acoustics not otherwise provided for
- G10K15/08—Arrangements for producing a reverberation or echo sound
Definitions
- the invention relates to a device for speech recognition in which the speech is optionally converted into electrical signals by means of a microphone near the speaker and converted into electrical signals via a first transmission channel or into electrical signals by means of a microphone remote from the speaker and fed to the recognition system via a second transmission channel, and in which the recognition system compares the speech elements recorded by means of the respective microphone with speech elements previously learned in a training phase and, if they match, generates a recognition signal.
- the invention further relates to a method for recognizing speech.
- Speaker is held, and a microphone remote from the speaker, in which the microphone voices m in a hands-free state takes up a greater distance.
- the typical distance for a microphone close to the speaker is in the range from 0 to 30 cm, ie predominantly the direct sound is converted into electrical signals.
- the distance from the speaker is larger and sound elements mix due to ecno effects, wall reflections and direct sound. If the microphone close to the speaker is used during the training phase and the microphone remote from the speaker is used in later use, the detection rate already drops due to the different spatial transmission functions due to the different transmission paths.
- a correction unit is switched into the first transmission channel, which changes the electrical signal in such a way that it contains spatial transmission properties.
- the language which is input via a microphone close to the speaker is thus changed in the electrical signal in such a way that it has the same properties as the language which has been input via the microphone remote from the speaker.
- the correctness unit thus simulates the room acoustic influences for a relatively large voice transmission path.
- the correction unit simulates sound reflections on nearby objects and or the reverberation in rooms.
- Figure 2 shows a device according to Figure 1 with adaptive
- FIG. 1 shows a device for speech recognition, in which the speech is entered by a person 10 using a telephone.
- the speech is input through a microphone 14 close to the speaker, for example with the handset.
- the speech is converted into an electrical signal by the microphone 14 m and pre-amplified by an amplifier 16.
- a correction unit 15 changes the electrical signal in such a way that it has transmission properties of a room with a transmission path greater than the near range.
- this correction unit 15 simulates the reverberation in rooms and / or the sound reflections on nearby objects within the voice transmission path.
- Such sound reflections can originate, for example, from a table top, from a screen or from other objects.
- the reverberation in rooms comes from reflections on relatively distant objects, such as from the walls of the room.
- the electrical signal changed by the correction unit 15 passes through a compensation filter 16 which serves to compensate for varying microphone and amplifier frequency responses.
- the electrical signal is then fed to a data processing system 17 which carries out the further digital processing for speech recognition.
- the input of speech elements is shown via a hands-free system.
- the language of the person 10 is changed by a special room transmission function RUF, ie the speech elements arriving from the speaker 10 at the microphone 20 are reflections on nearby objects and through the reverberation _.r. Clear and if necessary superimposed by external noise.
- the electrical signal of the microphone 2-3 ir ⁇ remote from the speaker is pre-amplified by a preamplifier 22 and reaches a compensation filter 24 for compensation of the microphone and amplifier frequency response.
- the filtered electrical signal is fed to the data processing system 17 for speech recognition.
- speech samples are stored in the data processing system 17 during a training phase.
- a personal telephone book can be set up with the aid of such speech samples.
- the name of a participant is spoken at least twice during the training phase and with that for
- Name belonging phone number filed in a personal phone book After the end of the training phase, the name is re-entered in the use phase, with the data processing system 17 using recognition methods, for example spectral analysis or LPC ceptral analysis, trying to recognize this name on the basis of the previously stored names and, if the result is positive, the name output the stored telephone number and establish the telephone connection.
- recognition methods for example spectral analysis or LPC ceptral analysis
- the correction unit 14 After the correction unit 14 generates an electrical speech signal in the transmission channel 12, which has the same spatial characteristics as the speech signal of the second transmission channel 19, it does not matter for speech recognition whether the same microphone 14 or 20 is used during the training phase or during the recognition phase.
- the correction unit 15 therefore makes it possible to use the telephone both with the handset and in the hands-free mode.
- FIG. 2 shows a variant of the device according to FIG. 1.
- the correction unit 15 is designed as an adaptive filter, ie the filter parameters are dependent on the recorded audio signals varies. The detection rate can be increased in this way.
- the compensation filters 18 and 24 in the two transmission channels 12 and 19 are also designed as adaptive filters; their filter parameters are set depending on the recorded audio signals.
Landscapes
- Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
- Circuit For Audible Band Transducer (AREA)
- Interconnected Communication Systems, Intercoms, And Interphones (AREA)
Abstract
Description
Claims
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE19811879 | 1998-03-18 | ||
DE19811879A DE19811879C1 (de) | 1998-03-18 | 1998-03-18 | Einrichtung und Verfahren zum Erkennen von Sprache |
PCT/DE1999/000289 WO1999048086A1 (de) | 1998-03-18 | 1999-02-03 | Mikrophonanordnung für die spracherkennung unter variablen räumlichen bedingungen |
Publications (2)
Publication Number | Publication Date |
---|---|
EP1062487A1 true EP1062487A1 (de) | 2000-12-27 |
EP1062487B1 EP1062487B1 (de) | 2003-06-11 |
Family
ID=7861400
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP99914401A Expired - Lifetime EP1062487B1 (de) | 1998-03-18 | 1999-02-03 | Mikrophonanordnung für die spracherkennung unter variablen räumlichen bedingungen |
Country Status (6)
Country | Link |
---|---|
US (1) | US7043427B1 (de) |
EP (1) | EP1062487B1 (de) |
AT (1) | ATE242873T1 (de) |
DE (2) | DE19811879C1 (de) |
ES (1) | ES2201695T3 (de) |
WO (1) | WO1999048086A1 (de) |
Families Citing this family (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE19963142A1 (de) * | 1999-12-24 | 2001-06-28 | Christoph Bueltemann | Verfahren und Vorrichtung zur sprachgesteuerten Bedienung eines Kiosksystems |
DE10052991A1 (de) * | 2000-10-19 | 2002-05-02 | Deutsche Telekom Ag | Verfahren zur Ermittlung raumakustischer und elektroakustischer Parameter |
US20070239441A1 (en) * | 2006-03-29 | 2007-10-11 | Jiri Navratil | System and method for addressing channel mismatch through class specific transforms |
US20090018826A1 (en) * | 2007-07-13 | 2009-01-15 | Berlin Andrew A | Methods, Systems and Devices for Speech Transduction |
US8696458B2 (en) * | 2008-02-15 | 2014-04-15 | Thales Visionix, Inc. | Motion tracking system and method using camera and non-camera sensors |
US7974841B2 (en) * | 2008-02-27 | 2011-07-05 | Sony Ericsson Mobile Communications Ab | Electronic devices and methods that adapt filtering of a microphone signal responsive to recognition of a targeted speaker's voice |
US11012732B2 (en) * | 2009-06-25 | 2021-05-18 | DISH Technologies L.L.C. | Voice enabled media presentation systems and methods |
US20150228274A1 (en) * | 2012-10-26 | 2015-08-13 | Nokia Technologies Oy | Multi-Device Speech Recognition |
US10229672B1 (en) | 2015-12-31 | 2019-03-12 | Google Llc | Training acoustic models using connectionist temporal classification |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5267323A (en) * | 1989-12-29 | 1993-11-30 | Pioneer Electronic Corporation | Voice-operated remote control system |
DE4312155A1 (de) * | 1993-04-14 | 1994-10-20 | Friedrich Dipl Ing Hiller | Verfahren und Vorrichtung zur Verbesserung der Erkennungsleistung und Erhöhung der Zuverlässigkeit bei der automatischen Spracherkennung in geräuschvoller Umgebung |
US5528731A (en) * | 1993-11-19 | 1996-06-18 | At&T Corp. | Method of accommodating for carbon/electret telephone set variability in automatic speaker verification |
US5515445A (en) * | 1994-06-30 | 1996-05-07 | At&T Corp. | Long-time balancing of omni microphones |
US5737485A (en) * | 1995-03-07 | 1998-04-07 | Rutgers The State University Of New Jersey | Method and apparatus including microphone arrays and neural networks for speech/speaker recognition systems |
US5765124A (en) * | 1995-12-29 | 1998-06-09 | Lucent Technologies Inc. | Time-varying feature space preprocessing procedure for telephone based speech recognition |
US6275800B1 (en) * | 1999-02-23 | 2001-08-14 | Motorola, Inc. | Voice recognition system and method |
US6219645B1 (en) * | 1999-12-02 | 2001-04-17 | Lucent Technologies, Inc. | Enhanced automatic speech recognition using multiple directional microphones |
-
1998
- 1998-03-18 DE DE19811879A patent/DE19811879C1/de not_active Expired - Fee Related
-
1999
- 1999-02-03 AT AT99914401T patent/ATE242873T1/de not_active IP Right Cessation
- 1999-02-03 ES ES99914401T patent/ES2201695T3/es not_active Expired - Lifetime
- 1999-02-03 US US09/646,315 patent/US7043427B1/en not_active Expired - Fee Related
- 1999-02-03 WO PCT/DE1999/000289 patent/WO1999048086A1/de active IP Right Grant
- 1999-02-03 DE DE59905927T patent/DE59905927D1/de not_active Expired - Lifetime
- 1999-02-03 EP EP99914401A patent/EP1062487B1/de not_active Expired - Lifetime
Non-Patent Citations (1)
Title |
---|
See references of WO9948086A1 * |
Also Published As
Publication number | Publication date |
---|---|
ATE242873T1 (de) | 2003-06-15 |
DE59905927D1 (de) | 2003-07-17 |
WO1999048086A1 (de) | 1999-09-23 |
US7043427B1 (en) | 2006-05-09 |
ES2201695T3 (es) | 2004-03-16 |
EP1062487B1 (de) | 2003-06-11 |
DE19811879C1 (de) | 1999-05-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE10002321C2 (de) | Sprachgesteuerte Vorrichtung und System mit einer derartigen sprachgesteuerten Vorrichtung | |
DE3887786T2 (de) | Elektronisches Fernmeldegerät mit Rauschunterdrückungsfunktion. | |
EP0747880B1 (de) | Spracherkennungssystem | |
DE2207141A1 (de) | Schaltungsanordnung zur unterdrueckung unerwuenschter sprachsignale mittels eines vorhersagenden filters | |
DE10251113A1 (de) | Verfahren zum Betrieb eines Spracherkennungssystems | |
DE10043064B4 (de) | Verfahren und Vorrichtung zur Elimination von Lautsprecherinterferenzen aus Mikrofonsignalen | |
EP1062487A1 (de) | Mikrophonanordnung für die spracherkennung unter variablen räumlichen bedingungen | |
DE2510731A1 (de) | Elektrische schaltung fuer eine hoerhilfe | |
EP1920589A1 (de) | Vorrichtung zur lageabhängigen steuerung | |
DE102006001730A1 (de) | Beschallungsanlage, Verfahren zur Verbesserung der Sprachqualität und/oder Verständlichkeit von Sprachdurchsagen sowie Computerprogramm | |
DE19827197A1 (de) | Verfahren und Einrichtung zur Beeinflussung der Lautstärke von Audiowiedergabegeräten in Kraftfahrzeugen | |
DE102014210760B4 (de) | Betrieb einer Kommunikationsanlage | |
DE4229910A1 (de) | Verfahren zum Verbessern der akustischen Rückhördämpfung von elektroakustischen Anlagen | |
DE3734446C2 (de) | ||
DE3734447C1 (de) | Konferenz-Freisprecheinrichtung | |
EP1501190B1 (de) | Verfahren und Vorrichtung zum Entzerren eines mit einem äusseren Störsignal behafteten Audiosignals | |
WO1985001411A1 (en) | Telephone transmission installation | |
EP0309869B1 (de) | Verfahren zur Kompensation von mit Störungen behafteten Sprachsignalen für Spracherkennungssysteme | |
DE102021103310B4 (de) | Verfahren und vorrichtung zur verbesserung der sprachverständlichkeit in einem raum | |
EP3763144B1 (de) | Haupteinheit, system und verfahren für ein infotainmentsystem eines fahrzeugs | |
EP0898441A2 (de) | Verfahren zur Eingabe von akustischen Signalen in ein elektrisches Gerät und elektrisches Gerät | |
DE102005017338A1 (de) | Kommunikationsterminal für eine bidirektionale Kommunikation und Verfahren zur bidirektionalen Kommunikation | |
DE102004044387B4 (de) | Kommunikationssystem | |
DE19956572B4 (de) | Verfahren zur Telekommunikation und Kommunikationsterminal | |
DE10136491B4 (de) | Verfahren und Vorrichtung zur Verbesserung der Sprachqualität auf transparenten Telekommunikations-Übertragungswegen |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20000822 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE DE ES FR GB IT NL |
|
GRAH | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOS IGRA |
|
GRAH | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOS IGRA |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
AK | Designated contracting states |
Designated state(s): AT BE DE ES FR GB IT NL |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: NL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20030611 |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D Free format text: NOT ENGLISH |
|
REF | Corresponds to: |
Ref document number: 59905927 Country of ref document: DE Date of ref document: 20030717 Kind code of ref document: P |
|
GBT | Gb: translation of ep patent filed (gb section 77(6)(a)/1977) | ||
NLV1 | Nl: lapsed or annulled due to failure to fulfill the requirements of art. 29p and 29m of the patents act | ||
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: AT Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20040203 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: BE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20040228 |
|
REG | Reference to a national code |
Ref country code: ES Ref legal event code: FG2A Ref document number: 2201695 Country of ref document: ES Kind code of ref document: T3 |
|
ET | Fr: translation filed | ||
PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
26N | No opposition filed |
Effective date: 20040312 |
|
BERE | Be: lapsed |
Owner name: *SIEMENS A.G. Effective date: 20040228 |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: 732E Free format text: REGISTERED BETWEEN 20121025 AND 20121031 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: TP Owner name: SIEMENS ENTERPRISE COMMUNICATIONS GMBH & CO. K, DE Effective date: 20130108 |
|
REG | Reference to a national code |
Ref country code: ES Ref legal event code: PC2A Owner name: SIEMENS ENTERPRISE COMMUNICATIONS GMBH & CO. KG Effective date: 20130228 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R082 Ref document number: 59905927 Country of ref document: DE Representative=s name: FRITZSCHE PATENT, DE |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R082 Ref document number: 59905927 Country of ref document: DE Representative=s name: FRITZSCHE PATENTANWAELTE, DE Effective date: 20130313 Ref country code: DE Ref legal event code: R082 Ref document number: 59905927 Country of ref document: DE Representative=s name: FRITZSCHE PATENT, DE Effective date: 20130313 Ref country code: DE Ref legal event code: R081 Ref document number: 59905927 Country of ref document: DE Owner name: UNIFY GMBH & CO. KG, DE Free format text: FORMER OWNER: SIEMENS AKTIENGESELLSCHAFT, 80333 MUENCHEN, DE Effective date: 20130313 Ref country code: DE Ref legal event code: R081 Ref document number: 59905927 Country of ref document: DE Owner name: SIEMENS ENTERPRISE COMMUNICATIONS GMBH & CO. K, DE Free format text: FORMER OWNER: SIEMENS AKTIENGESELLSCHAFT, 80333 MUENCHEN, DE Effective date: 20130313 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R082 Ref document number: 59905927 Country of ref document: DE Representative=s name: FRITZSCHE PATENT, DE |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R082 Ref document number: 59905927 Country of ref document: DE Representative=s name: FRITZSCHE PATENTANWAELTE, DE Effective date: 20131112 Ref country code: DE Ref legal event code: R082 Ref document number: 59905927 Country of ref document: DE Representative=s name: FRITZSCHE PATENT, DE Effective date: 20131112 Ref country code: DE Ref legal event code: R081 Ref document number: 59905927 Country of ref document: DE Owner name: UNIFY GMBH & CO. KG, DE Free format text: FORMER OWNER: SIEMENS ENTERPRISE COMMUNICATIONS GMBH & CO. KG, 81379 MUENCHEN, DE Effective date: 20131112 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: CD Owner name: UNIFY GMBH & CO.KG, DE Effective date: 20140429 |
|
REG | Reference to a national code |
Ref country code: ES Ref legal event code: PC2A Owner name: UNIFY GMBH & CO.KG Effective date: 20140708 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 18 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: DE Payment date: 20160218 Year of fee payment: 18 Ref country code: IT Payment date: 20160222 Year of fee payment: 18 Ref country code: ES Payment date: 20160223 Year of fee payment: 18 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: FR Payment date: 20160222 Year of fee payment: 18 Ref country code: GB Payment date: 20160222 Year of fee payment: 18 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R082 Ref document number: 59905927 Country of ref document: DE Representative=s name: FRITZSCHE PATENTANWAELTE, DE Ref country code: DE Ref legal event code: R081 Ref document number: 59905927 Country of ref document: DE Owner name: UNIFY GMBH & CO. KG, DE Free format text: FORMER OWNER: UNIFY GMBH & CO. KG, 81379 MUENCHEN, DE |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R119 Ref document number: 59905927 Country of ref document: DE |
|
GBPC | Gb: european patent ceased through non-payment of renewal fee |
Effective date: 20170203 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: ST Effective date: 20171031 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: DE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20170901 Ref country code: FR Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20170228 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: GB Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20170203 Ref country code: IT Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20170203 |
|
REG | Reference to a national code |
Ref country code: ES Ref legal event code: FD2A Effective date: 20180703 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: ES Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20170204 |