EP0615226A2 - Procédé de réduction de bruit pour canaux de parole pertubés - Google Patents

Procédé de réduction de bruit pour canaux de parole pertubés Download PDF

Info

Publication number
EP0615226A2
EP0615226A2 EP94102963A EP94102963A EP0615226A2 EP 0615226 A2 EP0615226 A2 EP 0615226A2 EP 94102963 A EP94102963 A EP 94102963A EP 94102963 A EP94102963 A EP 94102963A EP 0615226 A2 EP0615226 A2 EP 0615226A2
Authority
EP
European Patent Office
Prior art keywords
channels
speech
interference
noise reduction
individual
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
EP94102963A
Other languages
German (de)
English (en)
Other versions
EP0615226B1 (fr
EP0615226A3 (en
Inventor
Klaus Dr. Ing. Linhard
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Mercedes Benz Group AG
Original Assignee
Daimler Benz AG
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Daimler Benz AG filed Critical Daimler Benz AG
Publication of EP0615226A2 publication Critical patent/EP0615226A2/fr
Publication of EP0615226A3 publication Critical patent/EP0615226A3/de
Application granted granted Critical
Publication of EP0615226B1 publication Critical patent/EP0615226B1/fr
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • G10L2021/02166Microphone arrays; Beamforming

Definitions

  • the invention relates to a method according to the preamble of patent claim 1.
  • Such a method is used in automatic speech recognition or in hands-free systems to improve speech quality, e.g. in offices or in a motor vehicle.
  • Disrupted speech is easier to grasp if it is recorded with two or more channels. Language and interference should be present in each channel.
  • the multi-channel signals are processed with digital signal processing.
  • the transit time difference of the useful signal in the individual channels must first be determined. It will later be possible to merge the individual channels into one channel in the correct phase.
  • an acoustic directional lobe is set for this event.
  • the noise reduction is first carried out in each individual channel. Since the noise reduction does not work correctly, distortions and artificial insertions (e.g. "musical tones") can occur. When merging the individual processed channels, these errors are averaged and thus reduced.
  • the sum signal is then post-processed using the cross-correlation of the signals in the individual channels. It is assumed that interference or reverberation is less correlated than the useful signal of the channels.
  • a method for merging 2 disturbed speech channels is described in the publication "Multimicrophone signal-processing technique to remove room reverberation from speech signals” by Allen, Berkley and Blauert (J: Acoust. Soc. Am., Vol. 62, No. 4, October 1977) and out “Noise Suppression Signal Processing Using 2-Point Received Signals” by Kaneda and Tohyame (Electronics and Communication in Japan, Vol. 67-A, No. 12, 1984).
  • the first method is intended for the dehalling of speech signals and does not use a real phase compensation of the useful signal and the dehalling with noise reduction is only carried out in a post-processing stage.
  • the second method uses a simple linear phase compensation of the channels, but here too the noise is only reduced in the post-processing stage.
  • the invention is therefore based on the object of specifying a method for noise reduction in which the noise reduction is carried out in several stages and a significant improvement in the speech quality is achieved.
  • the microphone signals x and y are transformed into the frequency range (FFT, Fast Fourier Transformation).
  • the segments are half overlapped and weighted with a Hanning window.
  • the segments are each N values long and are expanded by an additional N zeros.
  • the transformed segments X l (i) and Y l (i) result.
  • the output signal z results after inverse transformation and the overlap of the segments.
  • the sampling rate of the signals x and y is, for example, 12 kHz.
  • the long-term mean of the magnitude spectrum is subtracted (spectral subtraction H SPS ).
  • the short-term average K and the long-term average L are used to calculate a first adaptive smoothing constant ⁇ .
  • the interference spectrum S nn (i) is estimated with ⁇ .
  • This adaptive smoothing constant replaces the otherwise common speech pause detector.
  • S ⁇ xx, l ( i ) (1 - ⁇ l ) S ⁇ xx, l - 1 ( i ) + ⁇ l
  • Part of the background noise is allowed to create a natural auditory impression and to mask part of the "musical tones".
  • a second adaptive smoothing with ⁇ is used to reduce a further part of the "musical tones" by smoothing the power density S xx little during speech and strongly smoothing during pause.
  • the method specified in the unpublished patent application P 42 43 831 is used to calculate the linear phase shift between useful parts in the channels. This method fits seamlessly into the noise reduction method according to the invention.
  • the phase shift is estimated from a selected number of the maximums of the cross power density and the phase correction is achieved by multiplication in the frequency domain with the all-pass function H ALLP .
  • X ⁇ l ( i ): X ⁇ l ( i )H ALLP
  • l X ⁇ l ( i ): X ⁇ l ( i ) (cos ( i * ⁇ ) + j sin ( i * ⁇ ))
  • phase correction is carried out for the other channel.
  • the first channel serves as a reference.
  • the directional filters for the channels are calculated using a "beamforming process". Various cases can be considered as noise. Different directional filters H R result depending on the noise situation . A set of these filters is selected, however, if the system status is known in later operation, it is possible to switch to a specific set or the filters can be continuously adapted.
  • the "Beamforming method” is the Frost gradient method ("An Algorithm for Linearly Constrained Adaptive Array Processing" Proc. IEEE, Vol. 60, No. 8, 1972) or according to Sondhi and Elko ("Adaptive Optimization of Microphone Arrays under a Nonlinear Contraint” Int. Conf. on ASSP, Tokyo, 1096, pp. 981-984).
  • the method according to the invention is not limited to systems with two channels, but can be used for multi-channel systems (3 and more channels).

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Soundproofing, Sound Blocking, And Sound Damping (AREA)
EP94102963A 1993-03-11 1994-02-28 Procédé de réduction de bruit pour canaux de parole perturbés Expired - Lifetime EP0615226B1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
DE4307688A DE4307688A1 (de) 1993-03-11 1993-03-11 Verfahren zur Geräuschreduktion für gestörte Sprachkanäle
DE4307688 1993-03-11

Publications (3)

Publication Number Publication Date
EP0615226A2 true EP0615226A2 (fr) 1994-09-14
EP0615226A3 EP0615226A3 (en) 1995-08-23
EP0615226B1 EP0615226B1 (fr) 1999-05-06

Family

ID=6482502

Family Applications (1)

Application Number Title Priority Date Filing Date
EP94102963A Expired - Lifetime EP0615226B1 (fr) 1993-03-11 1994-02-28 Procédé de réduction de bruit pour canaux de parole perturbés

Country Status (2)

Country Link
EP (1) EP0615226B1 (fr)
DE (2) DE4307688A1 (fr)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1998003965A1 (fr) * 1996-07-19 1998-01-29 Daimler-Benz Ag Procede pour reduire les parasites dans un signal vocal
EP1251493A2 (fr) * 2001-04-14 2002-10-23 DaimlerChrysler AG Procédé pour la réduction du bruit avec fréquence parasite auto-adaptative
EP1286333A1 (fr) * 2001-08-21 2003-02-26 Culturecom Technology (Macau) Ltd. Procédé et dispositif de traitements d'un signal de son
US11741973B2 (en) 2015-03-09 2023-08-29 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Audio encoder for encoding a multichannel signal and audio decoder for decoding an encoded audio signal

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE19942868A1 (de) * 1999-09-08 2001-03-15 Volkswagen Ag Verfahren zum Betrieb einer Mehrfachmikrofonanordnung in einem Kraftfahrzeug sowie Mehrfachmikrofonanordnung selbst
DE19955156A1 (de) * 1999-11-17 2001-06-21 Univ Karlsruhe Verfahren und Vorrichtung zur Unterdrückung eines Störsignalanteils im Ausgangssignal eines Schallwandlermittels
DE10120231A1 (de) * 2001-04-19 2002-10-24 Deutsche Telekom Ag Verfahren und Anordnung zur einkanaligen Geräuschreduktion für gestörte Sprachsignale

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4112430A (en) * 1977-06-01 1978-09-05 The United States Of America As Represented By The Secretary Of The Navy Beamformer for wideband signals
US4653102A (en) * 1985-11-05 1987-03-24 Position Orientation Systems Directional microphone system

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4066842A (en) * 1977-04-27 1978-01-03 Bell Telephone Laboratories, Incorporated Method and apparatus for cancelling room reverberation and noise pickup
JPS5715597A (en) * 1980-07-02 1982-01-26 Nippon Gakki Seizo Kk Microphone device
US4811404A (en) * 1987-10-01 1989-03-07 Motorola, Inc. Noise suppression system
JPH01118900A (ja) * 1987-11-01 1989-05-11 Ricoh Co Ltd 雑音抑圧装置
DE4012349A1 (de) * 1989-04-19 1990-10-25 Ricoh Kk Einrichtung zum beseitigen von geraeuschen
GB8911153D0 (en) * 1989-05-16 1989-09-20 Smiths Industries Plc Speech recognition apparatus and methods
US5267323A (en) * 1989-12-29 1993-11-30 Pioneer Electronic Corporation Voice-operated remote control system
DE4106405C2 (de) * 1990-03-23 1996-02-29 Ricoh Kk Geräuschunterdrückungseinrichtung für ein Spracherkennungsystem

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4112430A (en) * 1977-06-01 1978-09-05 The United States Of America As Represented By The Secretary Of The Navy Beamformer for wideband signals
US4653102A (en) * 1985-11-05 1987-03-24 Position Orientation Systems Directional microphone system

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING 1991, Bd. 5, 14.Mai 1991 - 17.Mai 1991 TORONTO, CA, Seiten 3581-3584, KELLERMANN 'A self-steering digital microphone array' *
SIGNAL PROCESSING VI - THEORIES AND APPLICATIONS. PROCEEDINGS OF EUSIPCO-92, SIXTH EUROPEAN SIGNAL PROCESSING CONFERENCE, 24.August 1992 - 27.August 1992 BRUSSELS, BE, Seiten 1633-1636, LE BOUQUIN ET AL. 'Study of a noise cancellation system based on the coherence function' *

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1998003965A1 (fr) * 1996-07-19 1998-01-29 Daimler-Benz Ag Procede pour reduire les parasites dans un signal vocal
US6687669B1 (en) 1996-07-19 2004-02-03 Schroegmeier Peter Method of reducing voice signal interference
EP1251493A2 (fr) * 2001-04-14 2002-10-23 DaimlerChrysler AG Procédé pour la réduction du bruit avec fréquence parasite auto-adaptative
EP1251493A3 (fr) * 2001-04-14 2003-11-19 DaimlerChrysler AG Procédé pour la réduction du bruit avec fréquence parasite auto-adaptative
US7020291B2 (en) 2001-04-14 2006-03-28 Harman Becker Automotive Systems Gmbh Noise reduction method with self-controlling interference frequency
EP1286333A1 (fr) * 2001-08-21 2003-02-26 Culturecom Technology (Macau) Ltd. Procédé et dispositif de traitements d'un signal de son
US11741973B2 (en) 2015-03-09 2023-08-29 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Audio encoder for encoding a multichannel signal and audio decoder for decoding an encoded audio signal
US11881225B2 (en) 2015-03-09 2024-01-23 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Audio encoder for encoding a multichannel signal and audio decoder for decoding an encoded audio signal

Also Published As

Publication number Publication date
DE4307688A1 (de) 1994-09-15
EP0615226B1 (fr) 1999-05-06
EP0615226A3 (en) 1995-08-23
DE59408194D1 (de) 1999-06-10

Similar Documents

Publication Publication Date Title
DE69409121T2 (de) Störreduktionssystem für ein binaurales hörgerät
DE69905035T2 (de) Rauschunterdrückung mittels spektraler subtraktion unter verwendung von linearem faltungsprodukt und kausaler filterung
DE69531136T2 (de) Verfahren und Einrichtung zur mehrkanaligen Kompensation eines akustischen Echos
US5400409A (en) Noise-reduction method for noise-affected voice channels
DE60316704T2 (de) Mehrkanalige spracherkennung in ungünstigen umgebungen
DE69932626T2 (de) Signalverarbeitungsvorrichtung und verfahren
DE102010023615B4 (de) Signalverarbeitungsvorrichtung und Signalverarbeitungsverfahren
EP0747880B1 (fr) Système de reconnaissance de la parole
DE4243831A1 (de) Verfahren zur Laufzeitschätzung an gestörten Sprachkanälen
EP1143416A2 (fr) Suppression de bruit dans le domaine temporel
DE102008027848A1 (de) Echounterdrücker, Echounterdrückungsverfahren und computerlesbares Speichermedium
EP3375204B1 (fr) Traitement de signal audio dans un véhicule
DE112007003625T5 (de) Echounterdrückungsvorrichtung, echounterdrückungssystem, Echounterdrückungsverfahren und Computerprogramm
DE69616724T2 (de) Verfahren und System für die Spracherkennung
EP1189419B1 (fr) Procede et appareil pour eliminer l'interference d'un haut-parleur sur de signaux de microphone
EP1155561B1 (fr) Dispositif et procede de suppression de bruit dans des installations telephoniques
EP3065417B1 (fr) Procede de suppression d'un bruit parasite dans un systeme acoustique
EP0615226B1 (fr) Procédé de réduction de bruit pour canaux de parole perturbés
DE69817461T2 (de) Verfahren und Vorrichtung zur optimierten Verarbeitung eines Störsignals während einer Tonaufnahme
DE602005000897T2 (de) Eingangsschallprozessor
DE102018117558A1 (de) Adaptives nachfiltern
DE10137348A1 (de) Verfahren und Schaltungsanordnung zur Geräuschreduktion bei der Sprachübertragung in Kommunikationssystemen
DE19729521B4 (de) Verfahren und Vorrichtung zur Störgeräusch- und Echounterdrückung
DE10025655B4 (de) Verfahren zum Entfernen einer unerwünschten Komponente aus einem Signal und System zum Unterscheiden zwischen unerwünschten und erwünschten Signalkomponenten
DE3230391C2 (fr)

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): DE FR GB IT

PUAL Search report despatched

Free format text: ORIGINAL CODE: 0009013

AK Designated contracting states

Kind code of ref document: A3

Designated state(s): DE FR GB IT

17P Request for examination filed

Effective date: 19950920

GRAG Despatch of communication of intention to grant

Free format text: ORIGINAL CODE: EPIDOS AGRA

17Q First examination report despatched

Effective date: 19980610

GRAG Despatch of communication of intention to grant

Free format text: ORIGINAL CODE: EPIDOS AGRA

GRAH Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOS IGRA

GRAH Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOS IGRA

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: DAIMLERCHRYSLER AG

ITF It: translation for a ep patent filed

Owner name: BARZANO' E ZANARDO MILANO S.P.A.

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): DE FR GB IT

REF Corresponds to:

Ref document number: 59408194

Country of ref document: DE

Date of ref document: 19990610

GBT Gb: translation of ep patent filed (gb section 77(6)(a)/1977)

Effective date: 19990702

ET Fr: translation filed
PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

26N No opposition filed
REG Reference to a national code

Ref country code: GB

Ref legal event code: IF02

REG Reference to a national code

Ref country code: FR

Ref legal event code: TP

REG Reference to a national code

Ref country code: DE

Ref legal event code: R082

Ref document number: 59408194

Country of ref document: DE

Representative=s name: GRUENECKER, KINKELDEY, STOCKMAIR & SCHWANHAEUS, DE

REG Reference to a national code

Ref country code: DE

Ref legal event code: R082

Ref document number: 59408194

Country of ref document: DE

Representative=s name: GRUENECKER, KINKELDEY, STOCKMAIR & SCHWANHAEUS, DE

Effective date: 20120411

Ref country code: DE

Ref legal event code: R081

Ref document number: 59408194

Country of ref document: DE

Owner name: NUANCE COMMUNICATIONS, INC. (N.D.GES.D. STAATE, US

Free format text: FORMER OWNER: HARMAN BECKER AUTOMOTIVE SYSTEMS GMBH, 76307 KARLSBAD, DE

Effective date: 20120411

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: IT

Payment date: 20120224

Year of fee payment: 19

REG Reference to a national code

Ref country code: FR

Ref legal event code: TP

Owner name: NUANCE COMMUNICATIONS, INC., US

Effective date: 20120924

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: GB

Payment date: 20130228

Year of fee payment: 20

Ref country code: FR

Payment date: 20130301

Year of fee payment: 20

Ref country code: DE

Payment date: 20130220

Year of fee payment: 20

REG Reference to a national code

Ref country code: DE

Ref legal event code: R071

Ref document number: 59408194

Country of ref document: DE

REG Reference to a national code

Ref country code: GB

Ref legal event code: PE20

Expiry date: 20140227

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: DE

Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION

Effective date: 20140301

Ref country code: GB

Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION

Effective date: 20140227