EP2434781A1 - Procédé de reconstruction d'un signal vocal et dispositif auditif - Google Patents

Procédé de reconstruction d'un signal vocal et dispositif auditif Download PDF

Info

Publication number
EP2434781A1
EP2434781A1 EP11182407A EP11182407A EP2434781A1 EP 2434781 A1 EP2434781 A1 EP 2434781A1 EP 11182407 A EP11182407 A EP 11182407A EP 11182407 A EP11182407 A EP 11182407A EP 2434781 A1 EP2434781 A1 EP 2434781A1
Authority
EP
European Patent Office
Prior art keywords
amplitude spectrum
input signal
speech
spectrum
predetermined amplitude
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP11182407A
Other languages
German (de)
English (en)
Inventor
Ulrich Giese
Alexander Grafenberg
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sivantos Pte Ltd
Original Assignee
Siemens Medical Instruments Pte Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Siemens Medical Instruments Pte Ltd filed Critical Siemens Medical Instruments Pte Ltd
Publication of EP2434781A1 publication Critical patent/EP2434781A1/fr
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R25/00Deaf-aid sets, i.e. electro-acoustic or electro-mechanical hearing aids; Electric tinnitus maskers providing an auditory perception
    • H04R25/35Deaf-aid sets, i.e. electro-acoustic or electro-mechanical hearing aids; Electric tinnitus maskers providing an auditory perception using translation techniques
    • H04R25/353Frequency, e.g. frequency shift or compression
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0364Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R25/00Deaf-aid sets, i.e. electro-acoustic or electro-mechanical hearing aids; Electric tinnitus maskers providing an auditory perception
    • H04R25/35Deaf-aid sets, i.e. electro-acoustic or electro-mechanical hearing aids; Electric tinnitus maskers providing an auditory perception using translation techniques
    • H04R25/356Amplitude, e.g. amplitude shift or compression
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/06Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
    • G10L2021/065Aids for the handicapped in understanding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2225/00Details of deaf aids covered by H04R25/00, not provided for in any of its subgroups
    • H04R2225/43Signal processing in hearing aids to enhance the speech intelligibility
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2430/00Signal processing covered by H04R, not provided for in its groups
    • H04R2430/03Synergistic effects of band splitting and sub-band processing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R25/00Deaf-aid sets, i.e. electro-acoustic or electro-mechanical hearing aids; Electric tinnitus maskers providing an auditory perception
    • H04R25/50Customised settings for obtaining desired overall acoustical characteristics

Definitions

  • the present invention relates to a method for reconstructing a speech signal. Moreover, the present invention relates to a hearing device with which a speech signal can be reconstructed.
  • hearing device is here understood to mean any sound-emitting device which can be worn on or in the ear, in particular a hearing device, a headset, headphones and the like.
  • Hearing aids are portable hearing aids that are used to care for the hearing impaired.
  • different types of hearing aids such as behind-the-ear hearing aids (BTE), hearing aid with external receiver (RIC: receiver in the canal) and in-the-ear hearing aids (IDO), e.g. Concha hearing aids or canal hearing aids (ITE, CIC).
  • BTE behind-the-ear hearing aids
  • RIC hearing aid with external receiver
  • IDO in-the-ear hearing aids
  • ITE canal hearing aids
  • the hearing aids listed by way of example are worn on the outer ear or in the ear canal.
  • bone conduction hearing aids, implantable or vibrotactile hearing aids are also available on the market. The stimulation of the damaged hearing takes place either mechanically or electrically.
  • Hearing aids have in principle as essential components an input transducer, an amplifier and an output transducer.
  • the input transducer is usually a sound receiver, z. As a microphone, and / or an electromagnetic receiver, for. B. an induction coil.
  • the output transducer is usually used as an electroacoustic transducer, z. As miniature speaker, or as an electromechanical transducer, z. B. bone conduction, realized.
  • the amplifier is usually integrated in a signal processing unit. This basic structure is in FIG. 1 shown using the example of a behind-the-ear hearing aid. In a hearing aid housing 1 for carrying behind the ear are one or more microphones 2 to Built-in sound recording from the environment.
  • a signal processing unit 3 which is also integrated in the hearing aid housing 1, processes the microphone signals and amplifies them.
  • the output signal of the signal processing unit 3 is transmitted to a loudspeaker or earpiece 4, which outputs an acoustic signal.
  • the sound is optionally transmitted via a sound tube, which is fixed with an earmold in the ear canal, to the eardrum of the device carrier.
  • the power supply of the hearing device and in particular the signal processing unit 3 is effected by a likewise integrated into the hearing aid housing 1 battery. 5
  • Speech intelligibility An essential aspect in the care of the hearing-impaired with hearing aids is speech intelligibility. This means that a word or a word component should also be recognized as such or as such by the hearing aid wearer.
  • An essential role in speech intelligibility is played by the consonants, in particular, for example, the "S". In the listening situation "speech in noise”, consonants are often not clearly audible or are heard as other consonants. For example, the word “nonfiction" can be heard as a "textbook”.
  • noise reduction algorithms or speech enhancement algorithms are mostly used.
  • speech in broadband noise only a directional microphone increases speech intelligibility.
  • Directional microphones work only meaningful if the noise and the language come from different directions.
  • Other noise reduction algorithms e.g. Wiener filters, do not increase speech intelligibility in noise. At best they reduce the listening effort.
  • the object of the present invention is thus to propose a method and a hearing device with which increased speech intelligibility can be ensured.
  • this object is achieved by a method for reconstructing a speech signal by storing a predetermined amplitude spectrum of a speech component, detecting an amplitude spectrum of an input signal containing the speech signal, detecting at least a matching part and a mismatched part of the predetermined amplitude spectrum with the amplitude spectrum of the input signal and Changing a gain of the input signal in the mismatched portion of the amplitude spectrum such that a greater match than the original gain with the predetermined amplitude spectrum is achieved.
  • the invention provides a hearing device with which a speech signal is reconstructable, comprising a memory device for storing a predetermined amplitude spectrum of a speech component, a detection device for detecting an amplitude spectrum of an input signal containing the speech signal, a detection device for detecting at least one matching part and mismatched part of the predetermined amplitude spectrum with the amplitude spectrum of the input signal and an amplifying means, with which a gain of the speech signal in the mismatching part of the amplitude spectrum is variable such that a relation to an original gain further match with the predetermined amplitude spectrum is achieved.
  • the input signal containing the speech signal and any noise is examined for predetermined patterns in the amplitude spectrum. If certain patterns or pattern parts are detected in the amplitude spectrum of the input signal, the remaining amplitude spectrum can be adapted to the given pattern by changing the gain. Thus, for example, a predetermined Language component of an amplitude spectrum are "worked out".
  • the processing of the input signal takes place in a plurality of frequency channels, and each amplitude spectrum is characterized by one amplitude value per frequency channel. This is equivalent if the processing takes place in digital frequency values and an amplitude value is assigned to each frequency value in a specific amplitude spectrum.
  • the language component is a consonant. Consonants have a greater importance in speech intelligibility than vowels.
  • a predetermined amplitude spectrum of a plurality of speech components is stored, the amplitude spectrum of the input signal is checked for at least partial coincidence with each of the predetermined amplitude spectra, and the gain is changed in response to the at least partially coincident predetermined amplitude spectrum.
  • the reconstruction of several different consonants in an input signal can be achieved in a targeted manner if corresponding parts of amplitude spectra are detected.
  • the detection for matches can be limited to formants.
  • Formants are rapidly detectable in a spectrum and they carry the essential information for the distinctness of language components.
  • the gain may be changed to achieve full match with the predetermined amplitude spectrum. In this way, certain speech components can be made very clearly audible.
  • Detecting at least one matching part and a mismatched part of the predetermined amplitude spectrum with the amplitude spectrum of the input signal may include balancing the absolute values of the predetermined amplitude spectrum with the absolute values of the amplitude spectrum of the input signal.
  • the amplitude spectrum of the input signal it is not necessary for the amplitude spectrum of the input signal to be absolutely identical to the stored amplitude spectrum. Rather, a relative agreement of the spectral values is sufficient.
  • the input signal can be additionally amplified in its entirety or transferred to another frequency range. This further increases the audibility of the reconstructed speech component.
  • the inventive method for reconstructing a speech signal can be used particularly advantageously in signal processing in a hearing aid.
  • a corresponding time signal can be obtained, as in FIG. 2 symbolically indicated. From this time signal, a sample or snapshot sa with a certain time width can be obtained.
  • a short-term spectrum can be obtained from the temporal snapshot sa, which is described by way of example in FIG. 3 is shown.
  • the short-term spectrum of a consonant has a typical shape.
  • a consonant can be identified on the basis of its specific position of the formants 10, 11.
  • one or more consonants are now recorded in an undisturbed environment.
  • the spectrum of each consonant is, for example, sampled digitally and the individual samples 12 of the short-term spectrum 13 are stored in a memory device of the hearing device and in particular of the hearing device. In this way, for each consonant being recorded, a short-term spectrum can be stored in the hearing device.
  • the hearing device now permanently analyzes the input signal and searches for the spectral pattern of the consonant or the patterns of the stored consonants.
  • the consonant hereinafter the method is described by means of a single consonant
  • the background noise has the noise spectrum 14, while the consonant, ie the useful signal, has the spectrum 15.
  • the noise spectrum 14 predominates, while in a other section s, the signal spectrum 15, namely that of the spoken consonant predominates.
  • the consonant spectrum 15 is actually sampled by the signal processing.
  • the sampled spectrum 15 is compared with the stored spectrum 13.
  • this part of the spectrum 15 has, for example, a very characteristic form, it can be concluded that the stored consonant has been spoken. It is then assumed that the signal in the spectral range n is superimposed by a noise. The amplification is then lowered, for example channel by channel, so that the stored spectrum 13 of the consonant is also given in the spectral range n. This reduction in gain is in FIG. 4 symbolized by the arrows 16. The spectrum is thus reconstructed in the region n or extrapolated from the measured consonant spectrum 15 using the stored spectrum 13. The resulting spectrum corresponds to that of the stored spectrum, which was recorded without background noise. If the hearing aid wearer now reproduces the reconstructed spectrum, he hears the consonant more clearly, since the background noise was damped. He practically hears the consonant as if it were spoken calmly.
  • FIG. 4 Only a very small part of the short-term spectrum (area s) is detected as a prominent area. Whether this small area alone is sufficient to identify the corresponding consonant depends on the performance of the detection device in the hearing device. As a rule, a single peak or formant 10 will not be sufficient to correctly identify a consonant. Therefore be with FIG. 5 explains a second embodiment, in which an identification is easier to carry out.
  • the same consonant spectrum 15 is here superimposed with an interference spectrum 14 'of lower amplitude. Only in a very small area n 'dominates the noise. In the much larger area s' dominates the consonant spectrum 15. In particular, the formants protrude 10 and 11 on the interference spectrum 14 '.
  • the spoken consonant can be identified more easily by comparison with the stored spectrum 13 than in the case of FIG. 4 .
  • To reconstruct the entire spectrum only the gain in the region n 'according to the arrows 16' has to be reduced. The reconstructed spectrum then no longer has any interfering components. The hearing aid wearer perceives the spoken consonant as if it were spoken in peace.
  • the reconstructed consonants can then be further processed, e.g. through specific reinforcement. Likewise, the reconstructed consonants can be pushed, for example, by frequency shift / compression in an audible range for the hearing aid wearer.
  • Analogous to the reduction of noise components and the useful signal components of the speech component can be raised in terms of higher gain.
  • the entire spectrum is then raised uniformly, for example, in the regions s ', while in the region n' it is only channel-specifically raised in such a way that ultimately the pattern of the stored spectrum 13 is formed.
  • the present invention detect by statistical methods a spectral pattern of a speech component in a background noise.
  • the noisy pattern is then reconstructed from a known pattern by specific reduction of gain (in the relevant channels).
  • the reconstructed language component can be further processed.
  • the respective language component is thus suppressed, whereby an increased speech intelligibility in noise can be achieved.

Landscapes

  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Otolaryngology (AREA)
  • Neurosurgery (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Quality & Reliability (AREA)
  • Computational Linguistics (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Measurement Of The Respiration, Hearing Ability, Form, And Blood Characteristics Of Living Organisms (AREA)
EP11182407A 2010-09-27 2011-09-22 Procédé de reconstruction d'un signal vocal et dispositif auditif Withdrawn EP2434781A1 (fr)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
DE102010041435A DE102010041435A1 (de) 2010-09-27 2010-09-27 Verfahren zum Rekonstruieren eines Sprachsignals und Hörvorrichtung

Publications (1)

Publication Number Publication Date
EP2434781A1 true EP2434781A1 (fr) 2012-03-28

Family

ID=44674558

Family Applications (1)

Application Number Title Priority Date Filing Date
EP11182407A Withdrawn EP2434781A1 (fr) 2010-09-27 2011-09-22 Procédé de reconstruction d'un signal vocal et dispositif auditif

Country Status (3)

Country Link
US (1) US20120076331A1 (fr)
EP (1) EP2434781A1 (fr)
DE (1) DE102010041435A1 (fr)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10187112B2 (en) * 2015-02-04 2019-01-22 Viavi Solutions, Inc. Leakage detection in DOCSIS 3.1 environment
TWI662544B (zh) * 2018-05-28 2019-06-11 塞席爾商元鼎音訊股份有限公司 偵測環境噪音以改變播放語音頻率之方法及其聲音播放裝置
CN110570875A (zh) * 2018-06-05 2019-12-13 塞舌尔商元鼎音讯股份有限公司 检测环境噪音以改变播放语音频率的方法及声音播放装置
CN110648686B (zh) * 2018-06-27 2023-06-23 达发科技股份有限公司 调整语音频率的方法及其声音播放装置

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1901286A2 (fr) * 2006-09-13 2008-03-19 Fujitsu Limited Appareil d'amélioration de la parole, appareil d'enregistrement de la parole, programme d'amélioration de la parole, programme d'enregistrement de la parole, procédé d'amélioration de la parole et procédé d'enregistrement de la parole

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4454609A (en) * 1981-10-05 1984-06-12 Signatron, Inc. Speech intelligibility enhancement
GB8613327D0 (en) * 1986-06-02 1986-07-09 British Telecomm Speech processor
DE3733659A1 (de) * 1986-10-03 1988-04-21 Ricoh Kk Verfahren zum vergleichen von mustern
US5226084A (en) * 1990-12-05 1993-07-06 Digital Voice Systems, Inc. Methods for speech quantization and error correction
US5630011A (en) * 1990-12-05 1997-05-13 Digital Voice Systems, Inc. Quantization of harmonic amplitudes representing speech
US7065485B1 (en) * 2002-01-09 2006-06-20 At&T Corp Enhancing speech intelligibility using variable-rate time-scale modification
US7366656B2 (en) * 2003-02-20 2008-04-29 Ramot At Tel Aviv University Ltd. Method apparatus and system for processing acoustic signals
US7457741B2 (en) * 2004-03-30 2008-11-25 National Institute of Advnaced Industrial Science and Technology Device for transmitting speech information
US8983832B2 (en) * 2008-07-03 2015-03-17 The Board Of Trustees Of The University Of Illinois Systems and methods for identifying speech sound features

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1901286A2 (fr) * 2006-09-13 2008-03-19 Fujitsu Limited Appareil d'amélioration de la parole, appareil d'enregistrement de la parole, programme d'amélioration de la parole, programme d'enregistrement de la parole, procédé d'amélioration de la parole et procédé d'enregistrement de la parole

Also Published As

Publication number Publication date
DE102010041435A1 (de) 2012-03-29
US20120076331A1 (en) 2012-03-29

Similar Documents

Publication Publication Date Title
EP2603018B1 (fr) Dispositif auditif avec détection d'activité de locuteur et procédé de fonctionnement d'un dispositif auditif
EP3451705B1 (fr) Procédé et dispositif de reconnaissance rapide de voix propre
EP2229010B1 (fr) Appareil auditif et procédé de compensation du bruit dans un appareil auditif
EP1931172A1 (fr) Prothèse auditive avec suppression du bruit et procédé correspondant
DE102012200745A1 (de) Verfahren und Hörvorrichtung zum Schätzen eines Bestandteils der eigenen Stimme
DE102020207579A1 (de) Verfahren zur richtungsabhängigen Rauschunterdrückung für ein Hörsystem, welches eine Hörvorrichtung umfasst
EP2434781A1 (fr) Procédé de reconstruction d'un signal vocal et dispositif auditif
EP2495724B1 (fr) Procédé et dispositif destinés à l'évaluation d'un bruit parasite
DE102016221578B3 (de) Verfahren zur Takterkennung mittels eines Hörgerätes
DE102008046040A1 (de) Verfahren zum Betrieb einer Hörvorrichtung mit Richtwirkung und zugehörige Hörvorrichtung
EP1912470A2 (fr) Procédé pour la compression dynamique d'un signal audio et appareil auditif correspondant
EP1962554A2 (fr) Appareil auditif doté d'une séparation de signal d'erreur et procédé correspondant
EP1926087A1 (fr) Adaptation d'un dispositif auditif à un signal vocal
DE102017203947A1 (de) Verfahren zum Betreiben einer Hörhilfevorrichtung sowie Hörhilfevorrichtung
EP2793488B1 (fr) Adaptation de microphone binaurale au moyen de sa propre voix
DE102009009040A1 (de) Vorrichtung und Verfahren zur Störgeräuschschätzung bei einer binauralen Hörgeräteversorgung
DE102011006472B4 (de) Verfahren zur Verbesserung der Sprachverständlichkeit mit einem Hörhilfegerät sowie Hörhilfegerät
EP3048813B1 (fr) Procédé et dispositif de suppression du bruit basée sur l'inter-corrélation de bandes secondaires
EP2982136A1 (fr) Procédé d'estimation d'un signal utile et dispositif auditif
DE102020201615B3 (de) Hörsystem mit mindestens einem im oder am Ohr des Nutzers getragenen Hörinstrument sowie Verfahren zum Betrieb eines solchen Hörsystems
EP2635048A2 (fr) Amplification d'un signal vocal en fonction du niveau d'entrée
DE102023200412B3 (de) Verfahren zum Betrieb eines Hörgeräts
EP3985997B1 (fr) Système d'appareil auditif et son procédé de fonctionnement
DE102011089865B4 (de) Verfahren zum Anpassen einer Frequenzkompressionseinrichtung einer Hörvorrichtung und entsprechende Anpassungsvorrichtung
DE102007030067B4 (de) Hörgerät mit passiver, eingangspegelabhängiger Geräuschreduktion und Verfahren

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

AX Request for extension of the european patent

Extension state: BA ME

17P Request for examination filed

Effective date: 20120411

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20140401