GB2341299A - Suppressing noise in a speech communications unit - Google Patents

Suppressing noise in a speech communications unit Download PDF

Info

Publication number
GB2341299A
GB2341299A GB9819224A GB9819224A GB2341299A GB 2341299 A GB2341299 A GB 2341299A GB 9819224 A GB9819224 A GB 9819224A GB 9819224 A GB9819224 A GB 9819224A GB 2341299 A GB2341299 A GB 2341299A
Authority
GB
United Kingdom
Prior art keywords
noise
speech
signal
periodic
gaussian noise
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
GB9819224A
Other versions
GB9819224D0 (en
Inventor
Dominic Sai Fan Chan
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Motorola Solutions UK Ltd
Original Assignee
Motorola Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Motorola Ltd filed Critical Motorola Ltd
Priority to GB9819224A priority Critical patent/GB2341299A/en
Publication of GB9819224D0 publication Critical patent/GB9819224D0/en
Priority to EP99117227A priority patent/EP0984433A2/en
Publication of GB2341299A publication Critical patent/GB2341299A/en
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L2021/02085Periodic noise

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Noise Elimination (AREA)

Abstract

In order to suppress periodic noise, such as from a siren on a public safety vehicle or from a car engine, a speech signal having a periodic noise interferer is input to a speech processor in which the amplitude of the periodic noise is determined 72, a known gaussian noise sequence having the same amplitude as the periodic noise is generated 74 and combined 76 with the (speech + periodic noise signal) so that a standard wide-band noise suppression algorithm can be used in a noise suppressor 78 to suppress both the periodic noise and the gaussian noise. The noise suppressor 78 also provides an input to a sub-band gain adjustment block 80 so that an appropriate amount of the gaussian noise can be subtracted 82 from the output of the noise suppressor 78 to give an output speech signal with a greatly reduced periodic noise content. The arrangement may be used after a speech decoder (50, Fig.3) or in front of a speech encoder 84 in a mobile communications unit.

Description

1 2341299 NOISE SUPPRESSER SPEECH COMMUNICATIONS UNIT AND MIETHOD OF
OPERATION
Field of the Invention
This invention relates to suppressing noise in communications systems and more particularly to suppressing periodic noise from car engines or police sirens in a mobile communications system.
BackLyround of the Invention Many voice communications systems, such as the TErrestrial Trunked RAdio (TETRA) system for private mobile radio users, use speech processing units to encode and decode speech patterns. In such voice communications systems the speech encoder converts the analogue speech pattern into a suitable digital format for transmission and the speech decoder converts a received digital speech signal into an appropriate analog speech pattern.
As spectrum for such voice communications systems is a valuable resource, it is desirable to limit the channel bandwidth used, to maximise the number of users per frequency band. Hence, the primary objective in the use of speech coding techniques is to reduce the occupied capacity of the speech patterns as much as possible, by use of compression techniques, without losing fidelity of speech signals.
Speech coding typically uses speech production modelling techniques to compress pulse code modulation WM) speech signals into bit-rates that are suitable for different kinds of bandwidth-limited applications such as speech communication systems or voice storage systems.
The basic speech production model, that is commonly used in speech coding algorithms, is shown in FIG. 1. The model in FIG. 1 was used in early linear predictive coding (LM based vocoders. The LPC:1ilter models the combined effect of the glottal pulse model, the vocal tract and the lip radiation. For voiced speech, the voiced excitation, which consists of a pulse train separated -2by the pitch duration T, is used as an input signal to the LPC filter. Alternatively, for unvoiced speech, a gaussian noise source is used as the LPC filter input excitation.
The advance of speech coding development led to the introduction of Analysis by Synthesis technique used in CELP (Code Excited Linear Prediction) such as (Algebraic Code Excited Linear Prediction). The improved speech production model or the synthesis model used in the ACELP case is shown in FIG. 2.
The excitation in the ACELP case is a weighted combination of the innovative codebook vector and the adaptive codebook vector. Typically research papers on the subject matter of CELP-based speech coding techniques refer to two codebooks, namely an "Innovative" codebook as the basic codebook for CELP, in order to distinguish the codebook from the "adaptive" codebook. The innovative codebook in the ACELP case consists of code-vectors each contains only a small number of pulses and zero value elsewhere. The periodicity of the excitation, which is needed for voiced speech, derives from the last frame total LPC filter input excitation based on the present frame pitch lag value.
The main customers of TETRA radios are public safety organisations. The noise level in the mobile operating environment is often higher than that in fixed telecommunication systems. There are mainly two kinds of noise that will affect the TETRA speech quality, namely wideband noise and periodic noise. Wideband noise comes from the various operating environments; such as car or wind noise, street noise, babble noise. Periodic noise mainly comes from repetitive motion in car engines, as well as the sirens of public safety vehicles. For car engine noise, the fundamental of the periodic noise is mainly concentrated at low frequencies, typically of the order of less than 250 Hz.
Siren signal suppression is sometimes necessary especially during interconnect duplex communication. It has been argued by J. R. Deller Jr., J. G. Proakis and J. H. L. Hansen in "Discrete-Time Processing of Speech signals", published by MacMillan in 1993 that narrow-band noise sources such as a varying sinusoidal signal or an artificial noise component fatigue the auditory system faster than wideband noise.
One problem with developing algorithms to suppress siren signals is that the siren signal spectrum fulfils the conditions which most noise suppression algorithm uses to decide whether the incoming signal is a speech signal. 5 Thus it is desirable to suppress periodic noise in speech codecs, particularly in the mobile communication environment.
Summarv of the Invention According to a first aspect of the invention, a speech communications unit is provided. The speech communications unit includes a speech processor for receiving an input speech signal having a periodic noise interferer. The speech processor is operably coupled to a noise determining means for determining an amplitude of the periodic noise interferer a gaussian noise generator for generating a known gaussian noise sequence and combining the speech signal with the known gaussian noise sequence to produce a resultant signal, and inputting the resultant signal into a noise suppression procedure to provide a noise suppressed speech signal, wherein the speech processor determines an amplitude level of the suppressed gaussian noise and subtracts a respective level of suppressed gaussian noise from the resultant signal thereby reducing the periodic noise content in the speech signal.
In this manner, the introduction of a gaussian noise signal to the periodic interferer, into the speech signal reduces the periodic noise content of the signal. Preferably, the gaussian noise signal generated is of substantially equal amplitude to the periodic interferer. Additionally, the speech communications unit further includes a noise suppresser function, coupled to the output of the summing junction, for further suppressing noise in the speech signal. The speech processor is either a speech post- processing function in a speech decoder or a speech pre-processing function in a speech encoder. The speech communications unit preferably includes a gain adjuster, operably coupled to the gaussian noise generator and the noise suppresser function for receiving the known gaussian noise sequence and the noise suppressed signal, the gain adjuster being operably coupled to a second summing junction for recombining the gain adjusted signal with the noise suppressed signal thereby suppressing the periodic noise interferer.
In accordance with a second aspect of the preferred embodiment of the invention, a method of reducing a periodic interferer in a speech signal is provided. The method includes the steps of determining an amplitude of the periodic interferer; generating a gaussian noise signal of substantially similar amplitude to the periodic interferer; and introducing the gaussian noise signal into the speech signal having the periodic interferer to produce a resultant signal, inputting the resultant signal into a noise suppression procedure to provide a noise suppressed speech signal, wherein the speech processor determines an amplitude level of the suppressed gaussian noise and subtracts a respective level of suppressed gaussian noise from the resultant signal thereby reducing the periodic noise content in the speech signal.
A preferred embodiment of the invention will now be described, by way of example only, with reference to the drawings.
Brief Descrir)tion of the DrawinEs FIG. 1 shows a block diagram of a synthesis functional model of a basic LPC codec.
FIG, 2 shows a block diagram of a synthesis functional model of a basic ACELP codec.
FIG. 3 shows a block diagram of a decoding siren suppression algorithm according to a preferred embodiment of the invention.
FIG. 4 shows a block diagram of an encoding siren suppression algorithm according to a preferred embodiment of the invention.
FIG. 5 shows experimental results of the siren suppression algorithm of the preferred embodiment of the invention.
Detailed DescriDtion of the Drawings Referring first to FIG. 1, a block diagram of a synthesis functional model of a basic LPC codec is shown. A voiced excitation source 10 provides a pulse train signal, of pitch duration T into a voiced gain element 12. The amplified pulse train signal from voiced gain element 12 is then selectively input, via a switch 14, to a Linear Predictive Coder (LPC) Filter 16. When no voice signal is present, an unvoiced excitation source 18 provides a gaussian noise signal into an unvoiced gain element 20. The amplified gaussian noise signal from unvoiced gain element 20 is selectively input, via switch 14, to the Linear Predictive Coder (LPC) Filter 16, when no voice is present. The output from the LPC filter 16 is synthetic speech.
In this manner, a series of amplified pulses from the voiced excitation source 10 are combined with amplified signals from an unvoiced excitation source 18, filtered with the resultant generated signal being representative of synthetic speech.
Referring next to FIG. 2, a block diagram of a synthesis functional model of a basic ACELP codec is shown. An excitation vector from the "Innovative" codebook 30 is chosen and input to the voice gain element 31. Another excitation vector from the "Adaptive" codebook 32 is also chosen according to the present frame pitch lag value T and input the gain element 33. The output of voice gain element 31 and the output of voice gain element 33 are input to a summation device 34. The output of the summation device 34 is input to the Linear Predictive Coder filter 35. The output of the summation device 34 is also used to update the "Adaptive" codebook for next frame speech synthesis. The output from the LPC filter 35 is then synthetic speech.
In this manner, a series of amplified excitation vectors from the "Adaptive" codebook 32, incorporating a feedback path from the excitation vector source, are combined with a variety of amplified pulses selected from an "Innovative" codebook 30 (unvoiced excitation source). The combined signal is then filtered with the resultant signal from the LPC filter being representative of synthetically generated speech. The particular vectors are chosen to best imitate the speech signal to be transmitted, or being received.
The arrangements used in FIG. 1 or FIG. 2 are implemented in the encoding functions of a speech codec. The corresponding functions are required in reverse when decoding received speech.
Referring now to FIG. 3, a block diagram of a decoding siren suppression algorithm, according to a preferred embodiment of the invention, is shown. A speech signal is received and decoded in a speech decoder 50. The decoded speech signal With a siren signal contained within it, is then input to a first summing junction 56. Additionally, the decoded speech signal, together with the siren signal is processed to determine the amplitude of the siren signal in processor 52. The amplitude of the siren signal is then used to generate a gaussian noise signal of that same amplitude in the noise generator 54. The output gaussian noise signal is also fed to the first summing junction 56 and input to a sub- band gain adjustment block 60. The output from the first summing junction 56 is effectively the decoded speech signal, plus the gaussian noise signal which hides the periodic noise from the siren. This speech and gaussian noise signal is then used in a noise suppression algorithm 58 to reduce the level of the gaussian noise. The improved speech signal, i.e. with a 20 reduced gaussian noise level, is then input to the sub-band gain adjustment block 60. The output from the sub-band gain adjustment block 60 is combined with the improved speech signal in a second summing junction 62. The resultant signal is the decoded speech signal with a greatly reduced siren noise effect. 25 Under high SNR condition, preliminary experimental results show that by injecting a known gaussian noise sequence into the incoming signal, at a level comparable to the siren, a standard wide-band noise suppression algorithm is able to suppress the siren plus the gaussian 30 noise signal by the level specified in the noise suppression algorithm, whilst maintaining good speech quality. As the gaussian noise sequence is known, it can be extracted from the noise suppression process output. This arrangement has the advantage of requiring just a single 35 microphone input and avoids the need for precise estimation of the siren harmonic frequency. The only requirements are the detection of the siren signal and determination of its amplitude.
Advantageously, suppression of siren signal under high signal to noise conditions is achieved using just a single microphone input.
Furthermore, the need for precise estimation of the siren harmonic frequency is avoided, by purely detecting the siren signal and estimating its amplitude.
Referring now to FIG. 4, a block diagram of an encoding siren suppression algorithm is shown. The encoding process deals with the situation where the siren is close to the transmitting unit, performing the speech encoding function. The encoding function is basically the decoding function in reverse, with the determination of the siren amplitude being calculated and used to generate a gaussian noise signal of similar amplitude.
A speech signal, having a siren signal contained within it, is input to a first summing junction 76. Additionally, the speech signal, together with the siren signal is processed to determine the amplitude of the siren signal in processor 72. The amplitude of the siren signal is then used to generate a gaussian noise signal of that same amplitude in the noise generator 74. The output gaussian noise signal is also fed to the first summing junction 76 and input to a sub-band gain adjustment block 80. The output from the first summing junction 76 is effectively the speech signal, minus the gaussian noise signal which hides the periodic noise from the siren. This speech and gaussian noise signal is then used in a noise suppression algorithm 78 to reduce the level of the gaussian noise. The improved speech signal, i.e. with a reduced gaussian noise level, is then input to the sub-band gain adjustment block 80. The output from the sub-band gain adjustment block 80 is combined with the improved speech signal in a second silmming junction 82. The resultant signal is the encoded speech signal with a greatly reduced siren noise content.
In such a manner, a speech signal is generated with an interfering periodic noise signal having a greatly reduced effect.
Referring now to FIG. 5, experimental results of the siren suppression algorithm, according to the preferred embodiment of the invention, is shown. The results are shown with regard to amplitude versus time of the speech waveforms. Three distinct waveforms are provided. The first waveform 90 shows the input speech signal with the effect of the periodic interference (siren signal). The periodic noise content can be clearly seen with the darkly shaded areas indicating a rapidly changing and relatively constant interfering source. The second waveform 92 shows the input speech signal after applying the standard wide-band noise suppression algorithm. It is clearly shown that the periodic interference (siren signal) is not affected by the standard wide-band noise suppression algorithm, With the darkly shaded areas showing little change from the original speech plus siren signal. The third waveform 94 shows the input speech signal after applying the gaussian noise suppression algorithm, together with the standard wide-band noise suppression algorithm. The third waveform clearly shows a significant reduction in the periodic interference (siren signal) content, with the darkly shaded areas showing approximately a 10 dB siren suppression.
Thus, the present invention transforms a periodic noise contaminated speech signal into a wideband gaussian noise contaminated speech signal such that a standard wideband noise suppression procedure can be used to suppress the periodic noise. The periodic noise is then detected and its average amplitude estimated. A known gausslan noise sequence with a comparable amplitude is then added to the periodic noise contaminated speech signal. The noise (periodic + gausslan) in the resultant signal is then suppressed using a standard wideband noise suppression procedure (for example the Motorola sub-band noise suppression algorithm). At the same time, the underlying periodic noise is also suppressed. Since the added gaussian noise sequence is known and the sub-band gains used by the noise suppression algorithm have already determined, the suppressed gaussian noise at the noise suppression procedure output is then calculated and subtracted. The resultant signal is the speech signal with the periodic noise suppressed.
Hence, an arrangement for suppressing periodic noise in a contaminated speech signal is provided.

Claims (7)

Claims
1. A speech communications unit comprising a speech processor for receiving an input speech signal having a periodic noise interferer, the speech processor being operably coupled to noise determining means for determining an amplitude of the periodic noise interferer, a gaussian noise generator for generating a known gaussian noise sequence and combining the input speech signal with the known gaussian noise sequence and inputting a resultant signal into a noise suppression procedure to provide a noise suppressed speech signal, wherein the speech processor determines an amplitude level of the suppressed gaussian noise and subtracts a respective level of suppressed gaussian noise from the resultant signal thereby reducing the periodic noise content in the speech signal.
2. The speech communications unit of claim 1 wherein the known gaussian noise sequence is generated at a substantially similar amplitude to the periodic noise interferer.
3. The speech communications unit of claims 1 or 2 wherein the speech processor is a speech encoder or speech decoder and the speech communications unit further comprises a noise suppresser function coupled to an output of a summing junction operably coupled to the gaussian noise generator for further suppressing noise in the speech signal.
4. The speech communications unit of claim 3, further comprising a gain adjuster, operably coupled to the gaussian noise generator and the noise suppresser function for receiving the known gaussian noise sequence and the noise suppressed signal, the gain adjuster being operably coupled to a second summing junction for recombining the gain adjusted signal with the noise suppressed signal thereby suppressing the periodic noise interferer.
5. The speech communications unit of any one of the preceding claims, wherein the speech processor is a speech post-processing or a speech pre processing function.
6. A method of reducing a periodic interferer in a speech signal, the method comprising the steps of determining an amplitude of the periodic interferer; generating a gaussian noise signal of substantially similar amplitude to the periodic interferer; introducing the gaussian noise signal into the speech signal having the periodic interferer to produce a resultant signal; inputting the resultant signal into a noise suppression procedure to provide a noise suppressed speech signal; determining an amplitude level of the suppressed gaussian noise; and subtracting a respective level of suppressed gaussian noise from the resultant signal thereby reducing the periodic noise content in the speech signal.
7. A speech communications unit as substantially described with reference to, and/or as illustrated by, FIG. 3 or FIG. 4 of the drawings.
GB9819224A 1998-09-04 1998-09-04 Suppressing noise in a speech communications unit Withdrawn GB2341299A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
GB9819224A GB2341299A (en) 1998-09-04 1998-09-04 Suppressing noise in a speech communications unit
EP99117227A EP0984433A2 (en) 1998-09-04 1999-09-02 Noise suppresser speech communications unit and method of operation

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
GB9819224A GB2341299A (en) 1998-09-04 1998-09-04 Suppressing noise in a speech communications unit

Publications (2)

Publication Number Publication Date
GB9819224D0 GB9819224D0 (en) 1998-10-28
GB2341299A true GB2341299A (en) 2000-03-08

Family

ID=10838292

Family Applications (1)

Application Number Title Priority Date Filing Date
GB9819224A Withdrawn GB2341299A (en) 1998-09-04 1998-09-04 Suppressing noise in a speech communications unit

Country Status (2)

Country Link
EP (1) EP0984433A2 (en)
GB (1) GB2341299A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2350989A (en) * 1999-03-12 2000-12-13 Fulcrum Systems Ltd Reducing background-noise from a siren
US20150255085A1 (en) * 2014-03-07 2015-09-10 JVC Kenwood Corporation Noise reduction device

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109767781A (en) * 2019-03-06 2019-05-17 哈尔滨工业大学(深圳) Speech separating method, system and storage medium based on super-Gaussian priori speech model and deep learning

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4932063A (en) * 1987-11-01 1990-06-05 Ricoh Company, Ltd. Noise suppression apparatus
WO1997034290A1 (en) * 1996-03-13 1997-09-18 Ericsson Inc. Noise suppressor circuit and associated method for suppressing periodic interference component portions of a communication signal
US5742927A (en) * 1993-02-12 1998-04-21 British Telecommunications Public Limited Company Noise reduction apparatus using spectral subtraction or scaling and signal attenuation between formant regions

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4932063A (en) * 1987-11-01 1990-06-05 Ricoh Company, Ltd. Noise suppression apparatus
US5742927A (en) * 1993-02-12 1998-04-21 British Telecommunications Public Limited Company Noise reduction apparatus using spectral subtraction or scaling and signal attenuation between formant regions
WO1997034290A1 (en) * 1996-03-13 1997-09-18 Ericsson Inc. Noise suppressor circuit and associated method for suppressing periodic interference component portions of a communication signal

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2350989A (en) * 1999-03-12 2000-12-13 Fulcrum Systems Ltd Reducing background-noise from a siren
GB2350989B (en) * 1999-03-12 2001-04-25 Fulcrum Systems Ltd Background-noise reduction
US20150255085A1 (en) * 2014-03-07 2015-09-10 JVC Kenwood Corporation Noise reduction device
US9384756B2 (en) * 2014-03-07 2016-07-05 JVC Kenwood Corporation Cyclic noise reduction for targeted frequency bands

Also Published As

Publication number Publication date
EP0984433A2 (en) 2000-03-08
GB9819224D0 (en) 1998-10-28

Similar Documents

Publication Publication Date Title
US6334105B1 (en) Multimode speech encoder and decoder apparatuses
CA2399706C (en) Background noise reduction in sinusoidal based speech coding systems
AU714752B2 (en) Speech coder
JP5202335B2 (en) Method for trained discrimination and attenuation of echoes of digital signals in decoders and corresponding devices
KR101039343B1 (en) Method and device for pitch enhancement of decoded speech
JP4662673B2 (en) Gain smoothing in wideband speech and audio signal decoders.
JP4166673B2 (en) Interoperable vocoder
KR20010102004A (en) Celp transcoding
KR20000075936A (en) A high resolution post processing method for a speech decoder
JPH08328591A (en) Method for adaptation of noise masking level to synthetic analytical voice coder using short-term perception weightingfilter
JP2004310088A (en) Half-rate vocoder
KR20130133777A (en) Coding generic audio signals at low bitrates and low delay
WO2014131260A1 (en) System and method for post excitation enhancement for low bit rate speech coding
CA2244008A1 (en) Nonlinear filter for noise suppression in linear prediction speech pr0cessing devices
US20030065507A1 (en) Network unit and a method for modifying a digital signal in the coded domain
GB2342829A (en) Postfilter
US10672411B2 (en) Method for adaptively encoding an audio signal in dependence on noise information for higher encoding accuracy
JP2010520504A (en) Post filter for layered codec
EP0984433A2 (en) Noise suppresser speech communications unit and method of operation
Lombard et al. Frequency-domain comfort noise generation for discontinuous transmission in evs
KR100718487B1 (en) Harmonic noise weighting in digital speech coders
Iao Mixed wideband speech and music coding using a speech/music discriminator
Drygajilo Speech Coding Techniques and Standards
EP1521243A1 (en) Speech coding method applying noise reduction by modifying the codebook gain
EP1521242A1 (en) Speech coding method applying noise reduction by modifying the codebook gain

Legal Events

Date Code Title Description
WAP Application withdrawn, taken to be withdrawn or refused ** after publication under section 16(1)