EP0308817A2 - Procédé pour transformer les paramètres d'un vocodeur à canaux en paramètres d'un vocodeur à prédiction linéaire - Google Patents

Procédé pour transformer les paramètres d'un vocodeur à canaux en paramètres d'un vocodeur à prédiction linéaire Download PDF

Info

Publication number: EP0308817A2
Authority: EP; European Patent Office
Prior art keywords: parameters; vocoder; channel; lpc; vocoder parameters
Prior art date: 1987-09-23
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.): Withdrawn

Application number

EP88115139A

Other languages

German (de)

English (en)

Other versions

EP0308817A3 (fr

Inventor

Hans Dipl.-Ing. Brandl

Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)

Siemens AG

Original Assignee

Siemens AG

Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)

1987-09-23

Filing date

1988-09-15

Publication date

1989-03-29

1988-09-15 Application filed by Siemens AG filed Critical Siemens AG

1989-03-29 Publication of EP0308817A2 publication Critical patent/EP0308817A2/fr

1990-04-18 Publication of EP0308817A3 publication Critical patent/EP0308817A3/fr

Status Withdrawn legal-status Critical Current

Links

238000000034 method Methods 0.000 title claims abstract description 14
238000001228 spectrum Methods 0.000 claims abstract description 20
239000011159 matrix material Substances 0.000 claims abstract description 14
230000009466 transformation Effects 0.000 claims abstract description 8
230000006870 function Effects 0.000 claims description 17
238000009499 grossing Methods 0.000 claims description 14
238000004458 analytical method Methods 0.000 claims description 6
238000005311 autocorrelation function Methods 0.000 claims description 4
230000015572 biosynthetic process Effects 0.000 claims description 4
238000003786 synthesis reaction Methods 0.000 claims description 4
239000013598 vector Substances 0.000 claims description 4
230000003595 spectral effect Effects 0.000 description 8
230000005284 excitation Effects 0.000 description 6
230000006399 behavior Effects 0.000 description 2
230000005540 biological transmission Effects 0.000 description 2
238000004891 communication Methods 0.000 description 2
238000007792 addition Methods 0.000 description 1
238000004364 calculation method Methods 0.000 description 1
238000007796 conventional method Methods 0.000 description 1
238000005314 correlation function Methods 0.000 description 1
230000006866 deterioration Effects 0.000 description 1
238000012067 mathematical method Methods 0.000 description 1
238000010606 normalization Methods 0.000 description 1
230000035807 sensation Effects 0.000 description 1
230000007704 transition Effects 0.000 description 1
230000001755 vocal effect Effects 0.000 description 1

Images

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders

Definitions

the invention relates to a method according to the preamble of patent claim 1.
Digital narrowband communication networks with low data transmission rates (1-2 kbit / s) are currently being planned.
the coding methods used are based either on the principle of the channel vocoder or the linear prediction (LPC vocoder). Communication between the vocoders is only possible if a suitable data transcoding takes place at their interface.
the converter required for this should be designed to be as inexpensive as possible and should not deteriorate the speech quality if possible.
One way to build a converter is to transform the speech data back into the speech signal and re-encode it.
This method is very complex since two analysis units and two synthesis units are required.
the analysis quality also deteriorates the speech quality.
the deterioration of the speech quality can be avoided by directly re-encoding the data of the different vocoders. This possibility results from the very similar synthesis principle, that of the channel vocoder and the LPC vocoder is applied.
the speech signal is generated by an excitation signal that is filtered by a variable filter.
the excitation signal consists of a pulse train for voiced sounds and white noise for unvoiced sounds. With the excitation parameters, the pulse frequency and the excitation mode - voiced or unvoiced - are determined.
the variable transmission behavior of the filter corresponds to the variable resonance behavior of the human vocal tract. This changes slowly and is reset by filter parameters every 10 to 20 ms.
the task of the speech signal analysis of a vocoder is to obtain the excitation parameters and the filter parameters from a speech signal.
the LPC vocoder and the channel vocoder differ essentially in the structure of the filter. LPC assumes an all-pole filter and the channel vocoder assumes a filter bank.
the analysis methods for determining the corresponding filter parameters differ and there are other filter parameters that are transmitted in the different networks. In contrast, the excitation parameters are basically the same.
a recoding process is therefore sought which converts the filter parameters of a filter bank of a channel vocoder into the filter parameters of an all-pole filter of an LPC vocoder.
the channel vocoder parameters usually represent a non-equidistantly scanned spectrum in terms of message theory.
the power spectrum is now calculated from the amplitude spectrum and transformed into the autocorrelation function (AKF) using the Fourier transformation.
the corresponding LPC vocoder parameter set can now be calculated from the AKF in a known manner using the usual methods (eg Levinson recursion) (see H. Hermansky, B. Hanson, H. Witka; "Perceptually based Predictive Analysis of Speech" on ICASSP 85, p. 13.10 conference proceedings).
the direct transformation is associated with high technical expenditure. Powerful real-time processors are required to calculate spectra and correlation functions.
the invention is based on the object of specifying a method for transcoding channel vocoder parameters into LPC vocoder parameters which requires relatively few arithmetic operations with high accuracy.
the starting point are the channel vocoder parameters, which are available, for example, as a power spectrum (see FIG. 1). This range of services is only available in a channel vocoder in a section-wise constant form b k with jumps at the transition points from b k to b k + 1 .
b k energy values e j shown where the value e j is the energy in the channel with the number j corresponds.
the channel energy corresponds to the power in a 20 ms interval (this is the interval after which new filter parameters are set in each case). This interval is also the transformation interval.
a smoothed spectrum a k (see FIG. 2) is formed by folding with a smoothing function g (i, s).
Gaussian bell curves or similar functions are suitable for this smoothing function g, for example.
the following function is given as an example for the Gaussian bell curve:
smoothing functions g are the low-pass functions known from filter theory and digital signal processing.
the spread s defines the corner frequencies of the respective low-pass filter.
the scatter s can be a function of the current spectral line.
a larger scatter s is selected for the smoothing function g (i, s) at higher frequencies and thus wider channels in b k than at lower frequencies. This makes it possible to adjust the smoothing to the sensation of tonality (Bark scale) of the human ear.
the "harmony" in speech synthesis can be empirically selected by the choice of the scatter (s).
the LPC coefficients are generally calculated from the short-term autocorrelation function (approx. 20 ms), AKF for short, of the speech signal. These AKF, ie their correlation coefficients r i , can also be determined from the power spectrum of the speech signal by the inverse, discrete Fourier transformation.
the N spectral lines b l of the raw spectrum can be derived from the channel energy values e j (see FIG. 1)
the number of channels and thus also the number of channel energy values e j is around 16-18.
the elements of matrix C are calculated only once for a certain vocoder combination in the method according to the invention. Subsequently, only matrix multiplications between the energy vectors E (which contains the parameters) and the matrix C have to be carried out in order to recode the respective speech parameters.
the smoothed channel vocoder parameters a p are present at an input 1 of a first memory 2. For example, it will A set of these parameters, in the case of 18 channels, ie 18 values, is written into the first memory 2.
the transformation coefficients c ip of the matrix C are calculated and stored in a coefficient memory 3.
the channel vocoder parameters a p in the first memory 2 are addressed in succession by a first counter 4.
the coefficients c ip are addressed in the coefficient memory 3 according to their index p.
the addressed channel vocoder parameters a p and the addressed coefficients c ip are multiplied in a multiplier 5 and added up in a downstream adder 6.
the index i of the coefficients c ip is kept constant until the index i has reached its greatest value, for example 17 in formula 8.
the sum formed is written into a second memory 7 as LPC parameter l i .
the index i is then increased by one by a second counter 8 and the next LPC parameter l i + 1 is calculated.
the second counter 8 addresses the coefficients c ip in the coefficient memory 3 on the one hand according to their index i, and on the other hand the LPC vocoder parameters in the second memory 7.
the two counters 4 and 8 are clocked by a clock controller 9.
a transformed or recoded set of LPC vocoder parameters can then be removed at an output 10 of the second memory 7.

Landscapes

Engineering & Computer Science (AREA)
Physics & Mathematics (AREA)
Audiology, Speech & Language Pathology (AREA)
Computational Linguistics (AREA)
Signal Processing (AREA)
Health & Medical Sciences (AREA)
Spectroscopy & Molecular Physics (AREA)
Human Computer Interaction (AREA)
Acoustics & Sound (AREA)
Multimedia (AREA)
Compression, Expansion, Code Conversion, And Decoders (AREA)
Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
Carbon Steel Or Casting Steel Manufacturing (AREA)

EP88115139A 1987-09-23 1988-09-15 Procédé pour transformer les paramètres d'un vocodeur à canaux en paramètres d'un vocodeur à prédiction linéaire Withdrawn EP0308817A3 (fr)

Applications Claiming Priority (2)

Application Number	Priority Date	Filing Date	Title
DE19873732047 DE3732047A1 (de)	1987-09-23	1987-09-23	Verfahren zur umcodierung von kanalvocoder-parameter in lpc-vocoder-parameter
DE3732047		1987-09-23

Publications (2)

Publication Number	Publication Date
EP0308817A2 true EP0308817A2 (fr)	1989-03-29
EP0308817A3 EP0308817A3 (fr)	1990-04-18

Family

ID=6336687

Family Applications (1)

Application Number	Title	Priority Date	Filing Date
EP88115139A Withdrawn EP0308817A3 (fr)	1987-09-23	1988-09-15	Procédé pour transformer les paramètres d'un vocodeur à canaux en paramètres d'un vocodeur à prédiction linéaire

Country Status (2)

Country	Link
EP (1)	EP0308817A3 (fr)
DE (1)	DE3732047A1 (fr)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
EP0626675A1 (fr) *	1993-05-28	1994-11-30	Motorola Inc.	Excitation synchrone du temps d'un vocodeur et méthode
WO1995022819A1 (fr) *	1994-02-16	1995-08-24	Qualcomm Incorporated	Vocodeur asic
WO1996031873A1 (fr) *	1995-04-03	1996-10-10	Universite De Sherbrooke	Quantification des parametres spectraux pour un codage efficace de la parole, utilisant une matrice de prediction scindee
AU725711B2 (en) *	1994-02-16	2000-10-19	Qualcomm Incorporated	Block normalisation processor

Citations (1)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
EP0138073A1 (fr) *	1983-09-29	1985-04-24	Siemens Aktiengesellschaft	Convertisseur pour le transfert de données entre vocodeurs à canaux et à prédiction linéaire pour la transmission de signaux digitaux de parole au moyen de systèmes de transmission à bande réduite

1987
- 1987-09-23 DE DE19873732047 patent/DE3732047A1/de active Granted
1988
- 1988-09-15 EP EP88115139A patent/EP0308817A3/fr not_active Withdrawn

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
EP0138073A1 (fr) *	1983-09-29	1985-04-24	Siemens Aktiengesellschaft	Convertisseur pour le transfert de données entre vocodeurs à canaux et à prédiction linéaire pour la transmission de signaux digitaux de parole au moyen de systèmes de transmission à bande réduite

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
EP0626675A1 (fr) *	1993-05-28	1994-11-30	Motorola Inc.	Excitation synchrone du temps d'un vocodeur et méthode
US5784532A (en) *	1994-02-16	1998-07-21	Qualcomm Incorporated	Application specific integrated circuit (ASIC) for performing rapid speech compression in a mobile telephone system
EP0758123A3 (fr) *	1994-02-16	1997-03-12	Qualcomm Incorporated	Circuit bouchon pour la normalisation
US5727123A (en) *	1994-02-16	1998-03-10	Qualcomm Incorporated	Block normalization processor
WO1995022819A1 (fr) *	1994-02-16	1995-08-24	Qualcomm Incorporated	Vocodeur asic
AU697822B2 (en) *	1994-02-16	1998-10-15	Qualcomm Incorporated	Vocoder asic
US5926786A (en) *	1994-02-16	1999-07-20	Qualcomm Incorporated	Application specific integrated circuit (ASIC) for performing rapid speech compression in a mobile telephone system
AU725711B2 (en) *	1994-02-16	2000-10-19	Qualcomm Incorporated	Block normalisation processor
SG87819A1 (en) *	1994-02-16	2002-04-16	John G Mcdonough	Vocoder asic
CN100397484C (zh) *	1994-02-16	2008-06-25	高通股份有限公司	数字信号处理器
WO1996031873A1 (fr) *	1995-04-03	1996-10-10	Universite De Sherbrooke	Quantification des parametres spectraux pour un codage efficace de la parole, utilisant une matrice de prediction scindee
US5664053A (en) *	1995-04-03	1997-09-02	Universite De Sherbrooke	Predictive split-matrix quantization of spectral parameters for efficient coding of speech
CN1112674C (zh) *	1995-04-03	2003-06-25	舍布鲁克大学	用于语音有效编码的谱参数预测分解矩阵量化

Also Published As

Publication number	Publication date
DE3732047C2 (fr)	1992-10-29
DE3732047A1 (de)	1989-04-06
EP0308817A3 (fr)	1990-04-18

Legal Events

Date	Code	Title	Description
1989-02-11	PUAI	Public reference made under article 153(3) epc to a published international application that has entered the european phase	Free format text: ORIGINAL CODE: 0009012
1989-03-29	AK	Designated contracting states	Kind code of ref document: A2 Designated state(s): DE FR GB IT NL
1990-03-02	PUAL	Search report despatched	Free format text: ORIGINAL CODE: 0009013
1990-04-18	AK	Designated contracting states	Kind code of ref document: A3 Designated state(s): DE FR GB IT NL
1990-05-02	17P	Request for examination filed	Effective date: 19900307
1990-12-13	STAA	Information on the status of an ep patent application or granted ep patent	Free format text: STATUS: THE APPLICATION HAS BEEN WITHDRAWN
1991-01-30	18W	Application withdrawn	Withdrawal date: 19901207

Publication	Publication Date	Title
DE60317722T2 (de)	2008-09-25	Verfahren zur Reduzierung von Aliasing-Störungen, die durch die Anpassung der spektralen Hüllkurve in Realwertfilterbanken verursacht werden
EP1979901B1 (fr)	2015-10-14	Procede et dispositifs pour le codage de signaux audio
DE69529356T2 (de)	2003-08-28	Wellenforminterpolation mittels Zerlegung in Rauschen und periodische Signalanteile
DE69608947T2 (de)	2001-02-01	Verfahren zur Analyse eines Audiofrequenzsignals durch lineare Prädiktion, und Anwendung auf ein Verfahren zur Kodierung und Dekodierung eines Audiofrequenzsignals
DE69518452T2 (de)	2001-04-12	Verfahren für die Transformationskodierung akustischer Signale
EP1825461B1 (fr)	2008-09-03	Procede et dispositif pour elargir artificiellement la largeur de bande de signaux vocaux
DE69916321T2 (de)	2005-03-17	Kodierung eines verbesserungsmerkmals zur leistungsverbesserung in der kodierung von kommunikationssignalen
DE69821089T2 (de)	2004-11-11	Verbesserung von quellenkodierung unter verwendung von spektralbandreplikation
DE69634645T2 (de)	2006-03-02	Verfahren und Vorrichtung zur Sprachkodierung
DE60226308T2 (de)	2009-05-14	Quantisierung der Anregung in einem Geräuschrückkopplungskodierungssytem mit allgemeiner Rauschformung
DE60218385T2 (de)	2007-06-14	Nachfilterung von kodierter Sprache im Frequenzbereich
DE69227650T2 (de)	1999-06-24	Digitaler Sprachkodierer
DE69230308T2 (de)	2000-05-31	Transformationsverarbeitungsgerät und -verfahren und Medium zum Speichern komprimierter Digitaldaten
DE3853916T2 (de)	1995-12-14	Digitaler-sprachkodierer mit verbesserter vertoranregungsquelle.
DE60029990T2 (de)	2006-12-07	Glättung des verstärkungsfaktors in breitbandsprach- und audio-signal dekodierer
DE69810361T2 (de)	2003-09-11	Verfahren und Vorrichtung zur mehrkanaligen akustischen Signalkodierung und -dekodierung
DE69317958T2 (de)	1998-09-17	Kodierer von Audiosignalen mit niedriger Verzögerung, unter Verwendung von Analyse-durch-Synthese-Techniken
DE60126149T2 (de)	2007-10-18	Verfahren, einrichtung und programm zum codieren und decodieren eines akustischen parameters und verfahren, einrichtung und programm zum codieren und decodieren von klängen
DE69729527T2 (de)	2005-06-23	Verfahren und Vorrichtung zur Kodierung von Sprachsignalen
DE69426860T2 (de)	2001-07-19	Sprachcodierer und Verfahren zum Suchen von Codebüchern
EP1525576B1 (fr)	2009-05-27	Dispositif et procede permettant de generer une representation spectrale complexe d'un signal a valeurs discretes en temps
DE2524497A1 (de)	1975-12-18	Phasenvocoder-sprachsynthesesystem
EP1016319B1 (fr)	2001-08-29	Procede et dispositif pour coder un signal stereo temporellement discret
DE69033510T2 (de)	2000-11-23	Numerischer sprachkodierer mit verbesserter langzeitvorhersage durch subabtastauflösung
DE69708191T2 (de)	2002-03-28	Vorrichtung zur Signalkodierung