EP0341271A1 - Coded communications system - Google Patents

Coded communications system

Info

Publication number
EP0341271A1
EP0341271A1 EP19880907742 EP88907742A EP0341271A1 EP 0341271 A1 EP0341271 A1 EP 0341271A1 EP 19880907742 EP19880907742 EP 19880907742 EP 88907742 A EP88907742 A EP 88907742A EP 0341271 A1 EP0341271 A1 EP 0341271A1
Authority
EP
European Patent Office
Prior art keywords
filter
decoder
parameters
actual error
speech
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP19880907742
Other languages
German (de)
French (fr)
Inventor
Costas Xydeas
Nikolaos Gouvianakis
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
British Telecommunications PLC
Original Assignee
British Telecommunications PLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from GB878720388A external-priority patent/GB8720388D0/en
Priority claimed from GB888801858A external-priority patent/GB8801858D0/en
Application filed by British Telecommunications PLC filed Critical British Telecommunications PLC
Publication of EP0341271A1 publication Critical patent/EP0341271A1/en
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters

Definitions

  • This invention relates to a system for transmitting speech signals in coded form. This invention relates also to a transmitter and to a receiver, for coding and decoding respectively such a signal.
  • coded speech signals derive the characteristics of a filter from the input speech samples at a coder, and transmit these to a decoder, where they are used to configure a decoder filter which is then excited by a suitable excitation signal source to produce a synthesised reproduction of the input speech signal.
  • LPC Linear Predictive Coding
  • a predictor stage is included in the coder to remove long-term periodicity corresponding to speaker pitch from the difference signal; this predictor may also be considered as a filter, and the corresponding filter parameters are also transmitted to the decoder.
  • the prediction error regardless of whether or not pitch prediction is included in the predicted speech signal, differs from the actual error because the prediction is based on a model of excitation, not on the excitation actually used at the decoder
  • Recent LPC systems include, for example, the Multipulse Excited (HP), the Regular Pulse Excited (RPE), and the Code Book Excited (CE) LPC Systems.
  • the excitation to be used at the decoder is selected or derived at the coder; in the above systems, the decoder includes a controllable excitation signal generator, and the coder must therefore transmit control signals to the decoder.
  • the decoder may thus be a two-stage filter (with a relatively short delay all-pole filter and a relatively long delay filter) and an excitation generator.
  • excitation control signals are themselves produced at the coder; in MPLPC, this has been achieved by using "analysis by synthesis” - ie synthesising speech locally using the unmodified excitation pulse sequence, subtracting this synthesis from the actual input speech sequence to form an actual error signal, perceptually weighting the error signal, and then using the weighted error signal as a control in a closed optimisation loop to select the excitation sequence for minimum (weighted) actual error.
  • a method of speech coding in which filter parameters are periodically derived from an input speech signal in a coder and transmitted to a decoder to update the response of a decoder filter, in which the excitation input to the decoder filter is derived in the decoder from the speech output of the decoder filter.
  • the filter parameters are derived so as to reduce the actual error which will be produced at the decoder, rather than the prediction error.
  • a synthesis filter may be provided at the coder and the actual error may be derived by subtracting locally synthesised speech from the input speech signal.
  • the parameter derivation is achieved by iteratively deriving the delay parameter of an all-zero stage of the filter by minimising the actual error.
  • at least some of the other parameters are calculated inside the loop.
  • all other parameters are jointly optimised, by calculating them inside the delay parameter iteration loop.
  • these optimisation methods may be applied to a system using previous excitations, rather than previous decoded speech, as excitation to a plurality of parallel feed-forward stages of the filter.
  • Use of the invention can give a significant signal to noise ratio advantage and/or an ability to work effectively with small frame sizes and thus allows, for a given signal to noise ratio, a shorter coding delay.
  • FIG. 1 shows schematically a generalised transmitter employing a coder according to the invention
  • FIG. 2 shows schematically a generalised receiver employing a decoder according to the invention
  • FIG. 2a shows a specific embodiment of a receiver employing a decoder according to the invention
  • FIG. 3 shows schematically the components of a single excitation synthesis filter of Figures 1 or 2
  • - Figures 4a and 4b provide six algorithms corresponding to methods of optimising the parameters of the synthesis filter of a coder according to the invention
  • FIG. 5 shows schematically the components of a multiple excitation synthesis filter of a coder according to another aspect of the invention
  • FIG. 6 illustrates examples of methods of optimising the parameters of such a multiple input synthesis filter
  • FIG. 7a and 7b show excitation waveforms occurring in embodiments of the invention.
  • a filter optimisation stage 1 derives filter parameters, using an 'analysis by synthesis' technique, as follows.
  • n is the size of the
  • the filter parameters are thus coefficients [a k ] and [b k ] and delay d 1 .
  • ê i y i - ⁇ i .
  • a better measure of the perceived error to the human ear is obtained by frequency-weighting the error signal in such a manner as to de-emphasise formant regions in the error spectrum.
  • the difference signal e. is preferably filtered by a weighting filter 5 with suitable transfer function, to form the perceptually weighted error signal ê w .
  • This signal is then minimised to optimise the parameters of synthesis filter 3 by optimisation calculator 6 in an iterative closed loop, which derives values for [a k ], [b k ] and d 1 , which are quantised by quantiser 7 and passed to a coder 8, which codes the parameters for transmission to a decoder station, and to a local decoder 10 which is functionally identical to the decoder at the receiver station.
  • the parameters are used to configure a local decoder synthesis filter 11, which is then driven from the excitation sequence source 2 by the same excitation sequence [x -i ] for which the parameters were optimised, to produce an output frame of synthesised speech [ ⁇ i ] .
  • This synthesised output is received and processed as necessary by an excitation adaptation calculator 12, which thus constructs a new excitation sequence [x -i ] from the output of the decoder synthesis filter 11.
  • the new excitation sequence is then passed to the excitation sequence source 2 for use on the next input speech frame, for which it will form part of the Past Decoded Speech (PDS) signal.
  • PDS Past Decoded Speech
  • the excitation adaptation calculator 12 and Local Decoder 10 are not necessary in all embodiments of the invention as shown in Figure 1a.
  • a frame of input speech [y i ] is received and optimised filter parameters are derived in a closed loop using the excitation sequence supplied by excitation sequence generator 2.
  • the synthesised speech signal [ ⁇ i ] produced using these optimised parameters is synthesised by the synthesis filter 3 (rather than by the local decoder synthesis filter 11 as described above with reference to Figure 1) and supplied to the excitation sequence generator 2 which is simply a store capable of storing the synthesised frame [ ⁇ i ] , and supplying this as the next excitation sequence [x -i ] (and also of storing the initial excitation sequence needed when the first frame of input speech is to be coded).
  • a decoder unit 28 performs the inverse operation to coder 8, to recover the filter parameters [a k ] , [b k ] and d 1 , which are passed to synthesis filter 21 to produce the configuration previously optimised at the transmitter.
  • Excitation sequence generator 23 drives the filter with an excitation sequence [x -i ] which will be the same as that used at the transmitter; for the first frame of speech it will be the initial excitation sequence identical to that used at the transmitter.
  • a frame of synthesised speech output [ ⁇ i ] is thus produced for the listener, and this output is also supplied to the excitation adaption calculator 22 to create a new excitation sequence [x -i ] in the excitation sequence generator 23 in the same manner as at the local decoder 10.
  • the excitation adaption calculator 22 is not necessary in single input embodiments, and the excitation sequence generator 23 may simply be a frame store holding the last decoded frame
  • the distant decoder 20 and the local decoder 10 perform exactly the same synthesis, using the same filter and the same initial excitation, the updated excitation sequences thus created are also the same and there is no need to transmit any excitation sequence data.
  • re-initialisation sequences occasionally so as to bring the distant decoder and the local decoder back into agreement; this may take the form of a periodic control bit transmitted to both decoders, instructing each excitation sequence generator 23 to re-start with the preprogrammed initial excitation sequence.
  • the optimisation process is performed by minimising the value of the prediction (residual) error, which is derived by comparing the current value of the speech signal and its predicted value based on past samples,
  • e i is the prediction error for the i th sample.
  • the weighting filter 5 is defined in the same way as in MPLPC or RPE-LPC, and will henceforth be omitted from the analysis of operation of the invention; the term "actual error” will be understood to encompass the perceptually weighted actual error.
  • the filter characteristics are selected by partly linearising the problem, by calculating initially some parameters for minimum prediction error energy (using a least mean square solution) which is straightforward. Using these values, the optimum values for the remaining parameters are derived by finding the actual error ê i (for example by synthesising the output speech signal ⁇ i in a loop for each possible delay parameter value and subtracting it from the input speech signal y i ), finding the minimum mean squared weighted actual error ê w , and selecting parameter values which gave rise to this minimum ê w
  • the values of the other parameters may be calculated either inside or outside the loop, as discussed below.
  • the synthesis filter 3 of the filter optimisation stage 1 comprises essentially two stages in series; a first filter element 31 comprising a finite impulse response filter with transfer characteristic B(z)z -n-d1 , and a second filter element 32 comprising an infinite impulse response filter with transfer function so that
  • the number of delay elements (and hence the delay length) d, employed is variable in operation up to a predetermined maximum number N-1 (which may be greater or smaller than the frame size n).
  • This first filter element 31 therefore receives the excitation signal [x -i ] and contains the most recent q 1 + d 1 + n samples of the [x -i ] sequence.
  • the second filter element comprises a recursive filter with an all-pole response, defined by filter coefficients
  • the filter optimisation calculator 6 For each received frame of input speech, the filter optimisation calculator 6 initially calculates values for the [a k ] coefficients for mean square prediction error energy
  • Y is an (nxp) matrix of y k samples from y -p to y n-2 .
  • ⁇ and b are computed jointly from (D) for minimum prediction error and used to synthesise ⁇ .
  • the actual error energy is also determined using H(z) is therefore determined within an 'analysis by synthesis' loop where d, varies over a range of N values i.e. 0 ⁇ d 1 ⁇ N-1 and the ⁇ ,b vectors are defined for the value of d 1 which minimises In this method only the d 1 filter parameter is optimised for a minimum actual error energy while both ⁇ and b are selected for minimum prediction error energy T This is referred to in Figure 4 as 'scheme 1'.
  • the approach for estimating b is a modification of Scheme 1 above.
  • ⁇ and b are defined by equation (D) for the value of d 1 which minimises prediction error .
  • m represents the output of the 1/A(z) filter when its output is zero, and its "memory" consists of the most recent samples of the previous synthesised speech frame;
  • Q is an (nxn) convolution matrix and q, is the kth value of the impulse response of the 1/A(z) filter, employing the a and d, values obtained from the previous analysis by synthesis search process. This is referred to in Figure 4 as 'scheme 2'.
  • a (and b) is calculated for minimum prediction error using equation (D).
  • the values of b are then (re-Calculated inside the loop for minimum actual error, using equation (F).
  • Scheme 3 The d 1 value, and corresponding ⁇ & b values, giving rise to the minimum actual error energy are then selected as optimal. This is referred to in Figure 4 as "Scheme 3". Schemes 2 and 3 above are preferred on accuracy grounds, as b is (re-)defined by minimising the actual error energy rather than the prediction error energy ⁇ .
  • is initially calculated for minimum prediction error energy outside the loop and independently of b, by assuming b is equal to zero, yielding
  • the b and d 1 parameters can be optimised by following schemes 1, 2 or 3 described above.
  • the value of b producing a minimum prediction error energy is given by
  • the parameters produced are then quantised by a quantiser 7, which allocates available bits between parameters.
  • excitation sequence source 2 is adapted to supply a number of different sequences [x -i ] , [u -i ] , [v -i ]...to synthesis filter 3.
  • One excitation sequence, [x -i ], may be the Past
  • the filter 3 (and, of course, filters 11 and 21 of Figures 1, 2 and 2a) comprises two stages in series, the first filter element 51 comprising a number j of filters 51a, 51b, ... 51j, each receiving an excitation sequence [x -i ], [u -i ], [v -i ] etc., and producing an output.
  • the filters 51a, 51b etc. have responses B (1) (z)z -n-d1 , B (2) (z)z -n-d2 , b (3) (z)z -n-d3 etc., respectively, and their combined output is passed to the second filter element 32 which is as before a recursive filter with response 1/A(z) .
  • each filter 51a, 51b, etc. corresponds to filter element 31, and thus has a delay of length d j , and coefficients
  • the parameters d 1 , d 2 , etc., [a k ] and m ... are optimised by minimising the actual error energy for some parameters, reducing computational complexity by partially linearising (i.e. calculating some parameters by solving for minimum prediction energy) using methods based on those described above in the first to sixth embodiments.
  • the filter parameters are sequentially optimised so that the filter "grows" - in other words, values for ⁇ , and d 1 , are derived first, then values for b ) and d 2 , and so on - and consequently improves its accuracy.
  • the filter optimisation means 6 in one embodiment is adapted to employ a strategy based on that of the first embodiment (Method I) of the single excitation case.
  • a, and d 1 are found by minimising actual error energy following scheme 1, 2 or 3 of Figure 4, with other parameters assumed to be zero.
  • these values of ⁇ , and d 1 are used to derive ) and d 2 values which, in a preferred embodiment, may then themselves be used to redefine ⁇ and in various ways, some of which are discussed below.
  • a first approach is based on scheme 1 (or scheme 2).
  • u-(n + d 2 + q 2 ) to u -(d 2 + 1) which corresponds to equation F in the first stage.
  • the second analysis by synthesis loop can calculate for each value of d 2 (0 ⁇ d 2 ⁇ N-1), minimum prediction error energy solutions for , or a or ⁇ respectively. Regardless of how these are calculated, however, for each of the N resulting [ ⁇ , d 1 , d 2 ) filters, [ ⁇ i ] is synthesised and the filter which offers minimum actual error energy e e, is selected.
  • Some of the filter parameters can be re-estimated
  • filter parameters for minimum prediction energy some or all filter parameters can be re-defined, for either a minimum prediction error energy or a minimum actual error energy, as follows:
  • (i) may be optimised, independently of and b -(1) , within an analysis by synthesis loop which is executed for N different values of d 2 .
  • (ii) may be calculated jointly with for minimum prediction error energy For each value of d 2 , ⁇ is retained and is re-estimated for minimum prediction error.
  • [a k ] coefficients being initially optimised for minimum prediction error energy using a conventional LPC solution.
  • the method used in deriving filter coefficients can be chosen from a wide class of algorithms based on one or other of the six schemes which could be used in stage 1. Inspection of the second stage equations show their formal similarity to the stage 1 equations, however, and it is preferable that the filter optimisation calculator 6 employs the same method for stage 2 and each subsequent stage as for stage 1, with necessary extensions to the equations, since this lends itself easily to iterative programming techniques and hence reduces system complexity.
  • the excitation adaption calculator 12 will now be described. From the foregoing, it is clear that the excitation used at the receiver station to drive the synthesis filter 21 must be that for which the filter is optimised; when the receiver filter 21 is identical to the optimisation filter 3 and local decoder filter 11 at the transmitter, the excitation must be the same as that for which the transmitted parameters were optimised.
  • this is arranged, as described above, by providing the excitation sequence generators at the transmitter 2 and the receiver 23 with a preprogrammed initial sequence - the exact nature of the sequence is not critical, but it may be for example a zero mean Gaussian random sequence.
  • This single sequence may be used to drive each input of a multiple-input synthesis filter 3 of the kind shown in Figure 5.
  • the values of the d 1 , d 2 and d- parameters define specific segments in the [x -i ] sequence which are then inserted into the FIR components 51a, 51b, 51c of the H(z) synthesis filter 3 to act as excitations thereto.
  • the adaptation calculator 12 is simply a link between the speech output and the excitation store.
  • Use of the PDS signal as an excitation sequence has significant advantages in increased signal to noise ratio for a given transmission rate, and/or low coding delay.
  • X(z) may be derived from the output of the first filter element 31 or 51, (ie an intermediate output of the whole filter H(z)).
  • the filter element may then be considered to be an infinite impulse response filter since there is thus a feedback path between its output and its input.
  • the input to the A(Z) filter may in one particular case be equivalent to that produced in the 'self-excited' vocoder described by Rose and Barnwell ('The self-excited vocoder - an alternative approach to Toll Quality at 4800 bps', IEEE Proc of ICASSP April 1986 pp 453 - 456).
  • a typical signal used as an input to the all-pole filter 32 is the PDS signal filtered by all-zero filter 31, which is essentially very similar to the speech waveforms shown.
  • the speech-like nature of the PDS excitation differs markedly from the PES excitations shown in Figure 7b; these latter waveforms are very much more similar to the excitations used in prior art systems such as MPLPC.
  • excitations generally resemble a train of glottal pulses like those produced by the human vocal cords.
  • the PDS excitation of the invention produces better results than the PES excitation. Since the PDS signal is 'speech-like' however it requires less filtering, as the filter merely needs to modify the previous speech frame to take account of the change between frames, rather than having to re-synthesise each speech frame from a random waveform as in the traditional source/filter model.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

In an LPC type coded communications system the excitation source is derived from previous filter outputs at the de­ coder; in one embodiment the speech output is used, in other embodiments, intermediate excitation outputs are used. To enable tracking the coder derives the filter parameters by using the same excitations, supplied by a local decoder, to syn­ thesise locally the actual error produced at the decoder; the parameters are optimised iteratively by varying the delay of an FIR stage and deriving the actual error in a loop, and selecting the delay for minimum actual error. The IIR parameters may be calculated jointly with the other FIR parameters inside the loop, either for minimum prediction error or minimum actual error. The FIR stage may comprise several parallel FIRs, separately excited.

Description

CODED COMMUNICATIONS SYSTEM
This invention relates to a system for transmitting speech signals in coded form. This invention relates also to a transmitter and to a receiver, for coding and decoding respectively such a signal.
Many known systems for transmitting coded speech signals derive the characteristics of a filter from the input speech samples at a coder, and transmit these to a decoder, where they are used to configure a decoder filter which is then excited by a suitable excitation signal source to produce a synthesised reproduction of the input speech signal.
In Linear Predictive Coding (LPC), an all-pole recursive filter is used to model the short-term spectral envelope of a speech signal, using a limited number of coefficients which are updated at periodic intervals, the coefficients usually being directly calculated by solving a set of linear equations devised to minimise the 'prediction error' (a measure of the difference between the input speech signal and the predicted speech signal).
In some LPC systems, a predictor stage is included in the coder to remove long-term periodicity corresponding to speaker pitch from the difference signal; this predictor may also be considered as a filter, and the corresponding filter parameters are also transmitted to the decoder.
The prediction error, regardless of whether or not pitch prediction is included in the predicted speech signal, differs from the actual error because the prediction is based on a model of excitation, not on the excitation actually used at the decoder Recent LPC systems include, for example, the Multipulse Excited (HP), the Regular Pulse Excited (RPE), and the Code Book Excited (CE) LPC Systems. The excitation to be used at the decoder is selected or derived at the coder; in the above systems, the decoder includes a controllable excitation signal generator, and the coder must therefore transmit control signals to the decoder.
The decoder may thus be a two-stage filter (with a relatively short delay all-pole filter and a relatively long delay filter) and an excitation generator.
The excitation control signals are themselves produced at the coder; in MPLPC, this has been achieved by using "analysis by synthesis" - ie synthesising speech locally using the unmodified excitation pulse sequence, subtracting this synthesis from the actual input speech sequence to form an actual error signal, perceptually weighting the error signal, and then using the weighted error signal as a control in a closed optimisation loop to select the excitation sequence for minimum (weighted) actual error.
According to the present invention there is provided a method of speech coding in which filter parameters are periodically derived from an input speech signal in a coder and transmitted to a decoder to update the response of a decoder filter, in which the excitation input to the decoder filter is derived in the decoder from the speech output of the decoder filter.
It is thus possible to reduce the bit rate by avoiding the transmission of data relating to the excitation sequence, as the excitation is simply derived from the previous output speech at the decoder. Preferably at least some of the filter parameters are derived so as to reduce the actual error which will be produced at the decoder, rather than the prediction error. A synthesis filter may be provided at the coder and the actual error may be derived by subtracting locally synthesised speech from the input speech signal.
Preferably the parameter derivation is achieved by iteratively deriving the delay parameter of an all-zero stage of the filter by minimising the actual error. Preferably at least some of the other parameters are calculated inside the loop. Preferably all other parameters are jointly optimised, by calculating them inside the delay parameter iteration loop. In another embodiment these optimisation methods may be applied to a system using previous excitations, rather than previous decoded speech, as excitation to a plurality of parallel feed-forward stages of the filter. Use of the invention can give a significant signal to noise ratio advantage and/or an ability to work effectively with small frame sizes and thus allows, for a given signal to noise ratio, a shorter coding delay.
Other aspects of the invention are recited in the claims .
The invention will now be described, by way of example only, with reference to the drawings in which;
- Figure 1 shows schematically a generalised transmitter employing a coder according to the invention,
- Figure la shows a specific embodiment of a transmitter employing a coder according to the invention,
- Figure 2 shows schematically a generalised receiver employing a decoder according to the invention,
- Figure 2a shows a specific embodiment of a receiver employing a decoder according to the invention,
- Figure 3 shows schematically the components of a single excitation synthesis filter of Figures 1 or 2, - Figures 4a and 4b provide six algorithms corresponding to methods of optimising the parameters of the synthesis filter of a coder according to the invention,
- Figure 5 shows schematically the components of a multiple excitation synthesis filter of a coder according to another aspect of the invention
- Figure 6 illustrates examples of methods of optimising the parameters of such a multiple input synthesis filter, and - Figures 7a and 7b show excitation waveforms occurring in embodiments of the invention.
Referring to Figure 1, at a transmitter, incoming speech is sampled to give frames [yi] (i=0,1,... n-1) of samples. From each frame, a filter optimisation stage 1 derives filter parameters, using an 'analysis by synthesis' technique, as follows. An excitation sequence source 2 generates an initial excitation sequence [x-i] (i=0,1,...n-1), which drives a two stage synthesis filter 3 corresponding to a finite impulse response filter B(z) and an infinite impulse response (all pole) filter 1/A(z) in series, having a response H(z):
where and n is the size of the
analysis frame from which H(z) is to be estimated.
The filter parameters are thus coefficients [ak] and [bk] and delay d1. A frame of synthesised speech output [ŷi ] is produced by the synthesis filter 3, and this is compared with the original input speech frame [yi] by subtractor 4 which produces a difference signal êi, where êi = yi - ŷi . A better measure of the perceived error to the human ear is obtained by frequency-weighting the error signal in such a manner as to de-emphasise formant regions in the error spectrum. Accordingly, the difference signal e. is preferably filtered by a weighting filter 5 with suitable transfer function, to form the perceptually weighted error signal êw. This signal is then minimised to optimise the parameters of synthesis filter 3 by optimisation calculator 6 in an iterative closed loop, which derives values for [ak], [bk] and d1, which are quantised by quantiser 7 and passed to a coder 8, which codes the parameters for transmission to a decoder station, and to a local decoder 10 which is functionally identical to the decoder at the receiver station.
At the local decoder 10, the parameters are used to configure a local decoder synthesis filter 11, which is then driven from the excitation sequence source 2 by the same excitation sequence [x-i] for which the parameters were optimised, to produce an output frame of synthesised speech [ŷi] .
This synthesised output is received and processed as necessary by an excitation adaptation calculator 12, which thus constructs a new excitation sequence [x-i] from the output of the decoder synthesis filter 11. The new excitation sequence is then passed to the excitation sequence source 2 for use on the next input speech frame, for which it will form part of the Past Decoded Speech (PDS) signal.
The excitation adaptation calculator 12 and Local Decoder 10 are not necessary in all embodiments of the invention as shown in Figure 1a. In this simple embodiment, a frame of input speech [yi] is received and optimised filter parameters are derived in a closed loop using the excitation sequence supplied by excitation sequence generator 2. Once this is done, the synthesised speech signal [ŷi] produced using these optimised parameters is synthesised by the synthesis filter 3 (rather than by the local decoder synthesis filter 11 as described above with reference to Figure 1) and supplied to the excitation sequence generator 2 which is simply a store capable of storing the synthesised frame [ŷi] , and supplying this as the next excitation sequence [x-i] (and also of storing the initial excitation sequence needed when the first frame of input speech is to be coded).
Referring to Figure 2, at the distant decoder 20 a decoder unit 28 performs the inverse operation to coder 8, to recover the filter parameters [ak] , [bk] and d1, which are passed to synthesis filter 21 to produce the configuration previously optimised at the transmitter. Excitation sequence generator 23 drives the filter with an excitation sequence [x-i] which will be the same as that used at the transmitter; for the first frame of speech it will be the initial excitation sequence identical to that used at the transmitter. A frame of synthesised speech output [ŷi] is thus produced for the listener, and this output is also supplied to the excitation adaption calculator 22 to create a new excitation sequence [x-i] in the excitation sequence generator 23 in the same manner as at the local decoder 10.
As shown in Figure 2a, the excitation adaption calculator 22 is not necessary in single input embodiments, and the excitation sequence generator 23 may simply be a frame store holding the last decoded frame
(and an initial excitation sequence) as discussed above.
Since the distant decoder 20 and the local decoder 10 perform exactly the same synthesis, using the same filter and the same initial excitation, the updated excitation sequences thus created are also the same and there is no need to transmit any excitation sequence data.
There is of course a tendency for the propagation of transmission errors, since a wrongly-received filter parameter will produce an "incorrect" excitation sequence which is, not that for which the next set of -filter parameters were optimised at the transmitter.
Accordingly, it is advantageous to provide, in a known manner, re-initialisation sequences occasionally so as to bring the distant decoder and the local decoder back into agreement; this may take the form of a periodic control bit transmitted to both decoders, instructing each excitation sequence generator 23 to re-start with the preprogrammed initial excitation sequence.
It will, of course, be understood that many of or all the above functions may be performed by a single digital processor and therefore need have no separate physical existence. The operation of the filter optimisation stage 1 of the transmitter of Figures 1 and 1a will now be described in greater detail.
In many prior art systems using LPC coding techniques, the optimisation process is performed by minimising the value of the prediction (residual) error, which is derived by comparing the current value of the speech signal and its predicted value based on past samples,
where ei is the prediction error for the ith sample.
This technique might also be used in implementing the invention, but preferably at least one filter characteristic is optimised by minimising the actual error êi = yi - ŷi between the synthesised speech output of the synthesis filter 3, or rather in a preferred embodiment the weighted actual error êw produced by passing êi through the perceptual weighting filter 5. The weighting filter 5 is defined in the same way as in MPLPC or RPE-LPC, and will henceforth be omitted from the analysis of operation of the invention; the term "actual error" will be understood to encompass the perceptually weighted actual error. Since minimising êw (the error vector) by calculation is a non-linear problem of high computational complexity, in preferred embodiments of the invention the filter characteristics are selected by partly linearising the problem, by calculating initially some parameters for minimum prediction error energy (using a least mean square solution) which is straightforward. Using these values, the optimum values for the remaining parameters are derived by finding the actual error êi (for example by synthesising the output speech signal ŷi in a loop for each possible delay parameter value and subtracting it from the input speech signal yi ), finding the minimum mean squared weighted actual error êw, and selecting parameter values which gave rise to this minimum êw The values of the other parameters may be calculated either inside or outside the loop, as discussed below.
Referring to Figure 3, the synthesis filter 3 of the filter optimisation stage 1 comprises essentially two stages in series; a first filter element 31 comprising a finite impulse response filter with transfer characteristic B(z)z -n-d1, and a second filter element 32 comprising an infinite impulse response filter with transfer function so that
the overall filter impulse response H(z) is
as required by equation A. The first filter element 31 consists of q1+1 coefficients [bi] (i=0,1,.....q1) and q1+d1 delay elements so that it comprises a delay of length (d1+n-1) followed by a q1+ 1 stage non-recursive filter; in practice, a small number (q1=0,1 or 2) of such coefficients are used.
The number of delay elements (and hence the delay length) d, employed is variable in operation up to a predetermined maximum number N-1 (which may be greater or smaller than the frame size n).
This first filter element 31 therefore receives the excitation signal [x-i] and contains the most recent q1+ d1+ n samples of the [x-i] sequence.
The second filter element comprises a recursive filter with an all-pole response, defined by filter coefficients
[ak] (k = 1, 2, ...p). In the following description, vector notation will be adopted whereby the vector ā denotes the set of [ak] coefficients and the vector b denotes the set of [bq] coefficients.
For each received frame of input speech, the filter optimisation calculator 6 initially calculates values for the [ak] coefficients for mean square prediction error energy
In preferred embodiments, (referred to generally as
Method 1 in Figures 4 and 5) the a vector is calculated jointly with the b vector inside a loop for each value of d1 between 0 and N-1, and d, is selected for minimum ew as discussed above,
For each specific value of d1, the ā and b vectors which minimise the predictor error energy are
(D)
where X is an nx(q1 + 1) matrix of excitation samples xk from
x-(n + d1 + q1 ) to X-(d1 + 1) and Y is an (nxp) matrix of yk samples from y-p to yn-2.
The actual error energy is minimised using one of the following three methods:
In a first embodiment, for each value of d1 in the loop, ā and b are computed jointly from (D) for minimum prediction error and used to synthesise ŷ. The actual error energy is also determined using H(z) is therefore determined within an 'analysis by synthesis' loop where d, varies over a range of N values i.e. 0≤ d1≤ N-1 and the ā,b vectors are defined for the value of d1 which minimises In this method only the d1 filter parameter is optimised for a minimum actual error energy while both ā and b are selected for minimum prediction error energy T This is referred to in Figure 4 as 'scheme 1'.
In a second embodiment, the approach for estimating b is a modification of Scheme 1 above. In this case, ā and b are defined by equation (D) for the value of d1 which minimises prediction error . Once d1 is optimised the b vector is re-estimated (outside the optimisation loop) for minimum actual error, using:
m represents the output of the 1/A(z) filter when its output is zero, and its "memory" consists of the most recent samples of the previous synthesised speech frame; where Q is an (nxn) convolution matrix and q, is the kth value of the impulse response of the 1/A(z) filter, employing the a and d, values obtained from the previous analysis by synthesis search process. This is referred to in Figure 4 as 'scheme 2'.
In a third embodiment, for each given value of d1, a (and b) is calculated for minimum prediction error using equation (D). The values of b are then (re-Calculated inside the loop for minimum actual error, using equation (F).
Thus within the analysis by synthesis loop, defined for each d, value (0≤ d1≤ N-1) ā is first calculated using equation (D) and 5 is then re-estimated in the loop. For each set of ā, b and d, values the actual error energy can be measured by synthesising and forming Alternatively can be calculated directly from the expression:
The d1 value, and corresponding ā & b values, giving rise to the minimum actual error energy are then selected as optimal. This is referred to in Figure 4 as "Scheme 3". Schemes 2 and 3 above are preferred on accuracy grounds, as b is (re-)defined by minimising the actual error energy rather than the prediction error energy ē.
In an alternative embodiment, referred to as Method II in Figures 4 and 5, ā is initially calculated for minimum prediction error energy outside the loop and independently of b, by assuming b is equal to zero, yielding
ā = (YTY)-1 (H)
Then given ā, the b and d1 parameters can be optimised by following schemes 1, 2 or 3 described above. In particular, the value of b producing a minimum prediction error energy is given by
b = (XTX)-1XT (J)
The corresponding approaches to schemes 1, 2 and 3 above are shown in Figure 4 as Schemes 4, 5 & 6 respectively and form fourth, fifth and sixth embodiments of the invention.
The parameters produced are then quantised by a quantiser 7, which allocates available bits between parameters.
Although the excitation to the synthesis filter has been described in terms of a single excitation sequence [x-i], it may be desirable to use several excitation sequences and accordingly, referring to Figure 1, excitation sequence source 2 is adapted to supply a number of different sequences [x-i] , [u-i] , [v-i]...to synthesis filter 3.
One excitation sequence, [x-i], may be the Past
Decoded Speech ŷi output of the synthesis filter 3; the other sequences are derived by the excitation adaptioncalculator 12 from intermediate outputs of the synthesis filter 3 (or all such sequences may be thus derived as discussed below).
Referring to Figure 5, the filter 3 (and, of course, filters 11 and 21 of Figures 1, 2 and 2a) comprises two stages in series, the first filter element 51 comprising a number j of filters 51a, 51b, ... 51j, each receiving an excitation sequence [x-i], [u-i], [v-i] etc., and producing an output. As shown, the filters 51a, 51b etc.; have responses B(1)(z)z-n-d1, B(2)(z)z-n-d2, b(3)(z)z-n-d3 etc., respectively, and their combined output is passed to the second filter element 32 which is as before a recursive filter with response 1/A(z) .
As shown, each filter 51a, 51b, etc., corresponds to filter element 31, and thus has a delay of length dj, and coefficients As with the case of a single excitation filter, the parameters d1, d2,.....dj, [ak] and m ... are optimised by minimising the actual error energy for some parameters, reducing computational complexity by partially linearising (i.e. calculating some parameters by solving for minimum prediction energy) using methods based on those described above in the first to sixth embodiments.
In general, the filter parameters are sequentially optimised so that the filter "grows" - in other words, values for ā, and d1, are derived first, then values for b ) and d2, and so on - and consequently improves its accuracy.
This lends itself to adaptive transmission schemes, wherein if a bit-rate reduction is required, only earlier coefficients (ā, d1 for example) need be transmitted.
Referring to Figure 6, the filter optimisation means 6 in one embodiment is adapted to employ a strategy based on that of the first embodiment (Method I) of the single excitation case. In a first stage of the process, a, and d1 are found by minimising actual error energy following scheme 1, 2 or 3 of Figure 4, with other parameters assumed to be zero. In a second stage these values of ā, and d1 are used to derive ) and d2 values which, in a preferred embodiment, may then themselves be used to redefine ā and in various ways, some of which are discussed below. A first approach is based on scheme 1 (or scheme 2).
If ) is calculated independently of a and a minimum predictor energy e e solution is given by
(F1) where U is an nx(q1 + 1) matrix of excitation samples uk from
u-(n + d2 + q2 ) to u-(d2 + 1) which corresponds to equation F in the first stage. may alternatively be calculated jointly with ā, or jointly with a and using straightforward matrix expressions corresponding to equation (D) for the first stage. Thus the second analysis by synthesis loop can calculate for each value of d2 (0≤ d2< N-1), minimum prediction error energy solutions for , or a or ā respectively. Regardless of how these are calculated, however, for each of the N resulting [ā, d1, d2) filters, [ŷi] is synthesised and the filter which offers minimum actual error energy e e, is selected.
Some of the filter parameters can be re-estimated
(outside the loop) for the specific values d1 = d1 and d2 = d2 which have been already determined by the process. Specifically, when the second analysis by synthesis process estimates filter parameters for minimum prediction energy, some or all filter parameters can be re-defined, for either a minimum prediction error energy or a minimum actual error energy, as follows:
(i) given d1 and d2, prediction error energy solutions for ā, and can be calculated.
(ii) given d1, d2 and a, the vectors can be re-calculated for minimum prediction error from:
Alternatively, approaches based on the algorithms of scheme 3 may be employed, for example;
(i) may be optimised, independently of and b-(1) , within an analysis by synthesis loop which is executed for N different values of d2. For each d2, the vector b is calculated for minimum actual error energy, so that is selected for the value of d2 = d which minimises
(ii) may be calculated jointly with for minimum prediction error energy For each value of d2, ā is retained and is re-estimated for minimum prediction error.
(iii) is calculated jointly with ā and for minimum predictor error. The new a vector is retained while and are re-estimated for minimum prediction error from equation (D3).
Now, given the optimised values of d1 and d2, i.e. and some of the filter parameters can be re-optimised in a single step (outside the analysis by synthesis loop) using equation (D3) to re-define and
The extension of the above calculations to a third stage for calculating and d3, and recalculating previous filter characteristics, is simple if the equations are appropriately modified to include higher terms, and the size of the filter, i.e. the number of summation terms used in producing increases progressively as the optimisation process evolves from one stage to the next.
In another embodiment of the invention, the approach of schemes 4,5 or 6 of Figure 4b may be followed, the
[ak] coefficients being initially optimised for minimum prediction error energy using a conventional LPC solution.
The method used in deriving filter coefficients can be chosen from a wide class of algorithms based on one or other of the six schemes which could be used in stage 1. Inspection of the second stage equations show their formal similarity to the stage 1 equations, however, and it is preferable that the filter optimisation calculator 6 employs the same method for stage 2 and each subsequent stage as for stage 1, with necessary extensions to the equations, since this lends itself easily to iterative programming techniques and hence reduces system complexity.
Although in the foregoing, the selection of some filter parameters was effected by calculating them for minimum prediction error energy e it will be understood that they could be calculated or estimated otherwise (for example by inverse filtering [yi] ).
It is important to realise that the adaptation of the H(Z) filter as described above is not only confined to the values of the coefficient sets but is also applied to the "structure" of the filter - ie the lengths of the delay lines d1, d2 etc - so that the operation of H(z) is re-defined on a frame-by-frame basis (in contradistinction to conventional adaptive filters).
The operation of the excitation adaption calculator 12 will now be described. From the foregoing, it is clear that the excitation used at the receiver station to drive the synthesis filter 21 must be that for which the filter is optimised; when the receiver filter 21 is identical to the optimisation filter 3 and local decoder filter 11 at the transmitter, the excitation must be the same as that for which the transmitted parameters were optimised.
Initially this is arranged, as described above, by providing the excitation sequence generators at the transmitter 2 and the receiver 23 with a preprogrammed initial sequence - the exact nature of the sequence is not critical, but it may be for example a zero mean Gaussian random sequence. This single sequence may be used to drive each input of a multiple-input synthesis filter 3 of the kind shown in Figure 5. Considering the first three filters 51a, 51b, 51c having responses B(1)(z)z-n-d1, B(2)(z)z-n-d2, B(3)(z)z-n-d3 as discussed above, where n is the frame size of the analysis frame, the values of the d1, d2 and d- parameters define specific segments in the [x-i] sequence which are then inserted into the FIR components 51a, 51b, 51c of the H(z) synthesis filter 3 to act as excitations thereto.
In the PDS embodiment of the invention, the adaptation calculator 12 is simply a link between the speech output and the excitation store. Use of the PDS signal as an excitation sequence has significant advantages in increased signal to noise ratio for a given transmission rate, and/or low coding delay.
As discussed above, it is of course possible to use other inputs for X(z): in particular, X(z) may be derived from the output of the first filter element 31 or 51, (ie an intermediate output of the whole filter H(z)). The filter element may then be considered to be an infinite impulse response filter since there is thus a feedback path between its output and its input. Where the excitation is derived only from such intermediate outputs, the input to the A(Z) filter may in one particular case be equivalent to that produced in the 'self-excited' vocoder described by Rose and Barnwell ('The self-excited vocoder - an alternative approach to Toll Quality at 4800 bps', IEEE Proc of ICASSP April 1986 pp 453 - 456).
The parameter optimization methods described above are equally applicable to such parallel filter structures.
Referring to Figure 7a a typical signal used as an input to the all-pole filter 32 is the PDS signal filtered by all-zero filter 31, which is essentially very similar to the speech waveforms shown. The speech-like nature of the PDS excitation differs markedly from the PES excitations shown in Figure 7b; these latter waveforms are very much more similar to the excitations used in prior art systems such as MPLPC.
These excitations generally resemble a train of glottal pulses like those produced by the human vocal cords.
At first sight, it is therefore surprising that the PDS excitation of the invention produces better results than the PES excitation. Since the PDS signal is 'speech-like' however it requires less filtering, as the filter merely needs to modify the previous speech frame to take account of the change between frames, rather than having to re-synthesise each speech frame from a random waveform as in the traditional source/filter model.
It has been found that SNR decreases with increasing frame size for each embodiment but also that the PDS embodiment using a single excitation sequence has an SNR advantage - especially at low frame sizes - which is very noticeable in subjective speech quality. Also the single excitation PDS embodiment is capable of operating on smaller frame sizes which reduces coding delay to considerable advantage.
Joint rather than separate evaluation of filter parameters has been found particularly effective.
It was found that increasing either the number of filter coefficients, or the maximum value of di, results in a higher SNR for both codecs. When the SNR improvement achieved is costed against the number of extra bits required to be transmitted per frame however, then it is clear that the allocation of the extra bits to the delay parameters pays off considerably better than the allocation of bits to new filter coefficients.

Claims

1. A method of transmitting speech in which filter parameters are periodically derived from an input speech signal in a coder and transmitted to a decoder to update the response of a decoder filter, in which the excitation input to the decoder filter is derived in the decoder from the speech output of the decoder filter.
2. A method according to claim 1 in which the derivation of the filter parameters is carried out in the manner such as to reduce the actual error between the input speech signal and the output of the decoder.
3. A method according to claim 2, in which the decoder includes a synthesis filter comprising a first filter and a second filter in series, the first filter having a variable delay parameter and the second filter being an infinite impulse response filter; and the filter parameters are periodically derived in the coder by an iterative process including the step of varying the delay parameter and deriving a corresponding estimate of the actual error between the input speech signal and the output of the decoder; and then selecting a value of the delay parameter which reduces this estimated actual error.
4. A method according to claim 3, in which the coder also includes such a synthesis filter, and the estimate of the actual error is derived at the coder by-- producing a synthetic speech output for each value of the delay parameter and comparing the synthetic speech output with the input speech signal.
5. A method according to claim 3 or claim 4 in which for each value of the delay parameter, values for other parameters of the first filter and for parameters of the second filter are jointly calculated and used to derive the estimated actual error.
6. A method according to claim 3 or claim 4, in which values for parameters of the second filter are initially estimated outside the iterative process, and values for the other parameters of the first filter are calculated for each value of the delay parameter and used to derive the estimated actual error.
7. A method of transmitting speech in which, in a coder, filter parameters are periodically derived from an input speech signal and transmitted to a decoder to update the response of a decoder filter comprising a first filter and a second filter in series, the first filter having a variable delay parameter and the second filter being an infinite impulse response filter; in which the excitation input to the decoder filter is derived at the decoder from one or more intermediate outputs of the decoder filter and in which the filter parameters are derived in the coder by an iterative process including the step of; varying the delay parameter, calculating the values of the other parameters of the first filter and of the second filter and using such values to derive an estimate of the actual error between the input speech signal and the output of the decoder; and then selecting the delay parameter value so as to reduce the estimated actual error.
8. A method according to any one of claims 5, 6 or 7, wherein values of the other parameters of the first filter are calculated, for each value of the delay parameter, so as to give rise to a low prediction error.
9. A method according to any one of claims 5, 6 or 7, wherein values of the other parameters of the first filter are calculated, for each value of the delay parameter, so as to give rise to a low actual error.
10. A method according to any one of claims 5, 6 or 7, wherein after selection of the delay parameter, values of the other parameters of the first filter are recalculated so as to give rise to a low actual error.
11. A method according to any preceding claim, in which the actual error is weighted so as to reduce its dependence upon perceptually less important spectral regions.
12. A speech coder comprising means arranged to periodically derive filter characteristics from an input speech signal, in such a manner as to reduce an estimate of the actual error between the input speech signal and 4he synthetic speech output which would be produced by a filter having those characteristics when excited by a signal derived from its previous synthetic speech output.
13. A coder according to claim 12 including a synthesis filter adapted to produce a synthetic speech output, the estimate of the actual error being the error formed between the input speech signal and the synthetic speech output of the synthesis filter.
14. A speech coder including means arranged to periodically derive filter characteristics from an input speech signal, the filter characteristics being derived in such a manner as to reduce an estimate of the actual error between the input speech signal and the output which would be produced by a synthesis filter comprising a first filter and a second filter in series, the first filter comprising a plurality of parallel feed forward filters, at least one of which is connected to receive one of a plurality of different excitation sequences derived from combinations of the outputs of the parallel feed forward filters, and the second filter being an infinite impulse responser filter, having those characteristics, when thus excited.
15. A coder according to claim 14, including means arranged to derive the said filter characteristics by a method comprising the steps of: varying the value of a delay parameter of a first said parallel feed forward filter, deriving a corresponding estimate of the actual error for each such value, and selecting a value of that delay parameter which reduces the said actual error; and then repeating the step for the delay parameter of each further said parallel feed forward filter, each said estimate being produced using all previously selected said delay parameter values.
16. A coder according to any one of claims 12 to 14, including also a local decoder adapted to produce a synthetic speech output.
17. A decoder for coded speech which includes a filter, the decoder being arranged upon receipt of coded speech signals to update the characteristics of the filter therefrom, the filter having an excitation input and a speech output connected thereto so that in use the filter is. excited from its previous speech output.
18. A decoder for coded speech which includes a filter, the decoder being arranged upon receipt of coded speech signals to update the characteristics of the filter therefrom, the filter comprising a first filter and a second filter in series, the first filter comprising a plurality of parallel feed forward filters, at least one of which is connected to receive one of a plurality of different excitation sequences derived from combinations of the outputs of the parallel feed forward filters, and the second filter being an infinite impulse response filter.
19. A decoder according to claim 18, in which at least one such feed forward filter is connected to receive an excitation sequence derived from the output of the second filter.
20. A receiver including a decoder according to any one of claims 17 to 19.
21. A receiver substantially as herein described with reference to the accompanying Figures 2 and 2a.
22. A transmitter including a coder according to any one of claims 12 to 16.
23. A transmitter substantially as herein described with reference to the accompanying Figures 1 and la.
24. A speech transmission system comprising a transmitter and a receiver according to any preceeding claim.
24. A method of speech transmission substantially as herein described, with reference to the accompanying Figures.
EP19880907742 1987-08-28 1988-08-26 Coded communications system Withdrawn EP0341271A1 (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
GB8720388 1987-08-28
GB878720388A GB8720388D0 (en) 1987-08-28 1987-08-28 Speech coding
GB8801858 1988-01-28
GB888801858A GB8801858D0 (en) 1988-01-28 1988-01-28 Coded communications system

Publications (1)

Publication Number Publication Date
EP0341271A1 true EP0341271A1 (en) 1989-11-15

Family

ID=26292659

Family Applications (1)

Application Number Title Priority Date Filing Date
EP19880907742 Withdrawn EP0341271A1 (en) 1987-08-28 1988-08-26 Coded communications system

Country Status (3)

Country Link
EP (1) EP0341271A1 (en)
JP (1) JPH02500620A (en)
WO (1) WO1989002148A1 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FI90477C (en) * 1992-03-23 1994-02-10 Nokia Mobile Phones Ltd A method for improving the quality of a coding system that uses linear forecasting
FI96248C (en) * 1993-05-06 1996-05-27 Nokia Mobile Phones Ltd Method for providing a synthetic filter for long-term interval and synthesis filter for speech coder
FI98164C (en) * 1994-01-24 1997-04-25 Nokia Mobile Phones Ltd Processing of speech coder parameters in a telecommunication system receiver
JPH0830299A (en) * 1994-07-19 1996-02-02 Nec Corp Voice coder
TW317051B (en) * 1996-02-15 1997-10-01 Philips Electronics Nv

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4038495A (en) * 1975-11-14 1977-07-26 Rockwell International Corporation Speech analyzer/synthesizer using recursive filters
US4301329A (en) * 1978-01-09 1981-11-17 Nippon Electric Co., Ltd. Speech analysis and synthesis apparatus

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO8902148A1 *

Also Published As

Publication number Publication date
WO1989002148A1 (en) 1989-03-09
JPH02500620A (en) 1990-03-01

Similar Documents

Publication Publication Date Title
EP0422232B1 (en) Voice encoder
EP0409239B1 (en) Speech coding/decoding method
US5701392A (en) Depth-first algebraic-codebook search for fast coding of speech
US6594626B2 (en) Voice encoding and voice decoding using an adaptive codebook and an algebraic codebook
CA1218745A (en) Speech signal processing system
US5826224A (en) Method of storing reflection coeffients in a vector quantizer for a speech coder to provide reduced storage requirements
US5359696A (en) Digital speech coder having improved sub-sample resolution long-term predictor
US5457783A (en) Adaptive speech coder having code excited linear prediction
EP1008982B1 (en) Voice encoder, voice decoder, voice encoder/decoder, voice encoding method, voice decoding method and voice encoding/decoding method
US6202045B1 (en) Speech coding with variable model order linear prediction
US5953697A (en) Gain estimation scheme for LPC vocoders with a shape index based on signal envelopes
JPH0683400A (en) Speech-message processing method
US5659659A (en) Speech compressor using trellis encoding and linear prediction
EP0450064B1 (en) Digital speech coder having improved sub-sample resolution long-term predictor
US5570453A (en) Method for generating a spectral noise weighting filter for use in a speech coder
US5797119A (en) Comb filter speech coding with preselected excitation code vectors
US6295520B1 (en) Multi-pulse synthesis simplification in analysis-by-synthesis coders
EP0341271A1 (en) Coded communications system
US5692101A (en) Speech coding method and apparatus using mean squared error modifier for selected speech coder parameters using VSELP techniques
US7337110B2 (en) Structured VSELP codebook for low complexity search
EP0855699B1 (en) Multipulse-excited speech coder/decoder
Gouvianakis et al. Advances in analysis by synthesis LPC speech coders
EP0539103A2 (en) Generalized analysis-by-synthesis speech coding method and apparatus
KR950001437B1 (en) Method of voice decoding
JP3270146B2 (en) Audio coding device

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 19890426

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE CH DE FR GB IT LI LU NL SE

18W Application withdrawn

Withdrawal date: 19910802

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION HAS BEEN WITHDRAWN

R18W Application withdrawn (corrected)

Effective date: 19910802