CN101030376A - Combined method for searching energy parameter gain shape quantization of 0.6kb/s voice coder - Google Patents

Combined method for searching energy parameter gain shape quantization of 0.6kb/s voice coder Download PDF

Info

Publication number
CN101030376A
CN101030376A CNA200710065402XA CN200710065402A CN101030376A CN 101030376 A CN101030376 A CN 101030376A CN A200710065402X A CNA200710065402X A CN A200710065402XA CN 200710065402 A CN200710065402 A CN 200710065402A CN 101030376 A CN101030376 A CN 101030376A
Authority
CN
China
Prior art keywords
energy parameter
quantization
value
index
subframe
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CNA200710065402XA
Other languages
Chinese (zh)
Inventor
崔慧娟
唐昆
李晔
洪侃
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tsinghua University
Original Assignee
Tsinghua University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tsinghua University filed Critical Tsinghua University
Priority to CNA200710065402XA priority Critical patent/CN101030376A/en
Publication of CN101030376A publication Critical patent/CN101030376A/en
Pending legal-status Critical Current

Links

Images

Abstract

A united searching method of vocoder energy parameter gain form quantization includes obtaining energy average value after energy parameter of three continuous voice frames is obtained, obtaining quantization index by making uniform scalar quantization on energy average value, using obtained index as center to confirm a candidate region, carrying out counter scalar quantization on value in said region, obtaining normalized vector by using counter scalar quantized value to make normalization on three said frames and carrying out vector quantization on all values in said region to seek out correct value and codebook index.

Description

0.6kb/s the combined method for searching that vocoder energy parameter gain shape quantizes
Technical field
The invention belongs to the speech coding technology field, particularly low rate parametric speech coding technology.
Background technology
Voice coding in communication system, voice storage-playback, have in the consumer product of phonetic function and be widely used.International Telecommunication Union, some regional organizations and some countries had formulated a series of voice compression coding standards in succession in the last few years, were that 1.2kb/s has obtained gratifying voice quality to 16kb/s in code rate.Domestic and international research mainly concentrates on the following high-quality speech compressed encoding of 1.2kb/s speed at present, is mainly used in radio communication, secret communication, high capacity voice storage playback etc.0.6kb/s the vocoder algorithm is a focus wherein, the present invention proposes a kind ofly to unite the Syndicating search algorithm of quantification at energy parameter gain shape in the 0.6kb/s vocoder algorithm, can reduce the quantization error of energy parameter, improves the quality of synthetic speech.
Summary of the invention
The object of the present invention is to provide a kind of quantization error that can reduce energy parameter, improve the combined method for searching of the energy parameter gain shape quantification of 600b/s vocoder synthetic speech quality.
The invention is characterized in that this method realizes successively according to the following steps in digital integrated circuit:
Step (1) is divided frame to the input speech signal sampling point in proper order by the time interval of setting; Wherein said voice signal sampling point is the signal sampling point after having disturbed according to the setpoint frequency sampling and through high-pass filtering removal power frequency;
Step (2) is formed superframe to the one group of ground of every adjacent three subframes of subframe that obtains through undue frame in the step (1), asks for the energy parameter g of each subframe in the current superframe respectively according to following formula N, i:
g n , i = log 1 L i Σ L i s ( n ) 2
Wherein, n represents current superframe sequence number,
I represents the sequence number of i subframe in the current superframe, i=1, and 2,3,
L iBe the length of window that the i subframe is asked for energy parameter,
S (n) is described L iInterior voice signal;
Step (3) is asked for the mean value g of each subframe energy parameter in the current superframe by following formula n:
g ‾ n = 1 3 Σ i = 1 3 g n , i ;
Step (4) is with the mean value g of the energy parameter asked in (3) nCarry out even scalar quantization by following formula, the span of its scalar quantization is between 10dB~77dB, uses 5 bits, thereby obtains the scalar quantization index value, represents with l:
Figure A20071006540200052
Figure A20071006540200053
Expression rounds downwards;
The index value l that step (5) obtains with step (4) is the center, determines the interval ψ of a scalar quantization candidate index according to following formula:
ψ=[l-2,l+2];
Step (6) is to each index value l in the candidate index interval of obtaining of step (5) mAsk for the value after its reactionary slogan, anti-communist poster amount quantizes respectively
Figure A20071006540200054
g n , l m ^ = 77 - 10 32 l m + 10 , l mIt is the value that belongs to ψ between the index area;
Each subframe energy parameter g of current superframe that step (7) obtains step (2) N, iRespectively divided by the value of asking in the step (6)
Figure A20071006540200056
Obtain the new vector (g after the normalization N, 1', g N, 2', g N, 3'), wherein g n , 1 ′ = g n , 1 / g n , l m ^ ;
Step (8) is carried out vector quantization with the Codebook of Vector Quantization of setting to the vector of asking in the step (7), and vector quantization adopts the method for full search, finds to make error in Codebook of Vector Quantization E l m , k = Σ i = 1 3 ( g n , i - g n , l m ^ * g k , i ′ ′ ) 2 Minimum index value k = arg min k E l m , k , g K, i" index is an i component of the three-dimensional code word vector of k in the quantification code book of representative setting, i=1,2,3;
Step (9) is to all l in the interval ψ of candidate index mValue is carried out the calculating of (7), (8) two formulas, finds to make E Lm, kMinimum l mWith k, as the final quantized result of energy parameter.
Characteristics of the present invention are the gain shape associating quantization method of energy parameter have been adopted the method for Syndicating search.Original technology does not adopt the Syndicating search algorithm, but adopts progressively search, can not effectively reduce the quantization error of energy parameter.The present invention adopts gain shape to unite the combined method for searching of quantification, can further reduce the quantization error about 1% of energy parameter.
This method can reduce the quantization error of energy parameter, improves the naturalness of synthetic speech.The most suitable 600b/s low rate of this method parametric speech coding will be realized on signal processor chip DSP.
Description of drawings
The 0.6kb/s vocoder energy parameter gain shape that Fig. 1 proposes for the present invention is united the combined method for searching FB(flow block) of quantification.
Embodiment
The combined method for searching that the 0.6kb/s vocoder energy parameter gain shape that the present invention proposes is united quantification reaches embodiment in conjunction with the accompanying drawings and further specifies as follows:
Method flow of the present invention may further comprise the steps as shown in Figure 1:
Step (1) is divided frame to the input speech signal sampling point in proper order by the time interval of setting; Wherein said voice signal sampling point is the signal sampling point after having disturbed according to the setpoint frequency sampling and through high-pass filtering removal power frequency;
Step (2) is formed superframe to the one group of ground of every adjacent three subframes of subframe that obtains through undue frame in the step (1), asks for the energy parameter g of each subframe in the current superframe respectively according to following formula N, i:
g n , i = log 1 L i Σ L i s ( n ) 2
Wherein, n represents current superframe sequence number,
I represents the sequence number of i subframe in the current superframe, i=1, and 2,3,
L iBe the length of window that the i subframe is asked for energy parameter,
S (n) is described L iInterior voice signal;
Step (3) is asked for the mean value g of each subframe energy parameter in the current superframe by following formula n:
g ‾ n = 1 3 Σ i = 1 3 g n , i ;
Step (4) is with the mean value g of the energy parameter asked in (3) nCarry out even scalar quantization by following formula, the span of its scalar quantization is between 10dB~77dB, uses 5 bits, thereby obtains the scalar quantization index value, represents with l:
Figure A20071006540200063
Figure A20071006540200064
Expression rounds downwards;
The index value l that step (5) obtains with step (4) is the center, determines the interval ψ of a scalar quantization candidate index according to following formula:
ψ=[l-2,l+2];
Step (6) is to each index value l in the candidate index interval of obtaining of step (5) mAsk for the value after its reactionary slogan, anti-communist poster amount quantizes respectively
Figure A20071006540200071
g n , l m ^ = 77 - 10 32 l m + 10 , l mIt is the value that belongs to ψ between the index area;
Each subframe energy parameter g of current superframe that step (7) obtains step (2) N, iRespectively divided by the value of asking in the step (6)
Figure A20071006540200073
Obtain the new vector (g after the normalization N, 1', g N, 2', g N, 3'), wherein g n , 1 ′ = g n , 1 / g n , l m ^ ;
Step (8) is carried out vector quantization with the Codebook of Vector Quantization of setting to the vector of asking in the step (7), and vector quantization adopts the method for full search, finds to make error in Codebook of Vector Quantization E l m , k = Σ i = 1 3 ( g n , i - g n , l m ^ * g k , i ′ ′ ) 2 Minimum index value k = arg min k E l m , k , g K, i" index is an i component of the three-dimensional code word vector of k in the quantification code book of representative setting, i=1,2,3;
Step (9) is to all l in the interval ψ of candidate index mValue is carried out the calculating of (7), (8) two formulas, finds to make E Lm, kMinimum l mWith k, as the final quantized result of energy parameter.
The specific embodiment of each step of said method of the present invention is described in detail as follows respectively:
Said method step (1) divides the embodiment of frame to be by the 8kHz frequency sampling, to remove the voice sampling point that power frequency is disturbed through high-pass filtering to the input speech signal sampling point in chronological order.Every 25ms, just 200 voice sampling points constitute a subframe;
The embodiment of said method step (2) is: each frame of the superframe that three speech frames are formed is by following formulas Extraction gain parameter g N, i:
g n , i = log 1 L i Σ L i s ( n ) 2
Wherein, n is the sequence number of current superframe, and i is the sequence number of i subframe of current superframe, and L is the length of window, is made as long 200 sampling points of current subframe, and s (n) promptly is aforesaid 8kHz frequency sampling, removes the voice signal that power frequency is disturbed through high-pass filtering.
The embodiment of said method step (3) is: the average energy g that asks for current superframe according to following formula n:
g ‾ n = ( Σ i = 1 3 g n , i ) / 3
The embodiment of said method step (4) is: with g nCarry out even scalar quantization, in order to improve the quantification performance, it is quantized span be limited between 10dB~77dB, overflow and underflow adopt amplitude limiting processing, and this parameter represents that with 5 bits the quantization index value is obtained by following formula:
Figure A20071006540200081
Wherein
Figure A20071006540200082
For rounding symbol.
Embodiment in the said method step (5) is: with the value l that asks in the step (4) is the interval ψ of the selected index candidate in center, wherein
ψ=[l-2,l+2]
The embodiment of said method step (6) is: to the index value l in the interval ψ mCarry out the reactionary slogan, anti-communist poster amount and quantize, the value of asking for behind the inverse quantization is
Figure A20071006540200083
Obtain by following formula:
g n , l m ^ = 77 - 10 32 l m + 10
The embodiment of said method step (7) is: the energy parameter of each frame that current superframe is tried to achieve in step (2) is divided by in the step (6)
Figure A20071006540200085
Obtain the new vector (g after the normalization N, 1', g N, 2', g N, 3'), wherein
g n , i ′ = g n , i / g ^ n , l m
The embodiment of said method step (8) is: the normalized vector of asking in the step (7) is carried out vector quantization, and the code book of vector quantization is as shown in the table:
Quantization index value k Code word vector g k
0 0.770791,0.856724,1.373817
1 0.726911,1.055533,1.217957
2 1.216932,0.911708,0.873279
3 0.881003,1.042607,1.076917
4 1.159534,1.037779,0.803366
5 1.061736,1.003369,0.935548
6 0.939219,0.911485,1.149099
7 0.998837,0.999830,1.002373
The search criteria of vector quantization is to make error E Lm, kMinimum, wherein E Lm, kBe calculated as follows:
E l m , k = Σ i = 1 3 ( g n , i - g n , l m ^ * g k , i ′ ′ ) 2
Wherein, g K, i" index is an i component of the vector of k in the representative quantification code book, k=1...8, i=1...3;
The embodiment of said method step (9) is: all values in the interval ψ of candidate index is carried out the calculating of step (7), (8), find to make E Lm, kMinimum l mWith k, as the final quantized result of energy parameter.

Claims (3)

1,0.6kb/s vocoder energy parameter gain shape is united the combined method for searching of quantification, it is characterized in that, this method realizes in digital integrated circuit successively according to the following steps:
Step (1) is divided frame to the input speech signal sampling point in proper order by the time interval of setting; Wherein said voice signal sampling point is the signal sampling point after having disturbed according to the setpoint frequency sampling and through high-pass filtering removal power frequency;
Step (2) is formed superframe to the one group of ground of every adjacent three subframes of subframe that obtains through undue frame in the step (1), asks for the energy parameter g of each subframe in the current superframe respectively according to following formula N.i:
g n . i = log 1 L i Σ L i s ( n ) 2
Wherein, n represents current superframe sequence number,
I represents the sequence number of i subframe in the current superframe, i=1, and 2,3,
L iBe the length of window that the i subframe is asked for energy parameter,
S (n) is described L iInterior voice signal;
Step (3) is asked for the mean value g of each subframe energy parameter in the current superframe by following formula n:
g ‾ n = 1 3 Σ i = 1 3 g n , i ;
Step (4) is with the mean value g of the energy parameter asked in (3) nCarry out even scalar quantization by following formula, the span of its scalar quantization is between 10dB~77dB, uses 5 bits, thereby obtains the scalar quantization index value, represents with l:
Figure A2007100654020002C3
Figure A2007100654020002C4
Expression rounds downwards;
The index value l that step (5) obtains with step (4) is the center, determines the interval ψ of a scalar quantization candidate index according to following formula:
ψ=[l-2,l+2];
Step (6) is to each index value l in the candidate index interval of obtaining of step (5) mAsk for the value after its reactionary slogan, anti-communist poster amount quantizes respectively
Figure A2007100654020002C5
g n , l m ^ = 77 - 10 32 l m + 10 , l mIt is the value that belongs to ψ between the index area;
Each subframe energy parameter g of current superframe that step (7) obtains step (2) N.iRespectively divided by the value of asking in the step (6)
Figure A2007100654020003C1
Obtain the new vector (g after the normalization N, 1', g N, 2', g N, 3'), wherein g n , 1 ′ = g n , 1 / g n , l m ^ ;
Step (8) is carried out vector quantization with the Codebook of Vector Quantization of setting to the vector of asking in the step (7), and vector quantization adopts the method for full search, finds to make error in Codebook of Vector Quantization E l m , k = Σ i = 1 3 ( g n , i - g n , l m ^ * g k , i ′ ′ ) 2 Minimum index value
k = arg min E l m , k , k g k , i ′ ′ Index is an i component of the three-dimensional code word vector of k in the quantification code book that representative is set, i=1,2,3;
Step (9) is to all l in the interval ψ of candidate index mValue is carried out the calculating of (7), (8) two formulas, finds to make E Lm, kMinimum l mWith k, as the final quantized result of energy parameter.
2, unite the combined method for searching of quantification by the described 0.6kb/s vocoder of claim 1 energy parameter gain shape, it is characterized in that, the number of sub frames in the described superframe is more than or equal to 2.
3, uniting the combined method for searching of quantification by the described 0.6kb/s vocoder of claim 2 energy parameter gain shape, it is characterized in that, is that ψ between a candidate regions is chosen at the center with the index value after the uniform quantization of initially asking in the step (5).
CNA200710065402XA 2007-04-13 2007-04-13 Combined method for searching energy parameter gain shape quantization of 0.6kb/s voice coder Pending CN101030376A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNA200710065402XA CN101030376A (en) 2007-04-13 2007-04-13 Combined method for searching energy parameter gain shape quantization of 0.6kb/s voice coder

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNA200710065402XA CN101030376A (en) 2007-04-13 2007-04-13 Combined method for searching energy parameter gain shape quantization of 0.6kb/s voice coder

Publications (1)

Publication Number Publication Date
CN101030376A true CN101030376A (en) 2007-09-05

Family

ID=38715678

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA200710065402XA Pending CN101030376A (en) 2007-04-13 2007-04-13 Combined method for searching energy parameter gain shape quantization of 0.6kb/s voice coder

Country Status (1)

Country Link
CN (1) CN101030376A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009140877A1 (en) * 2008-05-19 2009-11-26 华为技术有限公司 Method and apparatus of vector normalizing and decoding method and apparatus of vector normalizing
CN104620315A (en) * 2012-07-12 2015-05-13 诺基亚公司 Vector quantization

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009140877A1 (en) * 2008-05-19 2009-11-26 华为技术有限公司 Method and apparatus of vector normalizing and decoding method and apparatus of vector normalizing
CN104620315A (en) * 2012-07-12 2015-05-13 诺基亚公司 Vector quantization
CN106847296A (en) * 2012-07-12 2017-06-13 诺基亚技术有限公司 Vector quantization
CN104620315B (en) * 2012-07-12 2018-04-13 诺基亚技术有限公司 A kind of method and device of vector quantization
US10665247B2 (en) 2012-07-12 2020-05-26 Nokia Technologies Oy Vector quantization
CN106847296B (en) * 2012-07-12 2021-01-22 诺基亚技术有限公司 Vector quantization

Similar Documents

Publication Publication Date Title
CN101030377A (en) Method for increasing base-sound period parameter quantified precision of 0.6kb/s voice coder
CN1110145C (en) Scalable audio coding/decoding method and apparatus
CN1099777C (en) Digital signal encoding device, its decoding device, and its recording medium
CN1241170C (en) Method and system for line spectral frequency vector quantization in speech codec
CN101055720A (en) Method and apparatus for encoding and decoding an audio signal
CN1795495A (en) Audio encoding device, audio decoding device, audio encodingmethod, and audio decoding method
CN101727906B (en) Method and device for coding and decoding of high-frequency band signals
CN1756086A (en) Multichannel audio data encoding/decoding method and equipment
CN101044552A (en) Sound encoder and sound encoding method
CN1410970A (en) Algebraic code block of selective signal pulse amplitude for quickly speech encoding
CN1186765C (en) Method for encoding 2.3kb/s harmonic wave excidted linear prediction speech
CN1787078A (en) Stereo based on quantized singal threshold and method and system for multi sound channel coding and decoding
CN1262990C (en) Audio coding method and apparatus using harmonic extraction
CN1148232A (en) Filter for speech modification or enhancement, and various apparatus, system and method using same
CN1765153A (en) Coding of main and side signal representing a multichannel signal
CN1677493A (en) Intensified audio-frequency coding-decoding device and method
CN1677490A (en) Intensified audio-frequency coding-decoding device and method
CN1787383A (en) Methods and apparatuses for transforming, adaptively encoding, inversely transforming and adaptively decoding an audio signal
CN1193344C (en) Speech decoder and method for decoding speech
CN1264887A (en) Non-particular human speech recognition and prompt method based on special speech recognition chip
CN1677491A (en) Intensified audio-frequency coding-decoding device and method
CN1619646A (en) Method of and apparatus for enhancing dialog using formants
CN1841938A (en) Method and apparatus for coding audio signal
CN1051099A (en) The digital speech coder that has optimized signal energy parameters
CN1223087C (en) Spectrum modeling

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Open date: 20070905