KR100550002B1

KR100550002B1 - Apparatus for searching an adaptive codebook in speech coder and method thereof

Info

Publication number: KR100550002B1
Application number: KR1020030047453A
Authority: KR
Inventors: 성종모; 김현우; 김도영; 정성교; 박영철; 윤성완; 김경태; 윤대희
Original assignee: 학교법인연세대학교; 한국전자통신연구원
Priority date: 2003-07-11
Filing date: 2003-07-11
Publication date: 2006-02-08
Also published as: KR20050007852A

Abstract

음성부호화기에서 적응 코드북 검색기 및 그 방법이 개시된다. 상기 과제를 이루기 위해, G.723.1 음성 부호화기에서 본 발명에 따른 적응 코드북 검색기는 1차 피치 예측을 통해 적응 코드북 목표신호와의 오차가 최소인 폐-루프 피치 지연과 이득을 검색하는 1단계 검색기 및 1단계 검색기에서 검색된 이득을 이용하여 이득 테이블을 선택하고, 5차 피치 예측을 통해 적응 코드북 목표신호와의 오차를 최소로 하는 이득을 선택된 이득 테이블에서 검색하는 2단계 검색기를 포함하며, 1단계 검색기에서 검색된 폐-루프 피치 지연과 2단계 검색기에서 검색된 피치 이득을 적응 코드북 검색 결과로 하는 것을 특징으로 하며, 종래와 비슷한 성능을 가지면서 계산량을 줄임으로써, 효율적인 음성 부호화기를 구현 가능할 수 있다. 또한, 단일 채널 시스템뿐만 아니라 다채널 음성 저장 시스템이나 다채널 통화를 지원하는 게이트웨이와 같은 시스템을 보다 효과적으로 구현할 수 있다. An adaptive codebook searcher and its method in a speech encoder are disclosed. In order to achieve the above object, in the G.723.1 speech coder, the adaptive codebook searcher according to the present invention is a first-stage searcher for searching the closed-loop pitch delay and the gain having the minimum error with the adaptive codebook target signal through the first pitch prediction. A first stage searcher that selects a gain table using the gains retrieved from the first stage searcher, and searches the selected gain table for a gain that minimizes the error with the adaptive codebook target signal through the fifth-order pitch prediction; It is characterized in that the closed-loop pitch delay retrieved from and the pitch gain retrieved from the two-stage searcher are adaptive codebook search results, and the efficient speech coder can be implemented by reducing the calculation amount while having a similar performance as in the related art. In addition, it is possible to implement not only a single channel system but also a system such as a multichannel voice storage system or a gateway supporting multichannel calls.

Description

음성부호화기에서 적응 코드북 검색기 및 그 방법{Apparatus for searching an adaptive codebook in speech coder and method thereof}Apparatus for searching an adaptive codebook in speech coder and method

도 1은 종래 방식에 따른 G.723.1 음성 코덱에서 적응 코드북 검색기를 나타내는 블록도이다. 1 is a block diagram illustrating an adaptive codebook searcher in a conventional G.723.1 speech codec.

도 2는 G.723.1 음성 부호화기에서 본 발명에 따른 고속 적응 코드북 검색기의 일실시예를 나타내는 블록도이다. 2 is a block diagram illustrating an embodiment of a fast adaptive codebook searcher according to the present invention in a G.723.1 speech coder.

도 3은 도 2의 장치에서 수행되는 G.723.1 부호화기의 적응 코드북 검색 방법을 나타낸 순서도이다. FIG. 3 is a flowchart illustrating an adaptive codebook search method of a G.723.1 encoder performed in the apparatus of FIG. 2.

본 발명은 음성 부호화기에 관한 것으로, 특히, 음성 부호화기에서 적응 코드북 검색 방법에 관한 것이다. The present invention relates to a speech coder, and more particularly, to a method for adaptive codebook retrieval in a speech coder.

ITU-T G.723.1 음성 부호화기는 낮은 전송률로 멀티미디어 통신을 위한 음성부호화 표준안이며, 최근 주목을 받고 있는 VoIP(Voice over Internet Protocol) 응용에서 많이 사용되고 있다. ITU-T G.723.1 음성 부호화기는 시스템 구현의 유연성을 위해 5.3 kbps와 6.3 kbps의 두 가지 전송률을 지원하며, 낮은 전송률 음성 부호화 알고리즘의 주된 방식인 CELP (Code-Excited Linear Prediction) 계열의 음성부호화기이다. 부호화 과정은 음성 스펙트럼의 포락선을 모델링하는 선형예측분석 과정과 피치 정보를 모델링하기 위한 적응 코드북 과정 및 그 잔여 신호를 위한 고정 코드북 검색 과정으로 이루어진다. The ITU-T G.723.1 speech coder is a voice coding standard for multimedia communication at low data rates, and is widely used in voice over internet protocol (VoIP) applications, which are recently attracting attention. The ITU-T G.723.1 speech coder supports two rates of 5.3 kbps and 6.3 kbps for flexibility in system implementation, and is a code-excited linear prediction (CELP) family of speech coders that is the main method of low rate speech coding algorithms. . The encoding process consists of a linear predictive analysis process for modeling the envelope of the speech spectrum, an adaptive codebook process for modeling pitch information, and a fixed codebook search process for the residual signal.

적응 코드북 검색 과정은 일반적인 CELP 알고리즘과 같이 합성에 의한 분석(analysis-by-synthesis)에 기반하며, 일반적으로 부호화기가 제공하는 음질에 가장 큰 영향을 미치는 것으로 알려져 있다. 상기 G.723.1 표준안에서 음질의 향상을 위해 5차 피치 예측기를 사용하며, 효율적인 적응 코드북 검색을 위해 개회로 검색과 폐회로 검색의 두 단계로 구성되며, 개회로 검색은 일반적으로 널리 이용되는 자기상관 함수를 이용한다. The adaptive codebook retrieval process is based on analysis-by-synthesis like the general CELP algorithm and is generally known to have the greatest influence on the sound quality provided by the encoder. In the G.723.1 standard, the fifth-order pitch predictor is used to improve the sound quality, and it consists of two stages of open loop search and closed loop search for efficient adaptive codebook search. Use

도 1은 종래 방식에 따른 G.723.1 음성 코덱에서 적응 코드북 검색기를 나타내는 블록도로서, 적응 코드북 검색기는 5차의 피치 예측기(101), LPC(Linear Predictive Coding) 음성 합성기(102), 지각가중필터(103) 및 오류 계산기(104)로 구성된다. FIG. 1 is a block diagram illustrating an adaptive codebook searcher in a conventional G.723.1 speech codec. The adaptive codebook searcher is a fifth order pitch predictor 101, a linear predictive coding (LPC) speech synthesizer 102, and a perceptual weight filter. 103 and an error calculator 104.

도 1의 적응 코드북 검색기는 추정된 개회로 피치에 의해서 결정된 검색 범위에 대해서, 5차의 피치 예측기(101)를 이용하여 폐회로 피치를 검색한다. 5차 피치 예측기(101)는 "0"으로 구성된 여기신호를 입력으로 해서 피치 정보를 포함하는 여기신호를 합성하는 기능을 수행하며, 5차 피치 예측기(101)은 다음 수학식 1의 형태를 갖는다.The adaptive codebook retrieval of Fig. 1 uses the fifth-order pitch predictor 101 to retrieve the closed loop pitch for the search range determined by the estimated open loop pitch. The fifth-order pitch predictor 101 performs a function of synthesizing an excitation signal including pitch information by inputting an excitation signal composed of "0", and the fifth-order pitch predictor 101 has the form of the following equation (1). .

여기서, G_i는 이득이고, L은 피치 지연이다. Where G _i is the gain and L is the pitch delay.

LPC 음성 합성기(102)는 다음 수학식 2와 같은 형태를 가지며, 5차 피치 예측기(101)의 출력인 여기신호를 입력으로 해서 음성을 합성하는 기능을 수행한다.The LPC speech synthesizer 102 has a form as shown in Equation 2 below, and performs a function of synthesizing speech by using an excitation signal, which is the output of the fifth-order pitch predictor 101, as an input.

지각가중필터(103)는 다음 수학식 3과 같은 형태를 가지며, LPC 음성 합성기(102)의 출력인 합성된 음성과 목표 신호를 비교할 때, 사람의 청각기관의 특징을 이용하도록 하는 기능을 수행한다. The perceptual weight filter 103 has a form as shown in Equation 3 below, and performs a function of using the characteristics of the human auditory organ when comparing the synthesized speech output from the LPC speech synthesizer 102 with a target signal. .

여기서, a_j는 LPC 계수이고, r₁과 r₂는 가중치 값들이다.Where a _j is an LPC coefficient and r ₁ and r ₂ are weight values.

마지막으로, 오류 계산기(104)는 지각가중된 음성과 적응코드북 목표 신호간의 오류를 계산하여 최적의 적응 코드북 지연과 해당 이득을 검색하는 기능을 수행한다. G.723.1의 한 프레임은 4 개의 부-프레임으로 구성되며, 각 부-프레임에 대해서 적응 코드북 검색 과정을 수행한다. 표준안에 따라서, 첫 번째와 세 번째 부- 프레임은 개회로 피치의 -1, 0, 1 샘플 범위에서 폐회로 피치를 찾으며 두 번째와 네 번째 부-프레임에서 이전 부프레임의 양자화된 피치와 -1, 0, 1, 2 샘플 차이가 나는 값들 중에서 찾는다. 적응 코드북 검색은 5차 피치 예측기(101)의 각 지연과 해당 지연에 대한 이득 값을 구하는 과정으로, 다음 수학식 4와 같이 목적 신호 t(n)와 적응 코드북에 의해 합성된 신호

의 차이가 최소가 되는 각 이득과 지연을 구하게 된다. Finally, the error calculator 104 calculates an error between the perceptually weighted speech and the adaptive codebook target signal to retrieve the optimal adaptive codebook delay and the corresponding gain. One frame of G.723.1 consists of four sub-frames, and performs an adaptive codebook search process for each sub-frame. According to the standard, the first and third sub-frames find closed-loop pitch in the range of -1, 0, 1 samples of the open loop pitch, and the quantized pitch of the previous subframe and -1, in the second and fourth sub-frames. Look for values that differ by 0, 1, or 2 samples. The adaptive codebook search is a process of obtaining each delay of the fifth-order pitch predictor 101 and a gain value for the corresponding delay. A signal synthesized by the objective signal t (n) and the adaptive codebook as shown in Equation 4 below.

Each gain and delay that minimizes the difference between is obtained.

여기서, N은 부-프레임의 크기이고, l은 폐-루프 피치 검색을 위한 후보 값이며, 해당 범위는 상기에서 기술된 바와 같이 부-프레임에 따라 개회로 피치 주변 혹은 이전 부-프레임의 폐-루프 피치 주변에 해당한다. Gi는 5차 피치 예측기의 각 이득 값들이다. 수학식 4를 최소화시키는 과정은 다음 수학식 5를 최대화시키는 지연과 이득을 찾는 과정과 동일하다. Where N is the size of the sub-frame, l is the candidate value for the closed-loop pitch search, and the range is around the open circuit pitch or according to the sub-frame as described above. Corresponds to loop pitch. Gi are the respective gain values of the fifth-order pitch predictor. The process of minimizing Equation 4 is the same as finding a delay and gain that maximizes Equation 5 below.

G.723.1의 종래기술에 의한 구현에서는, 피치 예측기의 이득 Gi 5개(수학식 5의 첫 번째 항에서 i=-2,-1,0,1,2 각각에 대한 이득)를 비롯하여 Gi·Gi 5개(수학식 5에서 두 번째 항에 대응) 및 Gi·Gj 10개(수학식 5의 마지막 항에 대응되며, i=-1, 0, 1, 2와 j=-2, ..., i-1을 조합하면 모두 10가지 경우가 나옴) 등, 20개의 이득 값들을 이득 테이블에 미리 저장된 값 중에서 검색한다. 높은 전송률(6.3 kbps)인 경우에는 85 혹은 170 개의 항목을 갖는 이득 테이블을 검색하고, 낮은 전송률(5.3 kbps)에서는 170개의 항목을 갖는 이득 테이블만을 검색한다. 이러한 검색 과정은 모두 부-프레임 단위로 이루어지며 부-프레임의 위치에 따라 3 혹은 4 개의 피치 후보값에 대해서 이득 테이블 검색이 수행된다. 따라서, 30ms 크기의 G.723.1 프레임에 대해서 수학식 2를 구하기 위해 170 혹은 85개의 항목을 갖는 이득 테이블을 14회(= 3+4+3+4) 검색해야한다. 따라서, 종래 방식에 따라 상기 수학식 5와 같은 범주(criterion)를 이용하여 피치와 해당 이득을 계산하고자 할 때 과도한 이득 테이블 검색량으로 인해, 음성 부호화기 구현 시에서 단점이 된다. In the prior art implementation of G.723.1, Gi · Gi including the gain Gi of the pitch predictor (gain for each of i = -2, -1,0,1,2 in the first term of Equation 5). 5 (corresponding to the second term in Equation 5) and 10 Gi · Gj (corresponding to the last term in Equation 5, i = -1, 0, 1, 2 and j = -2, ..., 20 combinations of i-1 are retrieved from the values previously stored in the gain table. At high data rates (6.3 kbps) we search for a gain table with 85 or 170 entries, and at low data rates (5.3 kbps) we only search for gain tables with 170 entries. All of these search processes are performed in sub-frame units, and a gain table search is performed on three or four pitch candidate values according to the positions of the sub-frames. Therefore, in order to obtain Equation 2 for a 30ms G.723.1 frame, a gain table having 170 or 85 items must be searched 14 times (= 3 + 4 + 3 + 4). Therefore, when the pitch and the corresponding gain are to be calculated by using a criterion as shown in Equation 5 according to the conventional method, it is a disadvantage in implementing the voice coder.

본 발명이 이루고자 하는 기술적 과제는 G.723.1 음성 부호화기에서 음질을 저하시키지 않으면서도 계산량을 감소하여 효과적으로 음성 부호화기를 구성할 수 있도록 적응 코드북을 검색할 수 있는 적응 코드북 검색기 및 그 방법을 제공하는 데 있다.SUMMARY OF THE INVENTION The present invention provides an adaptive codebook finder and a method for searching for adaptive codebooks so that the G.723.1 speech coder can effectively construct a speech coder by reducing the amount of computation without degrading sound quality. have.

본 발명이 이루고자 하는 다른 기술적 과제는 상기 적응 코드북 검색 방법을 컴퓨터에서 실행 가능한 프로그램 코드로 기록된 기록 매체를 제공하는 데 있다. Another object of the present invention is to provide a recording medium in which the adaptive codebook retrieval method is recorded as program code executable on a computer.

상기 과제를 이루기 위해, G.723.1 음성 부호화기에서 본 발명에 따른 적응 코드북 검색기는 1차 피치 예측을 통해 적응 코드북 목표신호와의 오차가 최소인 폐-루프 피치 지연과 이득을 검색하는 1단계 검색기 및 1단계 검색기에서 검색된 이득을 이용하여 이득 테이블을 선택하고, 5차 피치 예측을 통해 적응 코드북 목표신호와의 오차를 최소로 하는 이득을 선택된 이득 테이블에서 검색하는 2단계 검색기를 포함하며, 1단계 검색기에서 검색된 폐-루프 피치 지연과 2단계 검색기에서 검색된 피치 이득을 적응 코드북 검색 결과로 하는 것이 바람직하다. In order to achieve the above object, in the G.723.1 speech coder, the adaptive codebook searcher according to the present invention is a first-stage searcher for searching the closed-loop pitch delay and the gain having the minimum error with the adaptive codebook target signal through the first pitch prediction. A first stage searcher that selects a gain table using the gains retrieved from the first stage searcher, and searches the selected gain table for a gain that minimizes the error with the adaptive codebook target signal through the fifth-order pitch prediction; It is preferable to use the closed-loop pitch delay found in and the pitch gain searched in the two-stage searcher as adaptive codebook search results.

상기 과제를 이루기 위해, G.723.1 음성 부호화기에서 본 발명에 따른 적응 코드북 검색 방법은 개-루프 피치에 의해서 결정되는 폐-루프 피치 지연 검색 범위를 결정하는 (a)단계, 상기 (a)단계에서 결정된 폐-루프 피치 검색 범위에 해당하는 폐-루프 피치 지연 후보들 각각에 대해서, 다음 수학식

을 계산하는 (b)단계, (b)단계의 수학식 계산값이 최대가 되는 폐-루프 지연을 선택하 고, 해당 이득을 계산하는 (c)단계, (c)단계에서 계산된 이득에 따라, 이득 테이블을 선택하는 (d)단계, (d)단계에서 선택된 항목들 각각에 대해서 다음 수학식In order to achieve the above object, in the G.723.1 speech coder, the adaptive codebook retrieval method according to the present invention comprises the steps of (a) and (a) determining a closed-loop pitch delay retrieval range determined by an open-loop pitch. For each of the closed-loop pitch delay candidates corresponding to the determined closed-loop pitch search range, the following equation

In accordance with the gains calculated in steps (c) and (c), the method selects the closed-loop delay that maximizes the equation calculated in steps (b) and (b). For each of the items selected in (d) and (d) of selecting a gain table,

을 계산하는 (e)단계 및 상기 (e)단계에서 계산된 값을 최소로 하는 이득 테이블의 항목을 피치 이득으로서 선택하는 (f)단계를 포함하는 것이 바람직하다. It is preferable to include the step (e) of calculating a and a step (f) of selecting an item of a gain table that minimizes the value calculated in the step (e) as the pitch gain.

이하, 본 발명에 따른 음성 부호화기의 적응 코드북 검색기 및 그 방법을 첨부한 도면들을 참조하여 다음과 같이 설명한다. Hereinafter, an adaptive codebook searcher and a method thereof of a speech coder according to the present invention will be described with reference to the accompanying drawings.

도 2는 G.723.1 음성 부호화기에서 본 발명에 따른 적응 코드북 검색기의 일실시예를 나타내는 블록도이다. 본 발명에 따른 적응 코드북 검색기는 1차 피치 예측기(201), LPC 음성 합성기(202), 지각가중필터(203) 및 오류 계산기(204)를 포함하는 1단계 검색기(210)와, 5차 피치 예측기(205), LPC 음성 합성기(206), 지각가중필터(207) 및 오류 계산기(208)를 포함하는 2단계 검색기(220)를 포함하여 구성된다. 2 is a block diagram illustrating an embodiment of an adaptive codebook searcher according to the present invention in a G.723.1 speech coder. The adaptive codebook searcher according to the present invention comprises a first-stage searcher 210 including a first-order pitch predictor 201, an LPC speech synthesizer 202, a perceptual weight filter 203, and an error calculator 204, and a fifth-order pitch predictor. 205, LPC speech synthesizer 206, perceptual weighting filter 207, and error calculator 208.

도 2를 참조하여, 1단계 검색기(210)는 1차 피치 예측을 통해 적응 코드북 목표신호(t)와의 오차가 최소인 폐-루프 피치 지연과 이득을 선택한다. 구체적으로, 1차 피치 예측기(201)는 다음 수학식 6과 같은 형태를 가진다.Referring to FIG. 2, the first stage searcher 210 selects a closed-loop pitch delay and a gain having a minimum error with the adaptive codebook target signal t through first-order pitch prediction. Specifically, the first order pitch predictor 201 has a form as shown in Equation 6 below.

여기서, g는 이득이고, L은 피치 지연을 의미한다. 1차 피치 예측기(201)는 "0"으로 구성된 신호를 입력으로 하고(zero-input), 검색 범위내의 피치 후보에 의해서 과거 프레임의 여기신호로부터 현재 부-프레임에 해당하는 여기신호를 생성한다. LPC 음성 합성기(202)는 1차 피치 예측기(201)에서 생성된 여기신호를 입력으로 해서 음성을 합성한다. 지각가중필터는(203)은 LPC 음성 합성기(202)의 출력인 합성된 음성과 목표 신호를 비교할 때, 사람의 청각기관의 특징을 이용하도록 하는 기능을 수행한다. 마지막으로, 오류 계산기(204)는 지각가중된 음성과 적응 코드북 목표 신호간의 오류를 계산하여 최적의 적응 코드북 지연과 해당 이득을 검색하는 기능을 수행한다. Where g is gain and L means pitch delay. The primary pitch predictor 201 inputs a signal composed of "0" (zero-input) and generates an excitation signal corresponding to the current sub-frame from the excitation signal of the past frame by the pitch candidate within the search range. The LPC speech synthesizer 202 synthesizes speech by using the excitation signal generated by the primary pitch predictor 201 as an input. The perceptual weighting filter 203 performs a function of using the characteristics of the human auditory organ when comparing the synthesized speech, which is the output of the LPC speech synthesizer 202, with the target signal. Finally, the error calculator 204 calculates the error between the perceptually weighted speech and the adaptive codebook target signal to retrieve the optimal adaptive codebook delay and the corresponding gain.

2단계 검색기(220)는 1단계 검색기(210)에서 계산된 폐-루프 피치와 1차 피치 예측기(201)의 이득을 이용해서 5차 피치 예측기(205)의 이득을 검색한다. 구체적으로, 5차 피치 예측기(205)는 최초에는 1단계 검색기(210)에서 계산된 폐-루프 피치를 이용하여 과거 프레임의 여기 신호로부터 현재 부-프레임에 해당하는 여기신호를 생성한다. LPC 음성 합성기(206)는 5차 피치 예측기(205)에서 생성된 여기신호를 입력으로 해서 음성을 합성한다. 지각가중필터는(207)은 LPC 음성 합성기(206)의 출력인 합성된 음성과 목표 신호를 비교할 때, 사람의 청각기관의 특징을 이용하도록 하는 기능을 수행한다. 마지막으로, 오류 계산기(208)는 지각가중된 음성과 적응 코드북 목표 신호간의 오류를 계산하여 오류를 최소화하는 5차 피치 예측기(205)의 이득을 검색한다. The second stage searcher 220 retrieves the gain of the fifth-order pitch predictor 205 using the closed-loop pitch calculated by the first stage searcher 210 and the gain of the first-order pitch predictor 201. Specifically, the fifth-order pitch predictor 205 initially generates the excitation signal corresponding to the current sub-frame from the excitation signal of the past frame using the closed-loop pitch calculated by the first stage searcher 210. The LPC speech synthesizer 206 synthesizes speech using an excitation signal generated by the fifth-order pitch predictor 205 as an input. The perceptual weighting filter 207 performs a function of using the characteristics of the human auditory organ when comparing the target speech with the synthesized speech output of the LPC speech synthesizer 206. Finally, the error calculator 208 calculates the error between the perceptually weighted speech and the adaptive codebook target signal to retrieve the gain of the fifth order predictor 205 that minimizes the error.

다시 설명하면, 본 발명에서는 효율적인 피치 지연 검색을 위해 1차 피치 예측기(201)를 적용하여 검색한다. 이러한 1차 피치 예측기(201)를 이용한 지연 검색은 다음 수학식 7를 최소화하는 지연과 이득을 찾는 것이다.In other words, in the present invention, the first-order pitch predictor 201 is applied to search for an efficient pitch delay search. Delay search using the first-order pitch predictor 201 is to find the delay and gain to minimize the following equation (7).

여기서, l은 폐-루프 피치 지연 후보이며, g는 1차 피치 예측기의 이득이다. 수학식 7을 최소화시키는 폐-루프 피치는 다음 수학식 8을 최대화시키는 지연이다. 지연 검색 범위는 기존의 방법과 동일하다. 즉, 첫 번째와 세 번째 부-프레임에서는 개회로 피치 주변의 3개의 후보에 대해서 검색이 이루어지며 두 번째와 네 번째 부-프레임에 대해서는 이전 부-프레임의 폐회로 피치 주변의 4개의 후보에 대해 검색이 이루어진다. 그리고 1차 피치 예측기의 이득 g는 다음 수학식 9과 같이 구해진다. Where l is a closed-loop pitch delay candidate and g is the gain of the first order predictor. The closed-loop pitch that minimizes equation (7) is the delay that maximizes the following equation (8). The delay search range is the same as the conventional method. That is, the first and third sub-frames are searched for three candidates around the open loop pitch, and the second and fourth sub-frames are searched for four candidates around the closed loop pitch of the previous sub-frame. This is done. The gain g of the first-order pitch predictor is obtained as in Equation 9 below.

여기서, L _clp 는 수학식 8을 최대화시키는 폐-루프 피치이다. Where L _clp is the closed-loop pitch that maximizes equation (8).

이상에서와 같은 검색 과정을 통해, 1차 피치 예측기에 대한 폐-루프 예측기의 폐-루프 피치 지연이 결정되면, 이득 검색을 한다. 이득 검색은 기존의 5차 피치 예측기와 동일한 방법으로 검색한다. 보다 효율적인 검색을 위해 수학식 9의 1차 피치 예측 이득인 g에 따라 종래의 170개의 항목을 갖는 이득 테이블 중에서 미리 선택된 85개의 항목을 갖는 이득 테이블을 사용한다. 즉, 수학식 9에서 구해진 이득 값에 따라 이득 테이블 검색 범위를 종래 방법의 1/2로 제한한다. 그리고, 다음 수학식 10을 최소화시키는 이득 테이블 인덱스를 검색한다. When the closed-loop pitch delay of the closed-loop predictor with respect to the first-order pitch predictor is determined through the search process as described above, a gain search is performed. Gain search is performed in the same manner as the conventional fifth-order pitch predictor. For a more efficient search, a gain table having 85 items selected in advance from the conventional 170 gain table having a primary pitch prediction gain of equation 9 is used. That is, the gain table search range is limited to 1/2 of the conventional method according to the gain value obtained in Equation (9). Then, the gain table index for minimizing the following expression (10) is searched.

적응 코드북의 연산량은 수학식 5 혹은 수학식 10의 계산과정과 비례하게 된다. 즉, 종래에는 N_ACB(=170)개의 항목을 갖는 이득 코드북 검색이 부-프레임마다 3회 혹은 4회 이루어지므로 프레임마다 총 14회 검색을 수행해야 한다. 반면, 본 발 명에 따르면 미리 선택된

(=85)개의 항목을 갖는 이득 테이블을 부-프레임마다 1단계 검색기에서(210) 구해진 폐-루프 피치 지연에 대해서 1회만 검색하므로 G.723.1 프레임마다 총 4회의 검색 과정만을 요구한다. 결과적으로, 본 발명에서는 종래 기술과 대비하여, 이득 코드북 검색 회수를 1/7로(= (4 / 14) * (1 / 2)) 감소시킬 수 있다. The computation amount of the adaptive codebook is proportional to the calculation process of Equation 5 or Equation 10. That is, since a gain codebook search having N _ACB (= 170) items is conventionally performed three or four times for each sub-frame, a total of 14 searches must be performed for each frame. On the other hand, according to the present invention, the preselected

Since the gain table with (= 85) items is searched only once for the closed-loop pitch delay obtained by the one-step searcher 210 for each sub-frame, only four search processes are required for each G.723.1 frame. As a result, in the present invention, the number of gain codebook searches can be reduced to 1/7 (= (4/14) * (1/2)) as compared with the prior art.

도 2 및 도 3을 참조하여, 먼저 개-루프 피치(L_olp) 혹은 이전 부-프레임의 폐-루프 피치(L_clp)에 의해서 결정되는 폐-루프 피치 검색 범위를 [L_olp-1,L _olp+1](첫 번째 혹은 세 번째 부-프레임의 경우) 또는 [L_clp-1,L_clp+2]로(두 번째 혹은 네 번째 부-프레임의 경우) 결정한다(제301단계). Referring to FIGS. 2 and 3, first, the closed-loop pitch search range determined by the open-loop pitch L _olp or the closed-loop pitch L _clp of the previous sub-frame is determined by [L _olp -1, L]. _olp +1] (for the first or third sub-frame) or [L _clp -1, L _clp +2] (for the second or fourth sub-frame) (step 301).

제301단계에서 결정된 폐-루프 피치 검색 범위에 해당하는 폐-루프 피치 후보들 각각에 대해서 오류 계산기(204)에서 수학식 8을 반복 계산한다(제302 내지 305단계). Equation 8 is repeatedly calculated by the error calculator 204 for each of the closed-loop pitch candidates corresponding to the closed-loop pitch search range determined in step 301 (steps 302 to 305).

폐-루프 피치 후보들 각각에 대해 제302 내지 305단계를 반복 수행한 결과, 수학식 8의 값이 최대가 되는 폐-루프 지연을 검색한다(제306단계). As a result of repeating steps 302 to 305 for each of the closed-loop pitch candidates, a closed-loop delay at which the value of Equation 8 is maximized is searched (step 306).

제306단계에서 검색된 폐-루프 지연에 대해서 1차 피치 예측기(201)의 이득을 계산한다(제307단계). The gain of the first-order pitch predictor 201 is calculated with respect to the closed-loop delay found in step 306 (step 307).

제307단계에서 계산된 이득에 따라, 5차 피치 예측기(205)의 85항목들(entries)의 이득 테이블을 선택한다(제311단계). 전술된 바와 같이, 5차 피치 예측기(205)는 1차 피치 예측기(201)의 이득에 따라 종래의 170개의 항목을 갖는 이득 테이블 중 85개의 항목을 갖는 이득 테이블을 선택한다. In accordance with the gain calculated in step 307, a gain table of 85 entries of the fifth-order pitch predictor 205 is selected (step 311). As described above, the fifth-order pitch predictor 205 selects a gain table having 85 items from the conventional 170 gain tables with gains of the primary pitch predictor 201.

오류 계산기(208)는 제311단계에서 선택된 85항목들 각각에 대해서 반복적으로 수학식 10을 계산한다(제312 내지 315단계). The error calculator 208 repeatedly calculates Equation 10 for each of the 85 items selected in step 311 (steps 312 to 315).

선택된 85항목들 각각에 대해 제312 내지 315단계를 반복 수행한 결과, 수학식 10을 최소로 하는 이득 테이블의 항목을 선택하며, 이를 5차 피치 예측기(205)의 이득으로 선택한다(제316단계). As a result of repeating steps 312 to 315 for each of the selected 85 items, an item of a gain table that minimizes Equation 10 is selected, and this is selected as a gain of the fifth-order pitch predictor 205 (step 316). ).

이상에서와 같이, 본 발명에서는 G.723.1 음성 부호화기의 적응 코드북 검색 시, 1차 피치 예측기를 이용하여 피치 지연을 먼저 계산하고, 5차 피치 예측기를 이용하여 이득을 순차적으로 검색함으로써, 종래와 유사한 음질을 제공하면서 계산량을 줄일 수 있다. 즉, 우선적으로 5차 피치 예측기 대신에 1차 피치 예측기를 사용하여 폐회로 지연을 검색하고, 구해진 지연에 대해 170개의 항목을 갖는 이득 테이블 중에서 1차 피치 예측 이득에 의해 미리 선택된 85개의 항목을 갖는 이득 테이블을 사용하여 이득을 순차적으로 검색함으로써 계산량을 효과적으로 줄일 수 있다. As described above, in the present invention, when searching for an adaptive codebook of a G.723.1 speech coder, a pitch delay is first calculated using a first-order pitch predictor, and a gain is sequentially searched using a fifth-order pitch predictor, thereby similar to the conventional method. You can reduce the amount of computation while providing sound quality. That is, a first-order pitch predictor is used instead of the fifth-order pitch predictor to retrieve the closed loop delay, and a gain having 85 items previously selected by the first-order pitch prediction gain from the gain table having 170 items for the obtained delay. By using tables to retrieve the gains sequentially, we can effectively reduce the amount of computation.

본 발명은 또한 컴퓨터로 읽을 수 있는 기록매체에 컴퓨터가 읽을 수 있는 코드로서 구현하는 것이 가능하다. 컴퓨터가 읽을 수 있는 기록매체는 컴퓨터 시스템에 의하여 읽혀질 수 있는 데이터가 저장되는 모든 종류의 기록장치를 포함한 다. 컴퓨터가 읽을 수 있는 기록매체의 예로는 ROM, RAM, CD-ROM, 자기 테이프, 플라피디스크, 광데이터 저장장치 등이 있으며, 또한 캐리어 웨이브(예를 들어 인터넷을 통한 전송)의 형태로 구현되는 것도 포함한다. 또한 컴퓨터가 읽을 수 있는 기록매체는 네트워크로 연결된 컴퓨터 시스템에 분산되어, 분산방식으로 컴퓨터가 읽을 수 있는 코드가 저장되고 실행될 수 있다.The invention can also be embodied as computer readable code on a computer readable recording medium. Computer-readable recording media include any type of recording device that stores data that can be read by a computer system. Examples of computer-readable recording media include ROM, RAM, CD-ROM, magnetic tape, floppy disk, optical data storage, and the like, which are also implemented in the form of a carrier wave (for example, transmission over the Internet). It also includes. The computer readable recording medium can also be distributed over network coupled computer systems so that the computer readable code is stored and executed in a distributed fashion.

이상 도면과 명세서에서 최적 실시예들이 개시되었다. 여기서 특정한 용어들이 사용되었으나, 이는 단지 본 발명을 설명하기 위한 목적에서 사용된 것이지 의미 한정이나 특허청구범위에 기재된 본 발명의 범위를 제한하기 위하여 사용된 것은 아니다. 그러므로 본 기술 분야의 통상의 지식을 가진 자라면 이로부터 다양한 변형 및 균등한 타 실시예가 가능하다는 점을 이해할 것이다. 따라서, 본 발명의 진정한 기술적 보호 범위는 첨부된 특허청구범위의 기술적 사상에 의해 정해져야 할 것이다.The best embodiments have been disclosed in the drawings and specification above. Although specific terms have been used herein, they are used only for the purpose of describing the present invention and are not used to limit the scope of the present invention as defined in the meaning or claims. Therefore, those skilled in the art will understand that various modifications and equivalent other embodiments are possible from this. Therefore, the true technical protection scope of the present invention will be defined by the technical spirit of the appended claims.

상술한 바와 같이, 본 발명에 따른 적응 코드북 검색기 및 그 방법에 따르면 종래와 비슷한 성능을 가지면서 계산량을 줄임으로써, 효율적인 음성 부호화기를 구현 가능할 수 있다. 또한, 단일 채널 시스템뿐만 아니라 다채널 음성 저장 시스템이나 다채널 통화를 지원하는 게이트웨이와 같은 시스템을 보다 효과적으로 구현할 수 있다. As described above, according to the adaptive codebook searcher and the method according to the present invention, it is possible to implement an efficient speech coder by reducing the amount of computation while having a similar performance as in the related art. In addition, it is possible to implement not only a single channel system but also a system such as a multichannel voice storage system or a gateway supporting multichannel calls.

Claims

G.723.1 음성 부호화기의 적응 코드북 검색기에 있어서, In an adaptive codebook searcher of a G.723.1 speech coder,

1차 피치 예측을 통해 적응 코드북 목표신호와의 오차가 최소인 폐-루프 피치 지연과 이득을 검색하는 1단계 검색기; 및A first stage searcher for searching for a closed-loop pitch delay and a gain having a minimum error with an adaptive codebook target signal through first-order pitch prediction; And

상기 1단계 검색기에서 검색된 이득을 이용하여 이득 테이블을 선택하고, 5차 피치 예측을 통해 상기 적응 코드북 목표신호와의 오차를 최소로 하는 피치 이득을 선택된 이득 테이블에서 검색하는 2단계 검색기를 포함하며, Selecting a gain table using the gain retrieved by the first stage searcher, and a second stage searcher for searching the selected gain table for the pitch gain that minimizes the error with the adaptive codebook target signal through 5th pitch prediction,

상기 1단계 검색기에서 검색된 폐-루프 피치 지연과 상기 2단계 검색기에서 검색된 피치 이득을 적응 코드북 검색 결과로 하는 것을 특징으로 하는 적응 코드북 검색기. And a closed-loop pitch delay found by the first stage searcher and a pitch gain found by the second stage searcher as adaptive codebook search results.

제1항에 있어서, 상기 1단계 검색기는 The method of claim 1, wherein the first stage searcher

검색 범위내의 피치 후보에 의해서 과거 프레임의 여기신호로부터 현재 부-프레임에 해당하는 여기신호를 생성하는 1차 피치 예측기;A first-order pitch predictor for generating excitation signals corresponding to current sub-frames from excitation signals of past frames by pitch candidates within a search range;

상기 1차 피치 예측기에서 합성된 여기신호를 입력으로 해서 음성을 합성하는 LPC 음성 합성기; An LPC speech synthesizer configured to synthesize speech by inputting an excitation signal synthesized by the first pitch predictor;

상기 LPC 음성 합성기에서 합성된 음성을 사람의 청각기관의 특징이 반영되도록 필터링하는 지각가중필터; 및 A perceptual weighting filter for filtering the speech synthesized by the LPC speech synthesizer to reflect the characteristics of human hearing organs; And

지각가중된 음성과 상기 적응 코드북 목표 신호간의 오류를 계산하여 최적의 상기 폐-루프 피치 지연 및 해당 이득을 검색하는 오류 계산기를 포함하는 것을 특징으로 하는 적응 코드북 검색기. And an error calculator for calculating an error between the perceptually weighted speech and the adaptive codebook target signal to retrieve the optimal closed-loop pitch delay and corresponding gain.

제2항에 있어서, The method of claim 2,

상기 1단계 검색기는 다음 수학식 1을 최소로 하는 폐-루프 피치 지연과 이득을 검색하는 것을 특징으로 하는 적응 코드북 검색기. And the first stage searcher searches for a closed-loop pitch delay and a gain that minimizes Equation 1 below.

[수학식 1][Equation 1]

(여기서, l은 폐-루프 피치 지연 후보이며, N은 서브프레임의 크기이고, g는 1차 피치 예측기의 이득을 각각 나타낸다) (Where l is a closed-loop pitch delay candidate, N is the size of the subframe, and g is the gain of the primary pitch predictor, respectively)

제3항에 있어서, 상기 수학식 1을 최소로 하는 폐-루프 피치 지연은 다음 수학식 2를 최대로 하는 것임을 특징으로 하는 적응 코드북 검색기. 4. The adaptive codebook finder of claim 3, wherein a closed-loop pitch delay that minimizes Equation 1 maximizes Equation 2 below.

[수학식 2][Equation 2]

(여기서, N은 서브프레임의 크기이고, l은 폐-루프 피치 검색을 위한 후보값을 각각 나타낸다) (Where N is the size of the subframe and l represents the candidate value for closed-loop pitch search, respectively)

제4항에 있어서, 상기 수학식 1을 최소로 하는 상기 1차 피치 예측기의 이득은 다음 수학식 3에 의해 구해지는 것임을 특징으로 하는 적응 코드북 검색기. 5. The adaptive codebook finder of claim 4, wherein the gain of the first-order pitch predictor which minimizes the equation (1) is obtained by the following equation (3).

[수학식 3][Equation 3]

(여기서, L_clp는 상기 수학식 2를 최대로 하는 폐-루프 피치 지연을 나타냄)Where L _clp represents a closed-loop pitch delay that maximizes Equation 2 above

제1항에 있어서, 상기 2단계 검색기는 The method of claim 1, wherein the second stage searcher

상기 1단계 검색기에서 계산된 폐-루프 피치를 이용하여 과거 프레임의 여기 신호로부터 현재 부-프레임에 해당하는 여기신호를 생성하는 5차 피치 예측기;A fifth-order pitch predictor for generating an excitation signal corresponding to the current sub-frame from the excitation signal of the past frame by using the closed-loop pitch calculated by the first stage searcher;

상기 5차 피치 예측기에서 합성된 여기신호를 입력해서 음성을 합성하는 LPC 음성 합성기; An LPC speech synthesizer configured to synthesize speech by inputting an excitation signal synthesized by the fifth-order pitch predictor;

상기 LPC 음성 합성기에서 합성된 음성을 사람의 청각기관의 특징을 반영하여 필터링하는 지각가중필터; 및 A perceptual weighting filter for filtering the speech synthesized by the LPC speech synthesizer to reflect the characteristics of human hearing organs; And

지각가중된 음성과 적응 코드북 목표 신호간의 오류를 계산하여 오류를 최소화하는 상기 5차 피치 예측기의 이득을 검색하여 상기 5차 피치 예측기로 제공하는 오류 계산기를 포함하는 것을 특징으로 하는 적응 코드북 검색기. And an error calculator that calculates an error between the perceptually weighted speech and the adaptive codebook target signal and retrieves a gain of the fifth order predictor to minimize the error and provides the gain to the fifth order predictor.

제6항에 있어서, The method of claim 6,

상기 2단계 검색기는 다음 수학식을 최소로하는 이득 테이블의 항목을 피치 이득으로서 선택하는 것을 특징으로 하는 적응 코드 검색기.And the step 2 searcher selects an item of a gain table that minimizes the following equation as pitch gain.

(여기서, L_clp는 상기 (c)단계에서 선택된 폐-루프 지연을 나타내고, i와 j는 각 이득에 대한 인덱스이고, G_i는 이득을 각각 나타낸다)Where L _clp represents the closed-loop delay selected in step (c), i and j are indices for each gain, and G _i represents the gain, respectively.

G.723.1 음성 부호화기의 적응 코드북 검색 방법에 있어서, In the G.723.1 speech coder adaptive codebook search method,

(a)개-루프 피치에 의해서 결정되는 폐-루프 피치 지연 검색 범위를 결정하는 단계;(a) determining a closed-loop pitch delay search range determined by the open-loop pitch;

(b)상기 (a)단계에서 결정된 폐-루프 피치 검색 범위에 해당하는 폐-루프 피치 지연 후보들 각각에 대해서, 다음 수학식을 계산하는 단계;(b) calculating the following equation for each of the closed-loop pitch delay candidates corresponding to the closed-loop pitch search range determined in step (a);

(여기서, N은 서브프레임의 크기이고, l은 폐-루프 피치 검색을 위한 후보값이며, g는 1차 피치 예측의 이득을 각각 나타낸다) Where N is the size of the subframe, l is the candidate for closed-loop pitch search, and g is the gain of the first-order pitch prediction, respectively.

(c)상기 (b)단계의 수학식 계산값이 최대가 되는 폐-루프 지연을 선택하고, 해당 이득을 계산하는 단계;(c) selecting a closed-loop delay at which the mathematical calculation value of step (b) is maximized and calculating a corresponding gain;

(d)상기 (c)단계에서 계산된 이득에 따라, 이득 테이블을 선택하는 단계; (d) selecting a gain table according to the gain calculated in step (c);

(e)상기 (d)단계에서 선택된 항목들 각각에 대해서 다음 수학식을 계산하는 단계; 및 (e) calculating the following equation for each of the items selected in step (d); And

(f)상기 (e)단계에서 계산된 값을 최소로 하는 이득 테이블의 항목을 피치 이득으로서 선택하는 단계를 포함하는 것을 특징으로 하는 적응 코드북 검색 방법. (f) selecting an item of a gain table that minimizes the value calculated in step (e) as pitch gain.

제8항에 있어서, 상기 (a)단계의 폐-루프 피치 지연 검색 범위는The method of claim 8, wherein the closed-loop pitch delay search range of step (a) is

첫 번째와 세 번째 부-프레임에서는 개회로 피치 주변의 3개의 후보를 상기 폐-루프 피치 지연 검색 범위로 하며,In the first and third sub-frames, the three candidates around the open loop pitch are the closed-loop pitch delay search ranges,

두 번째와 네 번째 부-프레임에서는 이전 부-프레임의 폐회로 피치 주변의 4개의 후보를 상기 폐-루프 피치 지연 검색 범위로 각각 설정하는 것을 특징으로 하는 적응 코드북 검색 방법.And in the second and fourth sub-frames, four candidates around the closed loop pitch of the previous sub-frame are set to the closed-loop pitch delay search range, respectively.

제8항에 있어서, 상기 (c)단계의 이득 계산은 다음 수학식에 의해 계산되는 것을 특징으로 하는 적응 코드북 검색 방법.10. The method of claim 8, wherein the gain calculation of step (c) is calculated by the following equation.

제8항에 있어서, 상기 (d)단계에서 선택되는 이득 테이블은 85개의 항목을 갖는 이득 테이블인 것을 특징으로 하는 적응 코드북 검색 방법.9. The method of claim 8, wherein the gain table selected in step (d) is a gain table having 85 items.

제8항의 적응 코드북 검색 방법을 컴퓨터에서 실행 가능한 프로그램 코드로 기록된 기록 매체. A recording medium recorded with program code executable on a computer for the adaptive codebook retrieval method of claim 8.