CN100485337C - 用于对音频信号进行编码的编码模型的选择 - Google Patents
用于对音频信号进行编码的编码模型的选择 Download PDFInfo
- Publication number
- CN100485337C CN100485337C CNB200580015656XA CN200580015656A CN100485337C CN 100485337 C CN100485337 C CN 100485337C CN B200580015656X A CNB200580015656X A CN B200580015656XA CN 200580015656 A CN200580015656 A CN 200580015656A CN 100485337 C CN100485337 C CN 100485337C
- Authority
- CN
- China
- Prior art keywords
- encoding model
- audio content
- sound signal
- model
- encoding
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 230000005236 sound signal Effects 0.000 title claims abstract description 84
- 238000000034 method Methods 0.000 claims abstract description 42
- 238000011156 evaluation Methods 0.000 claims description 54
- 238000005457 optimization Methods 0.000 claims description 19
- 230000007704 transition Effects 0.000 claims description 12
- 230000008569 process Effects 0.000 claims description 10
- 238000010972 statistical evaluation Methods 0.000 abstract 1
- 238000010586 diagram Methods 0.000 description 8
- 238000004458 analytical method Methods 0.000 description 4
- 238000004422 calculation algorithm Methods 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 230000005284 excitation Effects 0.000 description 3
- 238000005259 measurement Methods 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 238000013461 design Methods 0.000 description 2
- 230000003595 spectral effect Effects 0.000 description 2
- 230000006978 adaptation Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000007635 classification algorithm Methods 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 230000000873 masking effect Effects 0.000 description 1
- 230000000737 periodic effect Effects 0.000 description 1
- 238000005096 rolling process Methods 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/22—Mode decision, i.e. based on audio signal content versus external parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/20—Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
- Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
Abstract
Description
Claims (23)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/847,651 | 2004-05-17 | ||
US10/847,651 US7739120B2 (en) | 2004-05-17 | 2004-05-17 | Selection of coding models for encoding an audio signal |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101091108A CN101091108A (zh) | 2007-12-19 |
CN100485337C true CN100485337C (zh) | 2009-05-06 |
Family
ID=34962977
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB200580015656XA Active CN100485337C (zh) | 2004-05-17 | 2005-04-06 | 用于对音频信号进行编码的编码模型的选择 |
Country Status (17)
Country | Link |
---|---|
US (1) | US7739120B2 (zh) |
EP (1) | EP1747442B1 (zh) |
JP (1) | JP2008503783A (zh) |
KR (1) | KR20080083719A (zh) |
CN (1) | CN100485337C (zh) |
AT (1) | ATE479885T1 (zh) |
AU (1) | AU2005242993A1 (zh) |
BR (1) | BRPI0511150A (zh) |
CA (1) | CA2566353A1 (zh) |
DE (1) | DE602005023295D1 (zh) |
HK (1) | HK1110111A1 (zh) |
MX (1) | MXPA06012579A (zh) |
PE (1) | PE20060385A1 (zh) |
RU (1) | RU2006139795A (zh) |
TW (1) | TW200606815A (zh) |
WO (1) | WO2005111567A1 (zh) |
ZA (1) | ZA200609479B (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107077858A (zh) * | 2014-07-28 | 2017-08-18 | 弗劳恩霍夫应用研究促进协会 | 使用具有全带隙填充的频域处理器以及时域处理器的音频编码器和解码器 |
Families Citing this family (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
ATE409937T1 (de) * | 2005-06-20 | 2008-10-15 | Telecom Italia Spa | Verfahren und vorrichtung zum senden von sprachdaten zu einer fernen einrichtung in einem verteilten spracherkennungssystem |
EP1984911A4 (en) * | 2006-01-18 | 2012-03-14 | Lg Electronics Inc | DEVICE AND METHOD FOR SIGNAL CODING AND DECODING |
RU2420816C2 (ru) * | 2006-02-24 | 2011-06-10 | Франс Телеком | Способ двоичного кодирования показателей квантования огибающей сигнала, способ декодирования огибающей сигнала и соответствующие модули кодирования и декодирования |
US9159333B2 (en) | 2006-06-21 | 2015-10-13 | Samsung Electronics Co., Ltd. | Method and apparatus for adaptively encoding and decoding high frequency band |
KR101434198B1 (ko) * | 2006-11-17 | 2014-08-26 | 삼성전자주식회사 | 신호 복호화 방법 |
KR100964402B1 (ko) | 2006-12-14 | 2010-06-17 | 삼성전자주식회사 | 오디오 신호의 부호화 모드 결정 방법 및 장치와 이를 이용한 오디오 신호의 부호화/복호화 방법 및 장치 |
US20080202042A1 (en) * | 2007-02-22 | 2008-08-28 | Azad Mesrobian | Drawworks and motor |
CA2691993C (en) * | 2007-06-11 | 2015-01-27 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio encoder for encoding an audio signal having an impulse-like portion and stationary portion, encoding methods, decoder, decoding method, and encoded audio signal |
US9653088B2 (en) * | 2007-06-13 | 2017-05-16 | Qualcomm Incorporated | Systems, methods, and apparatus for signal encoding using pitch-regularizing and non-pitch-regularizing coding |
RU2454736C2 (ru) * | 2007-10-15 | 2012-06-27 | ЭлДжи ЭЛЕКТРОНИКС ИНК. | Способ и устройство обработки сигнала |
CN101221766B (zh) * | 2008-01-23 | 2011-01-05 | 清华大学 | 音频编码器切换的方法 |
CA2729751C (en) | 2008-07-10 | 2017-10-24 | Voiceage Corporation | Device and method for quantizing and inverse quantizing lpc filters in a super-frame |
EP2144230A1 (en) | 2008-07-11 | 2010-01-13 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Low bitrate audio encoding/decoding scheme having cascaded switches |
CA2871498C (en) * | 2008-07-11 | 2017-10-17 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio encoder and decoder for encoding and decoding audio samples |
CN101615910B (zh) | 2009-05-31 | 2010-12-22 | 华为技术有限公司 | 压缩编码的方法、装置和设备以及压缩解码方法 |
PL2473995T3 (pl) * | 2009-10-20 | 2015-06-30 | Fraunhofer Ges Forschung | Koder sygnału audio, dekoder sygnału audio, sposób dostarczania zakodowanej reprezentacji treści audio, sposób dostarczania dekodowanej reprezentacji treści audio oraz program komputerowy do wykorzystania w zastosowaniach z małym opóźnieniem |
US8442837B2 (en) * | 2009-12-31 | 2013-05-14 | Motorola Mobility Llc | Embedded speech and audio coding using a switchable model core |
IL205394A (en) * | 2010-04-28 | 2016-09-29 | Verint Systems Ltd | A system and method for automatically identifying a speech encoding scheme |
IL295473B2 (en) | 2010-07-02 | 2023-10-01 | Dolby Int Ab | After–selective bass filter |
JP5753540B2 (ja) * | 2010-11-17 | 2015-07-22 | パナソニック インテレクチュアル プロパティ コーポレーション オブアメリカPanasonic Intellectual Property Corporation of America | ステレオ信号符号化装置、ステレオ信号復号装置、ステレオ信号符号化方法及びステレオ信号復号方法 |
RU2656681C1 (ru) * | 2012-11-13 | 2018-06-06 | Самсунг Электроникс Ко., Лтд. | Способ и устройство для определения режима кодирования, способ и устройство для кодирования аудиосигналов и способ, и устройство для декодирования аудиосигналов |
PL2951820T3 (pl) | 2013-01-29 | 2017-06-30 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Urządzenie i sposób wyboru jednego spośród pierwszego algorytmu kodowania i drugiego algorytmu kodowania |
CN107452391B (zh) | 2014-04-29 | 2020-08-25 | 华为技术有限公司 | 音频编码方法及相关装置 |
CN107424622B (zh) | 2014-06-24 | 2020-12-25 | 华为技术有限公司 | 音频编码方法和装置 |
EP2980795A1 (en) | 2014-07-28 | 2016-02-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoding and decoding using a frequency domain processor, a time domain processor and a cross processor for initialization of the time domain processor |
AU2015258241B2 (en) | 2014-07-28 | 2016-09-15 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for selecting one of a first encoding algorithm and a second encoding algorithm using harmonics reduction |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6134518A (en) * | 1997-03-04 | 2000-10-17 | International Business Machines Corporation | Digital audio signal coding using a CELP coder and a transform coder |
ATE302991T1 (de) | 1998-01-22 | 2005-09-15 | Deutsche Telekom Ag | Verfahren zur signalgesteuerten schaltung zwischen verschiedenen audiokodierungssystemen |
US6633841B1 (en) * | 1999-07-29 | 2003-10-14 | Mindspeed Technologies, Inc. | Voice activity detection speech coding to accommodate music signals |
ES2269112T3 (es) | 2000-02-29 | 2007-04-01 | Qualcomm Incorporated | Codificador de voz multimodal en bucle cerrado de dominio mixto. |
WO2002023530A2 (en) * | 2000-09-11 | 2002-03-21 | Matsushita Electric Industrial Co., Ltd. | Quantization of spectral sequences for audio signal coding |
US6658383B2 (en) | 2001-06-26 | 2003-12-02 | Microsoft Corporation | Method for coding speech and music signals |
US6785645B2 (en) * | 2001-11-29 | 2004-08-31 | Microsoft Corporation | Real-time speech and music classifier |
US7613606B2 (en) | 2003-10-02 | 2009-11-03 | Nokia Corporation | Speech codecs |
-
2004
- 2004-05-17 US US10/847,651 patent/US7739120B2/en active Active
-
2005
- 2005-04-06 DE DE602005023295T patent/DE602005023295D1/de active Active
- 2005-04-06 KR KR1020087021059A patent/KR20080083719A/ko not_active Application Discontinuation
- 2005-04-06 MX MXPA06012579A patent/MXPA06012579A/es not_active Application Discontinuation
- 2005-04-06 EP EP05718394A patent/EP1747442B1/en active Active
- 2005-04-06 JP JP2007517472A patent/JP2008503783A/ja not_active Withdrawn
- 2005-04-06 CN CNB200580015656XA patent/CN100485337C/zh active Active
- 2005-04-06 CA CA002566353A patent/CA2566353A1/en not_active Abandoned
- 2005-04-06 BR BRPI0511150-1A patent/BRPI0511150A/pt not_active IP Right Cessation
- 2005-04-06 WO PCT/IB2005/000924 patent/WO2005111567A1/en active Application Filing
- 2005-04-06 RU RU2006139795/28A patent/RU2006139795A/ru not_active Application Discontinuation
- 2005-04-06 AU AU2005242993A patent/AU2005242993A1/en not_active Abandoned
- 2005-04-06 AT AT05718394T patent/ATE479885T1/de not_active IP Right Cessation
- 2005-05-12 PE PE2005000527A patent/PE20060385A1/es not_active Application Discontinuation
- 2005-05-13 TW TW094115502A patent/TW200606815A/zh unknown
-
2006
- 2006-11-15 ZA ZA200609479A patent/ZA200609479B/xx unknown
-
2008
- 2008-04-21 HK HK08104429.5A patent/HK1110111A1/xx unknown
Non-Patent Citations (4)
Title |
---|
"Source signal based rate adaptation for GSM ASR speechcodec". MAKINEN J ET AL.INFORMATION TECHNOLOG,Vol.2 . 2004 |
"Source signal based rate adaptation for GSM ASR speechcodec". MAKINEN J ET AL.INFORMATION TECHNOLOG,Vol.2 . 2004 * |
A wideband speech and audio codec at 16/24/32kbit/susing hybrid ACELP/TCX techniques. BESSETTE B ET AL.SPEECH CODEING PROCEEDINGS. 1999 |
A wideband speech and audio codec at 16/24/32kbit/susing hybrid ACELP/TCX techniques. BESSETTE B ET AL.SPEECH CODEING PROCEEDINGS. 1999 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107077858A (zh) * | 2014-07-28 | 2017-08-18 | 弗劳恩霍夫应用研究促进协会 | 使用具有全带隙填充的频域处理器以及时域处理器的音频编码器和解码器 |
CN107077858B (zh) * | 2014-07-28 | 2021-10-26 | 弗劳恩霍夫应用研究促进协会 | 使用具有全带隙填充的频域处理器以及时域处理器的音频编码器和解码器 |
Also Published As
Publication number | Publication date |
---|---|
AU2005242993A1 (en) | 2005-11-24 |
BRPI0511150A (pt) | 2007-11-27 |
ATE479885T1 (de) | 2010-09-15 |
US20050256701A1 (en) | 2005-11-17 |
HK1110111A1 (en) | 2008-07-04 |
RU2006139795A (ru) | 2008-06-27 |
CA2566353A1 (en) | 2005-11-24 |
TW200606815A (en) | 2006-02-16 |
DE602005023295D1 (de) | 2010-10-14 |
EP1747442B1 (en) | 2010-09-01 |
PE20060385A1 (es) | 2006-05-19 |
KR20080083719A (ko) | 2008-09-18 |
JP2008503783A (ja) | 2008-02-07 |
EP1747442A1 (en) | 2007-01-31 |
US7739120B2 (en) | 2010-06-15 |
MXPA06012579A (es) | 2006-12-15 |
CN101091108A (zh) | 2007-12-19 |
WO2005111567A1 (en) | 2005-11-24 |
ZA200609479B (en) | 2008-09-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN100485337C (zh) | 用于对音频信号进行编码的编码模型的选择 | |
CN1954365B (zh) | 使用不同编码模型的音频编码 | |
CN1954367B (zh) | 支持音频编码器模式间的转换 | |
CN1954364A (zh) | 带有不同编码帧长度的音频编码 | |
CN101681627B (zh) | 使用音调规则化及非音调规则化译码的信号编码方法及设备 | |
CN101320563B (zh) | 一种背景噪声编码/解码装置、方法和通信设备 | |
CN1957399B (zh) | 语音/音频解码装置以及语音/音频解码方法 | |
FI118834B (fi) | Audiosignaalien luokittelu | |
CN101622666B (zh) | 非因果后置滤波器 | |
CN101615396A (zh) | 音频编码设备、音频解码设备及其方法 | |
CN101494055A (zh) | 用于码分多址无线***的方法和装置 | |
CN104517612B (zh) | 基于amr-nb语音信号的可变码率编码器和解码器及其编码和解码方法 | |
CN102760441B (zh) | 一种背景噪声编码/解码装置、方法和通信设备 | |
KR20070017379A (ko) | 오디오 신호를 부호화하기 위한 부호화 모델들의 선택 | |
KR20070017378A (ko) | 서로 다른 코딩 모델들을 통한 오디오 인코딩 | |
KR20070017380A (ko) | 서로 다른 코딩 프레임 길이의 오디오 인코딩 | |
ZA200609478B (en) | Audio encoding with different coding frame lengths |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 1110111 Country of ref document: HK |
|
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: GR Ref document number: 1110111 Country of ref document: HK |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20160206 Address after: Espoo, Finland Patentee after: Technology Co., Ltd. of Nokia Address before: Espoo, Finland Patentee before: Nokia Oyj |