WO2012070866A3 - 스피치 시그널 부호화 방법 및 복호화 방법 - Google Patents

스피치 시그널 부호화 방법 및 복호화 방법 Download PDF

Info

Publication number
WO2012070866A3
WO2012070866A3 PCT/KR2011/008981 KR2011008981W WO2012070866A3 WO 2012070866 A3 WO2012070866 A3 WO 2012070866A3 KR 2011008981 W KR2011008981 W KR 2011008981W WO 2012070866 A3 WO2012070866 A3 WO 2012070866A3
Authority
WO
WIPO (PCT)
Prior art keywords
speech signal
analysis frame
encoding method
signal encoding
modified input
Prior art date
Application number
PCT/KR2011/008981
Other languages
English (en)
French (fr)
Other versions
WO2012070866A2 (ko
Inventor
정규혁
임종하
전혜정
강인규
김락용
Original Assignee
엘지전자 주식회사
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 엘지전자 주식회사 filed Critical 엘지전자 주식회사
Priority to KR1020137013582A priority Critical patent/KR101418227B1/ko
Priority to EP11842721.0A priority patent/EP2645365B1/en
Priority to CN201180056646.6A priority patent/CN103229235B/zh
Priority to US13/989,196 priority patent/US9177562B2/en
Publication of WO2012070866A2 publication Critical patent/WO2012070866A2/ko
Publication of WO2012070866A3 publication Critical patent/WO2012070866A3/ko

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0212Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/022Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

본 발명은 스피치 시그널의 부호화 방법 및 복호화 방법에 관한 것으로서, 본 발명에 따른 스피치 시그널의 부호화 방법은 입력 시그널 중 분석 프레임을 특정하는 단계, 상기 분석 프레임을 기반으로 변형 입력을 생성하는 단계, 상기 변형 입력에 윈도우를 적용하는 단계, 윈도우가 적용된 변형 입력을 MDCT(Modified Discrete Cosine Transform) 하여 변환 계수를 생성하는 단계 및 상기 변환 계수를 부호화하는 단계를 포함하며, 상기 변형 입력은 상기 분석 프레임 및 상기 분석 프레임 또는 상기 분석 프레임 중 일부의 자기 복제를 포함할 수 있다.
PCT/KR2011/008981 2010-11-24 2011-11-23 스피치 시그널 부호화 방법 및 복호화 방법 WO2012070866A2 (ko)

Priority Applications (4)

Application Number Priority Date Filing Date Title
KR1020137013582A KR101418227B1 (ko) 2010-11-24 2011-11-23 스피치 시그널 부호화 방법 및 복호화 방법
EP11842721.0A EP2645365B1 (en) 2010-11-24 2011-11-23 Speech signal encoding method and speech signal decoding method
CN201180056646.6A CN103229235B (zh) 2010-11-24 2011-11-23 语音信号编码方法和语音信号解码方法
US13/989,196 US9177562B2 (en) 2010-11-24 2011-11-23 Speech signal encoding method and speech signal decoding method

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US41721410P 2010-11-24 2010-11-24
US61/417,214 2010-11-24
US201161531582P 2011-09-06 2011-09-06
US61/531,582 2011-09-06

Publications (2)

Publication Number Publication Date
WO2012070866A2 WO2012070866A2 (ko) 2012-05-31
WO2012070866A3 true WO2012070866A3 (ko) 2012-09-27

Family

ID=46146303

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2011/008981 WO2012070866A2 (ko) 2010-11-24 2011-11-23 스피치 시그널 부호화 방법 및 복호화 방법

Country Status (5)

Country Link
US (1) US9177562B2 (ko)
EP (1) EP2645365B1 (ko)
KR (1) KR101418227B1 (ko)
CN (1) CN103229235B (ko)
WO (1) WO2012070866A2 (ko)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
IL294836A (en) 2013-04-05 2022-09-01 Dolby Int Ab Audio encoder and decoder
US10424305B2 (en) * 2014-12-09 2019-09-24 Dolby International Ab MDCT-domain error concealment
EP3483879A1 (en) * 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Analysis/synthesis windowing function for modulated lapped transformation
WO2020050665A1 (ko) * 2018-09-05 2020-03-12 엘지전자 주식회사 비디오 신호의 부호화/복호화 방법 및 이를 위한 장치
US20220232255A1 (en) * 2019-05-30 2022-07-21 Sharp Kabushiki Kaisha Image decoding apparatus
CN114007176B (zh) * 2020-10-09 2023-12-19 上海又为智能科技有限公司 用于降低信号延时的音频信号处理方法、装置及存储介质

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5848391A (en) * 1996-07-11 1998-12-08 Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. Method subband of coding and decoding audio signals using variable length windows
US20020007273A1 (en) * 1998-03-30 2002-01-17 Juin-Hwey Chen Low-complexity, low-delay, scalable and embedded speech and audio coding with adaptive frame loss concealment
US20080065373A1 (en) * 2004-10-26 2008-03-13 Matsushita Electric Industrial Co., Ltd. Sound Encoding Device And Sound Encoding Method
US20090094038A1 (en) * 2007-09-19 2009-04-09 Qualcomm Incorporated Efficient design of mdct / imdct filterbanks for speech and audio coding applications

Family Cites Families (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0944038B1 (en) * 1995-01-17 2001-09-12 Nec Corporation Speech encoder with features extracted from current and previous frames
KR0154387B1 (ko) 1995-04-01 1998-11-16 김주용 음성다중 시스템을 적용한 디지탈 오디오 부호화기
US6009386A (en) * 1997-11-28 1999-12-28 Nortel Networks Corporation Speech playback speed change using wavelet coding, preferably sub-band coding
US6330533B2 (en) * 1998-08-24 2001-12-11 Conexant Systems, Inc. Speech encoder adaptively applying pitch preprocessing with warping of target signal
US20030028386A1 (en) * 2001-04-02 2003-02-06 Zinser Richard L. Compressed domain universal transcoder
DE10129240A1 (de) * 2001-06-18 2003-01-02 Fraunhofer Ges Forschung Verfahren und Vorrichtung zum Verarbeiten von zeitdiskreten Audio-Abtastwerten
US20040064308A1 (en) * 2002-09-30 2004-04-01 Intel Corporation Method and apparatus for speech packet loss recovery
EP1604352A4 (en) * 2003-03-15 2007-12-19 Mindspeed Tech Inc SINGLE NOISE DELETION MODEL
DE10321983A1 (de) * 2003-05-15 2004-12-09 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und Verfahren zum Einbetten einer binären Nutzinformation in ein Trägersignal
US7325023B2 (en) * 2003-09-29 2008-01-29 Sony Corporation Method of making a window type decision based on MDCT data in audio encoding
DE10345996A1 (de) * 2003-10-02 2005-04-28 Fraunhofer Ges Forschung Vorrichtung und Verfahren zum Verarbeiten von wenigstens zwei Eingangswerten
JP4398416B2 (ja) 2005-10-07 2010-01-13 株式会社エヌ・ティ・ティ・ドコモ 変調装置、変調方法、復調装置、及び復調方法
US8069035B2 (en) * 2005-10-14 2011-11-29 Panasonic Corporation Scalable encoding apparatus, scalable decoding apparatus, and methods of them
JP5185254B2 (ja) * 2006-04-04 2013-04-17 ドルビー ラボラトリーズ ライセンシング コーポレイション Mdct領域におけるオーディオ信号音量測定と改良
US7987089B2 (en) * 2006-07-31 2011-07-26 Qualcomm Incorporated Systems and methods for modifying a zero pad region of a windowed frame of an audio signal
US20080103765A1 (en) * 2006-11-01 2008-05-01 Nokia Corporation Encoder Delay Adjustment
KR101291193B1 (ko) * 2006-11-30 2013-07-31 삼성전자주식회사 프레임 오류은닉방법
EP2015293A1 (en) * 2007-06-14 2009-01-14 Deutsche Thomson OHG Method and apparatus for encoding and decoding an audio signal using adaptively switched temporal resolution in the spectral domain
CN101437009B (zh) * 2007-11-15 2011-02-02 华为技术有限公司 丢包隐藏的方法及其***
US8457975B2 (en) * 2009-01-28 2013-06-04 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Audio decoder, audio encoder, methods for decoding and encoding an audio signal and computer program
KR101410312B1 (ko) * 2009-07-27 2014-06-27 연세대학교 산학협력단 오디오 신호 처리 방법 및 장치

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5848391A (en) * 1996-07-11 1998-12-08 Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. Method subband of coding and decoding audio signals using variable length windows
US20020007273A1 (en) * 1998-03-30 2002-01-17 Juin-Hwey Chen Low-complexity, low-delay, scalable and embedded speech and audio coding with adaptive frame loss concealment
US20080065373A1 (en) * 2004-10-26 2008-03-13 Matsushita Electric Industrial Co., Ltd. Sound Encoding Device And Sound Encoding Method
US20090094038A1 (en) * 2007-09-19 2009-04-09 Qualcomm Incorporated Efficient design of mdct / imdct filterbanks for speech and audio coding applications

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP2645365A4 *

Also Published As

Publication number Publication date
EP2645365A4 (en) 2015-01-07
WO2012070866A2 (ko) 2012-05-31
EP2645365B1 (en) 2018-01-17
US20130246054A1 (en) 2013-09-19
EP2645365A2 (en) 2013-10-02
KR20130086619A (ko) 2013-08-02
CN103229235A (zh) 2013-07-31
US9177562B2 (en) 2015-11-03
CN103229235B (zh) 2015-12-09
KR101418227B1 (ko) 2014-07-09

Similar Documents

Publication Publication Date Title
WO2012108680A3 (ko) 대역 확장 방법 및 장치
WO2008016945A3 (en) Systems and methods for modifying a window with a frame associated with an audio signal
WO2012070866A3 (ko) 스피치 시그널 부호화 방법 및 복호화 방법
WO2011059254A3 (en) An apparatus for processing a signal and method thereof
WO2009128667A3 (ko) 오디오 시맨틱 정보를 이용한 오디오 신호의 부호화/복호화 방법 및 그 장치
WO2008016925A3 (en) Systems, methods, and apparatus for wideband encoding and decoding of active frames
MY164393A (en) Mdct-based complex prediction stereo coding
MY162251A (en) Audio signal encoder,audio signal decoder,method for providing an encoded representation of an audio content,method for providing a decoded representation of an audio content and computer program for use in low delay applications
PH12012501116A1 (en) Speech encoding device, speech decoding device, speech encoding method, speech decoding method, speech encoding program, and speech decoding program
WO2012055016A8 (en) Coding generic audio signals at low bitrates and low delay
MX2016005542A (es) Decodificador de audio y metodo para proveer una informacion de audio decodificada usando un ocultamiento de error que modifica una señal de excitacion de dominio de tiempo.
CA2645911A1 (en) Method for encoding and decoding object-based audio signal and apparatus thereof
MY178139A (en) Audio decoder and method for providing a decoded audio information using an errorconcealment based on a time domain excitation signal
MY160467A (en) Audio encoder, audio decoder and related methods for processing multi-channel audio signals using complex prediction
MY169354A (en) Method and apparatus for compressing and decompressing a higher order ambisonics representation for a sound field
WO2008016935A3 (en) Systems, methods, and apparatus for wideband encoding and decoding of inactive frames
JP2011522472A5 (ko)
WO2009001874A1 (ja) オーディオ符号化方法、オーディオ復号方法、オーディオ符号化装置、オーディオ復号装置、プログラム、およびオーディオ符号化・復号システム
WO2009096713A3 (ko) 적응적 lpc 계수 보간을 이용한 오디오 신호의 부호화, 복호화 방법 및 장치
MX363348B (es) Codificador, descodificador y metodo para codificar y descodificar.
WO2010104300A3 (en) An apparatus for processing an audio signal and method thereof
WO2009096715A3 (ko) 오디오 신호의 부호화, 복호화 방법 및 장치
WO2010008185A3 (en) Method and apparatus to encode and decode an audio/speech signal
WO2010008175A3 (ko) 음성/오디오 통합 신호의 부호화/복호화 장치
ATE537537T1 (de) Signalkomprimierungsverfahren und -vorrichtung

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11842721

Country of ref document: EP

Kind code of ref document: A2

WWE Wipo information: entry into national phase

Ref document number: 13989196

Country of ref document: US

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 20137013582

Country of ref document: KR

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 2011842721

Country of ref document: EP