CN105229732B - 包括音频对象的音频场景的高效编码 - Google Patents

包括音频对象的音频场景的高效编码 Download PDF

Info

Publication number
CN105229732B
CN105229732B CN201480029540.0A CN201480029540A CN105229732B CN 105229732 B CN105229732 B CN 105229732B CN 201480029540 A CN201480029540 A CN 201480029540A CN 105229732 B CN105229732 B CN 105229732B
Authority
CN
China
Prior art keywords
audio object
mixed signal
audio
metadata
under
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201480029540.0A
Other languages
English (en)
Chinese (zh)
Other versions
CN105229732A (zh
Inventor
H·普恩哈根
K·克约尔林
T·赫冯恩
L·维勒莫斯
D·J·布瑞巴特
L·J·萨米尔森
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dolby International AB
Original Assignee
Dolby International AB
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dolby International AB filed Critical Dolby International AB
Publication of CN105229732A publication Critical patent/CN105229732A/zh
Application granted granted Critical
Publication of CN105229732B publication Critical patent/CN105229732B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/018Audio watermarking, i.e. embedding inaudible data in the audio signal
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/03Aspects of down-mixing multi-channel audio to configurations with lower numbers of playback channels, e.g. 7.1 -> 5.1
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/11Positioning of individual sound objects, e.g. moving airplane, within a sound field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/008Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Mathematical Physics (AREA)
  • Stereophonic System (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
CN201480029540.0A 2013-05-24 2014-05-23 包括音频对象的音频场景的高效编码 Active CN105229732B (zh)

Applications Claiming Priority (7)

Application Number Priority Date Filing Date Title
US201361827246P 2013-05-24 2013-05-24
US61/827,246 2013-05-24
US201361893770P 2013-10-21 2013-10-21
US61/893,770 2013-10-21
US201461973623P 2014-04-01 2014-04-01
US61/973,623 2014-04-01
PCT/EP2014/060733 WO2014187990A1 (en) 2013-05-24 2014-05-23 Efficient coding of audio scenes comprising audio objects

Publications (2)

Publication Number Publication Date
CN105229732A CN105229732A (zh) 2016-01-06
CN105229732B true CN105229732B (zh) 2018-09-04

Family

ID=50943284

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201480029540.0A Active CN105229732B (zh) 2013-05-24 2014-05-23 包括音频对象的音频场景的高效编码

Country Status (10)

Country Link
US (1) US9892737B2 (pt)
EP (1) EP3005356B1 (pt)
JP (1) JP6190947B2 (pt)
KR (1) KR101760248B1 (pt)
CN (1) CN105229732B (pt)
BR (2) BR112015029129B1 (pt)
ES (1) ES2640815T3 (pt)
HK (1) HK1213685A1 (pt)
RU (1) RU2630754C2 (pt)
WO (1) WO2014187990A1 (pt)

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3203469A4 (en) * 2014-09-30 2018-06-27 Sony Corporation Transmitting device, transmission method, receiving device, and receiving method
CA2963771A1 (en) * 2014-10-16 2016-04-21 Sony Corporation Transmission device, transmission method, reception device, and reception method
US10475463B2 (en) * 2015-02-10 2019-11-12 Sony Corporation Transmission device, transmission method, reception device, and reception method for audio streams
CN111586533B (zh) * 2015-04-08 2023-01-03 杜比实验室特许公司 音频内容的呈现
JPWO2016194563A1 (ja) 2015-06-02 2018-03-22 ソニー株式会社 送信装置、送信方法、メディア処理装置、メディア処理方法および受信装置
WO2017027308A1 (en) * 2015-08-07 2017-02-16 Dolby Laboratories Licensing Corporation Processing object-based audio signals
US10278000B2 (en) 2015-12-14 2019-04-30 Dolby Laboratories Licensing Corporation Audio object clustering with single channel quality preservation
US10779106B2 (en) 2016-07-20 2020-09-15 Dolby Laboratories Licensing Corporation Audio object clustering based on renderer-aware perceptual difference
EP4054213A1 (en) 2017-03-06 2022-09-07 Dolby International AB Rendering in dependence on the number of loudspeaker channels
WO2019069710A1 (ja) * 2017-10-05 2019-04-11 ソニー株式会社 符号化装置および方法、復号装置および方法、並びにプログラム
CN108733342B (zh) * 2018-05-22 2021-03-26 Oppo(重庆)智能科技有限公司 音量调节方法、移动终端及计算机可读存储介质
JP7504091B2 (ja) 2018-11-02 2024-06-21 ドルビー・インターナショナル・アーベー オーディオ・エンコーダおよびオーディオ・デコーダ
BR112021009306A2 (pt) * 2018-11-20 2021-08-10 Sony Group Corporation dispositivo e método de processamento de informações, e, programa.
CN113490980A (zh) * 2019-01-21 2021-10-08 弗劳恩霍夫应用研究促进协会 用于编码空间音频表示的装置和方法以及用于使用传输元数据来解码经编码的音频信号的装置和方法,以及相关的计算机程序
JPWO2021140959A1 (pt) * 2020-01-10 2021-07-15
US20240187807A1 (en) * 2021-02-20 2024-06-06 Dolby Laboratories Licensing Corporation Clustering audio objects

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101490744A (zh) * 2006-11-24 2009-07-22 Lg电子株式会社 用于编码和解码基于对象的音频信号的方法和装置
CN101517637A (zh) * 2006-09-18 2009-08-26 皇家飞利浦电子股份有限公司 音频对象的编码与解码
CN101529501A (zh) * 2006-10-16 2009-09-09 杜比瑞典公司 多声道下混对象编码的增强编码和参数表示
CN102576532A (zh) * 2009-04-28 2012-07-11 弗兰霍菲尔运输应用研究公司 用以基于下混信号表示型态针对上混信号表示型态的供应来提供一个或多个经调整参数的装置、音频信号译码器、音频信号转码器、音频信号编码器、音频位串流、使用对象相关参数信息的方法与计算机程序

Family Cites Families (40)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7567675B2 (en) 2002-06-21 2009-07-28 Audyssey Laboratories, Inc. System and method for automatic multiple listener room acoustic correction with low filter orders
DE10344638A1 (de) 2003-08-04 2005-03-10 Fraunhofer Ges Forschung Vorrichtung und Verfahren zum Erzeugen, Speichern oder Bearbeiten einer Audiodarstellung einer Audioszene
FR2862799B1 (fr) * 2003-11-26 2006-02-24 Inst Nat Rech Inf Automat Dispositif et methode perfectionnes de spatialisation du son
US7394903B2 (en) 2004-01-20 2008-07-01 Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. Apparatus and method for constructing a multi-channel output signal or for generating a downmix signal
WO2005098824A1 (en) 2004-04-05 2005-10-20 Koninklijke Philips Electronics N.V. Multi-channel encoder
GB2415639B (en) 2004-06-29 2008-09-17 Sony Comp Entertainment Europe Control of data processing
DE602006015294D1 (de) 2005-03-30 2010-08-19 Dolby Int Ab Mehrkanal-audiocodierung
WO2007055463A1 (en) * 2005-08-30 2007-05-18 Lg Electronics Inc. Apparatus for encoding and decoding audio signal and method thereof
JP5154538B2 (ja) 2006-03-29 2013-02-27 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ オーディオ復号
US8379868B2 (en) 2006-05-17 2013-02-19 Creative Technology Ltd Spatial audio coding based on universal spatial cues
RU2407072C1 (ru) * 2006-09-29 2010-12-20 ЭлДжи ЭЛЕКТРОНИКС ИНК. Способы и устройства кодирования и декодирования объектно-ориентированных аудиосигналов
KR20090013178A (ko) 2006-09-29 2009-02-04 엘지전자 주식회사 오브젝트 기반 오디오 신호를 인코딩 및 디코딩하는 방법 및 장치
US8620465B2 (en) 2006-10-13 2013-12-31 Auro Technologies Method and encoder for combining digital data sets, a decoding method and decoder for such combined digital data sets and a record carrier for storing such combined digital data set
RU2431940C2 (ru) 2006-10-16 2011-10-20 Фраунхофер-Гезелльшафт цур Фёрдерунг дер ангевандтен Форшунг Е.Ф. Аппаратура и метод многоканального параметрического преобразования
KR101055739B1 (ko) 2006-11-24 2011-08-11 엘지전자 주식회사 오브젝트 기반 오디오 신호의 부호화 및 복호화 방법과 그 장치
US8290167B2 (en) 2007-03-21 2012-10-16 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Method and apparatus for conversion between multi-channel audio formats
EP2076900A1 (en) 2007-10-17 2009-07-08 Fraunhofer-Gesellschaft zur Förderung der Angewandten Forschung e.V. Audio coding using upmix
CA2710562C (en) 2008-01-01 2014-07-22 Lg Electronics Inc. A method and an apparatus for processing an audio signal
KR101461685B1 (ko) 2008-03-31 2014-11-19 한국전자통신연구원 다객체 오디오 신호의 부가정보 비트스트림 생성 방법 및 장치
US8311810B2 (en) 2008-07-29 2012-11-13 Panasonic Corporation Reduced delay spatial coding and decoding apparatus and teleconferencing system
EP2214161A1 (en) 2009-01-28 2010-08-04 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus, method and computer program for upmixing a downmix audio signal
US9105264B2 (en) 2009-07-31 2015-08-11 Panasonic Intellectual Property Management Co., Ltd. Coding apparatus and decoding apparatus
KR101805212B1 (ko) 2009-08-14 2017-12-05 디티에스 엘엘씨 객체-지향 오디오 스트리밍 시스템
US9432790B2 (en) 2009-10-05 2016-08-30 Microsoft Technology Licensing, Llc Real-time sound propagation for dynamic sources
WO2011048067A1 (en) 2009-10-20 2011-04-28 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E. V. Apparatus for providing an upmix signal representation on the basis of a downmix signal representation, apparatus for providing a bitstream representing a multichannel audio signal, methods, computer program and bitstream using a distortion control signaling
AU2010321013B2 (en) 2009-11-20 2014-05-29 Dolby International Ab Apparatus for providing an upmix signal representation on the basis of the downmix signal representation, apparatus for providing a bitstream representing a multi-channel audio signal, methods, computer programs and bitstream representing a multi-channel audio signal using a linear combination parameter
TWI444989B (zh) 2010-01-22 2014-07-11 Dolby Lab Licensing Corp 針對改良多通道上混使用多通道解相關之技術
DK2556502T3 (en) 2010-04-09 2019-03-04 Dolby Int Ab MDCT-BASED COMPLEX PREVIEW Stereo Decoding
GB2485979A (en) 2010-11-26 2012-06-06 Univ Surrey Spatial audio coding
JP2012151663A (ja) 2011-01-19 2012-08-09 Toshiba Corp 立体音響生成装置及び立体音響生成方法
US9026450B2 (en) * 2011-03-09 2015-05-05 Dts Llc System for dynamically creating and rendering audio objects
WO2013142657A1 (en) 2012-03-23 2013-09-26 Dolby Laboratories Licensing Corporation System and method of speaker cluster design and rendering
US9516446B2 (en) 2012-07-20 2016-12-06 Qualcomm Incorporated Scalable downmix design for object-based surround codec with cluster analysis by synthesis
US9761229B2 (en) * 2012-07-20 2017-09-12 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for audio object clustering
CN104520924B (zh) 2012-08-07 2017-06-23 杜比实验室特许公司 指示游戏音频内容的基于对象的音频的编码和呈现
CN104885151B (zh) 2012-12-21 2017-12-22 杜比实验室特许公司 用于基于感知准则呈现基于对象的音频内容的对象群集
WO2014161993A1 (en) 2013-04-05 2014-10-09 Dolby International Ab Stereo audio encoder and decoder
EP3270375B1 (en) 2013-05-24 2020-01-15 Dolby International AB Reconstruction of audio scenes from a downmix
CA2990261C (en) 2013-05-24 2020-06-16 Dolby International Ab Audio encoder and decoder
CN105247611B (zh) 2013-05-24 2019-02-15 杜比国际公司 对音频场景的编码

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101517637A (zh) * 2006-09-18 2009-08-26 皇家飞利浦电子股份有限公司 音频对象的编码与解码
CN101529501A (zh) * 2006-10-16 2009-09-09 杜比瑞典公司 多声道下混对象编码的增强编码和参数表示
CN101490744A (zh) * 2006-11-24 2009-07-22 Lg电子株式会社 用于编码和解码基于对象的音频信号的方法和装置
CN102576532A (zh) * 2009-04-28 2012-07-11 弗兰霍菲尔运输应用研究公司 用以基于下混信号表示型态针对上混信号表示型态的供应来提供一个或多个经调整参数的装置、音频信号译码器、音频信号转码器、音频信号编码器、音频位串流、使用对象相关参数信息的方法与计算机程序

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
《Perceptual Audio Rendering of Complex Virtual Environments》;Nicolas Tsingos et al.;《ACM Transactions on Graphics(TOG)》;20040831;第23卷(第3期);第249-258页 *
《Spatial Audio Object Coding(SAOC)-The Upcoming MPEG Standard on Parametric Object Based Audio Coding》;Jonas Engdegard et al.;《AES 124th Convention》;20080520;第1-15页 *

Also Published As

Publication number Publication date
BR112015029129B1 (pt) 2022-05-31
WO2014187990A1 (en) 2014-11-27
BR122020017144B1 (pt) 2022-05-03
EP3005356B1 (en) 2017-08-09
US9892737B2 (en) 2018-02-13
BR112015029129A2 (pt) 2017-07-25
KR20160003058A (ko) 2016-01-08
US20160125887A1 (en) 2016-05-05
RU2630754C2 (ru) 2017-09-12
JP6190947B2 (ja) 2017-08-30
JP2016522911A (ja) 2016-08-04
ES2640815T3 (es) 2017-11-06
EP3005356A1 (en) 2016-04-13
RU2015150055A (ru) 2017-05-26
KR101760248B1 (ko) 2017-07-21
HK1213685A1 (zh) 2016-07-08
CN105229732A (zh) 2016-01-06

Similar Documents

Publication Publication Date Title
CN105229732B (zh) 包括音频对象的音频场景的高效编码
CN105229733B (zh) 包括音频对象的音频场景的高效编码
EP3127109B1 (en) Efficient coding of audio scenes comprising audio objects
CN105981411A (zh) 用于高声道计数的多声道音频的基于多元组的矩阵混合

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 1213685

Country of ref document: HK

GR01 Patent grant
GR01 Patent grant