WO2009046223A3 - Spatial audio analysis and synthesis for binaural reproduction and format conversion - Google Patents

Spatial audio analysis and synthesis for binaural reproduction and format conversion Download PDF

Info

Publication number
WO2009046223A3
WO2009046223A3 PCT/US2008/078632 US2008078632W WO2009046223A3 WO 2009046223 A3 WO2009046223 A3 WO 2009046223A3 US 2008078632 W US2008078632 W US 2008078632W WO 2009046223 A3 WO2009046223 A3 WO 2009046223A3
Authority
WO
WIPO (PCT)
Prior art keywords
format conversion
synthesis
spatial audio
audio analysis
reproduction
Prior art date
Application number
PCT/US2008/078632
Other languages
French (fr)
Other versions
WO2009046223A2 (en
Inventor
Michael M Goodwin
Jean-Marc Jot
Mark Dolson
Original Assignee
Creative Tech Ltd
Michael M Goodwin
Jean-Marc Jot
Mark Dolson
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US12/243,963 external-priority patent/US8374365B2/en
Application filed by Creative Tech Ltd, Michael M Goodwin, Jean-Marc Jot, Mark Dolson filed Critical Creative Tech Ltd
Priority to GB1006665A priority Critical patent/GB2467668B/en
Priority to CN200880119120.6A priority patent/CN101884065B/en
Publication of WO2009046223A2 publication Critical patent/WO2009046223A2/en
Publication of WO2009046223A3 publication Critical patent/WO2009046223A3/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Mathematical Physics (AREA)
  • Stereophonic System (AREA)

Abstract

A frequency-domain method for format conversion or reproduction of 2-channel or multi-channel audio signals such as recordings is described. The reproduction is based on spatial analysis of directional cues in the input audio signal and conversion of these cues into audio output signal cues for two or more channels in the frequency domain.
PCT/US2008/078632 2007-10-03 2008-10-02 Spatial audio analysis and synthesis for binaural reproduction and format conversion WO2009046223A2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
GB1006665A GB2467668B (en) 2007-10-03 2008-10-02 Spatial audio analysis and synthesis for binaural reproduction and format conversion
CN200880119120.6A CN101884065B (en) 2007-10-03 2008-10-02 Spatial audio analysis and synthesis for binaural reproduction and format conversion

Applications Claiming Priority (8)

Application Number Priority Date Filing Date Title
US97734507P 2007-10-03 2007-10-03
US60/977,345 2007-10-03
US97743207P 2007-10-04 2007-10-04
US60/977,432 2007-10-04
US10200208P 2008-10-01 2008-10-01
US12/243,963 2008-10-01
US61/102,002 2008-10-01
US12/243,963 US8374365B2 (en) 2006-05-17 2008-10-01 Spatial audio analysis and synthesis for binaural reproduction and format conversion

Publications (2)

Publication Number Publication Date
WO2009046223A2 WO2009046223A2 (en) 2009-04-09
WO2009046223A3 true WO2009046223A3 (en) 2009-06-11

Family

ID=40526952

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2008/078632 WO2009046223A2 (en) 2007-10-03 2008-10-02 Spatial audio analysis and synthesis for binaural reproduction and format conversion

Country Status (3)

Country Link
CN (1) CN101884065B (en)
GB (1) GB2467668B (en)
WO (1) WO2009046223A2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104285390B (en) * 2012-05-14 2017-06-09 杜比国际公司 The method and device that compression and decompression high-order ambisonics signal are represented

Families Citing this family (58)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011072729A1 (en) 2009-12-16 2011-06-23 Nokia Corporation Multi-channel audio processing
EP2532178A1 (en) 2010-02-02 2012-12-12 Koninklijke Philips Electronics N.V. Spatial sound reproduction
KR20120004909A (en) * 2010-07-07 2012-01-13 삼성전자주식회사 Method and apparatus for 3d sound reproducing
RU2573774C2 (en) 2010-08-25 2016-01-27 Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. Device for decoding signal, comprising transient processes, using combiner and mixer
ES2643163T3 (en) * 2010-12-03 2017-11-21 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and procedure for spatial audio coding based on geometry
DE102012200512B4 (en) * 2012-01-13 2013-11-14 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for calculating loudspeaker signals for a plurality of loudspeakers using a delay in the frequency domain
EP2733964A1 (en) * 2012-11-15 2014-05-21 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Segment-wise adjustment of spatial audio signal to different playback loudspeaker setup
EP2738962A1 (en) * 2012-11-29 2014-06-04 Thomson Licensing Method and apparatus for determining dominant sound source directions in a higher order ambisonics representation of a sound field
EP2743922A1 (en) * 2012-12-12 2014-06-18 Thomson Licensing Method and apparatus for compressing and decompressing a higher order ambisonics representation for a sound field
WO2014177202A1 (en) * 2013-04-30 2014-11-06 Huawei Technologies Co., Ltd. Audio signal processing apparatus
US9674632B2 (en) 2013-05-29 2017-06-06 Qualcomm Incorporated Filtering with binaural room impulse responses
US9883312B2 (en) 2013-05-29 2018-01-30 Qualcomm Incorporated Transformed higher order ambisonics audio data
US9466305B2 (en) * 2013-05-29 2016-10-11 Qualcomm Incorporated Performing positional analysis to code spherical harmonic coefficients
US9384741B2 (en) * 2013-05-29 2016-07-05 Qualcomm Incorporated Binauralization of rotated higher order ambisonics
WO2015041477A1 (en) 2013-09-17 2015-03-26 주식회사 윌러스표준기술연구소 Method and device for audio signal processing
US10204630B2 (en) 2013-10-22 2019-02-12 Electronics And Telecommunications Research Instit Ute Method for generating filter for audio signal and parameterizing device therefor
EP2866475A1 (en) 2013-10-23 2015-04-29 Thomson Licensing Method for and apparatus for decoding an audio soundfield representation for audio playback using 2D setups
BR112016014892B1 (en) 2013-12-23 2022-05-03 Gcoa Co., Ltd. Method and apparatus for audio signal processing
KR102160254B1 (en) 2014-01-10 2020-09-25 삼성전자주식회사 Method and apparatus for 3D sound reproducing using active downmix
US9502045B2 (en) 2014-01-30 2016-11-22 Qualcomm Incorporated Coding independent frames of ambient higher-order ambisonic coefficients
US9922656B2 (en) 2014-01-30 2018-03-20 Qualcomm Incorporated Transitioning of ambient higher-order ambisonic coefficients
EP4294055A1 (en) 2014-03-19 2023-12-20 Wilus Institute of Standards and Technology Inc. Audio signal processing method and apparatus
CN108966111B (en) 2014-04-02 2021-10-26 韦勒斯标准与技术协会公司 Audio signal processing method and device
KR102216657B1 (en) * 2014-04-02 2021-02-17 주식회사 윌러스표준기술연구소 A method and an apparatus for processing an audio signal
US9620137B2 (en) 2014-05-16 2017-04-11 Qualcomm Incorporated Determining between scalar and vector quantization in higher order ambisonic coefficients
US10770087B2 (en) 2014-05-16 2020-09-08 Qualcomm Incorporated Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals
US9852737B2 (en) 2014-05-16 2017-12-26 Qualcomm Incorporated Coding vectors decomposed from higher-order ambisonics audio signals
US9747910B2 (en) 2014-09-26 2017-08-29 Qualcomm Incorporated Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework
US9875745B2 (en) * 2014-10-07 2018-01-23 Qualcomm Incorporated Normalization of ambient higher order ambisonic audio data
US9826297B2 (en) 2014-10-29 2017-11-21 At&T Intellectual Property I, L.P. Accessory device that provides sensor input to a media device
US9794721B2 (en) * 2015-01-30 2017-10-17 Dts, Inc. System and method for capturing, encoding, distributing, and decoding immersive audio
PL3550859T3 (en) * 2015-02-12 2022-01-10 Dolby Laboratories Licensing Corporation Headphone virtualization
EP3121814A1 (en) * 2015-07-24 2017-01-25 Sound object techology S.A. in organization A method and a system for decomposition of acoustic signal into sound objects, a sound object and its use
EP3157268B1 (en) * 2015-10-12 2021-06-30 Oticon A/s A hearing device and a hearing system configured to localize a sound source
CN105376690A (en) * 2015-11-04 2016-03-02 北京时代拓灵科技有限公司 Method and device of generating virtual surround sound
US10225657B2 (en) 2016-01-18 2019-03-05 Boomcloud 360, Inc. Subband spatial and crosstalk cancellation for audio reproduction
WO2017127271A1 (en) * 2016-01-18 2017-07-27 Boomcloud 360, Inc. Subband spatial and crosstalk cancellation for audio reproduction
CA3011694C (en) 2016-01-19 2019-04-02 Boomcloud 360, Inc. Audio enhancement for head-mounted speakers
CN105792090B (en) * 2016-04-27 2018-06-26 华为技术有限公司 A kind of method and apparatus for increasing reverberation
CN107358960B (en) * 2016-05-10 2021-10-26 华为技术有限公司 Coding method and coder for multi-channel signal
US10231073B2 (en) 2016-06-17 2019-03-12 Dts, Inc. Ambisonic audio rendering with depth decoding
CN112954582A (en) 2016-06-21 2021-06-11 杜比实验室特许公司 Head tracking for pre-rendered binaural audio
MC200185B1 (en) 2016-09-16 2017-10-04 Coronal Audio Device and method for capturing and processing a three-dimensional acoustic field
MC200186B1 (en) 2016-09-30 2017-10-18 Coronal Encoding Method for conversion, stereo encoding, decoding and transcoding of a three-dimensional audio signal
CN107968984B (en) * 2016-10-20 2019-08-20 中国科学院声学研究所 A kind of 5-2 channel audio conversion optimization method
CN107182003B (en) * 2017-06-01 2019-09-27 西南电子技术研究所(中国电子科技集团公司第十研究所) Airborne three-dimensional call virtual auditory processing method
US10313820B2 (en) 2017-07-11 2019-06-04 Boomcloud 360, Inc. Sub-band spatial audio enhancement
CN107920303B (en) * 2017-11-21 2019-12-24 北京时代拓灵科技有限公司 Audio acquisition method and device
US10764704B2 (en) 2018-03-22 2020-09-01 Boomcloud 360, Inc. Multi-channel subband spatial processing for loudspeakers
EP3777244A4 (en) 2018-04-08 2021-12-08 DTS, Inc. Ambisonic depth extraction
WO2020084170A1 (en) * 2018-10-26 2020-04-30 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Directional loudness map based audio processing
CN111757240B (en) * 2019-03-26 2021-08-20 瑞昱半导体股份有限公司 Audio processing method and audio processing system
CN111757239B (en) * 2019-03-28 2021-11-19 瑞昱半导体股份有限公司 Audio processing method and audio processing system
EP4011094A1 (en) * 2019-08-08 2022-06-15 GN Hearing A/S A bilateral hearing aid system and method of enhancing speech of one or more desired speakers
EP4026123A4 (en) * 2019-09-03 2023-09-27 Dolby Laboratories Licensing Corporation Audio filterbank with decorrelating components
US10841728B1 (en) 2019-10-10 2020-11-17 Boomcloud 360, Inc. Multi-channel crosstalk processing
GB2598960A (en) * 2020-09-22 2022-03-23 Nokia Technologies Oy Parametric spatial audio rendering with near-field effect
CN114173256B (en) * 2021-12-10 2024-04-19 中国电影科学技术研究所 Method, device and equipment for restoring sound field space and posture tracking

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006072270A1 (en) * 2005-01-10 2006-07-13 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Compact side information for parametric coding of spatial audio
WO2007031896A1 (en) * 2005-09-13 2007-03-22 Koninklijke Philips Electronics N.V. Audio coding
WO2007096808A1 (en) * 2006-02-21 2007-08-30 Koninklijke Philips Electronics N.V. Audio encoding and decoding

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4637725B2 (en) * 2005-11-11 2011-02-23 ソニー株式会社 Audio signal processing apparatus, audio signal processing method, and program

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006072270A1 (en) * 2005-01-10 2006-07-13 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Compact side information for parametric coding of spatial audio
WO2007031896A1 (en) * 2005-09-13 2007-03-22 Koninklijke Philips Electronics N.V. Audio coding
WO2007096808A1 (en) * 2006-02-21 2007-08-30 Koninklijke Philips Electronics N.V. Audio encoding and decoding

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
FALLER C.: "Proc. of the 7th Int. Conf. DAFx'04, Napoles, Italy, October 5-8, 2004", article "Parametric coding of spatial audio" *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104285390B (en) * 2012-05-14 2017-06-09 杜比国际公司 The method and device that compression and decompression high-order ambisonics signal are represented

Also Published As

Publication number Publication date
GB2467668B (en) 2011-12-07
CN101884065A (en) 2010-11-10
CN101884065B (en) 2013-07-10
GB201006665D0 (en) 2010-06-09
GB2467668A (en) 2010-08-11
WO2009046223A2 (en) 2009-04-09

Similar Documents

Publication Publication Date Title
WO2009046223A3 (en) Spatial audio analysis and synthesis for binaural reproduction and format conversion
TW200727729A (en) Decoding of binaural audio signals
EP1735775B8 (en) Method for representing multi-channel audio signals
WO2009046460A3 (en) Phase-amplitude 3-d stereo encoder and decoder
ATE543343T1 (en) SOUND SIGNAL PROCESSING
ATE476732T1 (en) CONTROLLING BINAURAL AUDIO SIGNALS DECODING
WO2011085096A3 (en) Dj mixing headphones
EP2285139A3 (en) Device and method for converting spatial audio signal
WO2009031871A3 (en) A method and an apparatus of decoding an audio signal
PL2491551T3 (en) Apparatus for providing an upmix signal representation on the basis of a downmix signal representation, apparatus for providing a bitstream representing a multichannel audio signal, methods, computer program and bitstream using a distortion control signaling
WO2012033942A3 (en) Dynamic compensation of audio signals for improved perceived spectral imbalances
WO2013111034A3 (en) Audio rendering system and method therefor
WO2009044357A3 (en) Low frequency management for multichannel sound reproduction systems
SG171604A1 (en) A multi-mode sound reproduction system and a corresponding method thereof
TW200701822A (en) Multi-channel hierarchical audio coding with compact side-information
PL2489038T3 (en) Apparatus for providing an upmix signal representation on the basis of the downmix signal representation, apparatus for providing a bitstream representing a multi-channel audio signal, methods, computer programs and bitstream representing a multi-channel audio signal using a linear combination parameter
WO2006126844A8 (en) Method and apparatus for decoding an audio signal
WO2011020992A3 (en) Method, system and item for selective sound cancelling
WO2009142465A3 (en) A method and an apparatus for processing a signal
HK1128548A1 (en) Apparatus and method for multi -channel parameter transformation
WO2010038075A3 (en) Apparatus and method for reproducing a sound field with a loudspeaker array controlled via a control volume
DE602006018718D1 (en) Multichannel bass management
WO2005101898A3 (en) A method and system for sound source separation
WO2011139090A3 (en) Method and apparatus for reproducing stereophonic sound
WO2009075085A1 (en) Sound collecting device, sound collecting method, sound collecting program, and integrated circuit

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200880119120.6

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 08836621

Country of ref document: EP

Kind code of ref document: A2

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 1006665

Country of ref document: GB

Kind code of ref document: A

Free format text: PCT FILING DATE = 20081002

WWE Wipo information: entry into national phase

Ref document number: 1006665.2

Country of ref document: GB

122 Ep: pct application non-entry in european phase

Ref document number: 08836621

Country of ref document: EP

Kind code of ref document: A2