GB2467668A - Spatial audio analysis and synthesis for binaural reproduction and format conversion - Google Patents

Spatial audio analysis and synthesis for binaural reproduction and format conversion Download PDF

Info

Publication number
GB2467668A
GB2467668A GB1006665A GB201006665A GB2467668A GB 2467668 A GB2467668 A GB 2467668A GB 1006665 A GB1006665 A GB 1006665A GB 201006665 A GB201006665 A GB 201006665A GB 2467668 A GB2467668 A GB 2467668A
Authority
GB
United Kingdom
Prior art keywords
format conversion
synthesis
spatial audio
audio analysis
reproduction
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
GB1006665A
Other versions
GB201006665D0 (en
GB2467668B (en
Inventor
Michael M Goodwin
Jean-Marc Jot
Mark Dolson
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Creative Technology Ltd
Original Assignee
Creative Technology Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US12/243,963 external-priority patent/US8374365B2/en
Application filed by Creative Technology Ltd filed Critical Creative Technology Ltd
Publication of GB201006665D0 publication Critical patent/GB201006665D0/en
Publication of GB2467668A publication Critical patent/GB2467668A/en
Application granted granted Critical
Publication of GB2467668B publication Critical patent/GB2467668B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Mathematical Physics (AREA)
  • Stereophonic System (AREA)

Abstract

A frequency-domain method for format conversion or reproduction of 2-channel or multi-channel audio signals such as recordings is described. The reproduction is based on spatial analysis of directional cues in the input audio signal and conversion of these cues into audio output signal cues for two or more channels in the frequency domain.
GB1006665A 2007-10-03 2008-10-02 Spatial audio analysis and synthesis for binaural reproduction and format conversion Active GB2467668B (en)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US97734507P 2007-10-03 2007-10-03
US97743207P 2007-10-04 2007-10-04
US10200208P 2008-10-01 2008-10-01
US12/243,963 US8374365B2 (en) 2006-05-17 2008-10-01 Spatial audio analysis and synthesis for binaural reproduction and format conversion
PCT/US2008/078632 WO2009046223A2 (en) 2007-10-03 2008-10-02 Spatial audio analysis and synthesis for binaural reproduction and format conversion

Publications (3)

Publication Number Publication Date
GB201006665D0 GB201006665D0 (en) 2010-06-09
GB2467668A true GB2467668A (en) 2010-08-11
GB2467668B GB2467668B (en) 2011-12-07

Family

ID=40526952

Family Applications (1)

Application Number Title Priority Date Filing Date
GB1006665A Active GB2467668B (en) 2007-10-03 2008-10-02 Spatial audio analysis and synthesis for binaural reproduction and format conversion

Country Status (3)

Country Link
CN (1) CN101884065B (en)
GB (1) GB2467668B (en)
WO (1) WO2009046223A2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9826297B2 (en) 2014-10-29 2017-11-21 At&T Intellectual Property I, L.P. Accessory device that provides sensor input to a media device

Families Citing this family (58)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101450414B1 (en) 2009-12-16 2014-10-14 노키아 코포레이션 Multi-channel audio processing
JP6013918B2 (en) 2010-02-02 2016-10-25 コーニンクレッカ フィリップス エヌ ヴェKoninklijke Philips N.V. Spatial audio playback
KR20120004909A (en) 2010-07-07 2012-01-13 삼성전자주식회사 Method and apparatus for 3d sound reproducing
RU2573774C2 (en) 2010-08-25 2016-01-27 Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. Device for decoding signal, comprising transient processes, using combiner and mixer
CA2819394C (en) * 2010-12-03 2016-07-05 Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. Sound acquisition via the extraction of geometrical information from direction of arrival estimates
DE102012200512B4 (en) * 2012-01-13 2013-11-14 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for calculating loudspeaker signals for a plurality of loudspeakers using a delay in the frequency domain
EP2665208A1 (en) * 2012-05-14 2013-11-20 Thomson Licensing Method and apparatus for compressing and decompressing a Higher Order Ambisonics signal representation
EP2733964A1 (en) * 2012-11-15 2014-05-21 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Segment-wise adjustment of spatial audio signal to different playback loudspeaker setup
EP2738962A1 (en) * 2012-11-29 2014-06-04 Thomson Licensing Method and apparatus for determining dominant sound source directions in a higher order ambisonics representation of a sound field
EP2743922A1 (en) * 2012-12-12 2014-06-18 Thomson Licensing Method and apparatus for compressing and decompressing a higher order ambisonics representation for a sound field
WO2014177202A1 (en) * 2013-04-30 2014-11-06 Huawei Technologies Co., Ltd. Audio signal processing apparatus
US9384741B2 (en) * 2013-05-29 2016-07-05 Qualcomm Incorporated Binauralization of rotated higher order ambisonics
US9769586B2 (en) 2013-05-29 2017-09-19 Qualcomm Incorporated Performing order reduction with respect to higher order ambisonic coefficients
US9466305B2 (en) 2013-05-29 2016-10-11 Qualcomm Incorporated Performing positional analysis to code spherical harmonic coefficients
US9420393B2 (en) * 2013-05-29 2016-08-16 Qualcomm Incorporated Binaural rendering of spherical harmonic coefficients
KR102215129B1 (en) 2013-09-17 2021-02-10 주식회사 윌러스표준기술연구소 Method and apparatus for processing audio signals
CN108449704B (en) 2013-10-22 2021-01-01 韩国电子通信研究院 Method for generating a filter for an audio signal and parameterization device therefor
EP2866475A1 (en) 2013-10-23 2015-04-29 Thomson Licensing Method for and apparatus for decoding an audio soundfield representation for audio playback using 2D setups
EP4246513A3 (en) 2013-12-23 2023-12-13 Wilus Institute of Standards and Technology Inc. Audio signal processing method and parameterization device for same
KR102160254B1 (en) 2014-01-10 2020-09-25 삼성전자주식회사 Method and apparatus for 3D sound reproducing using active downmix
US9489955B2 (en) 2014-01-30 2016-11-08 Qualcomm Incorporated Indicating frame parameter reusability for coding vectors
US9922656B2 (en) 2014-01-30 2018-03-20 Qualcomm Incorporated Transitioning of ambient higher-order ambisonic coefficients
US9832585B2 (en) 2014-03-19 2017-11-28 Wilus Institute Of Standards And Technology Inc. Audio signal processing method and apparatus
KR101856127B1 (en) 2014-04-02 2018-05-09 주식회사 윌러스표준기술연구소 Audio signal processing method and device
KR102216657B1 (en) * 2014-04-02 2021-02-17 주식회사 윌러스표준기술연구소 A method and an apparatus for processing an audio signal
US9620137B2 (en) 2014-05-16 2017-04-11 Qualcomm Incorporated Determining between scalar and vector quantization in higher order ambisonic coefficients
US9852737B2 (en) 2014-05-16 2017-12-26 Qualcomm Incorporated Coding vectors decomposed from higher-order ambisonics audio signals
US10770087B2 (en) 2014-05-16 2020-09-08 Qualcomm Incorporated Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals
US9747910B2 (en) 2014-09-26 2017-08-29 Qualcomm Incorporated Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework
US9875745B2 (en) * 2014-10-07 2018-01-23 Qualcomm Incorporated Normalization of ambient higher order ambisonic audio data
EP3251116A4 (en) * 2015-01-30 2018-07-25 DTS, Inc. System and method for capturing, encoding, distributing, and decoding immersive audio
JP2018509864A (en) 2015-02-12 2018-04-05 ドルビー ラボラトリーズ ライセンシング コーポレイション Reverberation generation for headphone virtualization
EP3121814A1 (en) * 2015-07-24 2017-01-25 Sound object techology S.A. in organization A method and a system for decomposition of acoustic signal into sound objects, a sound object and its use
EP3157268B1 (en) * 2015-10-12 2021-06-30 Oticon A/s A hearing device and a hearing system configured to localize a sound source
CN105376690A (en) * 2015-11-04 2016-03-02 北京时代拓灵科技有限公司 Method and device of generating virtual surround sound
CA3011628C (en) * 2016-01-18 2019-04-09 Boomcloud 360, Inc. Subband spatial and crosstalk cancellation for audio reproduction
US10225657B2 (en) 2016-01-18 2019-03-05 Boomcloud 360, Inc. Subband spatial and crosstalk cancellation for audio reproduction
NZ745422A (en) 2016-01-19 2019-09-27 Boomcloud 360 Inc Audio enhancement for head-mounted speakers
CN105792090B (en) * 2016-04-27 2018-06-26 华为技术有限公司 A kind of method and apparatus for increasing reverberation
CN107358960B (en) * 2016-05-10 2021-10-26 华为技术有限公司 Coding method and coder for multi-channel signal
KR102483042B1 (en) * 2016-06-17 2022-12-29 디티에스, 인코포레이티드 Distance panning using near/far rendering
EP3852394A1 (en) 2016-06-21 2021-07-21 Dolby Laboratories Licensing Corporation Headtracking for pre-rendered binaural audio
MC200185B1 (en) 2016-09-16 2017-10-04 Coronal Audio Device and method for capturing and processing a three-dimensional acoustic field
MC200186B1 (en) 2016-09-30 2017-10-18 Coronal Encoding Method for conversion, stereo encoding, decoding and transcoding of a three-dimensional audio signal
CN107968984B (en) * 2016-10-20 2019-08-20 中国科学院声学研究所 A kind of 5-2 channel audio conversion optimization method
CN107182003B (en) * 2017-06-01 2019-09-27 西南电子技术研究所(中国电子科技集团公司第十研究所) Airborne three-dimensional call virtual auditory processing method
US10313820B2 (en) 2017-07-11 2019-06-04 Boomcloud 360, Inc. Sub-band spatial audio enhancement
CN107920303B (en) * 2017-11-21 2019-12-24 北京时代拓灵科技有限公司 Audio acquisition method and device
US10764704B2 (en) 2018-03-22 2020-09-01 Boomcloud 360, Inc. Multi-channel subband spatial processing for loudspeakers
WO2019199359A1 (en) 2018-04-08 2019-10-17 Dts, Inc. Ambisonic depth extraction
JP2022505964A (en) * 2018-10-26 2022-01-14 フラウンホーファー-ゲゼルシャフト・ツール・フェルデルング・デル・アンゲヴァンテン・フォルシュング・アインゲトラーゲネル・フェライン Directional volume map based audio processing
CN111757240B (en) * 2019-03-26 2021-08-20 瑞昱半导体股份有限公司 Audio processing method and audio processing system
CN111757239B (en) * 2019-03-28 2021-11-19 瑞昱半导体股份有限公司 Audio processing method and audio processing system
JP2022543121A (en) * 2019-08-08 2022-10-07 ジーエヌ ヒアリング エー/エス Bilateral hearing aid system and method for enhancing speech of one or more desired speakers
JP2022546552A (en) * 2019-09-03 2022-11-04 ドルビー ラボラトリーズ ライセンシング コーポレイション Audio filter bank with decorrelating component
US10841728B1 (en) 2019-10-10 2020-11-17 Boomcloud 360, Inc. Multi-channel crosstalk processing
GB2598960A (en) * 2020-09-22 2022-03-23 Nokia Technologies Oy Parametric spatial audio rendering with near-field effect
CN114173256B (en) * 2021-12-10 2024-04-19 中国电影科学技术研究所 Method, device and equipment for restoring sound field space and posture tracking

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006072270A1 (en) * 2005-01-10 2006-07-13 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Compact side information for parametric coding of spatial audio
WO2007031896A1 (en) * 2005-09-13 2007-03-22 Koninklijke Philips Electronics N.V. Audio coding
WO2007096808A1 (en) * 2006-02-21 2007-08-30 Koninklijke Philips Electronics N.V. Audio encoding and decoding

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4637725B2 (en) * 2005-11-11 2011-02-23 ソニー株式会社 Audio signal processing apparatus, audio signal processing method, and program

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006072270A1 (en) * 2005-01-10 2006-07-13 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Compact side information for parametric coding of spatial audio
WO2007031896A1 (en) * 2005-09-13 2007-03-22 Koninklijke Philips Electronics N.V. Audio coding
WO2007096808A1 (en) * 2006-02-21 2007-08-30 Koninklijke Philips Electronics N.V. Audio encoding and decoding

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
Parametric coding of spatial audio. (CHRISTOF FALLER) Proc of the 17th Int Conf DAFx'04, Napoles, Italy. October 5-8 2004 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9826297B2 (en) 2014-10-29 2017-11-21 At&T Intellectual Property I, L.P. Accessory device that provides sensor input to a media device
US10609462B2 (en) 2014-10-29 2020-03-31 At&T Intellectual Property I, L.P. Accessory device that provides sensor input to a media device

Also Published As

Publication number Publication date
GB201006665D0 (en) 2010-06-09
WO2009046223A3 (en) 2009-06-11
GB2467668B (en) 2011-12-07
CN101884065B (en) 2013-07-10
CN101884065A (en) 2010-11-10
WO2009046223A2 (en) 2009-04-09

Similar Documents

Publication Publication Date Title
GB2467668A (en) Spatial audio analysis and synthesis for binaural reproduction and format conversion
TW200727729A (en) Decoding of binaural audio signals
GB2467247A (en) Phase-amplitude 3-D stereo encoder and decoder
EP2005787A4 (en) Audio signal processing
ATE476732T1 (en) CONTROLLING BINAURAL AUDIO SIGNALS DECODING
EP1735775B8 (en) Method for representing multi-channel audio signals
EP2285139A3 (en) Device and method for converting spatial audio signal
WO2011085096A3 (en) Dj mixing headphones
WO2009031871A3 (en) A method and an apparatus of decoding an audio signal
HK1128548A1 (en) Apparatus and method for multi -channel parameter transformation
TW200701822A (en) Multi-channel hierarchical audio coding with compact side-information
WO2006126855A3 (en) Method and apparatus for decoding audio signal
PL2489038T3 (en) Apparatus for providing an upmix signal representation on the basis of the downmix signal representation, apparatus for providing a bitstream representing a multi-channel audio signal, methods, computer programs and bitstream representing a multi-channel audio signal using a linear combination parameter
WO2013111034A3 (en) Audio rendering system and method therefor
SG152998A1 (en) A multi-mode sound reproduction system and a corresponding method thereof
DE602006018718D1 (en) Multichannel bass management
TW200721111A (en) Audio coding
GB2485743A (en) Method, system and item for selective sound cancelling
MY178697A (en) Encoder, decoder and methods for signal-dependent zoom-transform in spatial audio object coding
GB2511441A (en) Electronically orbited speaker system
WO2005101898A3 (en) A method and system for sound source separation
MX2012012858A (en) Method and apparatus for reproducing stereophonic sound.
WO2009050903A1 (en) Audio mixing device
GB2443593A (en) Apparatus and method of reproduction virtual sound of two channels
MX2016004750A (en) Stereophonic sound reproduction method and apparatus.