GB2467668A - Spatial audio analysis and synthesis for binaural reproduction and format conversion - Google Patents
Spatial audio analysis and synthesis for binaural reproduction and format conversion Download PDFInfo
- Publication number
- GB2467668A GB2467668A GB1006665A GB201006665A GB2467668A GB 2467668 A GB2467668 A GB 2467668A GB 1006665 A GB1006665 A GB 1006665A GB 201006665 A GB201006665 A GB 201006665A GB 2467668 A GB2467668 A GB 2467668A
- Authority
- GB
- United Kingdom
- Prior art keywords
- format conversion
- synthesis
- spatial audio
- audio analysis
- reproduction
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000006243 chemical reaction Methods 0.000 title abstract 3
- 230000015572 biosynthetic process Effects 0.000 title 1
- 238000003786 synthesis reaction Methods 0.000 title 1
- 230000005236 sound signal Effects 0.000 abstract 2
- 238000000034 method Methods 0.000 abstract 1
- 238000012732 spatial analysis Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Mathematical Physics (AREA)
- Stereophonic System (AREA)
Abstract
A frequency-domain method for format conversion or reproduction of 2-channel or multi-channel audio signals such as recordings is described. The reproduction is based on spatial analysis of directional cues in the input audio signal and conversion of these cues into audio output signal cues for two or more channels in the frequency domain.
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US97734507P | 2007-10-03 | 2007-10-03 | |
US97743207P | 2007-10-04 | 2007-10-04 | |
US10200208P | 2008-10-01 | 2008-10-01 | |
US12/243,963 US8374365B2 (en) | 2006-05-17 | 2008-10-01 | Spatial audio analysis and synthesis for binaural reproduction and format conversion |
PCT/US2008/078632 WO2009046223A2 (en) | 2007-10-03 | 2008-10-02 | Spatial audio analysis and synthesis for binaural reproduction and format conversion |
Publications (3)
Publication Number | Publication Date |
---|---|
GB201006665D0 GB201006665D0 (en) | 2010-06-09 |
GB2467668A true GB2467668A (en) | 2010-08-11 |
GB2467668B GB2467668B (en) | 2011-12-07 |
Family
ID=40526952
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
GB1006665A Active GB2467668B (en) | 2007-10-03 | 2008-10-02 | Spatial audio analysis and synthesis for binaural reproduction and format conversion |
Country Status (3)
Country | Link |
---|---|
CN (1) | CN101884065B (en) |
GB (1) | GB2467668B (en) |
WO (1) | WO2009046223A2 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9826297B2 (en) | 2014-10-29 | 2017-11-21 | At&T Intellectual Property I, L.P. | Accessory device that provides sensor input to a media device |
Families Citing this family (58)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101450414B1 (en) | 2009-12-16 | 2014-10-14 | 노키아 코포레이션 | Multi-channel audio processing |
JP6013918B2 (en) | 2010-02-02 | 2016-10-25 | コーニンクレッカ フィリップス エヌ ヴェKoninklijke Philips N.V. | Spatial audio playback |
KR20120004909A (en) | 2010-07-07 | 2012-01-13 | 삼성전자주식회사 | Method and apparatus for 3d sound reproducing |
RU2573774C2 (en) | 2010-08-25 | 2016-01-27 | Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. | Device for decoding signal, comprising transient processes, using combiner and mixer |
CA2819394C (en) * | 2010-12-03 | 2016-07-05 | Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. | Sound acquisition via the extraction of geometrical information from direction of arrival estimates |
DE102012200512B4 (en) * | 2012-01-13 | 2013-11-14 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for calculating loudspeaker signals for a plurality of loudspeakers using a delay in the frequency domain |
EP2665208A1 (en) * | 2012-05-14 | 2013-11-20 | Thomson Licensing | Method and apparatus for compressing and decompressing a Higher Order Ambisonics signal representation |
EP2733964A1 (en) * | 2012-11-15 | 2014-05-21 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Segment-wise adjustment of spatial audio signal to different playback loudspeaker setup |
EP2738962A1 (en) * | 2012-11-29 | 2014-06-04 | Thomson Licensing | Method and apparatus for determining dominant sound source directions in a higher order ambisonics representation of a sound field |
EP2743922A1 (en) * | 2012-12-12 | 2014-06-18 | Thomson Licensing | Method and apparatus for compressing and decompressing a higher order ambisonics representation for a sound field |
WO2014177202A1 (en) * | 2013-04-30 | 2014-11-06 | Huawei Technologies Co., Ltd. | Audio signal processing apparatus |
US9384741B2 (en) * | 2013-05-29 | 2016-07-05 | Qualcomm Incorporated | Binauralization of rotated higher order ambisonics |
US9769586B2 (en) | 2013-05-29 | 2017-09-19 | Qualcomm Incorporated | Performing order reduction with respect to higher order ambisonic coefficients |
US9466305B2 (en) | 2013-05-29 | 2016-10-11 | Qualcomm Incorporated | Performing positional analysis to code spherical harmonic coefficients |
US9420393B2 (en) * | 2013-05-29 | 2016-08-16 | Qualcomm Incorporated | Binaural rendering of spherical harmonic coefficients |
KR102215129B1 (en) | 2013-09-17 | 2021-02-10 | 주식회사 윌러스표준기술연구소 | Method and apparatus for processing audio signals |
CN108449704B (en) | 2013-10-22 | 2021-01-01 | 韩国电子通信研究院 | Method for generating a filter for an audio signal and parameterization device therefor |
EP2866475A1 (en) | 2013-10-23 | 2015-04-29 | Thomson Licensing | Method for and apparatus for decoding an audio soundfield representation for audio playback using 2D setups |
EP4246513A3 (en) | 2013-12-23 | 2023-12-13 | Wilus Institute of Standards and Technology Inc. | Audio signal processing method and parameterization device for same |
KR102160254B1 (en) | 2014-01-10 | 2020-09-25 | 삼성전자주식회사 | Method and apparatus for 3D sound reproducing using active downmix |
US9489955B2 (en) | 2014-01-30 | 2016-11-08 | Qualcomm Incorporated | Indicating frame parameter reusability for coding vectors |
US9922656B2 (en) | 2014-01-30 | 2018-03-20 | Qualcomm Incorporated | Transitioning of ambient higher-order ambisonic coefficients |
US9832585B2 (en) | 2014-03-19 | 2017-11-28 | Wilus Institute Of Standards And Technology Inc. | Audio signal processing method and apparatus |
KR101856127B1 (en) | 2014-04-02 | 2018-05-09 | 주식회사 윌러스표준기술연구소 | Audio signal processing method and device |
KR102216657B1 (en) * | 2014-04-02 | 2021-02-17 | 주식회사 윌러스표준기술연구소 | A method and an apparatus for processing an audio signal |
US9620137B2 (en) | 2014-05-16 | 2017-04-11 | Qualcomm Incorporated | Determining between scalar and vector quantization in higher order ambisonic coefficients |
US9852737B2 (en) | 2014-05-16 | 2017-12-26 | Qualcomm Incorporated | Coding vectors decomposed from higher-order ambisonics audio signals |
US10770087B2 (en) | 2014-05-16 | 2020-09-08 | Qualcomm Incorporated | Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals |
US9747910B2 (en) | 2014-09-26 | 2017-08-29 | Qualcomm Incorporated | Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework |
US9875745B2 (en) * | 2014-10-07 | 2018-01-23 | Qualcomm Incorporated | Normalization of ambient higher order ambisonic audio data |
EP3251116A4 (en) * | 2015-01-30 | 2018-07-25 | DTS, Inc. | System and method for capturing, encoding, distributing, and decoding immersive audio |
JP2018509864A (en) | 2015-02-12 | 2018-04-05 | ドルビー ラボラトリーズ ライセンシング コーポレイション | Reverberation generation for headphone virtualization |
EP3121814A1 (en) * | 2015-07-24 | 2017-01-25 | Sound object techology S.A. in organization | A method and a system for decomposition of acoustic signal into sound objects, a sound object and its use |
EP3157268B1 (en) * | 2015-10-12 | 2021-06-30 | Oticon A/s | A hearing device and a hearing system configured to localize a sound source |
CN105376690A (en) * | 2015-11-04 | 2016-03-02 | 北京时代拓灵科技有限公司 | Method and device of generating virtual surround sound |
CA3011628C (en) * | 2016-01-18 | 2019-04-09 | Boomcloud 360, Inc. | Subband spatial and crosstalk cancellation for audio reproduction |
US10225657B2 (en) | 2016-01-18 | 2019-03-05 | Boomcloud 360, Inc. | Subband spatial and crosstalk cancellation for audio reproduction |
NZ745422A (en) | 2016-01-19 | 2019-09-27 | Boomcloud 360 Inc | Audio enhancement for head-mounted speakers |
CN105792090B (en) * | 2016-04-27 | 2018-06-26 | 华为技术有限公司 | A kind of method and apparatus for increasing reverberation |
CN107358960B (en) * | 2016-05-10 | 2021-10-26 | 华为技术有限公司 | Coding method and coder for multi-channel signal |
KR102483042B1 (en) * | 2016-06-17 | 2022-12-29 | 디티에스, 인코포레이티드 | Distance panning using near/far rendering |
EP3852394A1 (en) | 2016-06-21 | 2021-07-21 | Dolby Laboratories Licensing Corporation | Headtracking for pre-rendered binaural audio |
MC200185B1 (en) | 2016-09-16 | 2017-10-04 | Coronal Audio | Device and method for capturing and processing a three-dimensional acoustic field |
MC200186B1 (en) | 2016-09-30 | 2017-10-18 | Coronal Encoding | Method for conversion, stereo encoding, decoding and transcoding of a three-dimensional audio signal |
CN107968984B (en) * | 2016-10-20 | 2019-08-20 | 中国科学院声学研究所 | A kind of 5-2 channel audio conversion optimization method |
CN107182003B (en) * | 2017-06-01 | 2019-09-27 | 西南电子技术研究所(中国电子科技集团公司第十研究所) | Airborne three-dimensional call virtual auditory processing method |
US10313820B2 (en) | 2017-07-11 | 2019-06-04 | Boomcloud 360, Inc. | Sub-band spatial audio enhancement |
CN107920303B (en) * | 2017-11-21 | 2019-12-24 | 北京时代拓灵科技有限公司 | Audio acquisition method and device |
US10764704B2 (en) | 2018-03-22 | 2020-09-01 | Boomcloud 360, Inc. | Multi-channel subband spatial processing for loudspeakers |
WO2019199359A1 (en) | 2018-04-08 | 2019-10-17 | Dts, Inc. | Ambisonic depth extraction |
JP2022505964A (en) * | 2018-10-26 | 2022-01-14 | フラウンホーファー-ゲゼルシャフト・ツール・フェルデルング・デル・アンゲヴァンテン・フォルシュング・アインゲトラーゲネル・フェライン | Directional volume map based audio processing |
CN111757240B (en) * | 2019-03-26 | 2021-08-20 | 瑞昱半导体股份有限公司 | Audio processing method and audio processing system |
CN111757239B (en) * | 2019-03-28 | 2021-11-19 | 瑞昱半导体股份有限公司 | Audio processing method and audio processing system |
JP2022543121A (en) * | 2019-08-08 | 2022-10-07 | ジーエヌ ヒアリング エー/エス | Bilateral hearing aid system and method for enhancing speech of one or more desired speakers |
JP2022546552A (en) * | 2019-09-03 | 2022-11-04 | ドルビー ラボラトリーズ ライセンシング コーポレイション | Audio filter bank with decorrelating component |
US10841728B1 (en) | 2019-10-10 | 2020-11-17 | Boomcloud 360, Inc. | Multi-channel crosstalk processing |
GB2598960A (en) * | 2020-09-22 | 2022-03-23 | Nokia Technologies Oy | Parametric spatial audio rendering with near-field effect |
CN114173256B (en) * | 2021-12-10 | 2024-04-19 | 中国电影科学技术研究所 | Method, device and equipment for restoring sound field space and posture tracking |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2006072270A1 (en) * | 2005-01-10 | 2006-07-13 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Compact side information for parametric coding of spatial audio |
WO2007031896A1 (en) * | 2005-09-13 | 2007-03-22 | Koninklijke Philips Electronics N.V. | Audio coding |
WO2007096808A1 (en) * | 2006-02-21 | 2007-08-30 | Koninklijke Philips Electronics N.V. | Audio encoding and decoding |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4637725B2 (en) * | 2005-11-11 | 2011-02-23 | ソニー株式会社 | Audio signal processing apparatus, audio signal processing method, and program |
-
2008
- 2008-10-02 GB GB1006665A patent/GB2467668B/en active Active
- 2008-10-02 CN CN200880119120.6A patent/CN101884065B/en active Active
- 2008-10-02 WO PCT/US2008/078632 patent/WO2009046223A2/en active Application Filing
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2006072270A1 (en) * | 2005-01-10 | 2006-07-13 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Compact side information for parametric coding of spatial audio |
WO2007031896A1 (en) * | 2005-09-13 | 2007-03-22 | Koninklijke Philips Electronics N.V. | Audio coding |
WO2007096808A1 (en) * | 2006-02-21 | 2007-08-30 | Koninklijke Philips Electronics N.V. | Audio encoding and decoding |
Non-Patent Citations (1)
Title |
---|
Parametric coding of spatial audio. (CHRISTOF FALLER) Proc of the 17th Int Conf DAFx'04, Napoles, Italy. October 5-8 2004 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9826297B2 (en) | 2014-10-29 | 2017-11-21 | At&T Intellectual Property I, L.P. | Accessory device that provides sensor input to a media device |
US10609462B2 (en) | 2014-10-29 | 2020-03-31 | At&T Intellectual Property I, L.P. | Accessory device that provides sensor input to a media device |
Also Published As
Publication number | Publication date |
---|---|
GB201006665D0 (en) | 2010-06-09 |
WO2009046223A3 (en) | 2009-06-11 |
GB2467668B (en) | 2011-12-07 |
CN101884065B (en) | 2013-07-10 |
CN101884065A (en) | 2010-11-10 |
WO2009046223A2 (en) | 2009-04-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
GB2467668A (en) | Spatial audio analysis and synthesis for binaural reproduction and format conversion | |
TW200727729A (en) | Decoding of binaural audio signals | |
GB2467247A (en) | Phase-amplitude 3-D stereo encoder and decoder | |
EP2005787A4 (en) | Audio signal processing | |
ATE476732T1 (en) | CONTROLLING BINAURAL AUDIO SIGNALS DECODING | |
EP1735775B8 (en) | Method for representing multi-channel audio signals | |
EP2285139A3 (en) | Device and method for converting spatial audio signal | |
WO2011085096A3 (en) | Dj mixing headphones | |
WO2009031871A3 (en) | A method and an apparatus of decoding an audio signal | |
HK1128548A1 (en) | Apparatus and method for multi -channel parameter transformation | |
TW200701822A (en) | Multi-channel hierarchical audio coding with compact side-information | |
WO2006126855A3 (en) | Method and apparatus for decoding audio signal | |
PL2489038T3 (en) | Apparatus for providing an upmix signal representation on the basis of the downmix signal representation, apparatus for providing a bitstream representing a multi-channel audio signal, methods, computer programs and bitstream representing a multi-channel audio signal using a linear combination parameter | |
WO2013111034A3 (en) | Audio rendering system and method therefor | |
SG152998A1 (en) | A multi-mode sound reproduction system and a corresponding method thereof | |
DE602006018718D1 (en) | Multichannel bass management | |
TW200721111A (en) | Audio coding | |
GB2485743A (en) | Method, system and item for selective sound cancelling | |
MY178697A (en) | Encoder, decoder and methods for signal-dependent zoom-transform in spatial audio object coding | |
GB2511441A (en) | Electronically orbited speaker system | |
WO2005101898A3 (en) | A method and system for sound source separation | |
MX2012012858A (en) | Method and apparatus for reproducing stereophonic sound. | |
WO2009050903A1 (en) | Audio mixing device | |
GB2443593A (en) | Apparatus and method of reproduction virtual sound of two channels | |
MX2016004750A (en) | Stereophonic sound reproduction method and apparatus. |